SlideShare ist ein Scribd-Unternehmen logo
1 von 33
Downloaden Sie, um offline zu lesen
Daniel




How to Interview a Data Scientist
Daniel Tunkelang
Director of Data Science, LinkedIn
     Recruiting Solutions                     1
Drew Conway’s Venn Diagram




                             2
GOAL




       3
Specification for a Data Scientist



                        implements
                         algorithms

      analyzes data
                      thinks product



                                       4
What about




C                  ulture
             ommunication
                 uriosity


     Hold that thought…
                            ?
                                5
What can you learn from an interview?




                                        6
Interviewing is a last resort.




               Alternatives?

                                 7
Only hire people you’ve worked with.




                                       8
Hire interns. Convert to full-time. Profit!




                                              9
Try before you buy: short-term contracts.




                                            10
Alternatives are at best a partial solution.

§  Only hiring people you’ve worked with doesn’t scale.
   –  And traps you in a locally optimal monoculture.


§  Interns are great! But they are a significant investment.
   –  Managing interns well is a productivity gamble.
   –  Most interns have at least a year of school left.
   –  Not all interns will make your bar. You won’t always make theirs.


§  Try before you buy: nice in theory.
   –  Adverse selection bias when other offers are permanent roles.
   –  Creates bureaucracy.


                                                                          11
Can we at least make interviews natural?




                                           12
Spend a day working together.




                                13
Take-home assignment.




                        14
Review candidate’s previous work.




                                    15
High-fructose corn syrup is 100% natural.
§  Working sessions are difficult to set up.
   –  No more natural than a final exam.
   –  High variance, and very difficult to calibrate performance.


§  Take-home assignments are great for the employer.
   –  But they are a significant investment for the candidate.
   –  Adverse selection bias if other companies don’t require them.
   –  Creates incentive to cheat if significant part of hiring process.


§  Previous work is like natural experiments.
   –  Always good to review a candidate’s previous work.
   –  But not always possible to find work with high predictive value.



                                                                          16
So you gotta do interviews. But how?




                                       17
Three Principles

1.  Keep it real.

2.  No gotchas.

3.  Maybe = no.




                    18
Keeping It Real




                  19
Test basic coding with FizzBuzz questions.

        multiple of 3 -> Fizz
        multiple of 5 -> Buzz
        multiple of 15 -> FizzBuzz

   1, 2, Fizz, 4, Buzz, Fizz,
   7, 8, Fizz, Buzz, 11, Fizz,
   13, 14, FizzBuzz, 16, …
                                        20
Whiteboards suck for coding.




      http://ericleads.com/2012/10/how-to-conduct-a-better-coding-interview/


                                                                               21
Don’t ask pointless algorithm questions.




             implement




                                           22
Use real-world algorithms questions.



        bigdatascientist


         Did you mean:
         big data scientist
                                       23
Ask candidates to design your products.




                                          24
Keeping it real is also a great sell.
                                Similar Profiles




                               People You May Know




                                                     25
But no gotchas.




                  26
Gotchas reduce the signal-to-noise ratio.

§  Avoid problems where success hinges on a single insight.
   –  Good interview problems offer lots of room for partial credit.
   –  Making a key insight often reflects experience, not intelligence.


§  Don’t test a candidate’s knowledge of a niche technique.
   –  Unless that niche technique is critical to job performance.
   –  And can’t be learned on the job as part of on-boarding.


§  Be a hard interviewer, but don’t be an asshole.
   –  An interview is not a stress-test to see where candidates break.
   –  Interviews communicate your values to the candidate.


                                                                          27
Maybe = no.




              28
Commit to binary interview outcomes.

§  Forced choice so interviewers don’t take easy way out.
   –  Just like having 4 choices instead of 5 on a rating scale.
   –  Encourages interviewers to take their role seriously.


§  Each team member is a critical filter.
   –  Two no’s or one strong no is a no.
   –  All weak yes’s is a no.


§  Short-circuit candidates early in the process.
   –  Resume and phone screening should be aggressive.
   –  Onsite interviews should have ~50% chance of leading to offers.


                                                                        29
But what about




C                 ulture
            ommunication
                uriosity

         All are must-haves.
                                ?
 Every interview evaluates all three.
                                        30
Remember Your Goal




                     31
Three Principles

1.  Keep it real.
  –  Avoid whiteboard coding. Filter with FizzBuzz.
  –  Use real-world algorithms questions.
  –  Ask candidates to design your products.
2.  No gotchas.
  –  Gotchas reduce the signal-to-noise ratio.
3.  Maybe = no.
  –  Bad hires suck. Be conservative.
  –  Trust your team.
                                                  32
Thank you!




             33

Weitere ähnliche Inhalte

Was ist angesagt?

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdfRetrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Po-Chuan Chen
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
Simplilearn
 

Was ist angesagt? (20)

1 Introduction to AI.pptx
1 Introduction to AI.pptx1 Introduction to AI.pptx
1 Introduction to AI.pptx
 
Introduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-LearnIntroduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-Learn
 
Feature Engineering for ML - Dmitry Larko, H2O.ai
Feature Engineering for ML - Dmitry Larko, H2O.aiFeature Engineering for ML - Dmitry Larko, H2O.ai
Feature Engineering for ML - Dmitry Larko, H2O.ai
 
Feature Engineering & Selection
Feature Engineering & SelectionFeature Engineering & Selection
Feature Engineering & Selection
 
Unsupervised Machine Learning Ml And How It Works
Unsupervised Machine Learning Ml And How It WorksUnsupervised Machine Learning Ml And How It Works
Unsupervised Machine Learning Ml And How It Works
 
Em algorithm
Em algorithmEm algorithm
Em algorithm
 
Data science and Artificial Intelligence
Data science and Artificial IntelligenceData science and Artificial Intelligence
Data science and Artificial Intelligence
 
A Friendly Introduction to Machine Learning
A Friendly Introduction to Machine LearningA Friendly Introduction to Machine Learning
A Friendly Introduction to Machine Learning
 
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdfRetrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
 
Naive Bayes Classifier
Naive Bayes ClassifierNaive Bayes Classifier
Naive Bayes Classifier
 
Feature Engineering in Machine Learning
Feature Engineering in Machine LearningFeature Engineering in Machine Learning
Feature Engineering in Machine Learning
 
Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science
 
Winning Kaggle 101: Introduction to Stacking
Winning Kaggle 101: Introduction to StackingWinning Kaggle 101: Introduction to Stacking
Winning Kaggle 101: Introduction to Stacking
 
From deep learning to deep reasoning
From deep learning to deep reasoningFrom deep learning to deep reasoning
From deep learning to deep reasoning
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
 
Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018
 
Getting a Data Science Job
Getting a Data Science JobGetting a Data Science Job
Getting a Data Science Job
 
Transfer Learning -- The Next Frontier for Machine Learning
Transfer Learning -- The Next Frontier for Machine LearningTransfer Learning -- The Next Frontier for Machine Learning
Transfer Learning -- The Next Frontier for Machine Learning
 
How to Build Data Science Teams
How to Build Data Science TeamsHow to Build Data Science Teams
How to Build Data Science Teams
 
How to Become a Data Scientist
How to Become a Data ScientistHow to Become a Data Scientist
How to Become a Data Scientist
 

Andere mochten auch

Data By The People, For The People
Data By The People, For The PeopleData By The People, For The People
Data By The People, For The People
Daniel Tunkelang
 
A tutorial on deep learning at icml 2013
A tutorial on deep learning at icml 2013A tutorial on deep learning at icml 2013
A tutorial on deep learning at icml 2013
Philip Zheng
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural network
DEEPASHRI HK
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentation
lpaviglianiti
 

Andere mochten auch (20)

Hadoop and Machine Learning
Hadoop and Machine LearningHadoop and Machine Learning
Hadoop and Machine Learning
 
A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)
 
Data By The People, For The People
Data By The People, For The PeopleData By The People, For The People
Data By The People, For The People
 
Hands-on Deep Learning in Python
Hands-on Deep Learning in PythonHands-on Deep Learning in Python
Hands-on Deep Learning in Python
 
10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
 
A tutorial on deep learning at icml 2013
A tutorial on deep learning at icml 2013A tutorial on deep learning at icml 2013
A tutorial on deep learning at icml 2013
 
Deep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDeep Learning for Natural Language Processing
Deep Learning for Natural Language Processing
 
Introduction to Mahout and Machine Learning
Introduction to Mahout and Machine LearningIntroduction to Mahout and Machine Learning
Introduction to Mahout and Machine Learning
 
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
An Introduction to Supervised Machine Learning and Pattern Classification: Th...An Introduction to Supervised Machine Learning and Pattern Classification: Th...
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
 
Machine Learning and Data Mining: 12 Classification Rules
Machine Learning and Data Mining: 12 Classification RulesMachine Learning and Data Mining: 12 Classification Rules
Machine Learning and Data Mining: 12 Classification Rules
 
Myths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsMyths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data Scientists
 
Tutorial on Deep learning and Applications
Tutorial on Deep learning and ApplicationsTutorial on Deep learning and Applications
Tutorial on Deep learning and Applications
 
Tips for data science competitions
Tips for data science competitionsTips for data science competitions
Tips for data science competitions
 
Deep neural networks
Deep neural networksDeep neural networks
Deep neural networks
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine Learning
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural network
 
10 R Packages to Win Kaggle Competitions
10 R Packages to Win Kaggle Competitions10 R Packages to Win Kaggle Competitions
10 R Packages to Win Kaggle Competitions
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentation
 
The Business Analytics Value Proposition
The Business Analytics Value PropositionThe Business Analytics Value Proposition
The Business Analytics Value Proposition
 

Ähnlich wie How to Interview a Data Scientist

How to change your career to consulting
How to change your career to consultingHow to change your career to consulting
How to change your career to consulting
Purnima Thakre
 
La potenza di Linkedin: i candidati passivi
La potenza di Linkedin: i candidati passiviLa potenza di Linkedin: i candidati passivi
La potenza di Linkedin: i candidati passivi
Andrea Attana
 
Inside the world of passive talent - Research tips
Inside the world of passive talent - Research tipsInside the world of passive talent - Research tips
Inside the world of passive talent - Research tips
Harry Dhebar
 
Inside the world of passive talent
Inside the world of passive talentInside the world of passive talent
Inside the world of passive talent
Leonardo Intriago
 
Inside the world of passive talent research & tips
Inside the world of passive talent  research & tipsInside the world of passive talent  research & tips
Inside the world of passive talent research & tips
Carly Rodger
 
Inside the world of passive talent research & tips
Inside the world of passive talent   research & tipsInside the world of passive talent   research & tips
Inside the world of passive talent research & tips
Lynne Rooney
 
Inside the World of Passive Talent
Inside the World of Passive TalentInside the World of Passive Talent
Inside the World of Passive Talent
harrydhebar
 
Inside the world of passive talent research & tips
Inside the world of passive talent research & tipsInside the world of passive talent research & tips
Inside the world of passive talent research & tips
Donna Graham
 

Ähnlich wie How to Interview a Data Scientist (20)

Avoiding the Heuristic Solution: Moving past functional and correct to joyful...
Avoiding the Heuristic Solution: Moving past functional and correct to joyful...Avoiding the Heuristic Solution: Moving past functional and correct to joyful...
Avoiding the Heuristic Solution: Moving past functional and correct to joyful...
 
10 Observations from 10+ years in the Corporate UX Trenches
10 Observations from 10+ years in the Corporate UX Trenches10 Observations from 10+ years in the Corporate UX Trenches
10 Observations from 10+ years in the Corporate UX Trenches
 
Strata 2012: Humans, Machines, and the Dimensions of Microwork
Strata 2012: Humans, Machines, and the Dimensions of MicroworkStrata 2012: Humans, Machines, and the Dimensions of Microwork
Strata 2012: Humans, Machines, and the Dimensions of Microwork
 
The Top Ten Execution Missteps
The Top Ten Execution MisstepsThe Top Ten Execution Missteps
The Top Ten Execution Missteps
 
How to change your career to consulting
How to change your career to consultingHow to change your career to consulting
How to change your career to consulting
 
La potenza di Linkedin: i candidati passivi
La potenza di Linkedin: i candidati passiviLa potenza di Linkedin: i candidati passivi
La potenza di Linkedin: i candidati passivi
 
Inside the world of passive talent - Research tips
Inside the world of passive talent - Research tipsInside the world of passive talent - Research tips
Inside the world of passive talent - Research tips
 
Inside the world of passive talent
Inside the world of passive talentInside the world of passive talent
Inside the world of passive talent
 
Inside the world of passive talent research & tips
Inside the world of passive talent  research & tipsInside the world of passive talent  research & tips
Inside the world of passive talent research & tips
 
Inside The World Of Passive Talent Research & Tips
Inside The World Of Passive Talent   Research & TipsInside The World Of Passive Talent   Research & Tips
Inside The World Of Passive Talent Research & Tips
 
Inside the world of passive talent research & tips
Inside the world of passive talent   research & tipsInside the world of passive talent   research & tips
Inside the world of passive talent research & tips
 
Inside the World of Passive Talent
Inside the World of Passive TalentInside the World of Passive Talent
Inside the World of Passive Talent
 
Inside the world of passive talent research & tips
Inside the world of passive talent research & tipsInside the world of passive talent research & tips
Inside the world of passive talent research & tips
 
Inside The World Of Passive Talent
Inside The World Of Passive TalentInside The World Of Passive Talent
Inside The World Of Passive Talent
 
Valtech - Innovation Needs Waste
Valtech - Innovation Needs WasteValtech - Innovation Needs Waste
Valtech - Innovation Needs Waste
 
Macadamian - Product Camp - Top10 researchsecretswebinar dan_arra_march,2013
Macadamian - Product Camp - Top10 researchsecretswebinar dan_arra_march,2013Macadamian - Product Camp - Top10 researchsecretswebinar dan_arra_march,2013
Macadamian - Product Camp - Top10 researchsecretswebinar dan_arra_march,2013
 
Design thinking in everyday life
Design thinking in everyday lifeDesign thinking in everyday life
Design thinking in everyday life
 
Learnings from startups
Learnings from startupsLearnings from startups
Learnings from startups
 
UCF Sales Club Presentation
UCF Sales Club PresentationUCF Sales Club Presentation
UCF Sales Club Presentation
 
It Takes A Village To Create A Great Candidate Experience
It Takes A Village To Create A Great Candidate ExperienceIt Takes A Village To Create A Great Candidate Experience
It Takes A Village To Create A Great Candidate Experience
 

Mehr von Daniel Tunkelang

Enterprise Intelligence
Enterprise IntelligenceEnterprise Intelligence
Enterprise Intelligence
Daniel Tunkelang
 
My Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine LearningMy Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine Learning
Daniel Tunkelang
 
Web science - How is it different?
Web science - How is it different?Web science - How is it different?
Web science - How is it different?
Daniel Tunkelang
 
Find and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedInFind and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedIn
Daniel Tunkelang
 
Search as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal JourneySearch as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal Journey
Daniel Tunkelang
 
Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?
Daniel Tunkelang
 
Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem
Daniel Tunkelang
 

Mehr von Daniel Tunkelang (20)

Query Understanding and Ecommerce
Query Understanding and EcommerceQuery Understanding and Ecommerce
Query Understanding and Ecommerce
 
Semantic Equivalence of e-Commerce Queries
Semantic Equivalence of e-Commerce QueriesSemantic Equivalence of e-Commerce Queries
Semantic Equivalence of e-Commerce Queries
 
Helping Searchers Satisfice through Query Understanding
Helping Searchers Satisfice through Query UnderstandingHelping Searchers Satisfice through Query Understanding
Helping Searchers Satisfice through Query Understanding
 
MMM, Search!
MMM, Search!MMM, Search!
MMM, Search!
 
Enterprise Intelligence
Enterprise IntelligenceEnterprise Intelligence
Enterprise Intelligence
 
Query Understanding: A Manifesto
Query Understanding: A ManifestoQuery Understanding: A Manifesto
Query Understanding: A Manifesto
 
Where should you put your data scientists?
Where should you put your data scientists?Where should you put your data scientists?
Where should you put your data scientists?
 
Data Science: A Mindset for Productivity
Data Science: A Mindset for ProductivityData Science: A Mindset for Productivity
Data Science: A Mindset for Productivity
 
My Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine LearningMy Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine Learning
 
Web science - How is it different?
Web science - How is it different?Web science - How is it different?
Web science - How is it different?
 
Better Search Through Query Understanding
Better Search Through Query UnderstandingBetter Search Through Query Understanding
Better Search Through Query Understanding
 
Social Search in a Professional Context
Social Search in a Professional ContextSocial Search in a Professional Context
Social Search in a Professional Context
 
Find and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedInFind and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedIn
 
Search as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal JourneySearch as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal Journey
 
Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?
 
Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem
 
Information, Attention, and Trust: A Hierarchy of Needs
Information, Attention, and Trust: A Hierarchy of NeedsInformation, Attention, and Trust: A Hierarchy of Needs
Information, Attention, and Trust: A Hierarchy of Needs
 
Content, Connections, and Context
Content, Connections, and ContextContent, Connections, and Context
Content, Connections, and Context
 
Scale, Structure, and Semantics
Scale, Structure, and SemanticsScale, Structure, and Semantics
Scale, Structure, and Semantics
 
Recommendations as a Conversation with the User
Recommendations as a Conversation with the UserRecommendations as a Conversation with the User
Recommendations as a Conversation with the User
 

Kürzlich hochgeladen

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Kürzlich hochgeladen (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 

How to Interview a Data Scientist

  • 1. Daniel How to Interview a Data Scientist Daniel Tunkelang Director of Data Science, LinkedIn Recruiting Solutions 1
  • 3. GOAL 3
  • 4. Specification for a Data Scientist implements algorithms analyzes data thinks product 4
  • 5. What about C ulture ommunication uriosity Hold that thought… ? 5
  • 6. What can you learn from an interview? 6
  • 7. Interviewing is a last resort. Alternatives? 7
  • 8. Only hire people you’ve worked with. 8
  • 9. Hire interns. Convert to full-time. Profit! 9
  • 10. Try before you buy: short-term contracts. 10
  • 11. Alternatives are at best a partial solution. §  Only hiring people you’ve worked with doesn’t scale. –  And traps you in a locally optimal monoculture. §  Interns are great! But they are a significant investment. –  Managing interns well is a productivity gamble. –  Most interns have at least a year of school left. –  Not all interns will make your bar. You won’t always make theirs. §  Try before you buy: nice in theory. –  Adverse selection bias when other offers are permanent roles. –  Creates bureaucracy. 11
  • 12. Can we at least make interviews natural? 12
  • 13. Spend a day working together. 13
  • 16. High-fructose corn syrup is 100% natural. §  Working sessions are difficult to set up. –  No more natural than a final exam. –  High variance, and very difficult to calibrate performance. §  Take-home assignments are great for the employer. –  But they are a significant investment for the candidate. –  Adverse selection bias if other companies don’t require them. –  Creates incentive to cheat if significant part of hiring process. §  Previous work is like natural experiments. –  Always good to review a candidate’s previous work. –  But not always possible to find work with high predictive value. 16
  • 17. So you gotta do interviews. But how? 17
  • 18. Three Principles 1.  Keep it real. 2.  No gotchas. 3.  Maybe = no. 18
  • 20. Test basic coding with FizzBuzz questions. multiple of 3 -> Fizz multiple of 5 -> Buzz multiple of 15 -> FizzBuzz 1, 2, Fizz, 4, Buzz, Fizz, 7, 8, Fizz, Buzz, 11, Fizz, 13, 14, FizzBuzz, 16, … 20
  • 21. Whiteboards suck for coding. http://ericleads.com/2012/10/how-to-conduct-a-better-coding-interview/ 21
  • 22. Don’t ask pointless algorithm questions. implement 22
  • 23. Use real-world algorithms questions. bigdatascientist Did you mean: big data scientist 23
  • 24. Ask candidates to design your products. 24
  • 25. Keeping it real is also a great sell. Similar Profiles People You May Know 25
  • 27. Gotchas reduce the signal-to-noise ratio. §  Avoid problems where success hinges on a single insight. –  Good interview problems offer lots of room for partial credit. –  Making a key insight often reflects experience, not intelligence. §  Don’t test a candidate’s knowledge of a niche technique. –  Unless that niche technique is critical to job performance. –  And can’t be learned on the job as part of on-boarding. §  Be a hard interviewer, but don’t be an asshole. –  An interview is not a stress-test to see where candidates break. –  Interviews communicate your values to the candidate. 27
  • 29. Commit to binary interview outcomes. §  Forced choice so interviewers don’t take easy way out. –  Just like having 4 choices instead of 5 on a rating scale. –  Encourages interviewers to take their role seriously. §  Each team member is a critical filter. –  Two no’s or one strong no is a no. –  All weak yes’s is a no. §  Short-circuit candidates early in the process. –  Resume and phone screening should be aggressive. –  Onsite interviews should have ~50% chance of leading to offers. 29
  • 30. But what about C ulture ommunication uriosity All are must-haves. ? Every interview evaluates all three. 30
  • 32. Three Principles 1.  Keep it real. –  Avoid whiteboard coding. Filter with FizzBuzz. –  Use real-world algorithms questions. –  Ask candidates to design your products. 2.  No gotchas. –  Gotchas reduce the signal-to-noise ratio. 3.  Maybe = no. –  Bad hires suck. Be conservative. –  Trust your team. 32