SlideShare ist ein Scribd-Unternehmen logo
1 von 33
Patterns for
Successful Data Science
Projects
Bill Chambers
@bllchmbrs 2018-04-24
2
Introductions
3
About Me
4
About you
• Data Scientists?
• Data engineers?
• Data team leads?
Scoping Initiatives Looking to Grow
The Context of Your Org/Team
•Your company has a dozen or so
models in production, but now you
want to scale to hundreds/
thousands in the next year.
•Your company is scoping ML
initiatives right now, with little (if
any) ML in production.
Organizational Patterns Technical Patterns
6 Patterns for DS Projects
•Hierarchy of Needs
•Simple
•Track
•Value
•Alignment
•Discipline
Deep dive into each pattern and
apply it to data science projects
Organizational Patterns
in Data Science Projects
8
Value
n. the regard that something is held to deserve;
the importance, worth,
or usefulness of something
9
“Data science isn’t woven into our culture;
it is our culture. We started with it at the heart of
the business, rather than adding it to a traditional
organizational structure, and built the company’s
algorithms around our clients and their needs.”
Katrina Lake
CEO Stitch Fix
- HBR
10
“Having senior level support is very valuable. Our
CEO in particular is a great supporter of machine
learning and sees it as a fundamental part of our
future.”
- Matt Fryer, Chief Data Science Officer,
11
Alignment
n. arrangement in a straight line, or in correct or
appropriate relative positions
n. a position of agreement or alliance
Theory (job description)
•PhD in Computer Science, Computer Engineering, Mathematics...
•5 years of real world or research experience in data science
•Experience with Big Data technologies such as Hadoop, Cassandra etc.
•Experience in model development and life-cycle-management
•Programming skills in various languages (C++, Scala, Java, R) with
proficiency in Python and/or C++
•Understanding of Machine Learning, e.g.: linear/logistics regression
discriminant analysis, bagging, random forest, SVM, neural nets
•Knowledge and skills in the use of current state of the art machine
learning frameworks such as Scikit-Learn, H2O, Keras, TensorFlow and
Spark, etc.
Practice (on the job)
•How many daily active users do we
have?
•What’s our monthly churn?
•How many people are using
_______ feature?
•Can you build a data pipeline?
•We’re considering A/B testing, can
you write up a report on it?
Alignment in the context of DS
projects•The project is formally prioritized
–Funded and staffed appropriately
•You have the infrastructural resources to achieve the mission
•You have runway and cover from your leadership to get where you need
to go
The organization has alignment on
the value data science provides.
15
Discipline
n. a rule or system of rules governing conduct or
activity
Discipline in data science…
•Figure out what you’re going to do and execute at a high standard.
•focus on the results, not just on the tasks.
•Define phases and demonstrate results along the way.
•Don’t just stir the data to get the answer you want.
Make data science a discipline.
Technical Patterns
in Data Science Projects
19
• Defines a theory for
human motivation
(Abraham Maslow, 1943)
• Each base in the pyramid
must be supported
before one can move
onto the next
Maslow’s Hierarchy of Needs
20
Data-Driven Company Hierarchy of
Needs • stable and repeatable
• trackable
• parts of the workflow are automated
production
data science
• simple use cases
• done as one-offs
• little repeatability
ad hoc data science
• stable and repeatable
• high level of abstraction
data pipelines
• ad hoc
• no centralization of data
• little repeatability
data access
• low cognitive load for infra
• simple for end users & devs
• line of sight to automation
data infrastructure
21
Data Science Hierarchy of Needs
Source - Monica Rogati
22
How to approach this pyramid?
• stable and repeatable
• trackable
• parts of the workflow are automated
production
data science
• simple use cases
• done as one-offs
• little repeatability
ad hoc data science
• stable and repeatable
• high level of abstraction
data pipelines
• ad hoc
• no centralization of data
• little repeatability
data access
• low cognitive load for infra
• simple for end users & devs
• line of sight to automation
reliable data infrastructure
1
2
3
4
5
23
ML-System Anti-Patterns
“Hidden Technical Debt in Machine Learning Systems”, Google NIPS
2015
• Glue Code
• Lots of glue code to tie OSS/generic components together.
• Pipeline Jungles
• When pipelines evolve organically, they can become hard to
maintain.
• Abstraction Debt
• A general problem in ML, lots of different abstractions.
24
Simple
n. plain, basic, or uncomplicated in form, nature, or
design; without much decoration or ornamentation
25
KISS Principle
Keep it simple stupid
Kelly Johnson
1910-1990
F-117 Nighthawk
U2 Spy Plane
SR-71 Blackbird
26
Keeping it Simple in ML
Rules of Machine Learning: Best
Practices for ML Engineering
- Martin Zinkevich, Research Scientist @
Google
Rule #1: (Before Machine Learning)
Don’t be afraid to launch a product without
machine learning
Rule #4: (Your First Pipeline)
Keep the first model SIMPLE and get the
infrastructure right
27
Source – Uber Engineering
Example:
COTA: Improving Uber Customer Care with
NLP & Machine Learning
28
Track
n.
the act or process of following something or someone
n. Precise and continuous position-
finding of targets by radar, optical, or other means.
29
Hardest part of ML Systems isn’t ML
“Hidden Technical Debt in Machine Learning Systems”, Google NIPS
2015
30
MLflow Components
30
Trackin
g
Record and query
experiments:
code,
data, config,
results
Project
s
Packaging format
for reproducible
runs
on any platform
Model
s
General model
format
that supports diverse
deployment tools
Notebooks
Local Apps
Cloud Jobs
Tracking Server
UI
API
MLflow Tracking
Python or
REST API
Organizational Patterns Technical Patterns
6 Patterns for DS Projects
•Hierarchy of Needs
•Simplify
•Track
•Value
•Alignment
•Discipline
Thank you

Weitere ähnliche Inhalte

Was ist angesagt?

NDC Oslo : A Practical Introduction to Data Science
NDC Oslo : A Practical Introduction to Data ScienceNDC Oslo : A Practical Introduction to Data Science
NDC Oslo : A Practical Introduction to Data ScienceMark West
 
KDD 2019 IADSS Workshop - How Data Scientists can bridge the gap between Data...
KDD 2019 IADSS Workshop - How Data Scientists can bridge the gap between Data...KDD 2019 IADSS Workshop - How Data Scientists can bridge the gap between Data...
KDD 2019 IADSS Workshop - How Data Scientists can bridge the gap between Data...IADSS
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Domino Data Lab
 
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino Data Lab
 
KDD 2019 IADSS Workshop - Skills to Master Machine Learning and Data Science ...
KDD 2019 IADSS Workshop - Skills to Master Machine Learning and Data Science ...KDD 2019 IADSS Workshop - Skills to Master Machine Learning and Data Science ...
KDD 2019 IADSS Workshop - Skills to Master Machine Learning and Data Science ...IADSS
 
Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...
Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...
Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...Ali Alkan
 
What Managers Need to Know about Data Science
What Managers Need to Know about Data ScienceWhat Managers Need to Know about Data Science
What Managers Need to Know about Data ScienceAnnie Flippo
 
Why Data Science Projects Fail?
Why Data Science Projects Fail?Why Data Science Projects Fail?
Why Data Science Projects Fail?Ethan Ram
 
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talkNYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talkVivian S. Zhang
 
Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects FailSense Corp
 
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...IADSS
 
Back to Square One: Building a Data Science Team from Scratch
Back to Square One: Building a Data Science Team from ScratchBack to Square One: Building a Data Science Team from Scratch
Back to Square One: Building a Data Science Team from ScratchKlaas Bosteels
 
Data science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi PeriasamyData science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi PeriasamyPeter Kua
 
Knowi Overview: NoSQL Analytics and Business Intelligence
Knowi Overview:  NoSQL Analytics and Business IntelligenceKnowi Overview:  NoSQL Analytics and Business Intelligence
Knowi Overview: NoSQL Analytics and Business IntelligenceKnowi
 
Data Scientist Toolbox
Data Scientist ToolboxData Scientist Toolbox
Data Scientist ToolboxAndrei Savu
 
Bio-IT 2014: 'Capturing Value from Collaborative Research with the IDBS Trans...
Bio-IT 2014: 'Capturing Value from Collaborative Research with the IDBS Trans...Bio-IT 2014: 'Capturing Value from Collaborative Research with the IDBS Trans...
Bio-IT 2014: 'Capturing Value from Collaborative Research with the IDBS Trans...IDBS
 
1140 track 1 weiss_using his mac
1140 track 1 weiss_using his mac1140 track 1 weiss_using his mac
1140 track 1 weiss_using his macRising Media, Inc.
 
Data Scientist vs Data Analyst vs Data Engineer - Role & Responsibility, Skil...
Data Scientist vs Data Analyst vs Data Engineer - Role & Responsibility, Skil...Data Scientist vs Data Analyst vs Data Engineer - Role & Responsibility, Skil...
Data Scientist vs Data Analyst vs Data Engineer - Role & Responsibility, Skil...Simplilearn
 

Was ist angesagt? (20)

NDC Oslo : A Practical Introduction to Data Science
NDC Oslo : A Practical Introduction to Data ScienceNDC Oslo : A Practical Introduction to Data Science
NDC Oslo : A Practical Introduction to Data Science
 
KDD 2019 IADSS Workshop - How Data Scientists can bridge the gap between Data...
KDD 2019 IADSS Workshop - How Data Scientists can bridge the gap between Data...KDD 2019 IADSS Workshop - How Data Scientists can bridge the gap between Data...
KDD 2019 IADSS Workshop - How Data Scientists can bridge the gap between Data...
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field
 
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...
 
KDD 2019 IADSS Workshop - Skills to Master Machine Learning and Data Science ...
KDD 2019 IADSS Workshop - Skills to Master Machine Learning and Data Science ...KDD 2019 IADSS Workshop - Skills to Master Machine Learning and Data Science ...
KDD 2019 IADSS Workshop - Skills to Master Machine Learning and Data Science ...
 
Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...
Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...
Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...
 
What Managers Need to Know about Data Science
What Managers Need to Know about Data ScienceWhat Managers Need to Know about Data Science
What Managers Need to Know about Data Science
 
Why Data Science Projects Fail?
Why Data Science Projects Fail?Why Data Science Projects Fail?
Why Data Science Projects Fail?
 
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talkNYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
 
Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects Fail
 
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
 
Back to Square One: Building a Data Science Team from Scratch
Back to Square One: Building a Data Science Team from ScratchBack to Square One: Building a Data Science Team from Scratch
Back to Square One: Building a Data Science Team from Scratch
 
So you want to be a Data Scientist?
So you want to be a Data Scientist?So you want to be a Data Scientist?
So you want to be a Data Scientist?
 
Data science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi PeriasamyData science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi Periasamy
 
"Selling" Open Source 101
"Selling" Open Source 101"Selling" Open Source 101
"Selling" Open Source 101
 
Knowi Overview: NoSQL Analytics and Business Intelligence
Knowi Overview:  NoSQL Analytics and Business IntelligenceKnowi Overview:  NoSQL Analytics and Business Intelligence
Knowi Overview: NoSQL Analytics and Business Intelligence
 
Data Scientist Toolbox
Data Scientist ToolboxData Scientist Toolbox
Data Scientist Toolbox
 
Bio-IT 2014: 'Capturing Value from Collaborative Research with the IDBS Trans...
Bio-IT 2014: 'Capturing Value from Collaborative Research with the IDBS Trans...Bio-IT 2014: 'Capturing Value from Collaborative Research with the IDBS Trans...
Bio-IT 2014: 'Capturing Value from Collaborative Research with the IDBS Trans...
 
1140 track 1 weiss_using his mac
1140 track 1 weiss_using his mac1140 track 1 weiss_using his mac
1140 track 1 weiss_using his mac
 
Data Scientist vs Data Analyst vs Data Engineer - Role & Responsibility, Skil...
Data Scientist vs Data Analyst vs Data Engineer - Role & Responsibility, Skil...Data Scientist vs Data Analyst vs Data Engineer - Role & Responsibility, Skil...
Data Scientist vs Data Analyst vs Data Engineer - Role & Responsibility, Skil...
 

Ähnlich wie Patterns for Successful Data Science Projects (Spark AI Summit)

Data Science Training and Placement
Data Science Training and PlacementData Science Training and Placement
Data Science Training and PlacementAkhilGGM
 
Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)SayyedYusufali
 
Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)SayyedYusufali
 
Data science training in hydpdf converted (1)
Data science training in hydpdf  converted (1)Data science training in hydpdf  converted (1)
Data science training in hydpdf converted (1)SayyedYusufali
 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?DIGITALSAI1
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification courseKumarNaik21
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)SayyedYusufali
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabadVamsiNihal
 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabadsaitejavella
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training HyderabadNithinsunil1
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabadVamsiNihal
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)SayyedYusufali
 
data science training and placement
data science training and placementdata science training and placement
data science training and placementSaiprasadVella
 
online data science training
online data science trainingonline data science training
online data science trainingDIGITALSAI1
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabadVamsiNihal
 
data science online training in hyderabad
data science online training in hyderabaddata science online training in hyderabad
data science online training in hyderabadVamsiNihal
 
Best data science training in Hyderabad
Best data science training in HyderabadBest data science training in Hyderabad
Best data science training in HyderabadKumarNaik21
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training HyderabadNithinsunil1
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification courseKumarNaik21
 
Keynote: Graphs in Government_Lance Walter, CMO
Keynote:  Graphs in Government_Lance Walter, CMOKeynote:  Graphs in Government_Lance Walter, CMO
Keynote: Graphs in Government_Lance Walter, CMONeo4j
 

Ähnlich wie Patterns for Successful Data Science Projects (Spark AI Summit) (20)

Data Science Training and Placement
Data Science Training and PlacementData Science Training and Placement
Data Science Training and Placement
 
Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)
 
Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)
 
Data science training in hydpdf converted (1)
Data science training in hydpdf  converted (1)Data science training in hydpdf  converted (1)
Data science training in hydpdf converted (1)
 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification course
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabad
 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabad
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training Hyderabad
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
 
data science training and placement
data science training and placementdata science training and placement
data science training and placement
 
online data science training
online data science trainingonline data science training
online data science training
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
 
data science online training in hyderabad
data science online training in hyderabaddata science online training in hyderabad
data science online training in hyderabad
 
Best data science training in Hyderabad
Best data science training in HyderabadBest data science training in Hyderabad
Best data science training in Hyderabad
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training Hyderabad
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification course
 
Keynote: Graphs in Government_Lance Walter, CMO
Keynote:  Graphs in Government_Lance Walter, CMOKeynote:  Graphs in Government_Lance Walter, CMO
Keynote: Graphs in Government_Lance Walter, CMO
 

Kürzlich hochgeladen

Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...gajnagarg
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...Elaine Werffeli
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...amitlee9823
 
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...gajnagarg
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila
 
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...amitlee9823
 
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...gajnagarg
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...amitlee9823
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal
 

Kürzlich hochgeladen (20)

Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
 
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
 
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 

Patterns for Successful Data Science Projects (Spark AI Summit)

  • 1. Patterns for Successful Data Science Projects Bill Chambers @bllchmbrs 2018-04-24
  • 4. 4 About you • Data Scientists? • Data engineers? • Data team leads?
  • 5. Scoping Initiatives Looking to Grow The Context of Your Org/Team •Your company has a dozen or so models in production, but now you want to scale to hundreds/ thousands in the next year. •Your company is scoping ML initiatives right now, with little (if any) ML in production.
  • 6. Organizational Patterns Technical Patterns 6 Patterns for DS Projects •Hierarchy of Needs •Simple •Track •Value •Alignment •Discipline Deep dive into each pattern and apply it to data science projects
  • 8. 8 Value n. the regard that something is held to deserve; the importance, worth, or usefulness of something
  • 9. 9 “Data science isn’t woven into our culture; it is our culture. We started with it at the heart of the business, rather than adding it to a traditional organizational structure, and built the company’s algorithms around our clients and their needs.” Katrina Lake CEO Stitch Fix - HBR
  • 10. 10 “Having senior level support is very valuable. Our CEO in particular is a great supporter of machine learning and sees it as a fundamental part of our future.” - Matt Fryer, Chief Data Science Officer,
  • 11. 11 Alignment n. arrangement in a straight line, or in correct or appropriate relative positions n. a position of agreement or alliance
  • 12. Theory (job description) •PhD in Computer Science, Computer Engineering, Mathematics... •5 years of real world or research experience in data science •Experience with Big Data technologies such as Hadoop, Cassandra etc. •Experience in model development and life-cycle-management •Programming skills in various languages (C++, Scala, Java, R) with proficiency in Python and/or C++ •Understanding of Machine Learning, e.g.: linear/logistics regression discriminant analysis, bagging, random forest, SVM, neural nets •Knowledge and skills in the use of current state of the art machine learning frameworks such as Scikit-Learn, H2O, Keras, TensorFlow and Spark, etc.
  • 13. Practice (on the job) •How many daily active users do we have? •What’s our monthly churn? •How many people are using _______ feature? •Can you build a data pipeline? •We’re considering A/B testing, can you write up a report on it?
  • 14. Alignment in the context of DS projects•The project is formally prioritized –Funded and staffed appropriately •You have the infrastructural resources to achieve the mission •You have runway and cover from your leadership to get where you need to go The organization has alignment on the value data science provides.
  • 15. 15 Discipline n. a rule or system of rules governing conduct or activity
  • 16.
  • 17. Discipline in data science… •Figure out what you’re going to do and execute at a high standard. •focus on the results, not just on the tasks. •Define phases and demonstrate results along the way. •Don’t just stir the data to get the answer you want. Make data science a discipline.
  • 18. Technical Patterns in Data Science Projects
  • 19. 19 • Defines a theory for human motivation (Abraham Maslow, 1943) • Each base in the pyramid must be supported before one can move onto the next Maslow’s Hierarchy of Needs
  • 20. 20 Data-Driven Company Hierarchy of Needs • stable and repeatable • trackable • parts of the workflow are automated production data science • simple use cases • done as one-offs • little repeatability ad hoc data science • stable and repeatable • high level of abstraction data pipelines • ad hoc • no centralization of data • little repeatability data access • low cognitive load for infra • simple for end users & devs • line of sight to automation data infrastructure
  • 21. 21 Data Science Hierarchy of Needs Source - Monica Rogati
  • 22. 22 How to approach this pyramid? • stable and repeatable • trackable • parts of the workflow are automated production data science • simple use cases • done as one-offs • little repeatability ad hoc data science • stable and repeatable • high level of abstraction data pipelines • ad hoc • no centralization of data • little repeatability data access • low cognitive load for infra • simple for end users & devs • line of sight to automation reliable data infrastructure 1 2 3 4 5
  • 23. 23 ML-System Anti-Patterns “Hidden Technical Debt in Machine Learning Systems”, Google NIPS 2015 • Glue Code • Lots of glue code to tie OSS/generic components together. • Pipeline Jungles • When pipelines evolve organically, they can become hard to maintain. • Abstraction Debt • A general problem in ML, lots of different abstractions.
  • 24. 24 Simple n. plain, basic, or uncomplicated in form, nature, or design; without much decoration or ornamentation
  • 25. 25 KISS Principle Keep it simple stupid Kelly Johnson 1910-1990 F-117 Nighthawk U2 Spy Plane SR-71 Blackbird
  • 26. 26 Keeping it Simple in ML Rules of Machine Learning: Best Practices for ML Engineering - Martin Zinkevich, Research Scientist @ Google Rule #1: (Before Machine Learning) Don’t be afraid to launch a product without machine learning Rule #4: (Your First Pipeline) Keep the first model SIMPLE and get the infrastructure right
  • 27. 27 Source – Uber Engineering Example: COTA: Improving Uber Customer Care with NLP & Machine Learning
  • 28. 28 Track n. the act or process of following something or someone n. Precise and continuous position- finding of targets by radar, optical, or other means.
  • 29. 29 Hardest part of ML Systems isn’t ML “Hidden Technical Debt in Machine Learning Systems”, Google NIPS 2015
  • 30. 30 MLflow Components 30 Trackin g Record and query experiments: code, data, config, results Project s Packaging format for reproducible runs on any platform Model s General model format that supports diverse deployment tools
  • 31. Notebooks Local Apps Cloud Jobs Tracking Server UI API MLflow Tracking Python or REST API
  • 32. Organizational Patterns Technical Patterns 6 Patterns for DS Projects •Hierarchy of Needs •Simplify •Track •Value •Alignment •Discipline