SlideShare a Scribd company logo
1 of 47
Download to read offline
Applied Data Science
Making insights accessible and actionable
PRESENTED BY
Colin Ristig
Product Manager
colin@yhathq.com
Austin Ogilvie
Founder & CEO
a@yhathq.com
Agenda
Quick Intro to Data Science
Understanding the Value Chain
Designing Your Data Science Process
About Us
We help data scientists
build & deploy apps
Founded 2013
Headquarters in NYC
You may know us from
Data
Science
in 30 seconds
Data Science in 30 Seconds
Broadly…
A multidisciplinary field concerning
problem solving using data,
statistics & software.
“ What distinguishes data science itself from
the tools and techniques is the central goal
of deploying effective decision-making
models to a production environment. ”
Data Science is not “Interesting Research”
~ Nina Zumel & John Mount, Practical Data Science with R
It’s about day-to-day problems
Carl wants to watch
a good movie.
And practical, real-world solutions
Carl wants to watch
a good movie.
Hey, Carl.
Check these out!
Explanation isn’t always important
Carl wants to watch
a good movie.
Carl
Cindy
http://courses.washington.edu/css490/2012.Winter/lecture_slides/08b_collaborative_filtering_1_r1.pdf
Carl would like Frozen
because Cindy liked it.
Data
Science
Challenges
30%
Why?
Key obstacles data science teams face
Lack of Understanding
Key obstacles data science teams face
Difficulty of Experimentation
Hey, Trey. Online sales
are down. What can
we do to keep users
engaged and shopping
carts full?
Trey is asked to “look into something”
I’ll look into it.
Hm...cool. Can
you talk to the
dev team?
Here’s what
we should do:
Trey uncovers a bunch of things we didn’t know
Trey hands his work to deployment engineers
“Throw it over the wall” projects
Execs Data Science Application Developers
Common reasons these types of projects stall
- Unclear benefits
- Skepticism about effectiveness
- Too complex to operationalize
- Too time-consuming
- Unclear how to measure ROI
Data
Science
Value Chain
Making data valuable
Collect and display individual records
Structure, link,
metadata, interact, share
Understand,
infer, learn
Drive
value,
change
Clean, aggregate, visualize
Actions
Predictions
Reports
Charts
Records
Extracting value
from data is like any
other value chain.
Value
Like a raw material,
data has no obvious
utility to start out.
Collect and display individual records
Structure, link,
metadata, interact, share
Understand,
infer, learn
Drive
value,
change
Clean, aggregate, visualize
Actions
Predictions
Reports
Charts
Records
Value
Making data valuable
We make it valuable
through sequential
refinement.
Collect and display individual records
Structure, link,
metadata, interact, share
Understand,
infer, learn
Drive
value,
change
Clean, aggregate, visualize
Actions
Predictions
Reports
Charts
Records
Value
Making data valuable
Cost of Creating that Value
Building data products requires lots of work
Cost of Creating that Value
But most of the value is generated at the end
Cost of Creating that Value
Data Teams
Managers
Customers
Everyone has to see past a lot of challenges
Data
Science
Customers
- Consumers
Several types of customers
Carl wants to watch a good movie.
- Consumers
- App Developers
Cambria needs to call credit models from Salesforce.
Several types of customers
Douglas needs 3 AM server outages to stop.
Several types of customers
- Consumers
- App Developers
- Infrastructure Admins
Gordon wants sales reps calling the hottest leads.
Several types of customers
- Consumers
- App Developers
- Infrastructure Admins
- Sales & Marketing
Data
Science
5 Attributes
for Success
1. Focus on the customer
5 Attributes of Successful Data Science Teams
1. Focus on the customer
2. Identify practical constraints
5 Attributes of Successful Data Science Teams
1. Focus on the customer
2. Identify practical constraints
3. Start small but ship quickly
5 Attributes of Successful Data Science Teams
1. Focus on the customer
2. Identify practical constraints
3. Start small but ship quickly
4. Measure the impact
5 Attributes of Successful Data Science Teams
1. Focus on the customer
2. Identify practical constraints
3. Start small but ship quickly
4. Measure the impact
5. Relentless iteration
5 Attributes of Successful Data Science Teams
1. Focus on the customer
2. Identify practical constraints
3. Start small but ship quickly
4. Measure the impact
5. Relentless iteration
5 Attributes of Successful Data Science Teams
Demo
Hm...cool. Can
you talk to the
dev team?
Here’s what
we should do:
Trey uncovers a bunch of things we didn’t know
Trey hands his work to deployment engineers
“Throw it over the wall” projects
Data Science Application Developers
Deploy Models Faster
Data Science Application Developers
Yhat - Applied Data Science - Feb 2016

More Related Content

What's hot

DataTalks #4: Необходимый минимум инструментов для построения своей системы р...
DataTalks #4: Необходимый минимум инструментов для построения своей системы р...DataTalks #4: Необходимый минимум инструментов для построения своей системы р...
DataTalks #4: Необходимый минимум инструментов для построения своей системы р...WG_ Events
 
Flow-based road mapping & options thinking
Flow-based road mapping & options thinkingFlow-based road mapping & options thinking
Flow-based road mapping & options thinkingMatt Barcomb
 
Using flow based road mapping and options
Using flow based road mapping and optionsUsing flow based road mapping and options
Using flow based road mapping and optionsLeanDog
 
Modern testing overview
Modern testing overviewModern testing overview
Modern testing overviewMatt Barcomb
 
Lean development planning using options thinking
Lean development planning using options thinkingLean development planning using options thinking
Lean development planning using options thinkingMatt Barcomb
 
Dataiku data science studio
Dataiku data science studioDataiku data science studio
Dataiku data science studioNorman Poh
 
The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products Dataiku
 
Nadine Schöne, Dataiku. The Complete Data Value Chain in a Nutshell
Nadine Schöne, Dataiku. The Complete Data Value Chain in a NutshellNadine Schöne, Dataiku. The Complete Data Value Chain in a Nutshell
Nadine Schöne, Dataiku. The Complete Data Value Chain in a NutshellIT Arena
 
Practical AI & data science ethics
Practical AI & data science ethicsPractical AI & data science ethics
Practical AI & data science ethicsStephanie Locke
 

What's hot (9)

DataTalks #4: Необходимый минимум инструментов для построения своей системы р...
DataTalks #4: Необходимый минимум инструментов для построения своей системы р...DataTalks #4: Необходимый минимум инструментов для построения своей системы р...
DataTalks #4: Необходимый минимум инструментов для построения своей системы р...
 
Flow-based road mapping & options thinking
Flow-based road mapping & options thinkingFlow-based road mapping & options thinking
Flow-based road mapping & options thinking
 
Using flow based road mapping and options
Using flow based road mapping and optionsUsing flow based road mapping and options
Using flow based road mapping and options
 
Modern testing overview
Modern testing overviewModern testing overview
Modern testing overview
 
Lean development planning using options thinking
Lean development planning using options thinkingLean development planning using options thinking
Lean development planning using options thinking
 
Dataiku data science studio
Dataiku data science studioDataiku data science studio
Dataiku data science studio
 
The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products
 
Nadine Schöne, Dataiku. The Complete Data Value Chain in a Nutshell
Nadine Schöne, Dataiku. The Complete Data Value Chain in a NutshellNadine Schöne, Dataiku. The Complete Data Value Chain in a Nutshell
Nadine Schöne, Dataiku. The Complete Data Value Chain in a Nutshell
 
Practical AI & data science ethics
Practical AI & data science ethicsPractical AI & data science ethics
Practical AI & data science ethics
 

Viewers also liked

Electron - Build desktop apps using javascript
Electron - Build desktop apps using javascriptElectron - Build desktop apps using javascript
Electron - Build desktop apps using javascriptAustin Ogilvie
 
Ggplot in python
Ggplot in pythonGgplot in python
Ggplot in pythonAjay Ohri
 
Building a Beer Recommender with Yhat (PAPIs.io - November 2014)
Building a Beer Recommender with Yhat (PAPIs.io - November 2014)Building a Beer Recommender with Yhat (PAPIs.io - November 2014)
Building a Beer Recommender with Yhat (PAPIs.io - November 2014)Austin Ogilvie
 
American Century (Revolution Analytics Customer Day)
American Century (Revolution Analytics Customer Day)American Century (Revolution Analytics Customer Day)
American Century (Revolution Analytics Customer Day)Revolution Analytics
 
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...European Data Forum
 
Using R for Analyzing Loans, Portfolios and Risk: From Academic Theory to Fi...
Using R for Analyzing Loans, Portfolios and Risk:  From Academic Theory to Fi...Using R for Analyzing Loans, Portfolios and Risk:  From Academic Theory to Fi...
Using R for Analyzing Loans, Portfolios and Risk: From Academic Theory to Fi...Revolution Analytics
 
Using R for Social Media and Sports Analytics
Using R for Social Media and Sports AnalyticsUsing R for Social Media and Sports Analytics
Using R for Social Media and Sports AnalyticsAjay Ohri
 
Hadley verse
Hadley verseHadley verse
Hadley verseAjay Ohri
 
Analyze this
Analyze thisAnalyze this
Analyze thisAjay Ohri
 
Python at yhat (august 2013)
Python at yhat (august 2013)Python at yhat (august 2013)
Python at yhat (august 2013)Austin Ogilvie
 
Table of Useful R commands.
Table of Useful R commands.Table of Useful R commands.
Table of Useful R commands.Dr. Volkan OBAN
 
Analyzing mlb data with ggplot
Analyzing mlb data with ggplotAnalyzing mlb data with ggplot
Analyzing mlb data with ggplotAustin Ogilvie
 
What is r in spanish.
What is r in spanish.What is r in spanish.
What is r in spanish.Ajay Ohri
 
Kush stats alpha
Kush stats alpha Kush stats alpha
Kush stats alpha Ajay Ohri
 
Summer school python in spanish
Summer school python in spanishSummer school python in spanish
Summer school python in spanishAjay Ohri
 
Logical Fallacies
Logical FallaciesLogical Fallacies
Logical FallaciesAjay Ohri
 
Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014
Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014
Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014Austin Ogilvie
 
Training in Analytics and Data Science
Training in Analytics and Data ScienceTraining in Analytics and Data Science
Training in Analytics and Data ScienceAjay Ohri
 

Viewers also liked (20)

Electron - Build desktop apps using javascript
Electron - Build desktop apps using javascriptElectron - Build desktop apps using javascript
Electron - Build desktop apps using javascript
 
Ggplot in python
Ggplot in pythonGgplot in python
Ggplot in python
 
Building a Beer Recommender with Yhat (PAPIs.io - November 2014)
Building a Beer Recommender with Yhat (PAPIs.io - November 2014)Building a Beer Recommender with Yhat (PAPIs.io - November 2014)
Building a Beer Recommender with Yhat (PAPIs.io - November 2014)
 
ggplot for python
ggplot for pythonggplot for python
ggplot for python
 
American Century (Revolution Analytics Customer Day)
American Century (Revolution Analytics Customer Day)American Century (Revolution Analytics Customer Day)
American Century (Revolution Analytics Customer Day)
 
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
 
Using R for Analyzing Loans, Portfolios and Risk: From Academic Theory to Fi...
Using R for Analyzing Loans, Portfolios and Risk:  From Academic Theory to Fi...Using R for Analyzing Loans, Portfolios and Risk:  From Academic Theory to Fi...
Using R for Analyzing Loans, Portfolios and Risk: From Academic Theory to Fi...
 
Using R for Social Media and Sports Analytics
Using R for Social Media and Sports AnalyticsUsing R for Social Media and Sports Analytics
Using R for Social Media and Sports Analytics
 
Hadley verse
Hadley verseHadley verse
Hadley verse
 
Analyze this
Analyze thisAnalyze this
Analyze this
 
Python at yhat (august 2013)
Python at yhat (august 2013)Python at yhat (august 2013)
Python at yhat (august 2013)
 
Table of Useful R commands.
Table of Useful R commands.Table of Useful R commands.
Table of Useful R commands.
 
Analyzing mlb data with ggplot
Analyzing mlb data with ggplotAnalyzing mlb data with ggplot
Analyzing mlb data with ggplot
 
What is r in spanish.
What is r in spanish.What is r in spanish.
What is r in spanish.
 
Kush stats alpha
Kush stats alpha Kush stats alpha
Kush stats alpha
 
Rcpp
RcppRcpp
Rcpp
 
Summer school python in spanish
Summer school python in spanishSummer school python in spanish
Summer school python in spanish
 
Logical Fallacies
Logical FallaciesLogical Fallacies
Logical Fallacies
 
Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014
Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014
Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014
 
Training in Analytics and Data Science
Training in Analytics and Data ScienceTraining in Analytics and Data Science
Training in Analytics and Data Science
 

Similar to Yhat - Applied Data Science - Feb 2016

Where the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information AccessWhere the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information AccessInside Analysis
 
Webinar: AI as a Shared Service by Salesforce Senior Director of Product
Webinar: AI as a Shared Service by Salesforce Senior Director of ProductWebinar: AI as a Shared Service by Salesforce Senior Director of Product
Webinar: AI as a Shared Service by Salesforce Senior Director of ProductProduct School
 
AI as a Shared Service by Salesforce Senior Director of Product
AI as a Shared Service by Salesforce Senior Director of ProductAI as a Shared Service by Salesforce Senior Director of Product
AI as a Shared Service by Salesforce Senior Director of ProductProduct School
 
ANIn Coimbatore Sep 2023 | Agile for data science by Venkatesa Prasanna Selvaraj
ANIn Coimbatore Sep 2023 | Agile for data science by Venkatesa Prasanna SelvarajANIn Coimbatore Sep 2023 | Agile for data science by Venkatesa Prasanna Selvaraj
ANIn Coimbatore Sep 2023 | Agile for data science by Venkatesa Prasanna SelvarajAgileNetwork
 
Four Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyFour Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyArcadia Data
 
What makes an effective data team?
What makes an effective data team?What makes an effective data team?
What makes an effective data team?Snowplow Analytics
 
Smarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with AutomationSmarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with AutomationInside Analysis
 
#bluecruxtalks crash course - Part 1 - Master Data Factories.pdf
#bluecruxtalks crash course - Part 1 - Master Data Factories.pdf#bluecruxtalks crash course - Part 1 - Master Data Factories.pdf
#bluecruxtalks crash course - Part 1 - Master Data Factories.pdfBluecrux
 
Getting Data Quality Right
Getting Data Quality RightGetting Data Quality Right
Getting Data Quality RightDATAVERSITY
 
#bluecruxtalks in May: Building master data factories, together
#bluecruxtalks in May: Building master data factories, together#bluecruxtalks in May: Building master data factories, together
#bluecruxtalks in May: Building master data factories, togetherBluecrux
 
Driving Customer Loyalty with Azure Machine Learning
Driving Customer Loyalty with Azure Machine LearningDriving Customer Loyalty with Azure Machine Learning
Driving Customer Loyalty with Azure Machine LearningCCG
 
DataOps: Nine steps to transform your data science impact Strata London May 18
DataOps: Nine steps to transform your data science impact  Strata London May 18DataOps: Nine steps to transform your data science impact  Strata London May 18
DataOps: Nine steps to transform your data science impact Strata London May 18Harvinder Atwal
 
When Downtime Isn’t an Option: Performance Optimization Analytics in the Era ...
When Downtime Isn’t an Option: Performance Optimization Analytics in the Era ...When Downtime Isn’t an Option: Performance Optimization Analytics in the Era ...
When Downtime Isn’t an Option: Performance Optimization Analytics in the Era ...CA Technologies
 
Analytics in a Day Virtual Workshop
Analytics in a Day Virtual WorkshopAnalytics in a Day Virtual Workshop
Analytics in a Day Virtual WorkshopCCG
 
Cubodrom profile
Cubodrom profileCubodrom profile
Cubodrom profilecubodrom
 
Leverage Data Strategy as a Catalyst for Innovation
Leverage Data Strategy as a Catalyst for InnovationLeverage Data Strategy as a Catalyst for Innovation
Leverage Data Strategy as a Catalyst for InnovationGlorium Tech
 
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...Joe Lamantia
 
UX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
UX STRAT USA Presentation: Joe Lamantia, Bottomline TechnologiesUX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
UX STRAT USA Presentation: Joe Lamantia, Bottomline TechnologiesUX STRAT
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 

Similar to Yhat - Applied Data Science - Feb 2016 (20)

The coding portion of Data Science
The coding portion of Data ScienceThe coding portion of Data Science
The coding portion of Data Science
 
Where the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information AccessWhere the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information Access
 
Webinar: AI as a Shared Service by Salesforce Senior Director of Product
Webinar: AI as a Shared Service by Salesforce Senior Director of ProductWebinar: AI as a Shared Service by Salesforce Senior Director of Product
Webinar: AI as a Shared Service by Salesforce Senior Director of Product
 
AI as a Shared Service by Salesforce Senior Director of Product
AI as a Shared Service by Salesforce Senior Director of ProductAI as a Shared Service by Salesforce Senior Director of Product
AI as a Shared Service by Salesforce Senior Director of Product
 
ANIn Coimbatore Sep 2023 | Agile for data science by Venkatesa Prasanna Selvaraj
ANIn Coimbatore Sep 2023 | Agile for data science by Venkatesa Prasanna SelvarajANIn Coimbatore Sep 2023 | Agile for data science by Venkatesa Prasanna Selvaraj
ANIn Coimbatore Sep 2023 | Agile for data science by Venkatesa Prasanna Selvaraj
 
Four Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyFour Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics Strategy
 
What makes an effective data team?
What makes an effective data team?What makes an effective data team?
What makes an effective data team?
 
Smarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with AutomationSmarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with Automation
 
#bluecruxtalks crash course - Part 1 - Master Data Factories.pdf
#bluecruxtalks crash course - Part 1 - Master Data Factories.pdf#bluecruxtalks crash course - Part 1 - Master Data Factories.pdf
#bluecruxtalks crash course - Part 1 - Master Data Factories.pdf
 
Getting Data Quality Right
Getting Data Quality RightGetting Data Quality Right
Getting Data Quality Right
 
#bluecruxtalks in May: Building master data factories, together
#bluecruxtalks in May: Building master data factories, together#bluecruxtalks in May: Building master data factories, together
#bluecruxtalks in May: Building master data factories, together
 
Driving Customer Loyalty with Azure Machine Learning
Driving Customer Loyalty with Azure Machine LearningDriving Customer Loyalty with Azure Machine Learning
Driving Customer Loyalty with Azure Machine Learning
 
DataOps: Nine steps to transform your data science impact Strata London May 18
DataOps: Nine steps to transform your data science impact  Strata London May 18DataOps: Nine steps to transform your data science impact  Strata London May 18
DataOps: Nine steps to transform your data science impact Strata London May 18
 
When Downtime Isn’t an Option: Performance Optimization Analytics in the Era ...
When Downtime Isn’t an Option: Performance Optimization Analytics in the Era ...When Downtime Isn’t an Option: Performance Optimization Analytics in the Era ...
When Downtime Isn’t an Option: Performance Optimization Analytics in the Era ...
 
Analytics in a Day Virtual Workshop
Analytics in a Day Virtual WorkshopAnalytics in a Day Virtual Workshop
Analytics in a Day Virtual Workshop
 
Cubodrom profile
Cubodrom profileCubodrom profile
Cubodrom profile
 
Leverage Data Strategy as a Catalyst for Innovation
Leverage Data Strategy as a Catalyst for InnovationLeverage Data Strategy as a Catalyst for Innovation
Leverage Data Strategy as a Catalyst for Innovation
 
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
 
UX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
UX STRAT USA Presentation: Joe Lamantia, Bottomline TechnologiesUX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
UX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 

More from Austin Ogilvie

2013 - Yhat - YC app.pdf
2013 - Yhat - YC app.pdf2013 - Yhat - YC app.pdf
2013 - Yhat - YC app.pdfAustin Ogilvie
 
Yhat 2017 Investor Deck
Yhat 2017 Investor DeckYhat 2017 Investor Deck
Yhat 2017 Investor DeckAustin Ogilvie
 
Finding Lanes for Self-Driving Cars - PyData Berlin Jul 2017- Ross Kippenbroc...
Finding Lanes for Self-Driving Cars - PyData Berlin Jul 2017- Ross Kippenbroc...Finding Lanes for Self-Driving Cars - PyData Berlin Jul 2017- Ross Kippenbroc...
Finding Lanes for Self-Driving Cars - PyData Berlin Jul 2017- Ross Kippenbroc...Austin Ogilvie
 
Applied Data Science with Yhat
Applied Data Science with YhatApplied Data Science with Yhat
Applied Data Science with YhatAustin Ogilvie
 
Predictive Models for Production Apps with Yhat
Predictive Models for Production Apps with YhatPredictive Models for Production Apps with Yhat
Predictive Models for Production Apps with YhatAustin Ogilvie
 

More from Austin Ogilvie (6)

2013 - Yhat - YC app.pdf
2013 - Yhat - YC app.pdf2013 - Yhat - YC app.pdf
2013 - Yhat - YC app.pdf
 
2013 05-27-yhat-about
2013 05-27-yhat-about2013 05-27-yhat-about
2013 05-27-yhat-about
 
Yhat 2017 Investor Deck
Yhat 2017 Investor DeckYhat 2017 Investor Deck
Yhat 2017 Investor Deck
 
Finding Lanes for Self-Driving Cars - PyData Berlin Jul 2017- Ross Kippenbroc...
Finding Lanes for Self-Driving Cars - PyData Berlin Jul 2017- Ross Kippenbroc...Finding Lanes for Self-Driving Cars - PyData Berlin Jul 2017- Ross Kippenbroc...
Finding Lanes for Self-Driving Cars - PyData Berlin Jul 2017- Ross Kippenbroc...
 
Applied Data Science with Yhat
Applied Data Science with YhatApplied Data Science with Yhat
Applied Data Science with Yhat
 
Predictive Models for Production Apps with Yhat
Predictive Models for Production Apps with YhatPredictive Models for Production Apps with Yhat
Predictive Models for Production Apps with Yhat
 

Recently uploaded

Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfStefano Stabellini
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalLionel Briand
 
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdfInnovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdfYashikaSharma391629
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)jennyeacort
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfDrew Moseley
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based projectAnoyGreter
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsSafe Software
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsChristian Birchler
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identityteam-WIBU
 
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptxReal-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptxRTS corp
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 

Recently uploaded (20)

Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdf
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive Goal
 
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdfInnovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdf
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdf
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based project
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data Streams
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identity
 
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptxReal-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 

Yhat - Applied Data Science - Feb 2016

  • 1. Applied Data Science Making insights accessible and actionable PRESENTED BY Colin Ristig Product Manager colin@yhathq.com Austin Ogilvie Founder & CEO a@yhathq.com
  • 2. Agenda Quick Intro to Data Science Understanding the Value Chain Designing Your Data Science Process
  • 4. We help data scientists build & deploy apps
  • 6. You may know us from
  • 8. Data Science in 30 Seconds Broadly… A multidisciplinary field concerning problem solving using data, statistics & software.
  • 9. “ What distinguishes data science itself from the tools and techniques is the central goal of deploying effective decision-making models to a production environment. ” Data Science is not “Interesting Research” ~ Nina Zumel & John Mount, Practical Data Science with R
  • 10. It’s about day-to-day problems Carl wants to watch a good movie.
  • 11. And practical, real-world solutions Carl wants to watch a good movie. Hey, Carl. Check these out!
  • 12. Explanation isn’t always important Carl wants to watch a good movie. Carl Cindy http://courses.washington.edu/css490/2012.Winter/lecture_slides/08b_collaborative_filtering_1_r1.pdf Carl would like Frozen because Cindy liked it.
  • 14. 30%
  • 15. Why?
  • 16. Key obstacles data science teams face Lack of Understanding
  • 17. Key obstacles data science teams face Difficulty of Experimentation
  • 18. Hey, Trey. Online sales are down. What can we do to keep users engaged and shopping carts full? Trey is asked to “look into something” I’ll look into it.
  • 19. Hm...cool. Can you talk to the dev team? Here’s what we should do: Trey uncovers a bunch of things we didn’t know
  • 20. Trey hands his work to deployment engineers
  • 21. “Throw it over the wall” projects Execs Data Science Application Developers
  • 22. Common reasons these types of projects stall - Unclear benefits - Skepticism about effectiveness - Too complex to operationalize - Too time-consuming - Unclear how to measure ROI
  • 24. Making data valuable Collect and display individual records Structure, link, metadata, interact, share Understand, infer, learn Drive value, change Clean, aggregate, visualize Actions Predictions Reports Charts Records Extracting value from data is like any other value chain. Value
  • 25. Like a raw material, data has no obvious utility to start out. Collect and display individual records Structure, link, metadata, interact, share Understand, infer, learn Drive value, change Clean, aggregate, visualize Actions Predictions Reports Charts Records Value Making data valuable
  • 26. We make it valuable through sequential refinement. Collect and display individual records Structure, link, metadata, interact, share Understand, infer, learn Drive value, change Clean, aggregate, visualize Actions Predictions Reports Charts Records Value Making data valuable
  • 27. Cost of Creating that Value Building data products requires lots of work
  • 28. Cost of Creating that Value But most of the value is generated at the end
  • 29. Cost of Creating that Value Data Teams Managers Customers Everyone has to see past a lot of challenges
  • 31. - Consumers Several types of customers Carl wants to watch a good movie.
  • 32. - Consumers - App Developers Cambria needs to call credit models from Salesforce. Several types of customers
  • 33. Douglas needs 3 AM server outages to stop. Several types of customers - Consumers - App Developers - Infrastructure Admins
  • 34. Gordon wants sales reps calling the hottest leads. Several types of customers - Consumers - App Developers - Infrastructure Admins - Sales & Marketing
  • 36. 1. Focus on the customer 5 Attributes of Successful Data Science Teams
  • 37. 1. Focus on the customer 2. Identify practical constraints 5 Attributes of Successful Data Science Teams
  • 38. 1. Focus on the customer 2. Identify practical constraints 3. Start small but ship quickly 5 Attributes of Successful Data Science Teams
  • 39. 1. Focus on the customer 2. Identify practical constraints 3. Start small but ship quickly 4. Measure the impact 5 Attributes of Successful Data Science Teams
  • 40. 1. Focus on the customer 2. Identify practical constraints 3. Start small but ship quickly 4. Measure the impact 5. Relentless iteration 5 Attributes of Successful Data Science Teams
  • 41. 1. Focus on the customer 2. Identify practical constraints 3. Start small but ship quickly 4. Measure the impact 5. Relentless iteration 5 Attributes of Successful Data Science Teams
  • 42. Demo
  • 43. Hm...cool. Can you talk to the dev team? Here’s what we should do: Trey uncovers a bunch of things we didn’t know
  • 44. Trey hands his work to deployment engineers
  • 45. “Throw it over the wall” projects Data Science Application Developers
  • 46. Deploy Models Faster Data Science Application Developers