SlideShare ist ein Scribd-Unternehmen logo
1 von 21
Downloaden Sie, um offline zu lesen
Executive Briefing
Why managing machines is harder than you think
Peter Skomoroch - @peteskomoroch
Strata Data Conference, London - May 1, 2019
Background: Machine Learning & Data Products
Peter Skomoroch
@peteskomoroch
• Co-Founder and CEO of SkipFlag, Enterprise AI
startup acquired in 2018 by Workday
• 18+ years building machine learning products
• Principal Data Scientist, ran Data Products team at
LinkedIn. ML & Search at MIT, AOL, ProfitLogic
• Co-Host of O’Reilly AI Bots Podcast, Startup Advisor
Better, Faster Decisions at Scale
• Machine learning drove massive growth
at consumer internet companies over the
last decade
• A wave of AI startups and vertical
machine learning applications have
emerged across other industries
• For many problems, machine learning
makes better, faster, and more
repeatable decisions at scale
• Amazon, Google, and Microsoft are now
re-organizing themselves around AI
Data Products
Automated systems that collect and learn from data to
make user facing decisions with machine learning
Machine Learning Projects are Hard
• The transition to machine learning will be about 100x harder than the
transition to mobile
• Companies that adopt an experimental culture can still succeed
• Some of the biggest challenges are organizational, not technical
• Data driven companies like Google and Facebook have a strategic
advantage building ML products based on their data & compute assets,
large user population, tracking & instrumentation, and AI talent
If you only do things where you know
the answer in advance, your company
goes away.
Jeff Bezos
Founder, Chairman & CEO of Amazon.com
• Machine Learning shifts
engineering from a deterministic
process to a probabilistic one
• Take intelligent risks
• Most successful ML products are
experiments at massive scale
• Companies driven by analytics
and experimental insights are
more likely to succeed
Experimental Culture
Data Pipelines & Analytics Before AI
Credit: @mrogati
ML Algorithms Need Lots of Labelled Data
Common Crawl: ~4B pages monthly
Combined Pools of Data Give Better Results
https://www.flickr.com/photos/nakrnsm/3814916578
• Learning patterns across large
numbers of customers is the
power behind recommendations
from companies like Amazon and
Netflix
• The more precise or nuanced a
prediction, the more data will
need to be pooled
• You need large amounts of
labelled training data
• Transfer learning may help push
these limits further
Democratize Data Access
• Allow teams across your company to combine real data to improve
their product areas, design with data, and discover new insights
• Share derived data and input features for ML models across teams
• At LinkedIn we had a rich repository of signals like connection
strength, inferred skills, and other datasets that greatly accelerated
new product development
• Empower small teams to build things
quickly and compound returns on
feature engineering & derived data
See https://www.confluent.io/ebook/i-heart-logs-event-data-stream-processing-and-data-integration/
Product Management for Machine Learning
Image source: Martin Eriksson https://www.mindtheproduct.com/2011/10/what-exactly-is-a-product-manager/
• A Data Product Manager (PM)
has core product skills (strategy,
roadmaps, prioritization, etc.)
along with an intuitive grasp of
ML
• They help identify and prioritize
the highest value applications for
machine learning and do what it
takes to make them successful
Good ML Product Managers Have Data Expertise
• Know the difference between easy, hard, and impossible machine
learning problems
• Even if something is feasible from a machine learning perspective,
the level of effort may not justify building the feature
• Know your company’s data inside and out including quality issues,
limitations, biases, and gaps that need to be addressed
• Develop an intuitive understanding of your company’s data and how
it can be used to solve customer problems
Apply ML to a Metric the Business Cares About
Machine Learning Product Development
1. Verify you are solving the right problem
2. Theory + model design (in parallel with UI design)
3. Data collection, labelling, and cleaning
4. Feature engineering, model training, offline validation
5. Model deployment, monitoring & large scale training
• Iterate: repeat process, refine live model & improve
• 80% of effort and gains come from iterations after shipping v 1.0
• Use derived data from the system to build new products
ML Adds Uncertainty to Product Roadmaps
• PMs are often uncomfortable with expensive ideas that have an
uncertain probability of success
• Many organizations will struggle to justify the expense of projects that
require significant research investment upfront
• Some ML products may need to be split into time boxed projects that
get to market in a shorter time frame
• What can you productize now vs. much later on?
• Keep track of dependencies on other teams and have a “Plan B”
Every single company I've worked at
and talked to has the same problem
without a single exception so far —
poor data quality, especially tracking
data
Ruslan Belkin
VP of Engineering, Salesforce.com
• Guide user input when you can
• Use auto suggest fields
• Validate user inputs, emails
• Collect user tags, votes, ratings
• Track impressions, queries, clicks
• Sessionize logs
• Disambiguate and annotate
entities (company names,
locations, etc.)
Data Quality & Standardization
Testing Machine Learning Products
• Algorithm work that drags on without integration in the product where it can
be seen and tested by real users is risky
• Ship a complete MVP in production ASAP, benchmark, and iterate
• Beware unintended consequences from seemingly small product changes
• Remember the prototype is not the product - see what happens when you
use a more realistic data set or scale up your inputs
• Real world data changes over time, ensure your model tests and
benchmarks keep up with changes in underlying data
• Machine learning systems tend to fail in unexpected ways
Look at Your Input Data & Prediction Errors
Flywheel Effects & Data Products
• Users generate data as a side effect of
using most software products
• That data in turn, can improve the
product’s algorithms and enable new
types of recommendations, leading to
more data
• These “Flywheels” get better the more
customers use them leading to unique
competitive moats
• This works well in platforms, networks or
marketplaces where value compounds
* https://medium.freecodecamp.org/the-business-implications-of-machine-learning-11480b99184d
Final Thoughts
• Machine learning products are hard to
build, but within reach of teams who
invest in data infrastructure
• Some of the biggest challenges are
organizational, not technical
• Good product leaders are a key factor in
shipping successful ML products
• Find a machine learning application with
a direct connection to a metric your
organization values and ship it
Send me questions! @peteskomoroch
Q&A / Discussion

Weitere ähnliche Inhalte

Was ist angesagt?

Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesT.S. Lim
 
Machine learning for social media analytics
Machine learning for  social media analyticsMachine learning for  social media analytics
Machine learning for social media analyticsJenya Terpil
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfPremNaraindas1
 
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...ssuser4edc93
 
Responsible Generative AI
Responsible Generative AIResponsible Generative AI
Responsible Generative AICMassociates
 
Ai startups in healthcare By Dr.Mahboob Khan
Ai startups in healthcare By Dr.Mahboob Khan Ai startups in healthcare By Dr.Mahboob Khan
Ai startups in healthcare By Dr.Mahboob Khan Healthcare consultant
 
Generative AI Risks & Concerns
Generative AI Risks & ConcernsGenerative AI Risks & Concerns
Generative AI Risks & ConcernsAjitesh Kumar
 
AIDRC_Generative_AI_TL_v5.pdf
AIDRC_Generative_AI_TL_v5.pdfAIDRC_Generative_AI_TL_v5.pdf
AIDRC_Generative_AI_TL_v5.pdfThierry Lestable
 
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬VINCI Digital - Industrial IoT (IIoT) Strategic Advisory
 
LLM presentation final
LLM presentation finalLLM presentation final
LLM presentation finalRuth Griffin
 
AI in Marketing
AI in MarketingAI in Marketing
AI in Marketingvganti
 
Artificial Intelligence Virtual Assistants & Chatbots
Artificial Intelligence Virtual Assistants & ChatbotsArtificial Intelligence Virtual Assistants & Chatbots
Artificial Intelligence Virtual Assistants & ChatbotsaNumak & Company
 
An overview of Google PaLM 2
An overview of Google PaLM 2An overview of Google PaLM 2
An overview of Google PaLM 2Benjaminlapid1
 
Future of AI - 2023 07 25.pptx
Future of AI - 2023 07 25.pptxFuture of AI - 2023 07 25.pptx
Future of AI - 2023 07 25.pptxGreg Makowski
 
Digital Transformation for Insurance and Underwriting Processes - Caroly Mart...
Digital Transformation for Insurance and Underwriting Processes - Caroly Mart...Digital Transformation for Insurance and Underwriting Processes - Caroly Mart...
Digital Transformation for Insurance and Underwriting Processes - Caroly Mart...SigortaTatbikatcilariDernegi
 
The Future is in Responsible Generative AI
The Future is in Responsible Generative AIThe Future is in Responsible Generative AI
The Future is in Responsible Generative AISaeed Al Dhaheri
 

Was ist angesagt? (20)

Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in Businesses
 
Intro to LLMs
Intro to LLMsIntro to LLMs
Intro to LLMs
 
Machine learning for social media analytics
Machine learning for  social media analyticsMachine learning for  social media analytics
Machine learning for social media analytics
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdf
 
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
 
Responsible Generative AI
Responsible Generative AIResponsible Generative AI
Responsible Generative AI
 
Ai startups in healthcare By Dr.Mahboob Khan
Ai startups in healthcare By Dr.Mahboob Khan Ai startups in healthcare By Dr.Mahboob Khan
Ai startups in healthcare By Dr.Mahboob Khan
 
Generative AI
Generative AIGenerative AI
Generative AI
 
Generative AI Risks & Concerns
Generative AI Risks & ConcernsGenerative AI Risks & Concerns
Generative AI Risks & Concerns
 
Generative AI.pptx
Generative AI.pptxGenerative AI.pptx
Generative AI.pptx
 
AIDRC_Generative_AI_TL_v5.pdf
AIDRC_Generative_AI_TL_v5.pdfAIDRC_Generative_AI_TL_v5.pdf
AIDRC_Generative_AI_TL_v5.pdf
 
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
 
LLM presentation final
LLM presentation finalLLM presentation final
LLM presentation final
 
AI in Marketing
AI in MarketingAI in Marketing
AI in Marketing
 
Artificial Intelligence Virtual Assistants & Chatbots
Artificial Intelligence Virtual Assistants & ChatbotsArtificial Intelligence Virtual Assistants & Chatbots
Artificial Intelligence Virtual Assistants & Chatbots
 
An overview of Google PaLM 2
An overview of Google PaLM 2An overview of Google PaLM 2
An overview of Google PaLM 2
 
Future of AI - 2023 07 25.pptx
Future of AI - 2023 07 25.pptxFuture of AI - 2023 07 25.pptx
Future of AI - 2023 07 25.pptx
 
Bias in AI
Bias in AIBias in AI
Bias in AI
 
Digital Transformation for Insurance and Underwriting Processes - Caroly Mart...
Digital Transformation for Insurance and Underwriting Processes - Caroly Mart...Digital Transformation for Insurance and Underwriting Processes - Caroly Mart...
Digital Transformation for Insurance and Underwriting Processes - Caroly Mart...
 
The Future is in Responsible Generative AI
The Future is in Responsible Generative AIThe Future is in Responsible Generative AI
The Future is in Responsible Generative AI
 

Ähnlich wie Executive Briefing: Why managing machines is harder than you think

Product Management for AI
Product Management for AIProduct Management for AI
Product Management for AIPeter Skomoroch
 
Bridging the AI Gap: Building Stakeholder Support
Bridging the AI Gap: Building Stakeholder SupportBridging the AI Gap: Building Stakeholder Support
Bridging the AI Gap: Building Stakeholder SupportPeter Skomoroch
 
How to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPOHow to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPOProduct School
 
Productionising Machine Learning Models
Productionising Machine Learning ModelsProductionising Machine Learning Models
Productionising Machine Learning ModelsTash Bickley
 
How an AI-backed recommendation system can help increase revenue for your onl...
How an AI-backed recommendation system can help increase revenue for your onl...How an AI-backed recommendation system can help increase revenue for your onl...
How an AI-backed recommendation system can help increase revenue for your onl...Skyl.ai
 
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...Skyl.ai
 
Five Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data StrategyFive Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data StrategyPerficient, Inc.
 
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...Comit Projects Ltd
 
Tips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsTips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsAbhishek Sood
 
Getting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide WebinarGetting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide WebinarConcept Searching, Inc
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...DATAVERSITY
 
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...Concept Searching, Inc
 
Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...
Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...
Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...Manju Devadas
 
Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...
Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...
Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...Skyl.ai
 
Pluto7 - Tableau Webinar on enabling Organization to be Data Driven in 201...
Pluto7   -  Tableau Webinar on enabling Organization to be Data Driven in 201...Pluto7   -  Tableau Webinar on enabling Organization to be Data Driven in 201...
Pluto7 - Tableau Webinar on enabling Organization to be Data Driven in 201...Manju Devadas
 
FTFCU - How to Become a Data Driven Organization
FTFCU - How to Become a Data Driven OrganizationFTFCU - How to Become a Data Driven Organization
FTFCU - How to Become a Data Driven OrganizationNaveen Jain
 
Atlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdfAtlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdfSubrat Kumar Dash
 
Scaling Training Data for AI Applications
Scaling Training Data for AI ApplicationsScaling Training Data for AI Applications
Scaling Training Data for AI ApplicationsApplause
 
Analytics in manufacturing
Analytics in manufacturingAnalytics in manufacturing
Analytics in manufacturingSaurav Kumar
 

Ähnlich wie Executive Briefing: Why managing machines is harder than you think (20)

Product Management for AI
Product Management for AIProduct Management for AI
Product Management for AI
 
Bridging the AI Gap: Building Stakeholder Support
Bridging the AI Gap: Building Stakeholder SupportBridging the AI Gap: Building Stakeholder Support
Bridging the AI Gap: Building Stakeholder Support
 
How to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPOHow to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPO
 
Productionising Machine Learning Models
Productionising Machine Learning ModelsProductionising Machine Learning Models
Productionising Machine Learning Models
 
How an AI-backed recommendation system can help increase revenue for your onl...
How an AI-backed recommendation system can help increase revenue for your onl...How an AI-backed recommendation system can help increase revenue for your onl...
How an AI-backed recommendation system can help increase revenue for your onl...
 
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
 
Five Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data StrategyFive Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data Strategy
 
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
 
Tips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsTips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data Analytics
 
Getting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide WebinarGetting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide Webinar
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
 
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
 
Managing AI Products
Managing AI ProductsManaging AI Products
Managing AI Products
 
Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...
Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...
Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...
 
Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...
Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...
Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...
 
Pluto7 - Tableau Webinar on enabling Organization to be Data Driven in 201...
Pluto7   -  Tableau Webinar on enabling Organization to be Data Driven in 201...Pluto7   -  Tableau Webinar on enabling Organization to be Data Driven in 201...
Pluto7 - Tableau Webinar on enabling Organization to be Data Driven in 201...
 
FTFCU - How to Become a Data Driven Organization
FTFCU - How to Become a Data Driven OrganizationFTFCU - How to Become a Data Driven Organization
FTFCU - How to Become a Data Driven Organization
 
Atlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdfAtlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdf
 
Scaling Training Data for AI Applications
Scaling Training Data for AI ApplicationsScaling Training Data for AI Applications
Scaling Training Data for AI Applications
 
Analytics in manufacturing
Analytics in manufacturingAnalytics in manufacturing
Analytics in manufacturing
 

Mehr von Peter Skomoroch

Managing Machines: The New AI Dev Stack
Managing Machines: The New AI Dev StackManaging Machines: The New AI Dev Stack
Managing Machines: The New AI Dev StackPeter Skomoroch
 
Building Competitive Moats With Data
Building Competitive Moats With DataBuilding Competitive Moats With Data
Building Competitive Moats With DataPeter Skomoroch
 
O'Reilly Strata: Distilling Data Exhaust
O'Reilly Strata: Distilling Data ExhaustO'Reilly Strata: Distilling Data Exhaust
O'Reilly Strata: Distilling Data ExhaustPeter Skomoroch
 
SF Data Science: Developing Data Products
SF Data Science: Developing Data ProductsSF Data Science: Developing Data Products
SF Data Science: Developing Data ProductsPeter Skomoroch
 
Skills, Reputation, and Search
Skills, Reputation, and SearchSkills, Reputation, and Search
Skills, Reputation, and SearchPeter Skomoroch
 
LinkedIn Endorsements: Reputation, Virality, and Social Tagging
LinkedIn Endorsements: Reputation, Virality, and Social TaggingLinkedIn Endorsements: Reputation, Virality, and Social Tagging
LinkedIn Endorsements: Reputation, Virality, and Social TaggingPeter Skomoroch
 
Developing Data Products
Developing Data ProductsDeveloping Data Products
Developing Data ProductsPeter Skomoroch
 
Practical Problem Solving with Data - Onlab Data Conference, Tokyo
Practical Problem Solving with Data - Onlab Data Conference, TokyoPractical Problem Solving with Data - Onlab Data Conference, Tokyo
Practical Problem Solving with Data - Onlab Data Conference, TokyoPeter Skomoroch
 
Street Fighting Data Science
Street Fighting Data ScienceStreet Fighting Data Science
Street Fighting Data SciencePeter Skomoroch
 
Data Mashups -Data Science Summit
Data Mashups -Data Science SummitData Mashups -Data Science Summit
Data Mashups -Data Science SummitPeter Skomoroch
 
Geo Analytics Tutorial - Where 2.0 2011
Geo Analytics Tutorial - Where 2.0 2011Geo Analytics Tutorial - Where 2.0 2011
Geo Analytics Tutorial - Where 2.0 2011Peter Skomoroch
 
Rapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopRapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopPeter Skomoroch
 
Prototyping Data Intensive Apps: TrendingTopics.org
Prototyping Data Intensive Apps: TrendingTopics.orgPrototyping Data Intensive Apps: TrendingTopics.org
Prototyping Data Intensive Apps: TrendingTopics.orgPeter Skomoroch
 

Mehr von Peter Skomoroch (14)

Managing Machines: The New AI Dev Stack
Managing Machines: The New AI Dev StackManaging Machines: The New AI Dev Stack
Managing Machines: The New AI Dev Stack
 
Building Competitive Moats With Data
Building Competitive Moats With DataBuilding Competitive Moats With Data
Building Competitive Moats With Data
 
O'Reilly Strata: Distilling Data Exhaust
O'Reilly Strata: Distilling Data ExhaustO'Reilly Strata: Distilling Data Exhaust
O'Reilly Strata: Distilling Data Exhaust
 
SF Data Science: Developing Data Products
SF Data Science: Developing Data ProductsSF Data Science: Developing Data Products
SF Data Science: Developing Data Products
 
Skills, Reputation, and Search
Skills, Reputation, and SearchSkills, Reputation, and Search
Skills, Reputation, and Search
 
LinkedIn Endorsements: Reputation, Virality, and Social Tagging
LinkedIn Endorsements: Reputation, Virality, and Social TaggingLinkedIn Endorsements: Reputation, Virality, and Social Tagging
LinkedIn Endorsements: Reputation, Virality, and Social Tagging
 
Developing Data Products
Developing Data ProductsDeveloping Data Products
Developing Data Products
 
Practical Problem Solving with Data - Onlab Data Conference, Tokyo
Practical Problem Solving with Data - Onlab Data Conference, TokyoPractical Problem Solving with Data - Onlab Data Conference, Tokyo
Practical Problem Solving with Data - Onlab Data Conference, Tokyo
 
Street Fighting Data Science
Street Fighting Data ScienceStreet Fighting Data Science
Street Fighting Data Science
 
Data Mashups -Data Science Summit
Data Mashups -Data Science SummitData Mashups -Data Science Summit
Data Mashups -Data Science Summit
 
Geo Analytics Tutorial - Where 2.0 2011
Geo Analytics Tutorial - Where 2.0 2011Geo Analytics Tutorial - Where 2.0 2011
Geo Analytics Tutorial - Where 2.0 2011
 
Rapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopRapid Data Exploration With Hadoop
Rapid Data Exploration With Hadoop
 
Prototyping Data Intensive Apps: TrendingTopics.org
Prototyping Data Intensive Apps: TrendingTopics.orgPrototyping Data Intensive Apps: TrendingTopics.org
Prototyping Data Intensive Apps: TrendingTopics.org
 
Elasticwulf Pycon Talk
Elasticwulf Pycon TalkElasticwulf Pycon Talk
Elasticwulf Pycon Talk
 

Kürzlich hochgeladen

Beyond the Codes_Repositioning towards sustainable development
Beyond the Codes_Repositioning towards sustainable developmentBeyond the Codes_Repositioning towards sustainable development
Beyond the Codes_Repositioning towards sustainable developmentNimot Muili
 
International Ocean Transportation p.pdf
International Ocean Transportation p.pdfInternational Ocean Transportation p.pdf
International Ocean Transportation p.pdfAlejandromexEspino
 
digital Human resource management presentation.pdf
digital Human resource management presentation.pdfdigital Human resource management presentation.pdf
digital Human resource management presentation.pdfArtiSrivastava23
 
internship thesis pakistan aeronautical complex kamra
internship thesis pakistan aeronautical complex kamrainternship thesis pakistan aeronautical complex kamra
internship thesis pakistan aeronautical complex kamraAllTops
 
Gautam Buddh Nagar Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Gautam Buddh Nagar Call Girls 🥰 8617370543 Service Offer VIP Hot ModelGautam Buddh Nagar Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Gautam Buddh Nagar Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNitya salvi
 
Siliguri Escorts Service Girl ^ 9332606886, WhatsApp Anytime Siliguri
Siliguri Escorts Service Girl ^ 9332606886, WhatsApp Anytime SiliguriSiliguri Escorts Service Girl ^ 9332606886, WhatsApp Anytime Siliguri
Siliguri Escorts Service Girl ^ 9332606886, WhatsApp Anytime Siligurimeghakumariji156
 
How Software Developers Destroy Business Value.pptx
How Software Developers Destroy Business Value.pptxHow Software Developers Destroy Business Value.pptx
How Software Developers Destroy Business Value.pptxAaron Stannard
 
The Psychology Of Motivation - Richard Brown
The Psychology Of Motivation - Richard BrownThe Psychology Of Motivation - Richard Brown
The Psychology Of Motivation - Richard BrownSandaliGurusinghe2
 
Persuasive and Communication is the art of negotiation.
Persuasive and Communication is the art of negotiation.Persuasive and Communication is the art of negotiation.
Persuasive and Communication is the art of negotiation.aruny7087
 
Information Technology Project Management, Revised 7th edition test bank.docx
Information Technology Project Management, Revised 7th edition test bank.docxInformation Technology Project Management, Revised 7th edition test bank.docx
Information Technology Project Management, Revised 7th edition test bank.docxssuserf63bd7
 
Marketing Management 16th edition by Philip Kotler test bank.docx
Marketing Management 16th edition by Philip Kotler test bank.docxMarketing Management 16th edition by Philip Kotler test bank.docx
Marketing Management 16th edition by Philip Kotler test bank.docxssuserf63bd7
 
W.H.Bender Quote 62 - Always strive to be a Hospitality Service professional
W.H.Bender Quote 62 - Always strive to be a Hospitality Service professionalW.H.Bender Quote 62 - Always strive to be a Hospitality Service professional
W.H.Bender Quote 62 - Always strive to be a Hospitality Service professionalWilliam (Bill) H. Bender, FCSI
 
Safety T fire missions army field Artillery
Safety T fire missions army field ArtillerySafety T fire missions army field Artillery
Safety T fire missions army field ArtilleryKennethSwanberg
 

Kürzlich hochgeladen (14)

Beyond the Codes_Repositioning towards sustainable development
Beyond the Codes_Repositioning towards sustainable developmentBeyond the Codes_Repositioning towards sustainable development
Beyond the Codes_Repositioning towards sustainable development
 
International Ocean Transportation p.pdf
International Ocean Transportation p.pdfInternational Ocean Transportation p.pdf
International Ocean Transportation p.pdf
 
digital Human resource management presentation.pdf
digital Human resource management presentation.pdfdigital Human resource management presentation.pdf
digital Human resource management presentation.pdf
 
internship thesis pakistan aeronautical complex kamra
internship thesis pakistan aeronautical complex kamrainternship thesis pakistan aeronautical complex kamra
internship thesis pakistan aeronautical complex kamra
 
Gautam Buddh Nagar Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Gautam Buddh Nagar Call Girls 🥰 8617370543 Service Offer VIP Hot ModelGautam Buddh Nagar Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Gautam Buddh Nagar Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Siliguri Escorts Service Girl ^ 9332606886, WhatsApp Anytime Siliguri
Siliguri Escorts Service Girl ^ 9332606886, WhatsApp Anytime SiliguriSiliguri Escorts Service Girl ^ 9332606886, WhatsApp Anytime Siliguri
Siliguri Escorts Service Girl ^ 9332606886, WhatsApp Anytime Siliguri
 
How Software Developers Destroy Business Value.pptx
How Software Developers Destroy Business Value.pptxHow Software Developers Destroy Business Value.pptx
How Software Developers Destroy Business Value.pptx
 
The Psychology Of Motivation - Richard Brown
The Psychology Of Motivation - Richard BrownThe Psychology Of Motivation - Richard Brown
The Psychology Of Motivation - Richard Brown
 
Persuasive and Communication is the art of negotiation.
Persuasive and Communication is the art of negotiation.Persuasive and Communication is the art of negotiation.
Persuasive and Communication is the art of negotiation.
 
Abortion pills in Jeddah |• +966572737505 ] GET CYTOTEC
Abortion pills in Jeddah |• +966572737505 ] GET CYTOTECAbortion pills in Jeddah |• +966572737505 ] GET CYTOTEC
Abortion pills in Jeddah |• +966572737505 ] GET CYTOTEC
 
Information Technology Project Management, Revised 7th edition test bank.docx
Information Technology Project Management, Revised 7th edition test bank.docxInformation Technology Project Management, Revised 7th edition test bank.docx
Information Technology Project Management, Revised 7th edition test bank.docx
 
Marketing Management 16th edition by Philip Kotler test bank.docx
Marketing Management 16th edition by Philip Kotler test bank.docxMarketing Management 16th edition by Philip Kotler test bank.docx
Marketing Management 16th edition by Philip Kotler test bank.docx
 
W.H.Bender Quote 62 - Always strive to be a Hospitality Service professional
W.H.Bender Quote 62 - Always strive to be a Hospitality Service professionalW.H.Bender Quote 62 - Always strive to be a Hospitality Service professional
W.H.Bender Quote 62 - Always strive to be a Hospitality Service professional
 
Safety T fire missions army field Artillery
Safety T fire missions army field ArtillerySafety T fire missions army field Artillery
Safety T fire missions army field Artillery
 

Executive Briefing: Why managing machines is harder than you think

  • 1. Executive Briefing Why managing machines is harder than you think Peter Skomoroch - @peteskomoroch Strata Data Conference, London - May 1, 2019
  • 2. Background: Machine Learning & Data Products Peter Skomoroch @peteskomoroch • Co-Founder and CEO of SkipFlag, Enterprise AI startup acquired in 2018 by Workday • 18+ years building machine learning products • Principal Data Scientist, ran Data Products team at LinkedIn. ML & Search at MIT, AOL, ProfitLogic • Co-Host of O’Reilly AI Bots Podcast, Startup Advisor
  • 3. Better, Faster Decisions at Scale • Machine learning drove massive growth at consumer internet companies over the last decade • A wave of AI startups and vertical machine learning applications have emerged across other industries • For many problems, machine learning makes better, faster, and more repeatable decisions at scale • Amazon, Google, and Microsoft are now re-organizing themselves around AI
  • 4. Data Products Automated systems that collect and learn from data to make user facing decisions with machine learning
  • 5. Machine Learning Projects are Hard • The transition to machine learning will be about 100x harder than the transition to mobile • Companies that adopt an experimental culture can still succeed • Some of the biggest challenges are organizational, not technical • Data driven companies like Google and Facebook have a strategic advantage building ML products based on their data & compute assets, large user population, tracking & instrumentation, and AI talent
  • 6. If you only do things where you know the answer in advance, your company goes away. Jeff Bezos Founder, Chairman & CEO of Amazon.com • Machine Learning shifts engineering from a deterministic process to a probabilistic one • Take intelligent risks • Most successful ML products are experiments at massive scale • Companies driven by analytics and experimental insights are more likely to succeed Experimental Culture
  • 7. Data Pipelines & Analytics Before AI Credit: @mrogati
  • 8. ML Algorithms Need Lots of Labelled Data Common Crawl: ~4B pages monthly
  • 9. Combined Pools of Data Give Better Results https://www.flickr.com/photos/nakrnsm/3814916578 • Learning patterns across large numbers of customers is the power behind recommendations from companies like Amazon and Netflix • The more precise or nuanced a prediction, the more data will need to be pooled • You need large amounts of labelled training data • Transfer learning may help push these limits further
  • 10. Democratize Data Access • Allow teams across your company to combine real data to improve their product areas, design with data, and discover new insights • Share derived data and input features for ML models across teams • At LinkedIn we had a rich repository of signals like connection strength, inferred skills, and other datasets that greatly accelerated new product development • Empower small teams to build things quickly and compound returns on feature engineering & derived data See https://www.confluent.io/ebook/i-heart-logs-event-data-stream-processing-and-data-integration/
  • 11. Product Management for Machine Learning Image source: Martin Eriksson https://www.mindtheproduct.com/2011/10/what-exactly-is-a-product-manager/ • A Data Product Manager (PM) has core product skills (strategy, roadmaps, prioritization, etc.) along with an intuitive grasp of ML • They help identify and prioritize the highest value applications for machine learning and do what it takes to make them successful
  • 12. Good ML Product Managers Have Data Expertise • Know the difference between easy, hard, and impossible machine learning problems • Even if something is feasible from a machine learning perspective, the level of effort may not justify building the feature • Know your company’s data inside and out including quality issues, limitations, biases, and gaps that need to be addressed • Develop an intuitive understanding of your company’s data and how it can be used to solve customer problems
  • 13. Apply ML to a Metric the Business Cares About
  • 14. Machine Learning Product Development 1. Verify you are solving the right problem 2. Theory + model design (in parallel with UI design) 3. Data collection, labelling, and cleaning 4. Feature engineering, model training, offline validation 5. Model deployment, monitoring & large scale training • Iterate: repeat process, refine live model & improve • 80% of effort and gains come from iterations after shipping v 1.0 • Use derived data from the system to build new products
  • 15. ML Adds Uncertainty to Product Roadmaps • PMs are often uncomfortable with expensive ideas that have an uncertain probability of success • Many organizations will struggle to justify the expense of projects that require significant research investment upfront • Some ML products may need to be split into time boxed projects that get to market in a shorter time frame • What can you productize now vs. much later on? • Keep track of dependencies on other teams and have a “Plan B”
  • 16. Every single company I've worked at and talked to has the same problem without a single exception so far — poor data quality, especially tracking data Ruslan Belkin VP of Engineering, Salesforce.com • Guide user input when you can • Use auto suggest fields • Validate user inputs, emails • Collect user tags, votes, ratings • Track impressions, queries, clicks • Sessionize logs • Disambiguate and annotate entities (company names, locations, etc.) Data Quality & Standardization
  • 17. Testing Machine Learning Products • Algorithm work that drags on without integration in the product where it can be seen and tested by real users is risky • Ship a complete MVP in production ASAP, benchmark, and iterate • Beware unintended consequences from seemingly small product changes • Remember the prototype is not the product - see what happens when you use a more realistic data set or scale up your inputs • Real world data changes over time, ensure your model tests and benchmarks keep up with changes in underlying data • Machine learning systems tend to fail in unexpected ways
  • 18. Look at Your Input Data & Prediction Errors
  • 19. Flywheel Effects & Data Products • Users generate data as a side effect of using most software products • That data in turn, can improve the product’s algorithms and enable new types of recommendations, leading to more data • These “Flywheels” get better the more customers use them leading to unique competitive moats • This works well in platforms, networks or marketplaces where value compounds * https://medium.freecodecamp.org/the-business-implications-of-machine-learning-11480b99184d
  • 20. Final Thoughts • Machine learning products are hard to build, but within reach of teams who invest in data infrastructure • Some of the biggest challenges are organizational, not technical • Good product leaders are a key factor in shipping successful ML products • Find a machine learning application with a direct connection to a metric your organization values and ship it Send me questions! @peteskomoroch