SlideShare ist ein Scribd-Unternehmen logo
1 von 46
Downloaden Sie, um offline zu lesen
the 17 ways ML / AI can
FAIL
Brian Lange, Partner + Data Scientist
who is this guy?
who are these folks?
TODAY
- A quick framework for the lifecycle of a
data product
- A multitude of failure modes we’ve seen
throughout that lifecycle
- Some ways those failures can be avoided
Lifecycle of a ML/AI product/project
conception
The idea for a product
or project is born.
Lifecycle of a ML/AI product/project
conception experimentation
A data team tries to
make it happen.
Lifecycle of a ML/AI product/project
conception experimentation productionizing
The experiment
becomes more reliable
and integrated.
Lifecycle of a ML/AI product/project
conception experimentation productionizing usage
The product is
used by people
who didn’t
design it.
Lifecycle of a ML/AI product/project
conception experimentation productionizing usage
Lifecycle of a ML/AI product/project
∞
Let’s get to the
FAILures
conception experimentation productionizing usage ∞
Lifecycle of a ML/AI product/project
conception experimentation productionizing usage
conception experimentation productionizing usage
Failure of Imagination
The opportunity never gets dreamed of in the first place
How to avoid:
- Leadership that is aware and excited about
opportunities w/ data
- Data scientists are involved early in idea generation
- Data team has diverse backgrounds
conception experimentation productionizing usage
Failure by Overimagination
The idea isn’t something that can actually be
done, even using creative/cutting-edge methods
How to avoid:
- Leadership is data literate
- Data team is involved in conception stage
conception experimentation productionizing usage
Failure by Inability
There’s nobody at the organization with the skills to
even begin experimenting
How to avoid:
- Hire data scientists! Preferably self-starters
conception experimentation productionizing usage
Failure of Collection
The data needed to make the project happen
doesn’t exist yet
How to avoid:
- Collect as much as possible from as early as
possible, or
- Be patient and let the data roll in after defining
what you need
conception experimentation productionizing usage
Failure of Access
The data needed to make the project happen
exists, but something prevents us from
accessing it or we’re not aware of it.
How to avoid:
- Good data governance/documentation
- Hire data scientists skilled in pulling from many
possible types of sources
conception experimentation productionizing usage
Failure of Access
The data needed to make the project happen
exists, but something prevents us from
accessing it or we’re not aware of it.
How to avoid:
- High-ranking leadership supports the project, cuts
through red tape/gatekeepers
conception experimentation productionizing usage
Failure of Investment
Funding/resources are cut before the experiment
reaches its conclusion
How to avoid:
- Leadership/data team knows how to budget for
data projects
- Project has enough potential value to justify risk/
investment
conception experimentation productionizing usage
Failure of Data Richness
There isn’t enough data to train a model
effectively, or the data we have doesn’t have
enough “signal” in it
How to avoid:
- This is hard. Don’t necessarily know until you try
- Experienced data scientists can at least screen out
the projects where there obviously isn’t enough
conception experimentation productionizing usage
Failure of Rigor
The data team thinks they got great results, but
they’re not actually good.
How to avoid:
- Data team (and preferably leadership) are skeptical
and know how to properly use metrics
conception experimentation productionizing usage
Failure of Interpretability
The model works great, but you have no idea why
How to avoid:
- This only matters if you need to know.
- Constraints around interpretability should be
defined before experimentation
conception experimentation productionizing usage
Failure of IT Cooperation
The infrastructure gatekeepers are busy or
aren’t cooperative
How to avoid:
- Have a great IT team and plan for their involvement
early on, or
- Empower data team to stand up their own
infrastructure and hire for those skills
conception experimentation productionizing usage
Failure of Practicality
The thing you devise is too damn complicated
“We evaluated some of the new methods offline but the additional accuracy
gains that we measured did not seem to justify the engineering effort needed to
bring them into a production environment.”
conception experimentation productionizing usage
Failure of Practicality
The thing you devise is too damn complicated
How to avoid:
- Define relevant context of usage before even
building the model, and keep it in mind
- Don’t spend too much time experimenting if you
have something that’s viable
conception experimentation productionizing usage
Failure of Business Case
The infrastructure required to put your model into
production is too expensive compared to its utility
How to avoid:
- Focus on applications that are clearly high value
and may have many potential uses
- Hire skilled IT/data engineers who know how to
keep running costs low
conception experimentation productionizing usage
Failure of Business Case
The infrastructure required to put your model into
production is too expensive compared to its utility
How to avoid:
- Leadership plays role of monitoring project viability
or
- Data team is business savvy and plays this role
conception experimentation productionizing usage
Failure by Shifting Needs
You took so long that the business case changed
How to avoid:
- Iterate/sprint. Don’t go work on your project in
a closet for 9 months.
- Keep leadership involved with frequent quick
progress updates
conception experimentation productionizing usage
Failure of Adaptation
Reality drifts, and the model doesn’t keep up
How to avoid:
- Have a plan for how the system is updated
- Research how often patterns change drastically in your data
- Monitor accuracy over time with periodic quality checks
conception experimentation productionizing usage
Failure by Side Effects
Your product has unintended consequences
Black defendants were often predicted to be at a higher risk of recidivism than
they actually were. Our analysis found that black defendants who did not
recidivate over a two-year period were nearly twice as likely to be misclassified as
higher risk compared to their white counterparts (45 percent vs. 23 percent).
How We Analyzed the COMPAS Recidivism Algorithm
conception experimentation productionizing usage
Failure by Side Effects
Your product has unintended consequences
How to avoid:
- Dedicate some time up front to exploring potential implications
- Consider where your training data comes from
- Write test code that looks at model outcomes across different
groups of users
conception experimentation productionizing usage
Failure by Side Effects
Your product has unintended consequences
How to avoid:
- Avoid collecting unnecessary PII, and safeguard/encrypt what
you have
conception experimentation productionizing usage
Failure of Morality
Your product works well, but towards an immoral purpose
How to avoid:
- Dedicate some time up front to exploring potential
implications
- Invest in learning about ethics in design, data, technology
- Don’t work for evil organizations
conception experimentation productionizing usage
Failure of Design
Nobody uses your product
How to avoid:
- Use design methods to ensure your product solves a real need
- Focus on the interfaces and test them with users
- Experiment to determine how much people trust your product
and/or think it’s creepy
conception experimentation productionizing usage ∞
Lifecycle of a ML/AI product/project
$
🕑
FAIL
conception experimentation productionizing usage ∞
Lifecycle of a ML/AI product/project
$$$$$$$$
🕑 🕑 🕑 🕑 🕑 🕑
FAIL
Process choices to minimize failure
conception experimentation productionizing usage
Put prototypes in front of real users to learn and refine
Involve data team as early as possible
Prioritize ideas with clear value
Think about potential implications and context
Move fast, overcommunicate
Process choices to minimize failure
conception experimentation productionizing usage
Process choices to minimize failure
conception experimentation productionizing usage (or testing)
Process choices to minimize failure
conception experimentation productionizing usage (or testing)
Team choices to minimize failure
Leadership
- Recognize potential value of
data products and support
them with resources and
authority
- Are data literate enough to
evaluate work and contribute
ideas
- Keep tabs on team process
and help them navigate
shifting business realities
Data Team
- Creative and business savvy,
can generate ideas for new
data projects
- Diverse analytical
backgrounds
- Able to pull, merge, clean data
from a variety of sources
- Can accurately evaluate their
own work
- Have either design skills or
dedicated designers, to ensure
the usefulness of a product
IT
- Data literate, supportive,
and unconstrained, OR
provide self-serve resources
to a dedicated IT role on
data team
- Able to estimate and
minimize costs for
productionizing models
- Optionally, can assist data
team in more software-
heavy aspects
Culture choices to minimize failure
- Invest in training the team about ethics, privacy,
and security, and encourage conversations about it
- Encourage every role to focus on the needs of
individual users and the business
- Allow ideas for new projects to emerge from the
data team as well as leadership
- Make the default answer to “can I have that data/
computing power?” yes
Don’t let perfect be the enemy of good
- Checking all these boxes makes projects run smoother
- It’s exceedingly rare to check all these boxes
- Failure of Even Starting is a failure to learn or make
progress at all
Quick plugs
-Becoming a Data Scientist/Growing a Data
Science Team: Metis (thisismetis.com)
-Increasing Data Literacy in Leadership:
data-science-for-managers.datascope.co
-Hiring a data science team to help you
dream up projects and make them happen:
Datascope (datascope.co)
.co@bjlange
thanks!
questions?
brian [at] datascope.co
slides will be posted on Twitter later!

Weitere ähnliche Inhalte

Kürzlich hochgeladen

Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 

Kürzlich hochgeladen (20)

Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 

Empfohlen

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 

Empfohlen (20)

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 

The 17 Ways Machine Learning and AI Can Fail

  • 1. the 17 ways ML / AI can FAIL Brian Lange, Partner + Data Scientist
  • 2. who is this guy?
  • 3. who are these folks?
  • 4. TODAY - A quick framework for the lifecycle of a data product - A multitude of failure modes we’ve seen throughout that lifecycle - Some ways those failures can be avoided
  • 5. Lifecycle of a ML/AI product/project
  • 6. conception The idea for a product or project is born. Lifecycle of a ML/AI product/project
  • 7. conception experimentation A data team tries to make it happen. Lifecycle of a ML/AI product/project
  • 8. conception experimentation productionizing The experiment becomes more reliable and integrated. Lifecycle of a ML/AI product/project
  • 9. conception experimentation productionizing usage The product is used by people who didn’t design it. Lifecycle of a ML/AI product/project
  • 10. conception experimentation productionizing usage Lifecycle of a ML/AI product/project ∞
  • 11. Let’s get to the FAILures
  • 12. conception experimentation productionizing usage ∞ Lifecycle of a ML/AI product/project
  • 14. conception experimentation productionizing usage Failure of Imagination The opportunity never gets dreamed of in the first place How to avoid: - Leadership that is aware and excited about opportunities w/ data - Data scientists are involved early in idea generation - Data team has diverse backgrounds
  • 15. conception experimentation productionizing usage Failure by Overimagination The idea isn’t something that can actually be done, even using creative/cutting-edge methods How to avoid: - Leadership is data literate - Data team is involved in conception stage
  • 16. conception experimentation productionizing usage Failure by Inability There’s nobody at the organization with the skills to even begin experimenting How to avoid: - Hire data scientists! Preferably self-starters
  • 17. conception experimentation productionizing usage Failure of Collection The data needed to make the project happen doesn’t exist yet How to avoid: - Collect as much as possible from as early as possible, or - Be patient and let the data roll in after defining what you need
  • 18. conception experimentation productionizing usage Failure of Access The data needed to make the project happen exists, but something prevents us from accessing it or we’re not aware of it. How to avoid: - Good data governance/documentation - Hire data scientists skilled in pulling from many possible types of sources
  • 19. conception experimentation productionizing usage Failure of Access The data needed to make the project happen exists, but something prevents us from accessing it or we’re not aware of it. How to avoid: - High-ranking leadership supports the project, cuts through red tape/gatekeepers
  • 20. conception experimentation productionizing usage Failure of Investment Funding/resources are cut before the experiment reaches its conclusion How to avoid: - Leadership/data team knows how to budget for data projects - Project has enough potential value to justify risk/ investment
  • 21. conception experimentation productionizing usage Failure of Data Richness There isn’t enough data to train a model effectively, or the data we have doesn’t have enough “signal” in it How to avoid: - This is hard. Don’t necessarily know until you try - Experienced data scientists can at least screen out the projects where there obviously isn’t enough
  • 22. conception experimentation productionizing usage Failure of Rigor The data team thinks they got great results, but they’re not actually good. How to avoid: - Data team (and preferably leadership) are skeptical and know how to properly use metrics
  • 23. conception experimentation productionizing usage Failure of Interpretability The model works great, but you have no idea why How to avoid: - This only matters if you need to know. - Constraints around interpretability should be defined before experimentation
  • 24. conception experimentation productionizing usage Failure of IT Cooperation The infrastructure gatekeepers are busy or aren’t cooperative How to avoid: - Have a great IT team and plan for their involvement early on, or - Empower data team to stand up their own infrastructure and hire for those skills
  • 25. conception experimentation productionizing usage Failure of Practicality The thing you devise is too damn complicated “We evaluated some of the new methods offline but the additional accuracy gains that we measured did not seem to justify the engineering effort needed to bring them into a production environment.”
  • 26. conception experimentation productionizing usage Failure of Practicality The thing you devise is too damn complicated How to avoid: - Define relevant context of usage before even building the model, and keep it in mind - Don’t spend too much time experimenting if you have something that’s viable
  • 27. conception experimentation productionizing usage Failure of Business Case The infrastructure required to put your model into production is too expensive compared to its utility How to avoid: - Focus on applications that are clearly high value and may have many potential uses - Hire skilled IT/data engineers who know how to keep running costs low
  • 28. conception experimentation productionizing usage Failure of Business Case The infrastructure required to put your model into production is too expensive compared to its utility How to avoid: - Leadership plays role of monitoring project viability or - Data team is business savvy and plays this role
  • 29. conception experimentation productionizing usage Failure by Shifting Needs You took so long that the business case changed How to avoid: - Iterate/sprint. Don’t go work on your project in a closet for 9 months. - Keep leadership involved with frequent quick progress updates
  • 30. conception experimentation productionizing usage Failure of Adaptation Reality drifts, and the model doesn’t keep up How to avoid: - Have a plan for how the system is updated - Research how often patterns change drastically in your data - Monitor accuracy over time with periodic quality checks
  • 31. conception experimentation productionizing usage Failure by Side Effects Your product has unintended consequences Black defendants were often predicted to be at a higher risk of recidivism than they actually were. Our analysis found that black defendants who did not recidivate over a two-year period were nearly twice as likely to be misclassified as higher risk compared to their white counterparts (45 percent vs. 23 percent). How We Analyzed the COMPAS Recidivism Algorithm
  • 32. conception experimentation productionizing usage Failure by Side Effects Your product has unintended consequences How to avoid: - Dedicate some time up front to exploring potential implications - Consider where your training data comes from - Write test code that looks at model outcomes across different groups of users
  • 33. conception experimentation productionizing usage Failure by Side Effects Your product has unintended consequences How to avoid: - Avoid collecting unnecessary PII, and safeguard/encrypt what you have
  • 34. conception experimentation productionizing usage Failure of Morality Your product works well, but towards an immoral purpose How to avoid: - Dedicate some time up front to exploring potential implications - Invest in learning about ethics in design, data, technology - Don’t work for evil organizations
  • 35. conception experimentation productionizing usage Failure of Design Nobody uses your product How to avoid: - Use design methods to ensure your product solves a real need - Focus on the interfaces and test them with users - Experiment to determine how much people trust your product and/or think it’s creepy
  • 36. conception experimentation productionizing usage ∞ Lifecycle of a ML/AI product/project $ 🕑 FAIL
  • 37. conception experimentation productionizing usage ∞ Lifecycle of a ML/AI product/project $$$$$$$$ 🕑 🕑 🕑 🕑 🕑 🕑 FAIL
  • 38. Process choices to minimize failure conception experimentation productionizing usage Put prototypes in front of real users to learn and refine Involve data team as early as possible Prioritize ideas with clear value Think about potential implications and context Move fast, overcommunicate
  • 39. Process choices to minimize failure conception experimentation productionizing usage
  • 40. Process choices to minimize failure conception experimentation productionizing usage (or testing)
  • 41. Process choices to minimize failure conception experimentation productionizing usage (or testing)
  • 42. Team choices to minimize failure Leadership - Recognize potential value of data products and support them with resources and authority - Are data literate enough to evaluate work and contribute ideas - Keep tabs on team process and help them navigate shifting business realities Data Team - Creative and business savvy, can generate ideas for new data projects - Diverse analytical backgrounds - Able to pull, merge, clean data from a variety of sources - Can accurately evaluate their own work - Have either design skills or dedicated designers, to ensure the usefulness of a product IT - Data literate, supportive, and unconstrained, OR provide self-serve resources to a dedicated IT role on data team - Able to estimate and minimize costs for productionizing models - Optionally, can assist data team in more software- heavy aspects
  • 43. Culture choices to minimize failure - Invest in training the team about ethics, privacy, and security, and encourage conversations about it - Encourage every role to focus on the needs of individual users and the business - Allow ideas for new projects to emerge from the data team as well as leadership - Make the default answer to “can I have that data/ computing power?” yes
  • 44. Don’t let perfect be the enemy of good - Checking all these boxes makes projects run smoother - It’s exceedingly rare to check all these boxes - Failure of Even Starting is a failure to learn or make progress at all
  • 45. Quick plugs -Becoming a Data Scientist/Growing a Data Science Team: Metis (thisismetis.com) -Increasing Data Literacy in Leadership: data-science-for-managers.datascope.co -Hiring a data science team to help you dream up projects and make them happen: Datascope (datascope.co)