SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
BRIDGING THE GAP
FROM DATA SCIENCE TO
SERVICE
32ND PYDATA LONDON MEETUP
Daniel F Moisset - dmoisset@ /machinalis.com @dmoisset
ABOUT ME
Hi! I'm Daniel Moisset!
I work at Machinalis
A special thanks to Marcos Spontón who also works there and inspired most of this talk.
WARNING: THIS IS NOT A TECH
TALK!
In other words:
THIS TALK IS NOT ABOUT ALGORITHMS,
MODELS, TOOLS, OR USE CASES
In event di ferent words:
THIS TALK IS ABOUT PEOPLE
SO, RAISIN BREAD
By je freyw (Mmm...raisin bread) [ ],CC BY 2.0 via Wikimedia Commons
Machine Learning development is like the raisins in a
raisin bread... you need the bread first. But, it's just a
few tiny raisins but without it you would just have
plain bread
— I don't really know who, but I love the analogy
WHO WANTS RAISIN BREAD
Di ferent organizations use your services:
1. Large companies with a live product and data, but without enough
expertise/manpower in DS: «we'd like to add some raisins to our
bread»
2. Small start-up, with maybe just a prototype, that want to get to
production-ready scalable MVP: «We want some bread». And «it
should have raisins now/at some point in the future»
IS THAT WHAT THEY ACTUALLY
NEED?
“All the cool kids are doing it” is not good enough reason.
— Seen on the internet
Raisin cookies that look like chocolate chip cookies are
the main reason I have trust issues
PART I: COMMUNICATING
WITH THE CUSTOMER
IT'S NOT JUST SOFTWARE
DEVELOPMENT!
It also has a heavy R&D component
Higher uncertainty
Results are probabilistic
THERE'S A PAPER ABOUT IT ≠ A
PRODUCT
The distance may not be something coverable today.
MODELS ARE AN ASSET
Investing time on it is not a “necessary evil”
What's produced on a modelling phase is a critical component
A model emerges from the client data and constraints, so it is
unique to the client and an advantage over competitors.
MACHINE LEARNING ≠
CLAIRVOYANCE
Garbage in, Garbage out
The solution may not be clear; you may be unsure of what problem
is more important; but your business goal should be clear. Data
Science will not make it clear for you.
AGREEING ON METRICS
Explain what are you measuring and why
Explain what are the baselines and how much you think you can
improve
Connect these to the business goals.
A PICTURE IS WORTH A
THOUSAND WORDS
Visualize your proposal.
Be minimalistic.
Use o f the shelf tools for a proposal.
PART II: PROVIDING THE
SERVICE
THE SERVICE IS THE END, DATA
SCIENCE IS THE MEANS
Do not fall in love with the challenge
JUST OUT OF THE BOX MAY BE
ENOUGH
You should always be asking yourself:
1. Have I already covered the expectations?
2. Will an improved result here actually improve value?
MEASURE TWICE, CUT ONCE
Get a look at the object of analysis before starting work. Has it desirable
qualities?
1. Manageable size?
2. It's in an accessible representation?
3. Does it have a reasonable distribution?
4. ...
INVOLVE THE PO
Validate your assumptions with a person familiar with your domain
1. Are there contradictions between your assumptions and their
knowledge?
2. Are there contradictions between the data you already have and
their knowledge?
Keep learning about the business side, encourage your business
counterpart to learn to talk with Data Scientists.
PART OF YOUR SERVICE IS NOT
DS
Make sure you use the right tools and people in each area
PART III: WORKING AS A TEAM
SHARE INFORMATION
Basic descriptive statistics should be shared with all involved, even the
non DS. People in a team must be aware of what's important and
what's not.
SHARE UNCERTAINTY
There are a lot of tradeo fs to make regarding milestones and
deadlines. People can plan better (and have contingency plans) if they
know what parts of the project have higher risks.
IT'S OK TO BUILD FLIMSY CODE,
AS LONG AS IT'S NOT
SOFTWARE
code: programming text that runs on a computer
so tware: programming text that is part of a deliverable.
There are di ferences:
code does not necessarily need tests.
code does not necessarily need to follow other processes.
sometimes the outputs of your code are deliverable and may have
to be treated specially.
THE DISCUSSION IS
JUST BEGINNING
I'D LOVE TO HEAR ABOUT WHAT YOU'VE
LEARNED ELSEWHERE
THANKS!
ANY QUESTIONS?
You can find me at twitter (@dmoisset) or by email (dmoisset@machinalis.com)

Weitere ähnliche Inhalte

Andere mochten auch

Conférence numérique éducatif - semaine de l'innovation
Conférence numérique éducatif - semaine de l'innovationConférence numérique éducatif - semaine de l'innovation
Conférence numérique éducatif - semaine de l'innovationJean-Baptiste Lesaulnier
 
2017 ZRAY SPORTS
2017 ZRAY SPORTS2017 ZRAY SPORTS
2017 ZRAY SPORTSSophia Cui
 
(株)自治体構想による三根庁舎旧議場の利活用
(株)自治体構想による三根庁舎旧議場の利活用(株)自治体構想による三根庁舎旧議場の利活用
(株)自治体構想による三根庁舎旧議場の利活用隆志 杉山
 
E2D3で地図を作画してみよう
E2D3で地図を作画してみようE2D3で地図を作画してみよう
E2D3で地図を作画してみようShigeo Ueda
 
How a CDCL SAT solver works
How a CDCL SAT solver worksHow a CDCL SAT solver works
How a CDCL SAT solver worksMasahiro Sakai
 
Marigo Raftopoulos for Gamification World Congress, Barcelona 2015
Marigo Raftopoulos for Gamification World Congress, Barcelona 2015Marigo Raftopoulos for Gamification World Congress, Barcelona 2015
Marigo Raftopoulos for Gamification World Congress, Barcelona 2015Dr. Marigo Raftopoulos
 
顔認識アルゴリズム:Constrained local model を調べてみた
顔認識アルゴリズム:Constrained local model を調べてみた顔認識アルゴリズム:Constrained local model を調べてみた
顔認識アルゴリズム:Constrained local model を調べてみたJotaro Shigeyama
 
神に近づくx/net/context (Finding God with x/net/context)
神に近づくx/net/context (Finding God with x/net/context)神に近づくx/net/context (Finding God with x/net/context)
神に近づくx/net/context (Finding God with x/net/context)guregu
 
Basculement du monde et géopolitique du monde
Basculement du monde et géopolitique du mondeBasculement du monde et géopolitique du monde
Basculement du monde et géopolitique du mondeJean-François Fiorina
 
298885937-Us-Naval-Incompetence
298885937-Us-Naval-Incompetence298885937-Us-Naval-Incompetence
298885937-Us-Naval-IncompetenceAgha A
 
Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...
Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...
Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...Andre Stern, CTS
 
Gentooプリインストールなノートパソコンの話
Gentooプリインストールなノートパソコンの話Gentooプリインストールなノートパソコンの話
Gentooプリインストールなノートパソコンの話Takuto Matsuu
 

Andere mochten auch (15)

Conférence numérique éducatif - semaine de l'innovation
Conférence numérique éducatif - semaine de l'innovationConférence numérique éducatif - semaine de l'innovation
Conférence numérique éducatif - semaine de l'innovation
 
Making The Most Of Internship
Making The Most Of Internship  Making The Most Of Internship
Making The Most Of Internship
 
2017 ZRAY SPORTS
2017 ZRAY SPORTS2017 ZRAY SPORTS
2017 ZRAY SPORTS
 
(株)自治体構想による三根庁舎旧議場の利活用
(株)自治体構想による三根庁舎旧議場の利活用(株)自治体構想による三根庁舎旧議場の利活用
(株)自治体構想による三根庁舎旧議場の利活用
 
E2D3で地図を作画してみよう
E2D3で地図を作画してみようE2D3で地図を作画してみよう
E2D3で地図を作画してみよう
 
GUIA PARA SALIR DE LA PRECARIEDAD LABORAL
GUIA PARA SALIR DE LA PRECARIEDAD LABORALGUIA PARA SALIR DE LA PRECARIEDAD LABORAL
GUIA PARA SALIR DE LA PRECARIEDAD LABORAL
 
How a CDCL SAT solver works
How a CDCL SAT solver worksHow a CDCL SAT solver works
How a CDCL SAT solver works
 
Marigo Raftopoulos for Gamification World Congress, Barcelona 2015
Marigo Raftopoulos for Gamification World Congress, Barcelona 2015Marigo Raftopoulos for Gamification World Congress, Barcelona 2015
Marigo Raftopoulos for Gamification World Congress, Barcelona 2015
 
顔認識アルゴリズム:Constrained local model を調べてみた
顔認識アルゴリズム:Constrained local model を調べてみた顔認識アルゴリズム:Constrained local model を調べてみた
顔認識アルゴリズム:Constrained local model を調べてみた
 
神に近づくx/net/context (Finding God with x/net/context)
神に近づくx/net/context (Finding God with x/net/context)神に近づくx/net/context (Finding God with x/net/context)
神に近づくx/net/context (Finding God with x/net/context)
 
Basculement du monde et géopolitique du monde
Basculement du monde et géopolitique du mondeBasculement du monde et géopolitique du monde
Basculement du monde et géopolitique du monde
 
298885937-Us-Naval-Incompetence
298885937-Us-Naval-Incompetence298885937-Us-Naval-Incompetence
298885937-Us-Naval-Incompetence
 
Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...
Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...
Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...
 
Hair Extension Courses Manchester
Hair Extension Courses ManchesterHair Extension Courses Manchester
Hair Extension Courses Manchester
 
Gentooプリインストールなノートパソコンの話
Gentooプリインストールなノートパソコンの話Gentooプリインストールなノートパソコンの話
Gentooプリインストールなノートパソコンの話
 

Ähnlich wie Bridging the gap from data science to service

Future of IT preso
Future of IT presoFuture of IT preso
Future of IT presoLorna Garey
 
Digital transformation studies linkedin
Digital transformation studies linkedinDigital transformation studies linkedin
Digital transformation studies linkedinClaudete Mello
 
Data (by itself) Is Not Enough
Data (by itself) Is Not EnoughData (by itself) Is Not Enough
Data (by itself) Is Not EnoughCory Treffiletti
 
Information modelling (Stefan Berner): Extract
Information modelling (Stefan Berner): ExtractInformation modelling (Stefan Berner): Extract
Information modelling (Stefan Berner): Extractvdf Hochschulverlag AG
 
The Handy Guide to Cashing in the Currency of Networking
The Handy Guide to Cashing in the Currency of NetworkingThe Handy Guide to Cashing in the Currency of Networking
The Handy Guide to Cashing in the Currency of NetworkingHubilo
 
Exploring the Business Decision to Use Cloud Computing
Exploring the Business Decision to Use Cloud ComputingExploring the Business Decision to Use Cloud Computing
Exploring the Business Decision to Use Cloud ComputingDana Gardner
 
Digital Transformation Failure
Digital Transformation FailureDigital Transformation Failure
Digital Transformation FailureFrederik Bernard
 
Marketing Your Tech Talent
Marketing Your Tech TalentMarketing Your Tech Talent
Marketing Your Tech Talentdeirdrestraughan
 
Activities Computing - reading comprehension
Activities Computing - reading comprehensionActivities Computing - reading comprehension
Activities Computing - reading comprehensionCintia Santos
 
Prepare a wow demo - extreme365 2020
Prepare a wow demo  - extreme365 2020Prepare a wow demo  - extreme365 2020
Prepare a wow demo - extreme365 2020Nico Fernandez
 
Open Web Technologies and You - Durham College Student Integration Presentation
Open Web Technologies and You - Durham College Student Integration PresentationOpen Web Technologies and You - Durham College Student Integration Presentation
Open Web Technologies and You - Durham College Student Integration Presentationdarryl_lehmann
 
Accessibility Buy-In for Inclusive Product Week
Accessibility Buy-In for Inclusive Product WeekAccessibility Buy-In for Inclusive Product Week
Accessibility Buy-In for Inclusive Product WeekKat K. Richards
 
Why do most machine learning projects never make it to production
Why do most machine learning projects never make it to productionWhy do most machine learning projects never make it to production
Why do most machine learning projects never make it to productionCameron Vetter
 
Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...
Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...
Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...Dana Gardner
 
New Era Of Corporate Communications Riaan Vanmeulen Fnb
New Era Of Corporate Communications Riaan Vanmeulen   FnbNew Era Of Corporate Communications Riaan Vanmeulen   Fnb
New Era Of Corporate Communications Riaan Vanmeulen Fnbguest22cb1ea7
 
Interview for saby upadhyay
Interview for  saby upadhyayInterview for  saby upadhyay
Interview for saby upadhyayCameronDonovan
 

Ähnlich wie Bridging the gap from data science to service (20)

Future of IT preso
Future of IT presoFuture of IT preso
Future of IT preso
 
Digital transformation studies linkedin
Digital transformation studies linkedinDigital transformation studies linkedin
Digital transformation studies linkedin
 
Data (by itself) Is Not Enough
Data (by itself) Is Not EnoughData (by itself) Is Not Enough
Data (by itself) Is Not Enough
 
2014 Technical Communication Conference Program
2014 Technical Communication Conference Program2014 Technical Communication Conference Program
2014 Technical Communication Conference Program
 
Information modelling (Stefan Berner): Extract
Information modelling (Stefan Berner): ExtractInformation modelling (Stefan Berner): Extract
Information modelling (Stefan Berner): Extract
 
Matchbox presentation
Matchbox presentation Matchbox presentation
Matchbox presentation
 
The Handy Guide to Cashing in the Currency of Networking
The Handy Guide to Cashing in the Currency of NetworkingThe Handy Guide to Cashing in the Currency of Networking
The Handy Guide to Cashing in the Currency of Networking
 
Exploring the Business Decision to Use Cloud Computing
Exploring the Business Decision to Use Cloud ComputingExploring the Business Decision to Use Cloud Computing
Exploring the Business Decision to Use Cloud Computing
 
Digital Transformation Failure
Digital Transformation FailureDigital Transformation Failure
Digital Transformation Failure
 
Marketing Your Tech Talent
Marketing Your Tech TalentMarketing Your Tech Talent
Marketing Your Tech Talent
 
Activities Computing - reading comprehension
Activities Computing - reading comprehensionActivities Computing - reading comprehension
Activities Computing - reading comprehension
 
slide3.pptx
slide3.pptxslide3.pptx
slide3.pptx
 
Prepare a wow demo - extreme365 2020
Prepare a wow demo  - extreme365 2020Prepare a wow demo  - extreme365 2020
Prepare a wow demo - extreme365 2020
 
Open Web Technologies and You - Durham College Student Integration Presentation
Open Web Technologies and You - Durham College Student Integration PresentationOpen Web Technologies and You - Durham College Student Integration Presentation
Open Web Technologies and You - Durham College Student Integration Presentation
 
Boursiquot "Privacy and The Effective Search Experience"
Boursiquot "Privacy and The Effective Search Experience"Boursiquot "Privacy and The Effective Search Experience"
Boursiquot "Privacy and The Effective Search Experience"
 
Accessibility Buy-In for Inclusive Product Week
Accessibility Buy-In for Inclusive Product WeekAccessibility Buy-In for Inclusive Product Week
Accessibility Buy-In for Inclusive Product Week
 
Why do most machine learning projects never make it to production
Why do most machine learning projects never make it to productionWhy do most machine learning projects never make it to production
Why do most machine learning projects never make it to production
 
Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...
Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...
Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...
 
New Era Of Corporate Communications Riaan Vanmeulen Fnb
New Era Of Corporate Communications Riaan Vanmeulen   FnbNew Era Of Corporate Communications Riaan Vanmeulen   Fnb
New Era Of Corporate Communications Riaan Vanmeulen Fnb
 
Interview for saby upadhyay
Interview for  saby upadhyayInterview for  saby upadhyay
Interview for saby upadhyay
 

Kürzlich hochgeladen

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 

Kürzlich hochgeladen (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 

Bridging the gap from data science to service

  • 1. BRIDGING THE GAP FROM DATA SCIENCE TO SERVICE 32ND PYDATA LONDON MEETUP Daniel F Moisset - dmoisset@ /machinalis.com @dmoisset
  • 2. ABOUT ME Hi! I'm Daniel Moisset! I work at Machinalis A special thanks to Marcos Spontón who also works there and inspired most of this talk.
  • 3. WARNING: THIS IS NOT A TECH TALK! In other words: THIS TALK IS NOT ABOUT ALGORITHMS, MODELS, TOOLS, OR USE CASES In event di ferent words: THIS TALK IS ABOUT PEOPLE
  • 4. SO, RAISIN BREAD By je freyw (Mmm...raisin bread) [ ],CC BY 2.0 via Wikimedia Commons
  • 5. Machine Learning development is like the raisins in a raisin bread... you need the bread first. But, it's just a few tiny raisins but without it you would just have plain bread — I don't really know who, but I love the analogy
  • 6. WHO WANTS RAISIN BREAD Di ferent organizations use your services: 1. Large companies with a live product and data, but without enough expertise/manpower in DS: «we'd like to add some raisins to our bread» 2. Small start-up, with maybe just a prototype, that want to get to production-ready scalable MVP: «We want some bread». And «it should have raisins now/at some point in the future»
  • 7. IS THAT WHAT THEY ACTUALLY NEED? “All the cool kids are doing it” is not good enough reason. — Seen on the internet Raisin cookies that look like chocolate chip cookies are the main reason I have trust issues
  • 9. IT'S NOT JUST SOFTWARE DEVELOPMENT! It also has a heavy R&D component Higher uncertainty Results are probabilistic
  • 10. THERE'S A PAPER ABOUT IT ≠ A PRODUCT The distance may not be something coverable today.
  • 11. MODELS ARE AN ASSET Investing time on it is not a “necessary evil” What's produced on a modelling phase is a critical component A model emerges from the client data and constraints, so it is unique to the client and an advantage over competitors.
  • 12. MACHINE LEARNING ≠ CLAIRVOYANCE Garbage in, Garbage out The solution may not be clear; you may be unsure of what problem is more important; but your business goal should be clear. Data Science will not make it clear for you.
  • 13. AGREEING ON METRICS Explain what are you measuring and why Explain what are the baselines and how much you think you can improve Connect these to the business goals.
  • 14. A PICTURE IS WORTH A THOUSAND WORDS Visualize your proposal. Be minimalistic. Use o f the shelf tools for a proposal.
  • 15. PART II: PROVIDING THE SERVICE
  • 16. THE SERVICE IS THE END, DATA SCIENCE IS THE MEANS Do not fall in love with the challenge
  • 17. JUST OUT OF THE BOX MAY BE ENOUGH You should always be asking yourself: 1. Have I already covered the expectations? 2. Will an improved result here actually improve value?
  • 18. MEASURE TWICE, CUT ONCE Get a look at the object of analysis before starting work. Has it desirable qualities? 1. Manageable size? 2. It's in an accessible representation? 3. Does it have a reasonable distribution? 4. ...
  • 19. INVOLVE THE PO Validate your assumptions with a person familiar with your domain 1. Are there contradictions between your assumptions and their knowledge? 2. Are there contradictions between the data you already have and their knowledge? Keep learning about the business side, encourage your business counterpart to learn to talk with Data Scientists.
  • 20. PART OF YOUR SERVICE IS NOT DS Make sure you use the right tools and people in each area
  • 21. PART III: WORKING AS A TEAM
  • 22. SHARE INFORMATION Basic descriptive statistics should be shared with all involved, even the non DS. People in a team must be aware of what's important and what's not.
  • 23. SHARE UNCERTAINTY There are a lot of tradeo fs to make regarding milestones and deadlines. People can plan better (and have contingency plans) if they know what parts of the project have higher risks.
  • 24. IT'S OK TO BUILD FLIMSY CODE, AS LONG AS IT'S NOT SOFTWARE code: programming text that runs on a computer so tware: programming text that is part of a deliverable. There are di ferences: code does not necessarily need tests. code does not necessarily need to follow other processes. sometimes the outputs of your code are deliverable and may have to be treated specially.
  • 25. THE DISCUSSION IS JUST BEGINNING I'D LOVE TO HEAR ABOUT WHAT YOU'VE LEARNED ELSEWHERE
  • 26. THANKS! ANY QUESTIONS? You can find me at twitter (@dmoisset) or by email (dmoisset@machinalis.com)