SlideShare ist ein Scribd-Unternehmen logo
1 von 53
Product Management for
Data Science Products
Aloysius Lim, Director of AI Products, Eureka Analytics
aloysius@eureka.ai
Eureka Seminar Series on
Building Data Science Products at Scale
2 Oct 2018
2
Eureka Seminar Series
Building Data Science Products at Scale
2 Oct Product Management for Data Science Products
15 Oct Technology Choices for Data Science Products
23 Oct Building Targeted Advertising Product
12 Nov Alternative Data and Algorithms for Risk Modelling Product
3 Dec Spatial Temporal Data and Algorithms for Mobility Intelligence
Product
Sign Up
and get updates
on our Meetup page
3
Tentative Schedule
6.30 pm to 8.30 pm
WeWork @ 71 Robinson Road
What are Data Science products?
4
What are Data Science products?
5
https://enterprise.foursquare.com/products/places
What are Data Science products?
6
https://www.creditsesame.com/blog/credit/credit-bureau-guide-what-the-differences-are-between-equifax-transunion-experian/
What are Data Science products?
7
https://www.spotify.com/us/discoverweekly/
What are Data Science products?
8
What are Data Science products?
9
Data as Product Data Science-Powered
Products & Services
Data Science
Software & Tools
https://www.datarobot.com/product/
http://www.moorinsightsstrategy.com/nvidia-
gpu-cloud-its-not-what-you-may-think-it-is/
Every industry is using Data Science in its products and services
Adobe
Airbnb
AlephD
Alibaba
Amazon
Amazon Web Services
AMD
Ant Financial
ASOS.com
Astound
Baidu
Boeing
CareerBuilder
Channel 4 Television
Charlotte-Mecklenburg Police
Department
City of Denver, CO
City of Syracuse, NY
CognitiveScale
Comcast
Comodo Security Solutions
comScore
Cray
Criteo
Didi Chuxing
Dstillery
Dynatrace
Facebook
Flipkart
Fox Chase Cancer Center
Galois
Google
Huawei
IBM
iFLYTEK
Intel
JD.com
KD Consulting
LineZone Data
LinkedIn
Microsoft
Mobike
NEC
NetEase
Netflix
NTT
Nvidia
Oath
Pinterest
Pittsburgh Bureau of Fire
risQ
Roku
S&P
SAS
SciSports
SecretarĂ­a de Hacienda
Distrital
ShopRunner
Snap
Sutter Health
Symantec
Tableau
Tata Consultancy Services
Tencent
Textkernel
The Globe and Mail
The Lab at DC
Thomson Reuters
Three Bridges Capital
Translational MRI
Two Sigma Investments
Uber
United States Census Bureau
Vatican Secret Archives
Workday
Yahoo
Zhejiang Cainiao Supply Chain
Management
10
Non-academic organizations with papers at KDD 2018, a top data science research conference
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
Data Science Product Management in practice
11
What does “scale” mean?
Organization Scale
Revenue Scale
Customer Scale
From Blitzscaling by Reid Hoffman & Chris Yeh
12
https://hbr.org/2016/04/blitzscaling
What makes it challenging to build Data Science products at scale?
A Product Manager’s perspective
Getting to a precise and correct articulation of the
Data Science problem
Designing a solution when feasibility and
performance are not certain, and require further
research
Lack of data, or lack of control over data collection
and quality
Building a product robust to local differences (data,
geopolitical, cultural, behavioral, etc.)
Ensuring privacy of consumers
Preparing for (possibly unknown) future shifts in
data distributions
How can the PM define the correct problem for the
team to solve?
How can the PM plan the roadmap and product
requirements if the solution is not known?
How can the PM find creative solutions to work with
data limitations?
How can the PM identify these issues, and design a
product that works everywhere?
How can the PM design privacy into the product?
How can the PM identify potentially catastrophic
changes in the external environment, and respond in
an agile manner?
13
There is no “best” way to do Product Management
But here’s one framework
“A startup is a human institution
designed to deliver a new product or
service under conditions of extreme
uncertainty.”
Extreme Uncertainty: “situations that
cannot be modeled, are not clear-cut,
and where the risk is not necessarily
large – it’s just not yet known.”
– Eric Ries
14
https://mm1.com/about-us/newsroom/publications/poster-lean-startup/
http://www.startuplessonslearned.com/
2010/06/what-is-startup.html
How we do Agile
15
Sprint Sprint Sprint Sprint Sprint Sprint Sprint
Planning
Backlog
Planning
Backlog
Retro-
spective
Retro-
spective
Retro-
spective
Retro-
spective
Retro-
spective
Retro-
spective
Artefacts Artefacts Artefacts Artefacts Artefacts ArtefactsContinuous
Integration
Scrum
2-week sprints
Every 2 to 3 months
What does Data Science look like in practice?
Example: Microsoft Team Data Science Process
16
https://docs.microsoft.com/en-us/azure/machine-learning/team-data-science-process/overview
Case Example
Eureka Customer Acquisition
Targeted marketing
using mobile operator (“telco”) data
to generate leads for clients.
17
18
CEO: “Your mission, should you choose to accept it…”
Other business models
• Insights / reports (e.g. Experian
Mosaic)
• Data APIs (e.g. Foursquare)
• B2C products and services (e.g.
Spotify)
• B2B products and services (e.g.
Criteo)
• Machine Learning APIs (e.g.
Azure / AWS / GCP)
• SaaS (e.g. DataRobot)
• Etc.
19
Global
Advertisers
10+ Telcos
in 10
countries
Campaign
channels
Team
Data
Infra-
structure
Direct
Global
Contracts
Cost per lead
Cost per acquired customer
Revenue share to telco
Campaign costs
Dev & ops costs
Acquire
new
customers
Access to 1
billion
mobile
subscribers
Developme
nt
Deploymen
t
Scoring
Campaigns
Eureka Customer
Acquisition
https://strategyzer.com/canvas/business-model-canvas
20
Who are the stakeholders (not only customers),
and what are their needs?
21
Internal Stakeholders
Scaling Challenge
One product
Many advertisers & telcos
Many stakeholders
Many contexts
(language, geopolitical, cultural)
Advertisers
“Help me get high quality leads.”
“I can’t give you data about
existing customers.”
Telcos
“Help me to monetize my data.”
“I have 200m subscribers.”
“Data cannot leave my firewall.”
“I can only give you 1 month of
daily aggregated data.”
“My data comes in this format.”
Investors
“Is your technology
patentable?”
“Is your technology
defensible against
competitors?”
Senior Execs
“We need to demonstrate
revenue in 2 months!”
“We can only hire 10 people.”
“We need to deploy in 10
countries in the next 6 months.”
Team
“I want to publish my
work.”
Regulators
“You must comply with data protection laws!”
“I must approve any data shared with
partners.”
Subscribers
“I only want to receive ads I’m
interested in.”
“Is my privacy being protected?”
Icons from www.flaticon.com
22
What are the Data Science problems?
23
Business Problem:
Find good leads for product X
Data Science Problem:
Create customer segments
for different product
categories
Product X
Select relevant segment
Run campaign
Collect responses /
conversions
Telco data
Segment A
Soccer players
Segment B
Mums, >1 children
Segment C
Watch enthusiasts
Challenges
How to create “good” segments?
• Rules: How do we know they work?
• Supervised: What training data to use?
• Clustering: How to interpret the clusters?
How to measure the quality of the segments?
How to scale to thousands of segments across telcos?
How to select the best segment for each
campaign?
What if no matching segment can be found?
What if the “best” segment responds poorly to
the campaign?
How else can we frame the problem?
24
Business Problem:
Find good leads for product X
Data Science Problem:
Find behavioral lookalikes of
existing customers, iteratively
What features are useful and easy to compute?
Product X
Create Seed group
Existing customers
Heuristic rules
Random
Collect responses /
conversions
Behavioral
data features
Mathematically
defined
Run lookalike
model
Run campaign on
Target group
Update&TuneSeedgroup
Telco data
How to select a good seed group?
How to find lookalikes?
How to update / tune seed group?
Framing the problem differently
could lead to a very different product!
25
Business Problem:
Find good leads for product X
Data Science Problem:
Create customer segments
for different product
categories
Data Science Problem:
Find behavioral lookalikes of
existing customers, iteratively
Product X
Select relevant segment
Run campaign
Collect responses /
conversions
Telco data
Segment A
Soccer players
Segment B
Mums, >1 children
Segment C
Watch enthusiasts
Product X
Create Seed group
Existing customers
Heuristic rules
Random
Collect responses /
conversions
Behavioral
data features
Mathematically
defined
Run lookalike
model
Run campaign on
Target group
Update&TuneSeedgroup
Telco data
“Customer segmentation engine”
“Behavioral lookalike engine”
It is critical to define the Data Science problem well
One more step to the Data Science process
26
Data Science
Problem
Definition
27
Will it work?
28
Most critical risks to test
Does it work?
Can telco data predict campaign response?
Is lookalike an effective method for campaign targeting?
Does it work in multiple settings?
Different telcos, data sources, cultures
Can it scale?
1m, 10m, 100m subscribers
29
Build – Measure – Learn helps to manage extreme uncertainty
Applying Agile in Data Science product development
30
Literature Review
Artefact: Research report
Analysis & Experimentation
Artefact: Vignette or report
MVP Development
Artefacts: Code, tests & docs
We cannot solve all problems at once!
1. Break the problem down
2. Identify most critical hypotheses
3. Plan a research, analysis or experimental task to
test the hypothesis
4. Time-bound the problem
Build – Measure – Learn
Conduct literature review
Hypothesis
Mathematically defined behavioral
features from telco data are useful for
identifying campaign targets.
Artefact: Research Report
31
Build – Measure – Learn
Conduct literature review
Hypothesis
A scalable and effective algorithm for
lookalike modeling exists that can be
used for campaign scoring.
Artefact: Research Report
32
Build – Measure – Learn
Perform analysis
Hypothesis
Feature X (e.g. Radius of Gyration)
can reveal meaningful patterns in
human behavior.
Artefact: Analysis Vignette
33
Correlations
• Radius of Gyration and Total Distance are moderately correlated
• Total Distance and Distinct Count are strongly correlated
• Radius of gyration is more correlated with Distinct Count than to Total Count
Radius of Gyration and Total Distance
• 50% of MSISDN move withn a radius of 24 km,but traveled up to 630km of
total distance within a month, this includs 11% of MSISDN did not move
• 75% of MSISDN move within a radius of 97 km and made up to 3264km total
distance.
• 90% move within 280km, total distance of 9431km, and 95% move within a
radius of 440km and total distance of 15294km.
• There are MSISDN with radius of gyration that is about the whole north-south
distance of the country, 1598km.
• Radius of gyration is computed as follows:
Eureka Feature Engine Run on
CDR Data
Analysts: Ying Li
2018-08-06
Problem Statement
Upon the completion of running Eureka Feature Engine MVP in
telco’s Zeppelin environment, we want to analyze the results to
1. Inspect and verify the engine output for feasibility and
correctness
2. Explore if any insights can be gained for telco or Eureka
3. Demonstrate an analysis artifact through this Vignette
Methodology Description
• Run Scala/SQL/R EDA to understand the data and the
distributions of features produced by Eureka Feature Engine
• Run cumulative distributions
• Run correlation on different features
• Spot any worthy data nuggets
• Spot any suspicious data issues
Summary
• Feature Engine covered the entire
user base, computation looks
correctly directionally
• Meaningful analysis can be
conducted regarding user behavior
• Some data issues were identified to
be further debugged
Call to Actions
1. Seek feedback from commercial
team
2. Build clustering models to
investigate feasibility of
unsupervised learning
3. Debug the data issues
4. Continue develop with richer
feature set
5. Compare ”truth about the data”
(e.g., median radius of gyration”)
against “truth about the world”
(e.g., commute distance for people
in this city)
Cumulative Distribution by Radius of Gyration
Cumulative Distribution by Total Distance
Total Distance Total Count Distinct Count
Radius of Gyration 0.456928567 0.332110891 0.478864808
Total Distance 0.727597715 0.745295476
Total Count 0.772572346
Caveat
• Distance computation was done as point to point distance
between consecutive cell tower locations on the trajectory
• Usage of any insights from this analysis at current state is not
advised as the computation engine is MVP for demonstrating
feasibility and hence not fully tested for accuracy
Build – Measure – Learn
Build and test MVP
Hypothesis
Product works in the real world!
Artefact: Code, tests, docs
Offline experimental results
Online (live deployment) results
34
Collect responses /
conversions
Behavioral data
features
Run lookalike
model
Lookalike group
Offline evaluation
Lift = 4.8%
Random group
Online evaluation
Lift = 5.2%
Build – Measure – Learn
Build and test MVP
Hypothesis
Product works in multiple telcos and
countries, with different data,
environment and behavior
Artefact: Deployed MVP
Offline experimental results
Online (live deployment) results
35
Build – Measure – Learn
36
Literature Review
Artefact: Research report
Analysis & Experimentation
Artefact: Vignette or report
MVP Development
Artefacts: Code, tests & docs
Collect responses /
conversions
Behavioral data
features
Run lookalike
model
Lookalike group
Offline evaluation
Lift = 4.8%
Random group
Online evaluation
Lift = 5.2%
Build – Measure – Learn in real settings as soon as possible
“Best model” is not always best!
“If you followed the Prize competition, you might be wondering
what happened with the final Grand Prize ensemble that won the
$1M two years later. This is a truly impressive compilation and
culmination of years of work, blending hundreds of predictive
models to finally cross the finish line. We evaluated some of the
new methods offline but the additional accuracy gains that we
measured did not seem to justify the engineering effort needed
to bring them into a production environment. Also, our focus on
improving Netflix personalization had shifted to the next level by
then.”
37
https://medium.com/netflix-techblog/netflix-recommendations-beyond-the-5-stars-part-1-55838468f429
Measuring performance
38
39
What does this statement mean?
“Our model can achieve 70% accuracy.”
Is this good? How can we tell?
40
It depends on the application
“Our model can achieve 70% accuracy.”
41
Disease detection rate
Not good, possibly catastrophic
Ad click through rate
Astounding!
Too good to be true
It depends on what one means by “accuracy”
“Our model can achieve 70% accuracy.”
Definition of accuracy
% of correct predictions in all classes
42
It depends on what one means by “accuracy”
43
“Our model can achieve 70% accuracy.”
Sometimes, people mean Precision
True Positives / predicted Positives
“70% of people predicted to like
soccer actually like soccer.”
Sometimes, people mean Recall
True Positives / actual Positives
“Our model can detect 70% of soccer
lovers.”
Sometimes, they mean
something else altogether!
What to measure?
OFFLINE
Model performance
Accuracy, precision, recall, AUROC, Gini, lift, etc.
Privacy
k-anonymity, differential privacy
Compute performance
Compute time, memory / CPU utilization, storage
ONLINE
Model performance
Privacy
Compute performance
Response behavior
Click through rate, API utilization
Business outcomes
Revenue, risk reduction, cost savings
44
You build what you measure
45
If the wrong measurement is used
The wrong thing will be built in the
next iteration
What is the right performance metric for campaigns?
One possibility:
Area Under ROC Curve (AUROC)
• Captures overall classification
performance
• 1 is best, 0.5 is random, 0 is worst
• Easy to compare different models
46
TruePositiveRate
False Positive Rate
This model has the
bigger AUROC
What is the right performance metric for campaigns?
But when we run a campaign, we
really only care about the True
Positive Rate for the users with the
highest scores
So a better measure would be
Lift@n%
E.g. Lift@5% if we want to target top
5% of targets
47
TruePositiveRate
False Positive Rate
This model has the
higher lift@5%
The real test is when we measure performance online
48
Offline (modeling phase)
Estimated Lift@1%: 10.2X
Target size: 100,000
Lookalike: 40,000 Random: 30,000 Heuristic rules: 30,000
CTR: 3.82% CTR: 1.95% CTR: 1.58%
Actual Lift: 2.0X Actual Lift: 0.8X
Actual results from one of
our campaigns using
lookalike modeling
People
49
Superpowers for a Data Science Product Manager
50
Data Science
Intuition for data
Statistics
Machine learning
Data quality
Computer Science
Algorithms
Complexity (Big O)
Distributed Computing
Mode of Working
Hands on, “feel” the data
Detail orientation
Logical, critical thinking
Not afraid to be critical
about one’s own
assumptions
Product
Management
Our team structure: Scrum teams with all the skills required to
research, develop and ship a User Story
51
Cross-functional team allows diverse
perspectives to be considered from the
get-go
Algo performance vs engineering cost
Tech choices
Pipeline design
Etc.
Team members are empowered and
required to work across the spectrum of
tasks from data science to engineering
“Know all the basics,
but be expert in a few areas”
Data Scientists write unit tests
Engineers read research papers
You vs Me
Recipe for a Data Science product team
(adjust as needed)
1 PM 2 Data Scientists 3 Engineers
PM
Data
Scientist
Data
Scientist
Engineer
Engineer
Engineer
Key takeaways
Data Science is being used in more products and services, in all industries
Build – Measure – Learn to work with extreme uncertainty, unknown unknowns
Get the right framing of the Data Science problem, and measurement of results
Test with customers
52
Thank You
Please help us by giving us your feedback
goo.gl/uTuuHr
53

Weitere ähnliche Inhalte

Was ist angesagt?

5 Ways to Enhance the Customer Experience with Gainsight PX
5 Ways to Enhance the Customer Experience with Gainsight PX5 Ways to Enhance the Customer Experience with Gainsight PX
5 Ways to Enhance the Customer Experience with Gainsight PXProduct School
 
How to Run Product Discovery Experiments in FinTech
How to Run Product Discovery Experiments in FinTechHow to Run Product Discovery Experiments in FinTech
How to Run Product Discovery Experiments in FinTechProduct School
 
A/B Testing for New Product Launches by Booking.com Sr PM
A/B Testing for New Product Launches by Booking.com Sr PMA/B Testing for New Product Launches by Booking.com Sr PM
A/B Testing for New Product Launches by Booking.com Sr PMProduct School
 
Product management with analytics
Product management with analyticsProduct management with analytics
Product management with analyticsDaryl Hemeon
 
Aligning Product & Customer Success Teams to Fuel Growth by Gainsight Product...
Aligning Product & Customer Success Teams to Fuel Growth by Gainsight Product...Aligning Product & Customer Success Teams to Fuel Growth by Gainsight Product...
Aligning Product & Customer Success Teams to Fuel Growth by Gainsight Product...Product School
 
Data-Driven Product Management by Shutterfly Director of Product
Data-Driven Product Management by Shutterfly Director of ProductData-Driven Product Management by Shutterfly Director of Product
Data-Driven Product Management by Shutterfly Director of ProductProduct School
 
Ethical AI Products, With Equity in Mind by Mozilla Product Leader
 Ethical AI Products, With Equity in Mind by Mozilla Product Leader Ethical AI Products, With Equity in Mind by Mozilla Product Leader
Ethical AI Products, With Equity in Mind by Mozilla Product LeaderProduct School
 
Intro to PM Execution Interviews by Facebook Product Leader
Intro to PM Execution Interviews by Facebook Product LeaderIntro to PM Execution Interviews by Facebook Product Leader
Intro to PM Execution Interviews by Facebook Product LeaderProduct School
 
How to Manage Open Source Product by Github Sr. PM
How to Manage Open Source Product by Github Sr. PMHow to Manage Open Source Product by Github Sr. PM
How to Manage Open Source Product by Github Sr. PMProduct School
 
Leverage Data for Product Decisions by Google Prod. S&O Lead
Leverage Data for Product Decisions by Google Prod. S&O LeadLeverage Data for Product Decisions by Google Prod. S&O Lead
Leverage Data for Product Decisions by Google Prod. S&O LeadProduct School
 
Measuring Success of Data Platforms by Booking.com Product Leader
Measuring Success of Data Platforms by Booking.com Product LeaderMeasuring Success of Data Platforms by Booking.com Product Leader
Measuring Success of Data Platforms by Booking.com Product LeaderProduct School
 
How to Think Big, Start Small and Fail Fast by Google PM Lead
How to Think Big, Start Small and Fail Fast by Google PM LeadHow to Think Big, Start Small and Fail Fast by Google PM Lead
How to Think Big, Start Small and Fail Fast by Google PM LeadProduct School
 
How to be a Good Machine Learning PM by Google Product Manager
How to be a Good Machine Learning PM by Google Product ManagerHow to be a Good Machine Learning PM by Google Product Manager
How to be a Good Machine Learning PM by Google Product ManagerProduct School
 
Neglecting Users in Enterprise Apps by Microsoft Sr PM
Neglecting Users in Enterprise Apps by Microsoft Sr PMNeglecting Users in Enterprise Apps by Microsoft Sr PM
Neglecting Users in Enterprise Apps by Microsoft Sr PMProduct School
 
Data Analytics for better Product Decision Making by Mixpanel PM
Data Analytics for better Product Decision Making by Mixpanel PMData Analytics for better Product Decision Making by Mixpanel PM
Data Analytics for better Product Decision Making by Mixpanel PMProduct School
 
The Types of TPM Roles by Facebook Product Leader
The Types of TPM Roles by Facebook Product LeaderThe Types of TPM Roles by Facebook Product Leader
The Types of TPM Roles by Facebook Product LeaderProduct School
 
Building an Amazing Relationship Between Product Management and Marketing
Building an Amazing Relationship Between Product Management and MarketingBuilding an Amazing Relationship Between Product Management and Marketing
Building an Amazing Relationship Between Product Management and MarketingProductPlan
 
Tesla’s Referral Program: Viral Growth by fmr Tesla Product Leader
Tesla’s Referral Program: Viral Growth by fmr Tesla Product LeaderTesla’s Referral Program: Viral Growth by fmr Tesla Product Leader
Tesla’s Referral Program: Viral Growth by fmr Tesla Product LeaderProduct School
 
Defining Success Metrics for Your Product by Google Product Leader
Defining Success Metrics for Your Product by Google Product LeaderDefining Success Metrics for Your Product by Google Product Leader
Defining Success Metrics for Your Product by Google Product LeaderProduct School
 

Was ist angesagt? (20)

5 Ways to Enhance the Customer Experience with Gainsight PX
5 Ways to Enhance the Customer Experience with Gainsight PX5 Ways to Enhance the Customer Experience with Gainsight PX
5 Ways to Enhance the Customer Experience with Gainsight PX
 
How to Run Product Discovery Experiments in FinTech
How to Run Product Discovery Experiments in FinTechHow to Run Product Discovery Experiments in FinTech
How to Run Product Discovery Experiments in FinTech
 
A/B Testing for New Product Launches by Booking.com Sr PM
A/B Testing for New Product Launches by Booking.com Sr PMA/B Testing for New Product Launches by Booking.com Sr PM
A/B Testing for New Product Launches by Booking.com Sr PM
 
Product management with analytics
Product management with analyticsProduct management with analytics
Product management with analytics
 
Aligning Product & Customer Success Teams to Fuel Growth by Gainsight Product...
Aligning Product & Customer Success Teams to Fuel Growth by Gainsight Product...Aligning Product & Customer Success Teams to Fuel Growth by Gainsight Product...
Aligning Product & Customer Success Teams to Fuel Growth by Gainsight Product...
 
Data-Driven Product Management by Shutterfly Director of Product
Data-Driven Product Management by Shutterfly Director of ProductData-Driven Product Management by Shutterfly Director of Product
Data-Driven Product Management by Shutterfly Director of Product
 
Analytics for Startups
Analytics for StartupsAnalytics for Startups
Analytics for Startups
 
Ethical AI Products, With Equity in Mind by Mozilla Product Leader
 Ethical AI Products, With Equity in Mind by Mozilla Product Leader Ethical AI Products, With Equity in Mind by Mozilla Product Leader
Ethical AI Products, With Equity in Mind by Mozilla Product Leader
 
Intro to PM Execution Interviews by Facebook Product Leader
Intro to PM Execution Interviews by Facebook Product LeaderIntro to PM Execution Interviews by Facebook Product Leader
Intro to PM Execution Interviews by Facebook Product Leader
 
How to Manage Open Source Product by Github Sr. PM
How to Manage Open Source Product by Github Sr. PMHow to Manage Open Source Product by Github Sr. PM
How to Manage Open Source Product by Github Sr. PM
 
Leverage Data for Product Decisions by Google Prod. S&O Lead
Leverage Data for Product Decisions by Google Prod. S&O LeadLeverage Data for Product Decisions by Google Prod. S&O Lead
Leverage Data for Product Decisions by Google Prod. S&O Lead
 
Measuring Success of Data Platforms by Booking.com Product Leader
Measuring Success of Data Platforms by Booking.com Product LeaderMeasuring Success of Data Platforms by Booking.com Product Leader
Measuring Success of Data Platforms by Booking.com Product Leader
 
How to Think Big, Start Small and Fail Fast by Google PM Lead
How to Think Big, Start Small and Fail Fast by Google PM LeadHow to Think Big, Start Small and Fail Fast by Google PM Lead
How to Think Big, Start Small and Fail Fast by Google PM Lead
 
How to be a Good Machine Learning PM by Google Product Manager
How to be a Good Machine Learning PM by Google Product ManagerHow to be a Good Machine Learning PM by Google Product Manager
How to be a Good Machine Learning PM by Google Product Manager
 
Neglecting Users in Enterprise Apps by Microsoft Sr PM
Neglecting Users in Enterprise Apps by Microsoft Sr PMNeglecting Users in Enterprise Apps by Microsoft Sr PM
Neglecting Users in Enterprise Apps by Microsoft Sr PM
 
Data Analytics for better Product Decision Making by Mixpanel PM
Data Analytics for better Product Decision Making by Mixpanel PMData Analytics for better Product Decision Making by Mixpanel PM
Data Analytics for better Product Decision Making by Mixpanel PM
 
The Types of TPM Roles by Facebook Product Leader
The Types of TPM Roles by Facebook Product LeaderThe Types of TPM Roles by Facebook Product Leader
The Types of TPM Roles by Facebook Product Leader
 
Building an Amazing Relationship Between Product Management and Marketing
Building an Amazing Relationship Between Product Management and MarketingBuilding an Amazing Relationship Between Product Management and Marketing
Building an Amazing Relationship Between Product Management and Marketing
 
Tesla’s Referral Program: Viral Growth by fmr Tesla Product Leader
Tesla’s Referral Program: Viral Growth by fmr Tesla Product LeaderTesla’s Referral Program: Viral Growth by fmr Tesla Product Leader
Tesla’s Referral Program: Viral Growth by fmr Tesla Product Leader
 
Defining Success Metrics for Your Product by Google Product Leader
Defining Success Metrics for Your Product by Google Product LeaderDefining Success Metrics for Your Product by Google Product Leader
Defining Success Metrics for Your Product by Google Product Leader
 

Ă„hnlich wie Eureka Analytics Seminar Series - Product Management for Data Science Products

Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBala Iyer
 
Intro to Artificial Intelligence w/ Target's Director of PM
 Intro to Artificial Intelligence w/ Target's Director of PM Intro to Artificial Intelligence w/ Target's Director of PM
Intro to Artificial Intelligence w/ Target's Director of PMProduct School
 
Enable Advanced Analytics with Hadoop and an Enterprise Data Hub
Enable Advanced Analytics with Hadoop and an Enterprise Data HubEnable Advanced Analytics with Hadoop and an Enterprise Data Hub
Enable Advanced Analytics with Hadoop and an Enterprise Data HubCloudera, Inc.
 
Borys Pratsiuk "How to be NVidia partner"
Borys Pratsiuk "How to be NVidia partner"Borys Pratsiuk "How to be NVidia partner"
Borys Pratsiuk "How to be NVidia partner"Lviv Startup Club
 
How AI is revolutionizing the world
How AI is revolutionizing the worldHow AI is revolutionizing the world
How AI is revolutionizing the worldSK Reddy
 
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...Joe Lamantia
 
UX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
UX STRAT USA Presentation: Joe Lamantia, Bottomline TechnologiesUX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
UX STRAT USA Presentation: Joe Lamantia, Bottomline TechnologiesUX STRAT
 
Capitalize On Social Media With Big Data Analytics
Capitalize On Social Media With Big Data AnalyticsCapitalize On Social Media With Big Data Analytics
Capitalize On Social Media With Big Data AnalyticsHassan Keshavarz
 
I Love APIs Europe 2015: Business Sessions
I Love APIs Europe 2015: Business SessionsI Love APIs Europe 2015: Business Sessions
I Love APIs Europe 2015: Business SessionsApigee | Google Cloud
 
Research Presentation: How Numbers are Powering the Next Era of Marketing
Research Presentation: How Numbers are Powering the Next Era of MarketingResearch Presentation: How Numbers are Powering the Next Era of Marketing
Research Presentation: How Numbers are Powering the Next Era of MarketingMediaPost
 
Creating Big Data Success with the Collaboration of Business and IT
Creating Big Data Success with the Collaboration of Business and ITCreating Big Data Success with the Collaboration of Business and IT
Creating Big Data Success with the Collaboration of Business and ITEdward Chenard
 
Building the Analytics Capability
Building the Analytics CapabilityBuilding the Analytics Capability
Building the Analytics CapabilityBala Iyer
 
How Customer Experience is Driving Application Modernization
How Customer Experience is Driving Application ModernizationHow Customer Experience is Driving Application Modernization
How Customer Experience is Driving Application ModernizationNuxeo
 
Big Data Goes to Work - Liberating Latent Value in a Connected World - P.Coffee
Big Data Goes to Work - Liberating Latent Value in a Connected World - P.CoffeeBig Data Goes to Work - Liberating Latent Value in a Connected World - P.Coffee
Big Data Goes to Work - Liberating Latent Value in a Connected World - P.CoffeePeter Coffee
 
Big Data Meetup by Chad Richeson
Big Data Meetup by Chad RichesonBig Data Meetup by Chad Richeson
Big Data Meetup by Chad RichesonSocietyConsulting
 
Unlocking the Value of Big Data in 2014
Unlocking the Value of Big Data in 2014Unlocking the Value of Big Data in 2014
Unlocking the Value of Big Data in 2014Dun & Bradstreet
 
CSCMP 2014 :exploring scm big data cscmp
CSCMP 2014 :exploring scm big data cscmpCSCMP 2014 :exploring scm big data cscmp
CSCMP 2014 :exploring scm big data cscmpRobert Glenn Richey, Jr.
 
Opportunities with data science
Opportunities with data scienceOpportunities with data science
Opportunities with data scienceAshiq Rahman
 
Applied_Data_Science_Presented_by_Yhat
Applied_Data_Science_Presented_by_YhatApplied_Data_Science_Presented_by_Yhat
Applied_Data_Science_Presented_by_YhatCharlie Hecht
 

Ă„hnlich wie Eureka Analytics Seminar Series - Product Management for Data Science Products (20)

Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the Marketspace
 
Intro to Artificial Intelligence w/ Target's Director of PM
 Intro to Artificial Intelligence w/ Target's Director of PM Intro to Artificial Intelligence w/ Target's Director of PM
Intro to Artificial Intelligence w/ Target's Director of PM
 
Enable Advanced Analytics with Hadoop and an Enterprise Data Hub
Enable Advanced Analytics with Hadoop and an Enterprise Data HubEnable Advanced Analytics with Hadoop and an Enterprise Data Hub
Enable Advanced Analytics with Hadoop and an Enterprise Data Hub
 
Borys Pratsiuk "How to be NVidia partner"
Borys Pratsiuk "How to be NVidia partner"Borys Pratsiuk "How to be NVidia partner"
Borys Pratsiuk "How to be NVidia partner"
 
How AI is revolutionizing the world
How AI is revolutionizing the worldHow AI is revolutionizing the world
How AI is revolutionizing the world
 
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
 
UX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
UX STRAT USA Presentation: Joe Lamantia, Bottomline TechnologiesUX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
UX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
 
Capitalize On Social Media With Big Data Analytics
Capitalize On Social Media With Big Data AnalyticsCapitalize On Social Media With Big Data Analytics
Capitalize On Social Media With Big Data Analytics
 
I Love APIs Europe 2015: Business Sessions
I Love APIs Europe 2015: Business SessionsI Love APIs Europe 2015: Business Sessions
I Love APIs Europe 2015: Business Sessions
 
Research Presentation: How Numbers are Powering the Next Era of Marketing
Research Presentation: How Numbers are Powering the Next Era of MarketingResearch Presentation: How Numbers are Powering the Next Era of Marketing
Research Presentation: How Numbers are Powering the Next Era of Marketing
 
Creating Big Data Success with the Collaboration of Business and IT
Creating Big Data Success with the Collaboration of Business and ITCreating Big Data Success with the Collaboration of Business and IT
Creating Big Data Success with the Collaboration of Business and IT
 
Building the Analytics Capability
Building the Analytics CapabilityBuilding the Analytics Capability
Building the Analytics Capability
 
How Customer Experience is Driving Application Modernization
How Customer Experience is Driving Application ModernizationHow Customer Experience is Driving Application Modernization
How Customer Experience is Driving Application Modernization
 
Big Data Goes to Work - Liberating Latent Value in a Connected World - P.Coffee
Big Data Goes to Work - Liberating Latent Value in a Connected World - P.CoffeeBig Data Goes to Work - Liberating Latent Value in a Connected World - P.Coffee
Big Data Goes to Work - Liberating Latent Value in a Connected World - P.Coffee
 
Big Data Meetup by Chad Richeson
Big Data Meetup by Chad RichesonBig Data Meetup by Chad Richeson
Big Data Meetup by Chad Richeson
 
Unlocking the Value of Big Data in 2014
Unlocking the Value of Big Data in 2014Unlocking the Value of Big Data in 2014
Unlocking the Value of Big Data in 2014
 
CSCMP 2014 :exploring scm big data cscmp
CSCMP 2014 :exploring scm big data cscmpCSCMP 2014 :exploring scm big data cscmp
CSCMP 2014 :exploring scm big data cscmp
 
Opportunities with data science
Opportunities with data scienceOpportunities with data science
Opportunities with data science
 
Identifying the new frontier of big data as an enabler for T&T industries: Re...
Identifying the new frontier of big data as an enabler for T&T industries: Re...Identifying the new frontier of big data as an enabler for T&T industries: Re...
Identifying the new frontier of big data as an enabler for T&T industries: Re...
 
Applied_Data_Science_Presented_by_Yhat
Applied_Data_Science_Presented_by_YhatApplied_Data_Science_Presented_by_Yhat
Applied_Data_Science_Presented_by_Yhat
 

KĂĽrzlich hochgeladen

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel AraĂşjo
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 

KĂĽrzlich hochgeladen (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 

Eureka Analytics Seminar Series - Product Management for Data Science Products

  • 1. Product Management for Data Science Products Aloysius Lim, Director of AI Products, Eureka Analytics aloysius@eureka.ai Eureka Seminar Series on Building Data Science Products at Scale 2 Oct 2018
  • 2. 2
  • 3. Eureka Seminar Series Building Data Science Products at Scale 2 Oct Product Management for Data Science Products 15 Oct Technology Choices for Data Science Products 23 Oct Building Targeted Advertising Product 12 Nov Alternative Data and Algorithms for Risk Modelling Product 3 Dec Spatial Temporal Data and Algorithms for Mobility Intelligence Product Sign Up and get updates on our Meetup page 3 Tentative Schedule 6.30 pm to 8.30 pm WeWork @ 71 Robinson Road
  • 4. What are Data Science products? 4
  • 5. What are Data Science products? 5 https://enterprise.foursquare.com/products/places
  • 6. What are Data Science products? 6 https://www.creditsesame.com/blog/credit/credit-bureau-guide-what-the-differences-are-between-equifax-transunion-experian/
  • 7. What are Data Science products? 7 https://www.spotify.com/us/discoverweekly/
  • 8. What are Data Science products? 8
  • 9. What are Data Science products? 9 Data as Product Data Science-Powered Products & Services Data Science Software & Tools https://www.datarobot.com/product/ http://www.moorinsightsstrategy.com/nvidia- gpu-cloud-its-not-what-you-may-think-it-is/
  • 10. Every industry is using Data Science in its products and services Adobe Airbnb AlephD Alibaba Amazon Amazon Web Services AMD Ant Financial ASOS.com Astound Baidu Boeing CareerBuilder Channel 4 Television Charlotte-Mecklenburg Police Department City of Denver, CO City of Syracuse, NY CognitiveScale Comcast Comodo Security Solutions comScore Cray Criteo Didi Chuxing Dstillery Dynatrace Facebook Flipkart Fox Chase Cancer Center Galois Google Huawei IBM iFLYTEK Intel JD.com KD Consulting LineZone Data LinkedIn Microsoft Mobike NEC NetEase Netflix NTT Nvidia Oath Pinterest Pittsburgh Bureau of Fire risQ Roku S&P SAS SciSports SecretarĂ­a de Hacienda Distrital ShopRunner Snap Sutter Health Symantec Tableau Tata Consultancy Services Tencent Textkernel The Globe and Mail The Lab at DC Thomson Reuters Three Bridges Capital Translational MRI Two Sigma Investments Uber United States Census Bureau Vatican Secret Archives Workday Yahoo Zhejiang Cainiao Supply Chain Management 10 Non-academic organizations with papers at KDD 2018, a top data science research conference Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
  • 11. Data Science Product Management in practice 11
  • 12. What does “scale” mean? Organization Scale Revenue Scale Customer Scale From Blitzscaling by Reid Hoffman & Chris Yeh 12 https://hbr.org/2016/04/blitzscaling
  • 13. What makes it challenging to build Data Science products at scale? A Product Manager’s perspective Getting to a precise and correct articulation of the Data Science problem Designing a solution when feasibility and performance are not certain, and require further research Lack of data, or lack of control over data collection and quality Building a product robust to local differences (data, geopolitical, cultural, behavioral, etc.) Ensuring privacy of consumers Preparing for (possibly unknown) future shifts in data distributions How can the PM define the correct problem for the team to solve? How can the PM plan the roadmap and product requirements if the solution is not known? How can the PM find creative solutions to work with data limitations? How can the PM identify these issues, and design a product that works everywhere? How can the PM design privacy into the product? How can the PM identify potentially catastrophic changes in the external environment, and respond in an agile manner? 13
  • 14. There is no “best” way to do Product Management But here’s one framework “A startup is a human institution designed to deliver a new product or service under conditions of extreme uncertainty.” Extreme Uncertainty: “situations that cannot be modeled, are not clear-cut, and where the risk is not necessarily large – it’s just not yet known.” – Eric Ries 14 https://mm1.com/about-us/newsroom/publications/poster-lean-startup/ http://www.startuplessonslearned.com/ 2010/06/what-is-startup.html
  • 15. How we do Agile 15 Sprint Sprint Sprint Sprint Sprint Sprint Sprint Planning Backlog Planning Backlog Retro- spective Retro- spective Retro- spective Retro- spective Retro- spective Retro- spective Artefacts Artefacts Artefacts Artefacts Artefacts ArtefactsContinuous Integration Scrum 2-week sprints Every 2 to 3 months
  • 16. What does Data Science look like in practice? Example: Microsoft Team Data Science Process 16 https://docs.microsoft.com/en-us/azure/machine-learning/team-data-science-process/overview
  • 17. Case Example Eureka Customer Acquisition Targeted marketing using mobile operator (“telco”) data to generate leads for clients. 17
  • 18. 18
  • 19. CEO: “Your mission, should you choose to accept it…” Other business models • Insights / reports (e.g. Experian Mosaic) • Data APIs (e.g. Foursquare) • B2C products and services (e.g. Spotify) • B2B products and services (e.g. Criteo) • Machine Learning APIs (e.g. Azure / AWS / GCP) • SaaS (e.g. DataRobot) • Etc. 19 Global Advertisers 10+ Telcos in 10 countries Campaign channels Team Data Infra- structure Direct Global Contracts Cost per lead Cost per acquired customer Revenue share to telco Campaign costs Dev & ops costs Acquire new customers Access to 1 billion mobile subscribers Developme nt Deploymen t Scoring Campaigns Eureka Customer Acquisition https://strategyzer.com/canvas/business-model-canvas
  • 20. 20
  • 21. Who are the stakeholders (not only customers), and what are their needs? 21 Internal Stakeholders Scaling Challenge One product Many advertisers & telcos Many stakeholders Many contexts (language, geopolitical, cultural) Advertisers “Help me get high quality leads.” “I can’t give you data about existing customers.” Telcos “Help me to monetize my data.” “I have 200m subscribers.” “Data cannot leave my firewall.” “I can only give you 1 month of daily aggregated data.” “My data comes in this format.” Investors “Is your technology patentable?” “Is your technology defensible against competitors?” Senior Execs “We need to demonstrate revenue in 2 months!” “We can only hire 10 people.” “We need to deploy in 10 countries in the next 6 months.” Team “I want to publish my work.” Regulators “You must comply with data protection laws!” “I must approve any data shared with partners.” Subscribers “I only want to receive ads I’m interested in.” “Is my privacy being protected?” Icons from www.flaticon.com
  • 22. 22
  • 23. What are the Data Science problems? 23 Business Problem: Find good leads for product X Data Science Problem: Create customer segments for different product categories Product X Select relevant segment Run campaign Collect responses / conversions Telco data Segment A Soccer players Segment B Mums, >1 children Segment C Watch enthusiasts Challenges How to create “good” segments? • Rules: How do we know they work? • Supervised: What training data to use? • Clustering: How to interpret the clusters? How to measure the quality of the segments? How to scale to thousands of segments across telcos? How to select the best segment for each campaign? What if no matching segment can be found? What if the “best” segment responds poorly to the campaign?
  • 24. How else can we frame the problem? 24 Business Problem: Find good leads for product X Data Science Problem: Find behavioral lookalikes of existing customers, iteratively What features are useful and easy to compute? Product X Create Seed group Existing customers Heuristic rules Random Collect responses / conversions Behavioral data features Mathematically defined Run lookalike model Run campaign on Target group Update&TuneSeedgroup Telco data How to select a good seed group? How to find lookalikes? How to update / tune seed group?
  • 25. Framing the problem differently could lead to a very different product! 25 Business Problem: Find good leads for product X Data Science Problem: Create customer segments for different product categories Data Science Problem: Find behavioral lookalikes of existing customers, iteratively Product X Select relevant segment Run campaign Collect responses / conversions Telco data Segment A Soccer players Segment B Mums, >1 children Segment C Watch enthusiasts Product X Create Seed group Existing customers Heuristic rules Random Collect responses / conversions Behavioral data features Mathematically defined Run lookalike model Run campaign on Target group Update&TuneSeedgroup Telco data “Customer segmentation engine” “Behavioral lookalike engine”
  • 26. It is critical to define the Data Science problem well One more step to the Data Science process 26 Data Science Problem Definition
  • 28. 28
  • 29. Most critical risks to test Does it work? Can telco data predict campaign response? Is lookalike an effective method for campaign targeting? Does it work in multiple settings? Different telcos, data sources, cultures Can it scale? 1m, 10m, 100m subscribers 29
  • 30. Build – Measure – Learn helps to manage extreme uncertainty Applying Agile in Data Science product development 30 Literature Review Artefact: Research report Analysis & Experimentation Artefact: Vignette or report MVP Development Artefacts: Code, tests & docs We cannot solve all problems at once! 1. Break the problem down 2. Identify most critical hypotheses 3. Plan a research, analysis or experimental task to test the hypothesis 4. Time-bound the problem
  • 31. Build – Measure – Learn Conduct literature review Hypothesis Mathematically defined behavioral features from telco data are useful for identifying campaign targets. Artefact: Research Report 31
  • 32. Build – Measure – Learn Conduct literature review Hypothesis A scalable and effective algorithm for lookalike modeling exists that can be used for campaign scoring. Artefact: Research Report 32
  • 33. Build – Measure – Learn Perform analysis Hypothesis Feature X (e.g. Radius of Gyration) can reveal meaningful patterns in human behavior. Artefact: Analysis Vignette 33 Correlations • Radius of Gyration and Total Distance are moderately correlated • Total Distance and Distinct Count are strongly correlated • Radius of gyration is more correlated with Distinct Count than to Total Count Radius of Gyration and Total Distance • 50% of MSISDN move withn a radius of 24 km,but traveled up to 630km of total distance within a month, this includs 11% of MSISDN did not move • 75% of MSISDN move within a radius of 97 km and made up to 3264km total distance. • 90% move within 280km, total distance of 9431km, and 95% move within a radius of 440km and total distance of 15294km. • There are MSISDN with radius of gyration that is about the whole north-south distance of the country, 1598km. • Radius of gyration is computed as follows: Eureka Feature Engine Run on CDR Data Analysts: Ying Li 2018-08-06 Problem Statement Upon the completion of running Eureka Feature Engine MVP in telco’s Zeppelin environment, we want to analyze the results to 1. Inspect and verify the engine output for feasibility and correctness 2. Explore if any insights can be gained for telco or Eureka 3. Demonstrate an analysis artifact through this Vignette Methodology Description • Run Scala/SQL/R EDA to understand the data and the distributions of features produced by Eureka Feature Engine • Run cumulative distributions • Run correlation on different features • Spot any worthy data nuggets • Spot any suspicious data issues Summary • Feature Engine covered the entire user base, computation looks correctly directionally • Meaningful analysis can be conducted regarding user behavior • Some data issues were identified to be further debugged Call to Actions 1. Seek feedback from commercial team 2. Build clustering models to investigate feasibility of unsupervised learning 3. Debug the data issues 4. Continue develop with richer feature set 5. Compare ”truth about the data” (e.g., median radius of gyration”) against “truth about the world” (e.g., commute distance for people in this city) Cumulative Distribution by Radius of Gyration Cumulative Distribution by Total Distance Total Distance Total Count Distinct Count Radius of Gyration 0.456928567 0.332110891 0.478864808 Total Distance 0.727597715 0.745295476 Total Count 0.772572346 Caveat • Distance computation was done as point to point distance between consecutive cell tower locations on the trajectory • Usage of any insights from this analysis at current state is not advised as the computation engine is MVP for demonstrating feasibility and hence not fully tested for accuracy
  • 34. Build – Measure – Learn Build and test MVP Hypothesis Product works in the real world! Artefact: Code, tests, docs Offline experimental results Online (live deployment) results 34 Collect responses / conversions Behavioral data features Run lookalike model Lookalike group Offline evaluation Lift = 4.8% Random group Online evaluation Lift = 5.2%
  • 35. Build – Measure – Learn Build and test MVP Hypothesis Product works in multiple telcos and countries, with different data, environment and behavior Artefact: Deployed MVP Offline experimental results Online (live deployment) results 35
  • 36. Build – Measure – Learn 36 Literature Review Artefact: Research report Analysis & Experimentation Artefact: Vignette or report MVP Development Artefacts: Code, tests & docs Collect responses / conversions Behavioral data features Run lookalike model Lookalike group Offline evaluation Lift = 4.8% Random group Online evaluation Lift = 5.2%
  • 37. Build – Measure – Learn in real settings as soon as possible “Best model” is not always best! “If you followed the Prize competition, you might be wondering what happened with the final Grand Prize ensemble that won the $1M two years later. This is a truly impressive compilation and culmination of years of work, blending hundreds of predictive models to finally cross the finish line. We evaluated some of the new methods offline but the additional accuracy gains that we measured did not seem to justify the engineering effort needed to bring them into a production environment. Also, our focus on improving Netflix personalization had shifted to the next level by then.” 37 https://medium.com/netflix-techblog/netflix-recommendations-beyond-the-5-stars-part-1-55838468f429
  • 39. 39
  • 40. What does this statement mean? “Our model can achieve 70% accuracy.” Is this good? How can we tell? 40
  • 41. It depends on the application “Our model can achieve 70% accuracy.” 41 Disease detection rate Not good, possibly catastrophic Ad click through rate Astounding! Too good to be true
  • 42. It depends on what one means by “accuracy” “Our model can achieve 70% accuracy.” Definition of accuracy % of correct predictions in all classes 42
  • 43. It depends on what one means by “accuracy” 43 “Our model can achieve 70% accuracy.” Sometimes, people mean Precision True Positives / predicted Positives “70% of people predicted to like soccer actually like soccer.” Sometimes, people mean Recall True Positives / actual Positives “Our model can detect 70% of soccer lovers.” Sometimes, they mean something else altogether!
  • 44. What to measure? OFFLINE Model performance Accuracy, precision, recall, AUROC, Gini, lift, etc. Privacy k-anonymity, differential privacy Compute performance Compute time, memory / CPU utilization, storage ONLINE Model performance Privacy Compute performance Response behavior Click through rate, API utilization Business outcomes Revenue, risk reduction, cost savings 44
  • 45. You build what you measure 45 If the wrong measurement is used The wrong thing will be built in the next iteration
  • 46. What is the right performance metric for campaigns? One possibility: Area Under ROC Curve (AUROC) • Captures overall classification performance • 1 is best, 0.5 is random, 0 is worst • Easy to compare different models 46 TruePositiveRate False Positive Rate This model has the bigger AUROC
  • 47. What is the right performance metric for campaigns? But when we run a campaign, we really only care about the True Positive Rate for the users with the highest scores So a better measure would be Lift@n% E.g. Lift@5% if we want to target top 5% of targets 47 TruePositiveRate False Positive Rate This model has the higher lift@5%
  • 48. The real test is when we measure performance online 48 Offline (modeling phase) Estimated Lift@1%: 10.2X Target size: 100,000 Lookalike: 40,000 Random: 30,000 Heuristic rules: 30,000 CTR: 3.82% CTR: 1.95% CTR: 1.58% Actual Lift: 2.0X Actual Lift: 0.8X Actual results from one of our campaigns using lookalike modeling
  • 50. Superpowers for a Data Science Product Manager 50 Data Science Intuition for data Statistics Machine learning Data quality Computer Science Algorithms Complexity (Big O) Distributed Computing Mode of Working Hands on, “feel” the data Detail orientation Logical, critical thinking Not afraid to be critical about one’s own assumptions Product Management
  • 51. Our team structure: Scrum teams with all the skills required to research, develop and ship a User Story 51 Cross-functional team allows diverse perspectives to be considered from the get-go Algo performance vs engineering cost Tech choices Pipeline design Etc. Team members are empowered and required to work across the spectrum of tasks from data science to engineering “Know all the basics, but be expert in a few areas” Data Scientists write unit tests Engineers read research papers You vs Me Recipe for a Data Science product team (adjust as needed) 1 PM 2 Data Scientists 3 Engineers PM Data Scientist Data Scientist Engineer Engineer Engineer
  • 52. Key takeaways Data Science is being used in more products and services, in all industries Build – Measure – Learn to work with extreme uncertainty, unknown unknowns Get the right framing of the Data Science problem, and measurement of results Test with customers 52
  • 53. Thank You Please help us by giving us your feedback goo.gl/uTuuHr 53