SlideShare ist ein Scribd-Unternehmen logo
1 von 68
Downloaden Sie, um offline zu lesen
Pinterest
Iterative
supervised
clustering
Adancebetweendata
scienceandmachinelearning
DrJuneAndrews—September2016
ExplorePinterest’scontent
Questionourunderstanding
Inspirethefuture
Agenda
1
2
3
ExplorePinterest’scontent
Questionourunderstanding
Inspirethefuture
Agenda
1
2
3
Clothing
Cooking
Decorating
Beauty
Teaching
Carpentry
Cars
Animated GIFs
Electronics
Stereos
Fashion
Sewing
Articles
Painting
Photography
Nature
Cute cats
Tattoos
Hair
Microscopy
TV shows
Apps
Self help
Motorcycles
Chairs
Fashion
Travel
Garden
Chairs
Food
Linksare

behind

everyPin
Howareusersengaging

withlinkdomains?
2:50 PM 100%
Tool Pros Cons
Cluster algorithms
(SVM, K-Means, Spectral)
• Considers all users
• Accurate
• Tough to communicate
• Definitions change over time
User experience studies • Deep knowledge
• Captures the immeasurable
• Costly
• Considers few users
Domain expert hypothesis • Human interpretable • Inaccurate
Tool Pros Cons
Cluster algorithms
(SVM, K-Means, Spectral)
• Considers all users
• Accurate
• Tough to communicate
• Definitions change over time
User experience studies • Deep knowledge
• Captures the immeasurable
• Costly
• Considers few users
Domain expert hypothesis • Human interpretable • Inaccurate
Currentclusteranalysis
Cleanandloaddataintofavoriteclusteringalgorithm
Buildvisualizationsontopofclusters
Fiddlewithparametersinclusteringalgorithm
Addhumanlabelstoeachcluster
Sharehumaninterpretationofclusters
1
2
3
4
5
Currentclusteranalysis
Cleanandloaddataintofavoriteclusteringalgorithm
Buildvisualizationsontopofclusters
Fiddlewithparametersinclusteringalgorithm
Addhumanlabelstoeachcluster
Sharehumaninterpretationofclusters
1
2
3
4
5
Fatal flaw
Humanintheloopcomputing
Community membership identification from small
seed sets (Kloumann & Kleinberg)
T
Domain Expert
Favorite
Clustering
Algorithm
Humanintheloopcomputing
When machine confidence dips, engage with domain
expert
T
Domain Expert
Favorite
Clustering
Algorithm
?
T
Unsure
Confident
Humanintheloopcomputing
When machine confidence dips, engage with domain
expert
T
Domain Expert
Favorite
Clustering
Algorithm
T
T
Unsure
Confident
?
Humanintheloopcomputing
Domain expert determines when labeling is done
T
Domain Expert
Favorite
Clustering
Algorithm
T
Thats all!
Currentanalysismethodology
Cleanandloaddataintofavoriteclusteringalgorithm
Buildvisualizationsontopofclusters
Fiddlewithparametersinclusteringalgorithm
Addhumanlabelstoeachcluster
Sharehumaninterpretationofclusters
1
2
3
4
5
Humanintheloopcomputing
Stage 1: Machine clusters data
Favorite
Clustering
Algorithm
Humanintheloopcomputing
Stage 2: Domain expert creates 1 human interpretable
cluster
Domain Expert
Humanintheloopcomputing
Stage 3: Remove human labeled clusters and iterate
Favorite
Clustering
Algorithm
Domain Expert
How are users engaging
with link domains?
•Forasamplesetoflinkdomains
we’reinterestedin:
• AllPincreatesintheirfirstyearonPinterest
• AllrepinsintheirfirstyearonPinterest
• 100klinkdomainssampledtotal
Linksarebehind
everyPin
2:50 PM 100%
Python
Notebook
Provides guided iteration
Python
Notebook
Sample visualization 

for each cluster
Python
Notebook
Pin creates Repins
Few Many
Many
Few
Iteration1
Title Dark content
Description Fewer than 2 Pins a week on average
Examples Noisy low quality content
Iteration2
42% of domains left
Few Many Few Some Few Many
0 0 0 0 0 0
Cluster 1 Cluster 3Cluster 2
Pin creates Repins Pin creates RepinsPin creates Repins
Description
Domains with few Pins, but
these Pins thrive in the
Pinterest ecosystem
Calculation
def
detect_pinterest_specials(domain_engagement):
ratio = domain_engagement.n_repins / max(1.0,
float(domain_engagement.n_pin_creates))
return domain_engagement.n_pin_creates <= X
and ratio >= Y
Examples Fashion and impulse sites
Iteration2
Pinterest specials
Few
Pinterest specials
Repins
Many
0 0
Pin creates
Iteration3
33% of domains left
Few Few Few Some Few Many
0 0 0 0 0 0
Cluster 1 Cluster 3Cluster 2
Pin creates Repins Pin creates RepinsPin creates Repins
Iteration3
Steady growth
Description
Active Pin creates and
steady growth throughout
the year
Calculation
def detect_steady_growth(domain_engagement):
(growth_rate, intercept) =
np.polyfit(range(len(domain_engagement.monthly_repins)
), domain_engagement.monthly_repins,1)
return months_pins_created >= X and growth_rate >= Y
Examples Recipe and DIY sites
Some
Steady growth
Repins
Many
0 0
Pin creates
Iteration4
25% of domains left
Few Some Many Some Few Some
0 0 0 0 0 0
Cluster 1 Cluster 3Cluster 2
Pin creates Repins Pin creates RepinsPin creates Repins
Iteration4
Slow growth
Description Similar to steady growth,
but not as fast
Calculation
def detect_steady_growth(domain_engagement):
(growth_rate, intercept) = np.podef
detect_steady_growth(domain_engagement):
(growth_rate, intercept) =
np.polyfit(range(len(domain_engagement.monthly_repins)),
domain_engagement.monthly_repins,1)
return months_pins_created >= X and growth_rate >=
Ylyfit(range(len(domain_engagement.monthly_repins)),
domain_engagement.monthly_repins,1)
return months_pins_created >= X and growth_rate >= Y
Examples Little lower quality recipe 

and DIY sites
Few
Slow growth
Repins
Many
0 0
Pin creates
Iteration5
Churning
Description Slowly fade through the year
Calculation
def detect_churning(domain_engagement):
(repin_growth, intercept) = np.polyfit(
range(len(domain_engagement.monthly_repins) - 2),
domain_engagement.monthly_repins[2:],
1)
(pin_create_growth, intercept) = np.polyfit(
range(len(domain_engagement.monthly_repins) - 2),
domain_engagement.monthly_pin_creates[2:],
1)
return repin_growth < 0 and pin_create_growth < 0
Examples Fashion sale 

and click bait sites
Few
Churning
Repins
Many
0 0
Pin creates
Iteration6
Yearly
Description Slowly fade through the year
Calculation
def detect_churning(domain_engagement):
(repin_growth, intercept) = np.polyfit(
range(len(domain_engagement.monthly_repins) - 2),
domain_engagement.monthly_repins[2:],
1)
(pin_create_growth, intercept) = np.polyfit(
range(len(domain_engagement.monthly_repins) - 2),
domain_engagement.monthly_pin_creates[2:],
1)
return repin_growth < 0 and pin_create_growth < 0
Examples Seasonal fashion, 

such as snow boots
Few
Yearly
Pin creates Repins
Many
0 0
Iteration7
Late bloomer
Description Peak mid year
Calculation
def detect_late_bloomer(domain_engagement):
(concavity, pin_growth, intercept) = np.polyfit(
range(len(domain_engagement.monthly_repins) - 2),
[r + p for (r, p) in zip(domain_engagement.monthly_repins[2:],
domain_engagement.monthly_pin_creates[2:])],
2)
return concavity < 0
Examples Blogs that get off to a slow
start
Few
Pinterest late bloomer
Pin creates Repins
Many
0 0
Clusters
•Darkcontent
•Pinterestspecials
•Steadygrowth
•Slowgrowth
•Churning
•Yearly
•Latebloomer
ExplorePinterest’scontent
Questionourunderstanding
Inspirethefuture
Agenda
1
2
3
Doesasking
twiceyield
thesame
answer?
Shouldweclusteragain?
2:50 PM 100%
Costofreplicatinganalysisis
leavingotherbusiness
opportunitiesonthetable
2:50 PM 100%
Data
scienceis
expensive
Unknown
2:50 PM 100%
Wouldit
makea
difference?
Replication
Crisisin
Psychology
Silberzahn & Ahlmann; Crowdsourced research: Many hands make tight work
NatureAugust2015
Crowd
sourced
studyon
redcards

insoccer
Silberzahn & Ahlmann; Crowdsourced research: Many hands make tight work
NatureOctober2015
TheNewYorkTimesonpredictingthepresidency
September, 2016
Cohn; We Gave Four Good Pollsters the Same Raw Data. They Had Four Different Results.
…butwe’veloweredthecost!
2:50 PM 100%
Data
scienceis
expensive
…9datascientistsand

machinelearningengineers.
Samedata,sameUI,sameday.
Everyonefinishedin~1hour.
…so

wedidit
again
Modelsarealworldsituation
withlimitedresources
9ishuge!
weretheresultsthesame?
Everythingwas
thesame
Baseline
clusters
Results e Results l Results d Results m Results z Results b Results k
Dark content
Pinterest specials
Steady growth
Slow growth
Churning
Yearly
Late bloomer
Existingclustersasourbaseline
Baseline
clusters
Results e Results l Results d Results m Results z Results b Results k
Dark content Unpopular (95%) Trailing (90%)
Pinterest specials Trailing (100%)
Viral on Pinterest
(98%)
Pin creates drop
off (97%)
Steady growth
Increasing repins
(94%)
Continuous
growth (94%)
Slow growth
Churning
Yearly
Late bloomer
90%Matches
Baseline
clusters
Results e Results l Results d Results m Results z Results b Results k
Dark content Unpopular (95%) Trailing (90%)
Original pinny
(84%)
Pinterest specials Trailing (100%)
Minimal original
Pins (66%)
Viral on Pinterest
(98%)
Pin creates drop
off (97%)
Steady growth
Pinterest viral
content (62%) Other (53%)
Original Pinny
(51%)
Viral on the
internet (69%)
Increasing repins
(94%)
Continuous
growth (94%)
Suspected Save
button high Pin
creates (73%)
Slow growth
Pinterest viral
content (55%)
Original Pinny
(82%)
Viral on the
internet (65%)
Increasing repins
(65%)
Continuous
growth (86%)
Suspected Save
button high Pin
creates (51%)
Churning
Original Pinny
(68%)
Viral on the
internet (53%)
Yearly
Original Pinny
(71%)
Late bloomer
Original Pinny
(71%)
Continuous
growth (55%)
Suspected Save
button high Pin
creates (59%)
50%Matches
Baseline
Clusters
Results e Results l Results d Results m Results z Results b Results k
Dark content Unpopular (95%) Trailing (90%)
Original pinny
(84%)
Pinterest specials Trailing (100%)
Minimal original
Pins (66%)
Viral on Pinterest
(98%)
Pin creates drop
off (97%)
Steady growth
Pinterest viral
content (62%) Other (53%)
Original Pinny
(51%)
Viral on the
internet (69%)
Increasing repins
(94%)
Continuous
growth (94%)
Suspected Save
button high Pin
creates (73%)
Slow growth
Pinterest viral
content (55%)
Original Pinny
(82%)
Viral on the
internet (65%)
Increasing repins
(65%)
Continuous
growth (86%)
Suspected Save
button high Pin
creates (51%)
Churning
Original Pinny
(68%)
Viral on the
internet (53%)
Yearly
Original Pinny
(71%)
Late bloomer
Original Pinny
(71%)
Continuous
growth (55%)
Suspected Save
button high Pin
creates (59%)
50%Matches
Baseline
clusters
Results e Results l Results d Results m Results z Results b Results k
Dark content Unpopular (95%) Trailing (90%)
Original pinny
(84%)
Pinterest specials Trailing (100%)
Minimal original
Pins (66%)
Viral on Pinterest
(98%)
Pin creates drop
off (97%)
Steady growth
Pinterest viral
content (62%) Other (53%)
Original Pinny
(51%)
Viral on the
internet (69%)
Increasing repins
(94%)
Continuous
growth (94%)
Suspected Save
button high Pin
creates (73%)
Slow growth
Pinterest viral
content (55%)
Original Pinny
(82%)
Viral on the
internet (65%)
Increasing repins
(65%)
Continuous
growth (86%)
Suspected Save
button high Pin
creates (51%)
Churning
Original Pinny
(68%)
Viral on the
internet (53%)
Yearly
Original Pinny
(71%)
Late bloomer
Original Pinny
(71%)
Continuous
growth (55%)
Suspected Save
button high Pin
creates (59%)
50%Matches
Baseline
clusters
Results e Results l Results d Results m Results z Results b Results k
Dark content Unpopular (95%) Trailing (90%)
Original pinny
(84%)
Pinterest specials Trailing (100%)
Minimal original
Pins (66%)
Viral on Pinterest
(98%)
Pin creates drop
off (97%)
Steady growth
Pinterest viral
content (62%) Other (53%)
Original Pinny
(51%)
Viral on the
internet (69%)
Increasing repins
(94%)
Continuous
growth (94%)
Suspected Save
button high Pin
creates (73%)
Slow growth
Pinterest viral
content (55%)
Original Pinny
(82%)
Viral on the
internet (65%)
Increasing repins
(65%)
Continuous
growth (86%)
Suspected Save
button high Pin
creates (51%)
Churning
Original Pinny
(68%)
Viral on the
internet (53%)
Yearly
Original Pinny
(71%)
Late bloomer
Original Pinny
(71%)
Continuous
growth (55%)
Suspected Save
button high Pin
creates (59%)
50%Matches
Baseline
clusters
Results e Results l Results d Results m Results z Results b Results k
Yearly Seasonal Throwback Seasonal Annual
Steady growth
Gaining
popularity Increasing repins
Continuous
growth High engagement
Pinterest specials Initial flurry
Minimal original
Pins Viral on Pinterest
Pin create drop
off
Unpopular
domains with
good content
Conceptuallysimilarclusters
But not related in implementation
…Goodvs.bad
Differencesinperspective
Two

rootsof
variations
Signsofsuboptimalclustering
•Leadingwithbiases
•Cherry-picking:responding
toalimitedsubsetofthe
data
Few
Seasonal
Pin creates Repins
Few
0 0
Differences
ofperspective
•Resultsm-Viralgrowthcentric
• ViralonPinterest
• Viralontheinternet
• Lame
•Resultsd-Originalcontentcentric
• PersistentoriginalPins
• MinimaloriginalPins
• OriginalPinny
•Resultsl-Returnoninvestmentcentric
• Underserved
• Draught
• Trailing
Impactimplications
9datascientists

9answers
•Productsdependingonclusterused
• Viralmechanisms
• SpeedingPindemotion
• PromotingunderservedPins
•Forsameproduct,

domainsimpacteddifferfor
• Seasonality
• Steadygrowth
• Pinterestspecials
Bottomline
Itmatterswhichdatascientist
doesananalysis
ExplorePinterest’scontent
Questionourunderstanding
Inspirethefuture
Agenda
1
2
3
Let’saskthehardquestion

andbravetheanswertogether
Whenis

datascience

ahouse

ofcards?
Avalancheof
Resources
Measuringdatascienceimpact
•Experimentalsystemsarenowstandard
•Datascientistsaremoreavailable
•Reproducibleanalysis
•[Now]Fastreplicableanalysis
Utilize
Resources
Experiment
• Recordendtoendfromanalysistoimpact
• Innovateonprocesses
• Borrowideasonreplicationfromscience
• Tailorourtechniques forreplication
Concrete
experiments
Breakdowntheproblem

andbuildup
•NarrowDifferenceinPerception
throughPriminganalysts
•Developarubricofexcellence
•Trainanalystsongenerateddata
•Addprocessstabilizers
Pinterest

isinterested
pin.it/Data
Reachout!
DrJuneAndrews
june@pinterest.com/ DrAndrews/ DrJuneAndrews
Let’sdatascience,

datascience!
Let’scrackthecodeto
systematicinnovation
Thankyou!
Wearehiring!
https://engineering.pinterest.com/
pin.it/Data
Replication in Data Science - A Dance Between Data Science & Machine Learning Strata 2016

Weitere ähnliche Inhalte

Was ist angesagt?

Implementing Data Science
Implementing Data ScienceImplementing Data Science
Implementing Data ScienceNathan Watson
 
2018 02 converged it
2018 02 converged it2018 02 converged it
2018 02 converged itChris Dwan
 
The Art of Getting Buy-In from the Boss
The Art of Getting Buy-In from the BossThe Art of Getting Buy-In from the Boss
The Art of Getting Buy-In from the BossWatershed
 
10 tough decisions donor data migration decisions (Webinar hosted by Bloomera...
10 tough decisions donor data migration decisions (Webinar hosted by Bloomera...10 tough decisions donor data migration decisions (Webinar hosted by Bloomera...
10 tough decisions donor data migration decisions (Webinar hosted by Bloomera...Brandon Fix
 
Estimate and Measure. Minimize work, maximize value. Part 1
Estimate and Measure. Minimize work, maximize value. Part 1Estimate and Measure. Minimize work, maximize value. Part 1
Estimate and Measure. Minimize work, maximize value. Part 1Shiftup
 
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’tAdi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’tAgile Impact
 
Spinetti.david probability and statistics
Spinetti.david probability and statisticsSpinetti.david probability and statistics
Spinetti.david probability and statisticsDavid Spinetti
 
Practical Data Strategies in the real world of poor Data Quality
Practical Data Strategies in the real world of poor Data QualityPractical Data Strategies in the real world of poor Data Quality
Practical Data Strategies in the real world of poor Data QualityAndrew Patricio
 
UXDX 18: Data Enabled Design,
UXDX 18: Data Enabled Design, UXDX 18: Data Enabled Design,
UXDX 18: Data Enabled Design, UXDXConf
 
Data Analysis Goes Wrong by Microsoft Sr PM
Data Analysis Goes Wrong by Microsoft Sr PMData Analysis Goes Wrong by Microsoft Sr PM
Data Analysis Goes Wrong by Microsoft Sr PMProduct School
 
2016 Pittsburgh Data Jam Student Workshop
2016 Pittsburgh Data Jam Student Workshop2016 Pittsburgh Data Jam Student Workshop
2016 Pittsburgh Data Jam Student WorkshopMatthew DeReno
 
Data is worthless if you don;t communicate
Data is worthless if you don;t communicateData is worthless if you don;t communicate
Data is worthless if you don;t communicateAbhi Rana
 
From health persona to societal health uci 131202
From health persona to societal health  uci  131202From health persona to societal health  uci  131202
From health persona to societal health uci 131202Ramesh Jain
 
Data is not facts: The impossibility of being unbiased
Data is not facts: The impossibility of being unbiasedData is not facts: The impossibility of being unbiased
Data is not facts: The impossibility of being unbiasedAndrew Patricio
 
ZOLLHOF Know-How Event You track too much and learn too little
ZOLLHOF Know-How Event You track too much and learn too littleZOLLHOF Know-How Event You track too much and learn too little
ZOLLHOF Know-How Event You track too much and learn too littleZOLLHOF - Tech Incubator
 
How to Hire Data Scientists
How to Hire Data ScientistsHow to Hire Data Scientists
How to Hire Data ScientistsGalvanize
 
Data in development @ Spotify
Data in development @ SpotifyData in development @ Spotify
Data in development @ SpotifyOscar Carlsson
 
Predicting the Future With Microsoft Bing
Predicting the Future With Microsoft BingPredicting the Future With Microsoft Bing
Predicting the Future With Microsoft BingCybera Inc.
 
Simplify your analytics strategy- Palash badjatya
Simplify your analytics strategy- Palash badjatyaSimplify your analytics strategy- Palash badjatya
Simplify your analytics strategy- Palash badjatyaAcropolis Technical Campus
 

Was ist angesagt? (20)

Implementing Data Science
Implementing Data ScienceImplementing Data Science
Implementing Data Science
 
2018 02 converged it
2018 02 converged it2018 02 converged it
2018 02 converged it
 
The Art of Getting Buy-In from the Boss
The Art of Getting Buy-In from the BossThe Art of Getting Buy-In from the Boss
The Art of Getting Buy-In from the Boss
 
10 tough decisions donor data migration decisions (Webinar hosted by Bloomera...
10 tough decisions donor data migration decisions (Webinar hosted by Bloomera...10 tough decisions donor data migration decisions (Webinar hosted by Bloomera...
10 tough decisions donor data migration decisions (Webinar hosted by Bloomera...
 
Estimate and Measure. Minimize work, maximize value. Part 1
Estimate and Measure. Minimize work, maximize value. Part 1Estimate and Measure. Minimize work, maximize value. Part 1
Estimate and Measure. Minimize work, maximize value. Part 1
 
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’tAdi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
 
Spinetti.david probability and statistics
Spinetti.david probability and statisticsSpinetti.david probability and statistics
Spinetti.david probability and statistics
 
Practical Data Strategies in the real world of poor Data Quality
Practical Data Strategies in the real world of poor Data QualityPractical Data Strategies in the real world of poor Data Quality
Practical Data Strategies in the real world of poor Data Quality
 
UXDX 18: Data Enabled Design,
UXDX 18: Data Enabled Design, UXDX 18: Data Enabled Design,
UXDX 18: Data Enabled Design,
 
Data Analysis Goes Wrong by Microsoft Sr PM
Data Analysis Goes Wrong by Microsoft Sr PMData Analysis Goes Wrong by Microsoft Sr PM
Data Analysis Goes Wrong by Microsoft Sr PM
 
2016 Pittsburgh Data Jam Student Workshop
2016 Pittsburgh Data Jam Student Workshop2016 Pittsburgh Data Jam Student Workshop
2016 Pittsburgh Data Jam Student Workshop
 
Data is worthless if you don;t communicate
Data is worthless if you don;t communicateData is worthless if you don;t communicate
Data is worthless if you don;t communicate
 
From health persona to societal health uci 131202
From health persona to societal health  uci  131202From health persona to societal health  uci  131202
From health persona to societal health uci 131202
 
Data is not facts: The impossibility of being unbiased
Data is not facts: The impossibility of being unbiasedData is not facts: The impossibility of being unbiased
Data is not facts: The impossibility of being unbiased
 
ZOLLHOF Know-How Event You track too much and learn too little
ZOLLHOF Know-How Event You track too much and learn too littleZOLLHOF Know-How Event You track too much and learn too little
ZOLLHOF Know-How Event You track too much and learn too little
 
How to Hire Data Scientists
How to Hire Data ScientistsHow to Hire Data Scientists
How to Hire Data Scientists
 
Data in development @ Spotify
Data in development @ SpotifyData in development @ Spotify
Data in development @ Spotify
 
MnSearch Snippets April 2019: Google Data Studio - Steve Slater
MnSearch Snippets April 2019: Google Data Studio - Steve SlaterMnSearch Snippets April 2019: Google Data Studio - Steve Slater
MnSearch Snippets April 2019: Google Data Studio - Steve Slater
 
Predicting the Future With Microsoft Bing
Predicting the Future With Microsoft BingPredicting the Future With Microsoft Bing
Predicting the Future With Microsoft Bing
 
Simplify your analytics strategy- Palash badjatya
Simplify your analytics strategy- Palash badjatyaSimplify your analytics strategy- Palash badjatya
Simplify your analytics strategy- Palash badjatya
 

Andere mochten auch

Start15 homework template-1-miha_in_lina
Start15 homework template-1-miha_in_linaStart15 homework template-1-miha_in_lina
Start15 homework template-1-miha_in_linaMiha Zoubek
 
2 Skridt til en Doven Manhs Kost
2 Skridt til en Doven Manhs Kost2 Skridt til en Doven Manhs Kost
2 Skridt til en Doven Manhs Kosthumoroustempo408
 
5 Tips Mendapatkan Beasiswa Keluar Negeri
5 Tips Mendapatkan Beasiswa Keluar Negeri5 Tips Mendapatkan Beasiswa Keluar Negeri
5 Tips Mendapatkan Beasiswa Keluar NegeriAdinny Paramita
 
Start15 homework template-1-miha_in_lina
Start15 homework template-1-miha_in_linaStart15 homework template-1-miha_in_lina
Start15 homework template-1-miha_in_linaMiha Zoubek
 
Tugas 12 kbds bootcamp medellia kue
Tugas 12 kbds bootcamp   medellia kueTugas 12 kbds bootcamp   medellia kue
Tugas 12 kbds bootcamp medellia kueAdinny Paramita
 
The IWB’s Affordances and Mathematics teachers’ attitudes
The IWB’s Affordances and Mathematics teachers’ attitudesThe IWB’s Affordances and Mathematics teachers’ attitudes
The IWB’s Affordances and Mathematics teachers’ attitudesMuhammad Fraz Khan
 
Character profile and location
Character profile and locationCharacter profile and location
Character profile and locationemily123432
 

Andere mochten auch (15)

Start15 homework template-1-miha_in_lina
Start15 homework template-1-miha_in_linaStart15 homework template-1-miha_in_lina
Start15 homework template-1-miha_in_lina
 
2 Skridt til en Doven Manhs Kost
2 Skridt til en Doven Manhs Kost2 Skridt til en Doven Manhs Kost
2 Skridt til en Doven Manhs Kost
 
Diabetes Ev Doc 2013
Diabetes Ev Doc 2013Diabetes Ev Doc 2013
Diabetes Ev Doc 2013
 
5 Tips Mendapatkan Beasiswa Keluar Negeri
5 Tips Mendapatkan Beasiswa Keluar Negeri5 Tips Mendapatkan Beasiswa Keluar Negeri
5 Tips Mendapatkan Beasiswa Keluar Negeri
 
JavaScript
JavaScriptJavaScript
JavaScript
 
Start15 homework template-1-miha_in_lina
Start15 homework template-1-miha_in_linaStart15 homework template-1-miha_in_lina
Start15 homework template-1-miha_in_lina
 
Usman job cv
Usman job cvUsman job cv
Usman job cv
 
15
1515
15
 
Tugas 12 kbds bootcamp medellia kue
Tugas 12 kbds bootcamp   medellia kueTugas 12 kbds bootcamp   medellia kue
Tugas 12 kbds bootcamp medellia kue
 
The IWB’s Affordances and Mathematics teachers’ attitudes
The IWB’s Affordances and Mathematics teachers’ attitudesThe IWB’s Affordances and Mathematics teachers’ attitudes
The IWB’s Affordances and Mathematics teachers’ attitudes
 
Character profile and location
Character profile and locationCharacter profile and location
Character profile and location
 
WORLD WAR I
WORLD WAR I WORLD WAR I
WORLD WAR I
 
Cognitive Impairment UnAd
Cognitive Impairment UnAdCognitive Impairment UnAd
Cognitive Impairment UnAd
 
Acid suppression UnAd
Acid suppression UnAdAcid suppression UnAd
Acid suppression UnAd
 
Slide tiếng anh
Slide tiếng anhSlide tiếng anh
Slide tiếng anh
 

Ähnlich wie Replication in Data Science - A Dance Between Data Science & Machine Learning Strata 2016

Data Visualization: A Quick Tour for Data Science Enthusiasts
Data Visualization: A Quick Tour for Data Science EnthusiastsData Visualization: A Quick Tour for Data Science Enthusiasts
Data Visualization: A Quick Tour for Data Science EnthusiastsKrist Wongsuphasawat
 
Making data visual diy guide to getting started with data visualization
Making data visual diy guide to getting started with data visualizationMaking data visual diy guide to getting started with data visualization
Making data visual diy guide to getting started with data visualizationVisual Resources Association
 
The How and Why of Feature Engineering
The How and Why of Feature EngineeringThe How and Why of Feature Engineering
The How and Why of Feature EngineeringAlice Zheng
 
How to Feed a Data Hungry Organization – by Traveloka Data Team
How to Feed a Data Hungry Organization – by Traveloka Data TeamHow to Feed a Data Hungry Organization – by Traveloka Data Team
How to Feed a Data Hungry Organization – by Traveloka Data TeamTraveloka
 
Dances with unicorns
Dances with unicornsDances with unicorns
Dances with unicornsEspritAgile
 
Semi-Supervised Insight Generation from Petabyte Scale Text Data
Semi-Supervised Insight Generation from Petabyte Scale Text DataSemi-Supervised Insight Generation from Petabyte Scale Text Data
Semi-Supervised Insight Generation from Petabyte Scale Text DataTech Triveni
 
Primer to Machine Learning
Primer to Machine LearningPrimer to Machine Learning
Primer to Machine LearningJeff Tanner
 
Deep Learning: concepts and use cases (October 2018)
Deep Learning: concepts and use cases (October 2018)Deep Learning: concepts and use cases (October 2018)
Deep Learning: concepts and use cases (October 2018)Julien SIMON
 
巨量與開放資料之創新機會與關鍵挑戰-曾新穆
巨量與開放資料之創新機會與關鍵挑戰-曾新穆巨量與開放資料之創新機會與關鍵挑戰-曾新穆
巨量與開放資料之創新機會與關鍵挑戰-曾新穆台灣資料科學年會
 
C3P: Context-Aware Crowdsourced Cloud Privacy (at PETS 2014)
C3P: Context-Aware Crowdsourced Cloud Privacy (at PETS 2014)C3P: Context-Aware Crowdsourced Cloud Privacy (at PETS 2014)
C3P: Context-Aware Crowdsourced Cloud Privacy (at PETS 2014)Hamza Harkous
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273Abutest
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273Abutest
 
The math behind big systems analysis.
The math behind big systems analysis.The math behind big systems analysis.
The math behind big systems analysis.Theo Schlossnagle
 
Identify Root Causes – 5 Whys
Identify Root Causes – 5 WhysIdentify Root Causes – 5 Whys
Identify Root Causes – 5 WhysMatt Hansen
 
Machine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersMachine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersAlbert Y. C. Chen
 
Thexfactor 160108194702
Thexfactor 160108194702Thexfactor 160108194702
Thexfactor 160108194702Lori Trafford
 
The X factor: The Secret to Better Content Marketing
The X factor: The Secret to Better Content Marketing The X factor: The Secret to Better Content Marketing
The X factor: The Secret to Better Content Marketing Mathew Sweezey
 
BeaconsAI engr 245 lean launchpad stanford 2019
BeaconsAI engr 245 lean launchpad stanford 2019BeaconsAI engr 245 lean launchpad stanford 2019
BeaconsAI engr 245 lean launchpad stanford 2019Stanford University
 
BTech Final Project (1).pptx
BTech Final Project (1).pptxBTech Final Project (1).pptx
BTech Final Project (1).pptxSwarajPatel19
 

Ähnlich wie Replication in Data Science - A Dance Between Data Science & Machine Learning Strata 2016 (20)

Data Visualization: A Quick Tour for Data Science Enthusiasts
Data Visualization: A Quick Tour for Data Science EnthusiastsData Visualization: A Quick Tour for Data Science Enthusiasts
Data Visualization: A Quick Tour for Data Science Enthusiasts
 
Data Visualization
Data VisualizationData Visualization
Data Visualization
 
Making data visual diy guide to getting started with data visualization
Making data visual diy guide to getting started with data visualizationMaking data visual diy guide to getting started with data visualization
Making data visual diy guide to getting started with data visualization
 
The How and Why of Feature Engineering
The How and Why of Feature EngineeringThe How and Why of Feature Engineering
The How and Why of Feature Engineering
 
How to Feed a Data Hungry Organization – by Traveloka Data Team
How to Feed a Data Hungry Organization – by Traveloka Data TeamHow to Feed a Data Hungry Organization – by Traveloka Data Team
How to Feed a Data Hungry Organization – by Traveloka Data Team
 
Dances with unicorns
Dances with unicornsDances with unicorns
Dances with unicorns
 
Semi-Supervised Insight Generation from Petabyte Scale Text Data
Semi-Supervised Insight Generation from Petabyte Scale Text DataSemi-Supervised Insight Generation from Petabyte Scale Text Data
Semi-Supervised Insight Generation from Petabyte Scale Text Data
 
Primer to Machine Learning
Primer to Machine LearningPrimer to Machine Learning
Primer to Machine Learning
 
Deep Learning: concepts and use cases (October 2018)
Deep Learning: concepts and use cases (October 2018)Deep Learning: concepts and use cases (October 2018)
Deep Learning: concepts and use cases (October 2018)
 
巨量與開放資料之創新機會與關鍵挑戰-曾新穆
巨量與開放資料之創新機會與關鍵挑戰-曾新穆巨量與開放資料之創新機會與關鍵挑戰-曾新穆
巨量與開放資料之創新機會與關鍵挑戰-曾新穆
 
C3P: Context-Aware Crowdsourced Cloud Privacy (at PETS 2014)
C3P: Context-Aware Crowdsourced Cloud Privacy (at PETS 2014)C3P: Context-Aware Crowdsourced Cloud Privacy (at PETS 2014)
C3P: Context-Aware Crowdsourced Cloud Privacy (at PETS 2014)
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
 
The math behind big systems analysis.
The math behind big systems analysis.The math behind big systems analysis.
The math behind big systems analysis.
 
Identify Root Causes – 5 Whys
Identify Root Causes – 5 WhysIdentify Root Causes – 5 Whys
Identify Root Causes – 5 Whys
 
Machine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersMachine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional Managers
 
Thexfactor 160108194702
Thexfactor 160108194702Thexfactor 160108194702
Thexfactor 160108194702
 
The X factor: The Secret to Better Content Marketing
The X factor: The Secret to Better Content Marketing The X factor: The Secret to Better Content Marketing
The X factor: The Secret to Better Content Marketing
 
BeaconsAI engr 245 lean launchpad stanford 2019
BeaconsAI engr 245 lean launchpad stanford 2019BeaconsAI engr 245 lean launchpad stanford 2019
BeaconsAI engr 245 lean launchpad stanford 2019
 
BTech Final Project (1).pptx
BTech Final Project (1).pptxBTech Final Project (1).pptx
BTech Final Project (1).pptx
 

Mehr von June Andrews

Scaling & Transforming Stitch Fix's Visibility into What Folks will love
Scaling & Transforming Stitch Fix's Visibility into What Folks will loveScaling & Transforming Stitch Fix's Visibility into What Folks will love
Scaling & Transforming Stitch Fix's Visibility into What Folks will loveJune Andrews
 
The Uncanny Valley of ML
The Uncanny Valley of MLThe Uncanny Valley of ML
The Uncanny Valley of MLJune Andrews
 
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...June Andrews
 
Push & Pull History of Data Science in Industry & Academia
Push & Pull History of Data Science in Industry & AcademiaPush & Pull History of Data Science in Industry & Academia
Push & Pull History of Data Science in Industry & AcademiaJune Andrews
 
Counter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsCounter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsJune Andrews
 
Counter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsCounter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsJune Andrews
 

Mehr von June Andrews (9)

Scaling & Transforming Stitch Fix's Visibility into What Folks will love
Scaling & Transforming Stitch Fix's Visibility into What Folks will loveScaling & Transforming Stitch Fix's Visibility into What Folks will love
Scaling & Transforming Stitch Fix's Visibility into What Folks will love
 
The Uncanny Valley of ML
The Uncanny Valley of MLThe Uncanny Valley of ML
The Uncanny Valley of ML
 
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
 
Data Competitive
Data CompetitiveData Competitive
Data Competitive
 
Push & Pull History of Data Science in Industry & Academia
Push & Pull History of Data Science in Industry & AcademiaPush & Pull History of Data Science in Industry & Academia
Push & Pull History of Data Science in Industry & Academia
 
ML Playbook
ML PlaybookML Playbook
ML Playbook
 
Counter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsCounter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of Things
 
Counter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsCounter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of Things
 
Economic Insights
Economic InsightsEconomic Insights
Economic Insights
 

Kürzlich hochgeladen

Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformationAnnie Melnic
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etclalithasri22
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfNicoChristianSunaryo
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaManalVerma4
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...ThinkInnovation
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfnikeshsingh56
 
Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are successPratikSingh115843
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...
Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...
Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...ThinkInnovation
 

Kürzlich hochgeladen (16)

Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformation
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etc
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdf
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in India
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdf
 
2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use
 
Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are success
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...
Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...
Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...
 

Replication in Data Science - A Dance Between Data Science & Machine Learning Strata 2016