SlideShare ist ein Scribd-Unternehmen logo
1 von 31
Kaggle Competitions, New Friends, New
Skills and New Opportunities
Jo-fai (Joe) Chow
Data Scientist
joe@h2o.ai
@matlabulus
Version 2 – Data Science Exeter Meetup
Civil Engineer → Data Scientist
• 2005 - 2015
• Water Engineer
o Consultant for Utilities
• SEAMS (Sheffield)
o EngD Research
• University of Exeter
• XP Solutions (Newbury)
• 2015 - Present
• Data Scientist
o UK Telecom
• Virgin Media
o Silicon Valley
• Domino Data Lab
• H2O.ai
2
About Domino and H2O
3
About This Talk
• What happened
o Things I did since I
started participating in
Kaggle competitions.
o New opportunities –
results of new skills and
friends.
4
First MOOC Experience
• One of the first Massive
Open Online Courses.
o Met some new friends.
o Decided to collaborate
for fun.
o “How about Kaggle?”
o “What is Kaggle?”
5
About Kaggle
• World’s biggest predictive
modelling competition
platform
• 560k members
• Competition types:
o Featured (prize)
o Recruitment
o Playground
o 101
6
First Kaggle Experience
• First time in my life
o Supervised learning
• Random Forest
• Support Vector Machine
• Neural Networks
o Train, Validate & Predict.
o “Is it black magic?”
7
First Kaggle Experience
• Problems
o “Hey Joe, you are a nice
guy but we can’t work
together.”
o “You love MATLAB so
much. You even call
yourself @matlabulous
on twitter!”
o “We prefer R/Python.”
• Results
o I kept using MATLAB
o Lone wolf
o No collaboration 
8
Identifying Skills Gap
• Obvious skills gap:
o Open-source
programming langauges
o Machine learning
techniques
o Collaboration
• Kind of related
o Data visualisation
o Handling large datasets
o Explaining results
• That competition was a good wake up call.
9
From MATLAB to R/Python
MATLAB Python R
Neural Networks ✔️ ✔️ ✔️
Random Forest ✔️ ✔️ ✔️
SVM ✔️ ✔️ ✔️
Other Machine
Learning Libraries
Toolboxes
(commercial + open
source)
Scikit-learn and
many more
CRAN, GitHub
(A LOT!)
Data Visualisation I wasn’t good at it
anyway …
Matplotlib
(plus a lot more
since then)
ggplot2 (WOW!)
(plus a lot more
since then)
10
What can people do with R?
11
James Cheshire, UCL
Link
Paul Butler, Facebook
Link
Filling the Skills Gap
• More MOOC
o Machine Learning
• Andrew Ng (Coursera)
o Data Analysis
• Jeff Leek (Coursera)
• R
o Intro to Programming
• Dave Evans (Udacity)
• Python
• Things I also picked up:
o Linux (Ubuntu)
o Git
o Cloud computing
o HTML / CSS
12
Learning from other Kagglers
• Continuous learning
o Kaggle’s forums and blogs.
o New tools and tricks.
o Many things you cannot
learn from school.
o I am standing on the
shoulders of many
Kagglers.
13
Side Project 1 – Crime Data Viz
shiny::runGitHub("rApps", "woobe",
subdir = "crimemap") 14
http://insidebigdata.com/2013/11/30/
visualization-week-crimemap/
Before I knew it …Using R + crime data from data.gov.uk
Side Project 2 – Data Viz Contest
15
https://github.com/woobe/rugsmaps
While I was obsessed with making maps …
http://blog.revolutionanalytics.com/2014/08/
winner-for-revolution-analytics-user-group-map-contest.html
Side Project 3 – Colour Palette
16
I am also obsessed with colours …
https://github.com/woobe/rPlotter
http://blog.revolutionanalytics.com/
2015/03/color-extraction-with-r.html
#TheDress
Side Project 4 – World Cup 2014
• World Cup 2014 Correct
Score Prediction
o ML vs. my friends
o 10 out of 64 (15.6%)
o Friends’ avg. = 4 (6.3%)
o github.com/
woobe/wc2014
• Euro 2016
o Collecting data right now
o github.com/woobe/
euro2016
17
Open Up Myself
• Before Kaggle/MOOC
o I was drawing a circle
around myself.
o Fear of change.
o Domain-specific problem
solving.
• After Kaggle/MOOC
o Data-driven approach.
o Not a subject matter
expert? No worries 
o Free to try new tools, to
learn and to create.
18
New Opportunities
• LondonR
o First presentation
outside water industry /
academia.
o Very positive feedback.
o Led to other projects.
o bit.ly
/londonr_crimemap
19
New Opportunities
• useR! 2014 (UCLA)
o Presented a poster.
o Met new friends.
o Life-changing event.
o github.com/
woobe/useR_2014
20
New Friends
21
Ramnath Vaidyanathan
htmlwidgets
DataRobot
Nick @
DominoData
Lab
H2O.ai &
John Chambers!
rOpenSci
RStudio
Matt Dowle
data.table (also at H2O.ai)
More Opportunities
• First blog post about
H2O
o Things to try after useR!
– Part1: Deep Learning
with H2O
22
More Opportunities
• Blog post about Domino
and H2O
o I did it for fun. I did not
have any expectation.
o It helped attract
customers to both
Domino and H2O.
23
Becoming a Data Scientist
24
The leap of faith …
London Kagglers Assemble
• London Kaggle Meetup
o Sep 2015
o I met my Kaggle buddy
Mickael Le Gal
o He is a product data
scientist at Tictrac
25
Mickael Joe
London Kagglers Assemble
• Rossmann Store Sales
o We got stuck at top 10% for a long
period.
o Mickael had a breakthrough in feature
engineering with 48 hours to go.
o I re-trained all models and completed
model stacking just a few hours before
the deadline (thanks to Domino Data
Lab).
o Top 2% finish (our best result so far).
26
Joining H2O.ai
27
A call from Sri (CEO) just before Christmas 2015 …
More Opportunities
28
• bit.ly/joe_h2o_talk1
• bit.ly/joe_h2o_talk2
• bit.ly/joe_h2o_talk3
• bit.ly/joe_h2o_talk4
• …
LondonR
PyData Amsterdam
London
Kaggle
Summary of Benefits
• Direct
o Identify data science
skills gap.
o Learn quickly from the
community.
o Expand your network.
o Prepare yourself for real-
life data challenges.
• Indirect
o You also learn non-ML
skills along the way.
o You learn to build small
data products (e.g.
graph, web app, REST
API) and help others gain
insight.
29
Big Thank You!
• University of Exeter
o Prof. Dragan Savic
• Mango Solutions
• RStudio
• Domino Data Lab
• H2O.ai
• London Kaggle Meetup
Organisers
30
1st LondonR Talk
Crime Map Shiny App
bit.ly/londonr_crimemap
2nd LondonR Talk
Domino API Endpoint
bit.ly/1cYbZbF
Any Questions?
• Contact
o joe@h2o.ai
o @matlabulous
o github.com/woobe
• Links (All Slides)
o github.com/h2oai/h2o-
meetups
• H2O in London
o Coming soon!
• Meetups
• Office
o We’re hiring!
o www.h2o.ai/careers
31

Weitere ähnliche Inhalte

Was ist angesagt?

Using H2O AutoML for Kaggle Competitions
Using H2O AutoML for Kaggle CompetitionsUsing H2O AutoML for Kaggle Competitions
Using H2O AutoML for Kaggle CompetitionsSri Ambati
 
Automatic and Interpretable Machine Learning with H2O and LIME
Automatic and Interpretable Machine Learning with H2O and LIMEAutomatic and Interpretable Machine Learning with H2O and LIME
Automatic and Interpretable Machine Learning with H2O and LIMEJo-fai Chow
 
Gradoop: Scalable Graph Analytics with Apache Flink @ Flink & Neo4j Meetup Be...
Gradoop: Scalable Graph Analytics with Apache Flink @ Flink & Neo4j Meetup Be...Gradoop: Scalable Graph Analytics with Apache Flink @ Flink & Neo4j Meetup Be...
Gradoop: Scalable Graph Analytics with Apache Flink @ Flink & Neo4j Meetup Be...Martin Junghanns
 
Information-Rich Programming in F# with Semantic Data
Information-Rich Programming in F# with Semantic DataInformation-Rich Programming in F# with Semantic Data
Information-Rich Programming in F# with Semantic DataSteffen Staab
 
Gephi, Graphx, and Giraph
Gephi, Graphx, and GiraphGephi, Graphx, and Giraph
Gephi, Graphx, and GiraphDoug Needham
 
A Comparison of Different Strategies for Automated Semantic Document Annotation
A Comparison of Different Strategies for Automated Semantic Document AnnotationA Comparison of Different Strategies for Automated Semantic Document Annotation
A Comparison of Different Strategies for Automated Semantic Document AnnotationAnsgar Scherp
 
SF Python Meetup: TextRank in Python
SF Python Meetup: TextRank in PythonSF Python Meetup: TextRank in Python
SF Python Meetup: TextRank in PythonPaco Nathan
 
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...Oscar Corcho
 
Gradoop: Scalable Graph Analytics with Apache Flink @ FOSDEM 2016
Gradoop: Scalable Graph Analytics with Apache Flink @ FOSDEM 2016Gradoop: Scalable Graph Analytics with Apache Flink @ FOSDEM 2016
Gradoop: Scalable Graph Analytics with Apache Flink @ FOSDEM 2016Martin Junghanns
 
Big data week 2018 - Graph Analytics on Big Data
Big data week 2018 - Graph Analytics on Big DataBig data week 2018 - Graph Analytics on Big Data
Big data week 2018 - Graph Analytics on Big DataChristos Hadjinikolis
 
Shebanq roma-2013-10-01
Shebanq roma-2013-10-01Shebanq roma-2013-10-01
Shebanq roma-2013-10-01Dirk Roorda
 
Apache Spark GraphX highlights.
Apache Spark GraphX highlights. Apache Spark GraphX highlights.
Apache Spark GraphX highlights. Doug Needham
 
RDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactRDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactJean-Paul Calbimonte
 
sparklyr - Jeff Allen
sparklyr - Jeff Allensparklyr - Jeff Allen
sparklyr - Jeff AllenSri Ambati
 
Introduction to Data Science with H2O- Mountain View
Introduction to Data Science with H2O- Mountain ViewIntroduction to Data Science with H2O- Mountain View
Introduction to Data Science with H2O- Mountain ViewSri Ambati
 
Accelerating NLP with Dask and Saturn Cloud
Accelerating NLP with Dask and Saturn CloudAccelerating NLP with Dask and Saturn Cloud
Accelerating NLP with Dask and Saturn CloudSujit Pal
 
Summary of the Stream Reasoning workshop at ISWC 2016
Summary of the Stream Reasoning workshop at ISWC 2016Summary of the Stream Reasoning workshop at ISWC 2016
Summary of the Stream Reasoning workshop at ISWC 2016Daniele Dell'Aglio
 
The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?Frank van Harmelen
 

Was ist angesagt? (20)

ISAX
ISAXISAX
ISAX
 
Using H2O AutoML for Kaggle Competitions
Using H2O AutoML for Kaggle CompetitionsUsing H2O AutoML for Kaggle Competitions
Using H2O AutoML for Kaggle Competitions
 
Shebanq gniezno
Shebanq gnieznoShebanq gniezno
Shebanq gniezno
 
Automatic and Interpretable Machine Learning with H2O and LIME
Automatic and Interpretable Machine Learning with H2O and LIMEAutomatic and Interpretable Machine Learning with H2O and LIME
Automatic and Interpretable Machine Learning with H2O and LIME
 
Gradoop: Scalable Graph Analytics with Apache Flink @ Flink & Neo4j Meetup Be...
Gradoop: Scalable Graph Analytics with Apache Flink @ Flink & Neo4j Meetup Be...Gradoop: Scalable Graph Analytics with Apache Flink @ Flink & Neo4j Meetup Be...
Gradoop: Scalable Graph Analytics with Apache Flink @ Flink & Neo4j Meetup Be...
 
Information-Rich Programming in F# with Semantic Data
Information-Rich Programming in F# with Semantic DataInformation-Rich Programming in F# with Semantic Data
Information-Rich Programming in F# with Semantic Data
 
Gephi, Graphx, and Giraph
Gephi, Graphx, and GiraphGephi, Graphx, and Giraph
Gephi, Graphx, and Giraph
 
A Comparison of Different Strategies for Automated Semantic Document Annotation
A Comparison of Different Strategies for Automated Semantic Document AnnotationA Comparison of Different Strategies for Automated Semantic Document Annotation
A Comparison of Different Strategies for Automated Semantic Document Annotation
 
SF Python Meetup: TextRank in Python
SF Python Meetup: TextRank in PythonSF Python Meetup: TextRank in Python
SF Python Meetup: TextRank in Python
 
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
 
Gradoop: Scalable Graph Analytics with Apache Flink @ FOSDEM 2016
Gradoop: Scalable Graph Analytics with Apache Flink @ FOSDEM 2016Gradoop: Scalable Graph Analytics with Apache Flink @ FOSDEM 2016
Gradoop: Scalable Graph Analytics with Apache Flink @ FOSDEM 2016
 
Big data week 2018 - Graph Analytics on Big Data
Big data week 2018 - Graph Analytics on Big DataBig data week 2018 - Graph Analytics on Big Data
Big data week 2018 - Graph Analytics on Big Data
 
Shebanq roma-2013-10-01
Shebanq roma-2013-10-01Shebanq roma-2013-10-01
Shebanq roma-2013-10-01
 
Apache Spark GraphX highlights.
Apache Spark GraphX highlights. Apache Spark GraphX highlights.
Apache Spark GraphX highlights.
 
RDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactRDF Stream Processing: Let's React
RDF Stream Processing: Let's React
 
sparklyr - Jeff Allen
sparklyr - Jeff Allensparklyr - Jeff Allen
sparklyr - Jeff Allen
 
Introduction to Data Science with H2O- Mountain View
Introduction to Data Science with H2O- Mountain ViewIntroduction to Data Science with H2O- Mountain View
Introduction to Data Science with H2O- Mountain View
 
Accelerating NLP with Dask and Saturn Cloud
Accelerating NLP with Dask and Saturn CloudAccelerating NLP with Dask and Saturn Cloud
Accelerating NLP with Dask and Saturn Cloud
 
Summary of the Stream Reasoning workshop at ISWC 2016
Summary of the Stream Reasoning workshop at ISWC 2016Summary of the Stream Reasoning workshop at ISWC 2016
Summary of the Stream Reasoning workshop at ISWC 2016
 
The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?
 

Andere mochten auch

Ilu3 DU RIBEIRO - Cristielen Souza
Ilu3 DU RIBEIRO - Cristielen SouzaIlu3 DU RIBEIRO - Cristielen Souza
Ilu3 DU RIBEIRO - Cristielen SouzaCristielen Souza
 
Caffe + H2O - By Cyprien noel
Caffe + H2O - By Cyprien noelCaffe + H2O - By Cyprien noel
Caffe + H2O - By Cyprien noelSri Ambati
 
maxillomandibular relations - terminology
maxillomandibular relations - terminologymaxillomandibular relations - terminology
maxillomandibular relations - terminologyshammasm
 
World wind java sdk in progess
World wind java sdk in progessWorld wind java sdk in progess
World wind java sdk in progessRaffaele de Amicis
 
Fotografia Editorial - Primeiras Fotografias de Moda
Fotografia Editorial - Primeiras Fotografias de ModaFotografia Editorial - Primeiras Fotografias de Moda
Fotografia Editorial - Primeiras Fotografias de ModaAmanda Petry
 
H2O Deep Water - Making Deep Learning Accessible to Everyone
H2O Deep Water - Making Deep Learning Accessible to EveryoneH2O Deep Water - Making Deep Learning Accessible to Everyone
H2O Deep Water - Making Deep Learning Accessible to EveryoneJo-fai Chow
 
Cybersecurity with AI - Ashrith Barthur
Cybersecurity with AI - Ashrith BarthurCybersecurity with AI - Ashrith Barthur
Cybersecurity with AI - Ashrith BarthurSri Ambati
 
Changing Dynamics: Driving Direct Materials Sourcing and Supplier Collaboration
Changing Dynamics: Driving Direct Materials Sourcing and Supplier CollaborationChanging Dynamics: Driving Direct Materials Sourcing and Supplier Collaboration
Changing Dynamics: Driving Direct Materials Sourcing and Supplier CollaborationSAP Ariba
 
Cybersecurity with AI - Ashrith Barthur
Cybersecurity with AI - Ashrith BarthurCybersecurity with AI - Ashrith Barthur
Cybersecurity with AI - Ashrith BarthurSri Ambati
 
Gypsum products /certified fixed orthodontic courses by Indian dental academy
Gypsum products  /certified fixed orthodontic courses by Indian dental academy Gypsum products  /certified fixed orthodontic courses by Indian dental academy
Gypsum products /certified fixed orthodontic courses by Indian dental academy Indian dental academy
 
H2O Big Join Slides
H2O Big Join SlidesH2O Big Join Slides
H2O Big Join SlidesSri Ambati
 
H2O Deep Water - Making Deep Learning Accessible to Everyone
H2O Deep Water - Making Deep Learning Accessible to EveryoneH2O Deep Water - Making Deep Learning Accessible to Everyone
H2O Deep Water - Making Deep Learning Accessible to EveryoneJo-fai Chow
 
Converting Your Legacy Data to S1000D
Converting Your Legacy Data to S1000DConverting Your Legacy Data to S1000D
Converting Your Legacy Data to S1000Ddclsocialmedia
 
Embracing Reactive Streams with Java 9 and Spring 5
Embracing Reactive Streams with Java 9 and Spring 5Embracing Reactive Streams with Java 9 and Spring 5
Embracing Reactive Streams with Java 9 and Spring 5Wilder Rodrigues
 
Best Practices: Cutting Through the Confusion & Avoiding the Pitfalls of Crea...
Best Practices: Cutting Through the Confusion & Avoiding the Pitfalls of Crea...Best Practices: Cutting Through the Confusion & Avoiding the Pitfalls of Crea...
Best Practices: Cutting Through the Confusion & Avoiding the Pitfalls of Crea...dclsocialmedia
 
Mandibular movements /cosmetic dentistry courses
Mandibular movements /cosmetic dentistry coursesMandibular movements /cosmetic dentistry courses
Mandibular movements /cosmetic dentistry coursesIndian dental academy
 
Using Supply Chain Collaboration to Drive Visibility into Supply
Using Supply Chain Collaboration to Drive Visibility into SupplyUsing Supply Chain Collaboration to Drive Visibility into Supply
Using Supply Chain Collaboration to Drive Visibility into SupplySAP Ariba
 
Energía y sostenibilidad
Energía y sostenibilidadEnergía y sostenibilidad
Energía y sostenibilidadGiuliana Lauck
 

Andere mochten auch (20)

Ilu3 DU RIBEIRO - Cristielen Souza
Ilu3 DU RIBEIRO - Cristielen SouzaIlu3 DU RIBEIRO - Cristielen Souza
Ilu3 DU RIBEIRO - Cristielen Souza
 
Caffe + H2O - By Cyprien noel
Caffe + H2O - By Cyprien noelCaffe + H2O - By Cyprien noel
Caffe + H2O - By Cyprien noel
 
Unidad 1 energías renovables
Unidad 1 energías renovablesUnidad 1 energías renovables
Unidad 1 energías renovables
 
maxillomandibular relations - terminology
maxillomandibular relations - terminologymaxillomandibular relations - terminology
maxillomandibular relations - terminology
 
World wind java sdk in progess
World wind java sdk in progessWorld wind java sdk in progess
World wind java sdk in progess
 
Fotografia Editorial - Primeiras Fotografias de Moda
Fotografia Editorial - Primeiras Fotografias de ModaFotografia Editorial - Primeiras Fotografias de Moda
Fotografia Editorial - Primeiras Fotografias de Moda
 
H2O Deep Water - Making Deep Learning Accessible to Everyone
H2O Deep Water - Making Deep Learning Accessible to EveryoneH2O Deep Water - Making Deep Learning Accessible to Everyone
H2O Deep Water - Making Deep Learning Accessible to Everyone
 
Cybersecurity with AI - Ashrith Barthur
Cybersecurity with AI - Ashrith BarthurCybersecurity with AI - Ashrith Barthur
Cybersecurity with AI - Ashrith Barthur
 
Changing Dynamics: Driving Direct Materials Sourcing and Supplier Collaboration
Changing Dynamics: Driving Direct Materials Sourcing and Supplier CollaborationChanging Dynamics: Driving Direct Materials Sourcing and Supplier Collaboration
Changing Dynamics: Driving Direct Materials Sourcing and Supplier Collaboration
 
Cybersecurity with AI - Ashrith Barthur
Cybersecurity with AI - Ashrith BarthurCybersecurity with AI - Ashrith Barthur
Cybersecurity with AI - Ashrith Barthur
 
Gypsum products /certified fixed orthodontic courses by Indian dental academy
Gypsum products  /certified fixed orthodontic courses by Indian dental academy Gypsum products  /certified fixed orthodontic courses by Indian dental academy
Gypsum products /certified fixed orthodontic courses by Indian dental academy
 
H2O Big Join Slides
H2O Big Join SlidesH2O Big Join Slides
H2O Big Join Slides
 
H2O Deep Water - Making Deep Learning Accessible to Everyone
H2O Deep Water - Making Deep Learning Accessible to EveryoneH2O Deep Water - Making Deep Learning Accessible to Everyone
H2O Deep Water - Making Deep Learning Accessible to Everyone
 
Converting Your Legacy Data to S1000D
Converting Your Legacy Data to S1000DConverting Your Legacy Data to S1000D
Converting Your Legacy Data to S1000D
 
Embracing Reactive Streams with Java 9 and Spring 5
Embracing Reactive Streams with Java 9 and Spring 5Embracing Reactive Streams with Java 9 and Spring 5
Embracing Reactive Streams with Java 9 and Spring 5
 
Best Practices: Cutting Through the Confusion & Avoiding the Pitfalls of Crea...
Best Practices: Cutting Through the Confusion & Avoiding the Pitfalls of Crea...Best Practices: Cutting Through the Confusion & Avoiding the Pitfalls of Crea...
Best Practices: Cutting Through the Confusion & Avoiding the Pitfalls of Crea...
 
Mandibular movements /cosmetic dentistry courses
Mandibular movements /cosmetic dentistry coursesMandibular movements /cosmetic dentistry courses
Mandibular movements /cosmetic dentistry courses
 
Using Supply Chain Collaboration to Drive Visibility into Supply
Using Supply Chain Collaboration to Drive Visibility into SupplyUsing Supply Chain Collaboration to Drive Visibility into Supply
Using Supply Chain Collaboration to Drive Visibility into Supply
 
16.occlusal schemes lingualized occlusion
16.occlusal schemes   lingualized occlusion16.occlusal schemes   lingualized occlusion
16.occlusal schemes lingualized occlusion
 
Energía y sostenibilidad
Energía y sostenibilidadEnergía y sostenibilidad
Energía y sostenibilidad
 

Ähnlich wie Kaggle Competitions, New Friends, New Skills and New Opportunities

From Kaggle to H2O - The True Story of a Civil Engineer Turned Data Geek
From Kaggle to H2O - The True Story of a Civil Engineer Turned Data GeekFrom Kaggle to H2O - The True Story of a Civil Engineer Turned Data Geek
From Kaggle to H2O - The True Story of a Civil Engineer Turned Data GeekJo-fai Chow
 
Introduction to Machine Learning with H2O - Jo-Fai (Joe) Chow, H2O
Introduction to Machine Learning with H2O - Jo-Fai (Joe) Chow, H2OIntroduction to Machine Learning with H2O - Jo-Fai (Joe) Chow, H2O
Introduction to Machine Learning with H2O - Jo-Fai (Joe) Chow, H2OData Science Milan
 
Beginner android
Beginner androidBeginner android
Beginner androidSmriti Das
 
What you did last summer?
What you did last summer?What you did last summer?
What you did last summer?DoThinger
 
Testing In Production (TiP) Advances with Big Data and the Cloud
Testing In Production (TiP) Advances with Big Data and the CloudTesting In Production (TiP) Advances with Big Data and the Cloud
Testing In Production (TiP) Advances with Big Data and the CloudSOASTA
 
Predict the oscars with data science
Predict the oscars with data sciencePredict the oscars with data science
Predict the oscars with data scienceJustin Ezor
 
Digital Fabrication Studio 0.3 Introduction
Digital Fabrication Studio 0.3 IntroductionDigital Fabrication Studio 0.3 Introduction
Digital Fabrication Studio 0.3 IntroductionMassimo Menichinelli
 
Feedback Loops in Practice
Feedback Loops in PracticeFeedback Loops in Practice
Feedback Loops in PracticeMarcus Denker
 
E2D3 introduction
E2D3 introductionE2D3 introduction
E2D3 introductionE2D3
 
Digital Fabrication Studio v.0.2: Introduction
Digital Fabrication Studio v.0.2: IntroductionDigital Fabrication Studio v.0.2: Introduction
Digital Fabrication Studio v.0.2: IntroductionMassimo Menichinelli
 
DataOps - Lean principles and lean practices
DataOps - Lean principles and lean practicesDataOps - Lean principles and lean practices
DataOps - Lean principles and lean practicesLars Albertsson
 
2018 23-03 purdue-unal kick-off-final-final
2018 23-03 purdue-unal kick-off-final-final2018 23-03 purdue-unal kick-off-final-final
2018 23-03 purdue-unal kick-off-final-finalPurdue RCODI
 
Staying Competitive in Data Analytics: Analyze Boulder 20140903
Staying Competitive in Data Analytics: Analyze Boulder 20140903Staying Competitive in Data Analytics: Analyze Boulder 20140903
Staying Competitive in Data Analytics: Analyze Boulder 20140903Richard Hackathorn
 
How to get into Kaggle? by Philipp Singer and Dmitry Gordeev
How to get into Kaggle? by Philipp Singer and Dmitry GordeevHow to get into Kaggle? by Philipp Singer and Dmitry Gordeev
How to get into Kaggle? by Philipp Singer and Dmitry GordeevVienna Data Science Group
 
Summer 2014 Use of media learn360 in the classroom
Summer 2014 Use of media learn360 in the classroomSummer 2014 Use of media learn360 in the classroom
Summer 2014 Use of media learn360 in the classroomClairvoy
 
How to jump into Data Science
How to jump into Data ScienceHow to jump into Data Science
How to jump into Data ScienceYury Kashnitsky
 
Games Industry Analytics Forum 2 - Plumbee
Games Industry Analytics Forum 2 - PlumbeeGames Industry Analytics Forum 2 - Plumbee
Games Industry Analytics Forum 2 - PlumbeeGIAF
 
Data Driven: The Ancestry.com Journey to Self-Service Analytics
Data Driven: The Ancestry.com Journey to Self-Service AnalyticsData Driven: The Ancestry.com Journey to Self-Service Analytics
Data Driven: The Ancestry.com Journey to Self-Service AnalyticsWilliam Yetman
 

Ähnlich wie Kaggle Competitions, New Friends, New Skills and New Opportunities (20)

From Kaggle to H2O - The True Story of a Civil Engineer Turned Data Geek
From Kaggle to H2O - The True Story of a Civil Engineer Turned Data GeekFrom Kaggle to H2O - The True Story of a Civil Engineer Turned Data Geek
From Kaggle to H2O - The True Story of a Civil Engineer Turned Data Geek
 
Introduction to Machine Learning with H2O - Jo-Fai (Joe) Chow, H2O
Introduction to Machine Learning with H2O - Jo-Fai (Joe) Chow, H2OIntroduction to Machine Learning with H2O - Jo-Fai (Joe) Chow, H2O
Introduction to Machine Learning with H2O - Jo-Fai (Joe) Chow, H2O
 
Agile Data Science
Agile Data ScienceAgile Data Science
Agile Data Science
 
Beginner android
Beginner androidBeginner android
Beginner android
 
What you did last summer?
What you did last summer?What you did last summer?
What you did last summer?
 
Testing In Production (TiP) Advances with Big Data and the Cloud
Testing In Production (TiP) Advances with Big Data and the CloudTesting In Production (TiP) Advances with Big Data and the Cloud
Testing In Production (TiP) Advances with Big Data and the Cloud
 
Predict the oscars with data science
Predict the oscars with data sciencePredict the oscars with data science
Predict the oscars with data science
 
Digital Fabrication Studio 0.3 Introduction
Digital Fabrication Studio 0.3 IntroductionDigital Fabrication Studio 0.3 Introduction
Digital Fabrication Studio 0.3 Introduction
 
Feedback Loops in Practice
Feedback Loops in PracticeFeedback Loops in Practice
Feedback Loops in Practice
 
E2D3 introduction
E2D3 introductionE2D3 introduction
E2D3 introduction
 
Digital Fabrication Studio v.0.2: Introduction
Digital Fabrication Studio v.0.2: IntroductionDigital Fabrication Studio v.0.2: Introduction
Digital Fabrication Studio v.0.2: Introduction
 
DataOps - Lean principles and lean practices
DataOps - Lean principles and lean practicesDataOps - Lean principles and lean practices
DataOps - Lean principles and lean practices
 
2018 23-03 purdue-unal kick-off-final-final
2018 23-03 purdue-unal kick-off-final-final2018 23-03 purdue-unal kick-off-final-final
2018 23-03 purdue-unal kick-off-final-final
 
Impact of Open Source
Impact of Open SourceImpact of Open Source
Impact of Open Source
 
Staying Competitive in Data Analytics: Analyze Boulder 20140903
Staying Competitive in Data Analytics: Analyze Boulder 20140903Staying Competitive in Data Analytics: Analyze Boulder 20140903
Staying Competitive in Data Analytics: Analyze Boulder 20140903
 
How to get into Kaggle? by Philipp Singer and Dmitry Gordeev
How to get into Kaggle? by Philipp Singer and Dmitry GordeevHow to get into Kaggle? by Philipp Singer and Dmitry Gordeev
How to get into Kaggle? by Philipp Singer and Dmitry Gordeev
 
Summer 2014 Use of media learn360 in the classroom
Summer 2014 Use of media learn360 in the classroomSummer 2014 Use of media learn360 in the classroom
Summer 2014 Use of media learn360 in the classroom
 
How to jump into Data Science
How to jump into Data ScienceHow to jump into Data Science
How to jump into Data Science
 
Games Industry Analytics Forum 2 - Plumbee
Games Industry Analytics Forum 2 - PlumbeeGames Industry Analytics Forum 2 - Plumbee
Games Industry Analytics Forum 2 - Plumbee
 
Data Driven: The Ancestry.com Journey to Self-Service Analytics
Data Driven: The Ancestry.com Journey to Self-Service AnalyticsData Driven: The Ancestry.com Journey to Self-Service Analytics
Data Driven: The Ancestry.com Journey to Self-Service Analytics
 

Mehr von Jo-fai Chow

Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and Shiny
Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and ShinyMaking Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and Shiny
Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and ShinyJo-fai Chow
 
Automatic and Interpretable Machine Learning in R with H2O and LIME
Automatic and Interpretable Machine Learning in R with H2O and LIMEAutomatic and Interpretable Machine Learning in R with H2O and LIME
Automatic and Interpretable Machine Learning in R with H2O and LIMEJo-fai Chow
 
H2O at Berlin R Meetup
H2O at Berlin R MeetupH2O at Berlin R Meetup
H2O at Berlin R MeetupJo-fai Chow
 
H2O at BelgradeR Meetup
H2O at BelgradeR MeetupH2O at BelgradeR Meetup
H2O at BelgradeR MeetupJo-fai Chow
 
Introduction to H2O and Model Stacking Use Cases
Introduction to H2O and Model Stacking Use CasesIntroduction to H2O and Model Stacking Use Cases
Introduction to H2O and Model Stacking Use CasesJo-fai Chow
 
H2O at Poznan R Meetup
H2O at Poznan R MeetupH2O at Poznan R Meetup
H2O at Poznan R MeetupJo-fai Chow
 
Introduction to Machine Learning with H2O and Python
Introduction to Machine Learning with H2O and PythonIntroduction to Machine Learning with H2O and Python
Introduction to Machine Learning with H2O and PythonJo-fai Chow
 
Introduction to Machine Learning with H2O and Python
Introduction to Machine Learning with H2O and PythonIntroduction to Machine Learning with H2O and Python
Introduction to Machine Learning with H2O and PythonJo-fai Chow
 
A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...
A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...
A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...Jo-fai Chow
 
Designing Sustainable Drainage Systems
Designing Sustainable Drainage SystemsDesigning Sustainable Drainage Systems
Designing Sustainable Drainage SystemsJo-fai Chow
 
Developing a New Decision Support System for SuDS
Developing a New Decision Support System for SuDSDeveloping a New Decision Support System for SuDS
Developing a New Decision Support System for SuDSJo-fai Chow
 
Udacity Statement (Introduction to Statistics, August 2012)
Udacity Statement (Introduction to Statistics, August 2012)Udacity Statement (Introduction to Statistics, August 2012)
Udacity Statement (Introduction to Statistics, August 2012)Jo-fai Chow
 
Coursera Statement (Computational Investing, Part I,
Coursera Statement (Computational Investing, Part I, Coursera Statement (Computational Investing, Part I,
Coursera Statement (Computational Investing, Part I, Jo-fai Chow
 
Coursera Statement (Computing for Data Analysis, Oct 2013)
Coursera Statement (Computing for Data Analysis, Oct 2013)Coursera Statement (Computing for Data Analysis, Oct 2013)
Coursera Statement (Computing for Data Analysis, Oct 2013)Jo-fai Chow
 
Coursera Statement (Data Analysis, Mar 2013)
Coursera Statement (Data Analysis, Mar 2013)Coursera Statement (Data Analysis, Mar 2013)
Coursera Statement (Data Analysis, Mar 2013)Jo-fai Chow
 
A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...
A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...
A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...Jo-fai Chow
 

Mehr von Jo-fai Chow (16)

Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and Shiny
Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and ShinyMaking Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and Shiny
Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and Shiny
 
Automatic and Interpretable Machine Learning in R with H2O and LIME
Automatic and Interpretable Machine Learning in R with H2O and LIMEAutomatic and Interpretable Machine Learning in R with H2O and LIME
Automatic and Interpretable Machine Learning in R with H2O and LIME
 
H2O at Berlin R Meetup
H2O at Berlin R MeetupH2O at Berlin R Meetup
H2O at Berlin R Meetup
 
H2O at BelgradeR Meetup
H2O at BelgradeR MeetupH2O at BelgradeR Meetup
H2O at BelgradeR Meetup
 
Introduction to H2O and Model Stacking Use Cases
Introduction to H2O and Model Stacking Use CasesIntroduction to H2O and Model Stacking Use Cases
Introduction to H2O and Model Stacking Use Cases
 
H2O at Poznan R Meetup
H2O at Poznan R MeetupH2O at Poznan R Meetup
H2O at Poznan R Meetup
 
Introduction to Machine Learning with H2O and Python
Introduction to Machine Learning with H2O and PythonIntroduction to Machine Learning with H2O and Python
Introduction to Machine Learning with H2O and Python
 
Introduction to Machine Learning with H2O and Python
Introduction to Machine Learning with H2O and PythonIntroduction to Machine Learning with H2O and Python
Introduction to Machine Learning with H2O and Python
 
A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...
A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...
A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...
 
Designing Sustainable Drainage Systems
Designing Sustainable Drainage SystemsDesigning Sustainable Drainage Systems
Designing Sustainable Drainage Systems
 
Developing a New Decision Support System for SuDS
Developing a New Decision Support System for SuDSDeveloping a New Decision Support System for SuDS
Developing a New Decision Support System for SuDS
 
Udacity Statement (Introduction to Statistics, August 2012)
Udacity Statement (Introduction to Statistics, August 2012)Udacity Statement (Introduction to Statistics, August 2012)
Udacity Statement (Introduction to Statistics, August 2012)
 
Coursera Statement (Computational Investing, Part I,
Coursera Statement (Computational Investing, Part I, Coursera Statement (Computational Investing, Part I,
Coursera Statement (Computational Investing, Part I,
 
Coursera Statement (Computing for Data Analysis, Oct 2013)
Coursera Statement (Computing for Data Analysis, Oct 2013)Coursera Statement (Computing for Data Analysis, Oct 2013)
Coursera Statement (Computing for Data Analysis, Oct 2013)
 
Coursera Statement (Data Analysis, Mar 2013)
Coursera Statement (Data Analysis, Mar 2013)Coursera Statement (Data Analysis, Mar 2013)
Coursera Statement (Data Analysis, Mar 2013)
 
A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...
A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...
A Systematic, Multi-Criteria Decision Support Framework for Sustainable Drain...
 

Kürzlich hochgeladen

BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 

Kürzlich hochgeladen (20)

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 

Kaggle Competitions, New Friends, New Skills and New Opportunities

  • 1. Kaggle Competitions, New Friends, New Skills and New Opportunities Jo-fai (Joe) Chow Data Scientist joe@h2o.ai @matlabulus Version 2 – Data Science Exeter Meetup
  • 2. Civil Engineer → Data Scientist • 2005 - 2015 • Water Engineer o Consultant for Utilities • SEAMS (Sheffield) o EngD Research • University of Exeter • XP Solutions (Newbury) • 2015 - Present • Data Scientist o UK Telecom • Virgin Media o Silicon Valley • Domino Data Lab • H2O.ai 2
  • 4. About This Talk • What happened o Things I did since I started participating in Kaggle competitions. o New opportunities – results of new skills and friends. 4
  • 5. First MOOC Experience • One of the first Massive Open Online Courses. o Met some new friends. o Decided to collaborate for fun. o “How about Kaggle?” o “What is Kaggle?” 5
  • 6. About Kaggle • World’s biggest predictive modelling competition platform • 560k members • Competition types: o Featured (prize) o Recruitment o Playground o 101 6
  • 7. First Kaggle Experience • First time in my life o Supervised learning • Random Forest • Support Vector Machine • Neural Networks o Train, Validate & Predict. o “Is it black magic?” 7
  • 8. First Kaggle Experience • Problems o “Hey Joe, you are a nice guy but we can’t work together.” o “You love MATLAB so much. You even call yourself @matlabulous on twitter!” o “We prefer R/Python.” • Results o I kept using MATLAB o Lone wolf o No collaboration  8
  • 9. Identifying Skills Gap • Obvious skills gap: o Open-source programming langauges o Machine learning techniques o Collaboration • Kind of related o Data visualisation o Handling large datasets o Explaining results • That competition was a good wake up call. 9
  • 10. From MATLAB to R/Python MATLAB Python R Neural Networks ✔️ ✔️ ✔️ Random Forest ✔️ ✔️ ✔️ SVM ✔️ ✔️ ✔️ Other Machine Learning Libraries Toolboxes (commercial + open source) Scikit-learn and many more CRAN, GitHub (A LOT!) Data Visualisation I wasn’t good at it anyway … Matplotlib (plus a lot more since then) ggplot2 (WOW!) (plus a lot more since then) 10
  • 11. What can people do with R? 11 James Cheshire, UCL Link Paul Butler, Facebook Link
  • 12. Filling the Skills Gap • More MOOC o Machine Learning • Andrew Ng (Coursera) o Data Analysis • Jeff Leek (Coursera) • R o Intro to Programming • Dave Evans (Udacity) • Python • Things I also picked up: o Linux (Ubuntu) o Git o Cloud computing o HTML / CSS 12
  • 13. Learning from other Kagglers • Continuous learning o Kaggle’s forums and blogs. o New tools and tricks. o Many things you cannot learn from school. o I am standing on the shoulders of many Kagglers. 13
  • 14. Side Project 1 – Crime Data Viz shiny::runGitHub("rApps", "woobe", subdir = "crimemap") 14 http://insidebigdata.com/2013/11/30/ visualization-week-crimemap/ Before I knew it …Using R + crime data from data.gov.uk
  • 15. Side Project 2 – Data Viz Contest 15 https://github.com/woobe/rugsmaps While I was obsessed with making maps … http://blog.revolutionanalytics.com/2014/08/ winner-for-revolution-analytics-user-group-map-contest.html
  • 16. Side Project 3 – Colour Palette 16 I am also obsessed with colours … https://github.com/woobe/rPlotter http://blog.revolutionanalytics.com/ 2015/03/color-extraction-with-r.html #TheDress
  • 17. Side Project 4 – World Cup 2014 • World Cup 2014 Correct Score Prediction o ML vs. my friends o 10 out of 64 (15.6%) o Friends’ avg. = 4 (6.3%) o github.com/ woobe/wc2014 • Euro 2016 o Collecting data right now o github.com/woobe/ euro2016 17
  • 18. Open Up Myself • Before Kaggle/MOOC o I was drawing a circle around myself. o Fear of change. o Domain-specific problem solving. • After Kaggle/MOOC o Data-driven approach. o Not a subject matter expert? No worries  o Free to try new tools, to learn and to create. 18
  • 19. New Opportunities • LondonR o First presentation outside water industry / academia. o Very positive feedback. o Led to other projects. o bit.ly /londonr_crimemap 19
  • 20. New Opportunities • useR! 2014 (UCLA) o Presented a poster. o Met new friends. o Life-changing event. o github.com/ woobe/useR_2014 20
  • 21. New Friends 21 Ramnath Vaidyanathan htmlwidgets DataRobot Nick @ DominoData Lab H2O.ai & John Chambers! rOpenSci RStudio Matt Dowle data.table (also at H2O.ai)
  • 22. More Opportunities • First blog post about H2O o Things to try after useR! – Part1: Deep Learning with H2O 22
  • 23. More Opportunities • Blog post about Domino and H2O o I did it for fun. I did not have any expectation. o It helped attract customers to both Domino and H2O. 23
  • 24. Becoming a Data Scientist 24 The leap of faith …
  • 25. London Kagglers Assemble • London Kaggle Meetup o Sep 2015 o I met my Kaggle buddy Mickael Le Gal o He is a product data scientist at Tictrac 25 Mickael Joe
  • 26. London Kagglers Assemble • Rossmann Store Sales o We got stuck at top 10% for a long period. o Mickael had a breakthrough in feature engineering with 48 hours to go. o I re-trained all models and completed model stacking just a few hours before the deadline (thanks to Domino Data Lab). o Top 2% finish (our best result so far). 26
  • 27. Joining H2O.ai 27 A call from Sri (CEO) just before Christmas 2015 …
  • 28. More Opportunities 28 • bit.ly/joe_h2o_talk1 • bit.ly/joe_h2o_talk2 • bit.ly/joe_h2o_talk3 • bit.ly/joe_h2o_talk4 • … LondonR PyData Amsterdam London Kaggle
  • 29. Summary of Benefits • Direct o Identify data science skills gap. o Learn quickly from the community. o Expand your network. o Prepare yourself for real- life data challenges. • Indirect o You also learn non-ML skills along the way. o You learn to build small data products (e.g. graph, web app, REST API) and help others gain insight. 29
  • 30. Big Thank You! • University of Exeter o Prof. Dragan Savic • Mango Solutions • RStudio • Domino Data Lab • H2O.ai • London Kaggle Meetup Organisers 30 1st LondonR Talk Crime Map Shiny App bit.ly/londonr_crimemap 2nd LondonR Talk Domino API Endpoint bit.ly/1cYbZbF
  • 31. Any Questions? • Contact o joe@h2o.ai o @matlabulous o github.com/woobe • Links (All Slides) o github.com/h2oai/h2o- meetups • H2O in London o Coming soon! • Meetups • Office o We’re hiring! o www.h2o.ai/careers 31

Hinweis der Redaktion

  1. s
  2. All slides and code available online – sit back and relax, remember you’re here today for a good cause, care about shelter animals