SlideShare ist ein Scribd-Unternehmen logo
1 von 21
Downloaden Sie, um offline zu lesen
Calling	
  Your	
  Shots	
  with	
  Data
How	
  to	
  Ask	
  Smarter	
  Questions	
  to	
  Make	
  
Better	
  Business	
  Decisions	
  
November	
  9,	
  2015
Jessica	
  LanfordChen	
  Huang
Need	
  a	
  handy	
  conference	
  guide?
Download	
  our	
  app,	
  “H2O	
  World	
  2015”
What	
  do	
  these	
  stickers	
  mean?
I have H2O
Installed
I have Python
installed
I have R
installed
I have the H2O
World data
sets
Pick	
  up	
  stickers	
  or	
  get	
  install	
  help	
  at	
  the	
  
information	
  booth
Agenda
• Introduction	
  
• Business	
  Decision	
  Process	
  	
  
• Tools	
  &	
  Resources	
  
• Bridging	
  the	
  Communications	
  Gap	
  
• Q&A
Business	
  Decision	
  Process
Make	
  data-­‐
informed	
  
business	
  
decisions
Ask	
  business	
  
questions
Define	
  
business	
  
problems
Analysis	
  
Process
Motivators	
  that	
  Influence	
  Business	
  Decisions
Business	
  decisions	
  will	
  
impact:	
  
Customer	
  product	
  
interaction	
  
Customer	
  and	
  company	
  
engagement	
  
Product	
  development	
  
???	
  
1
2
3
4
Your%
customers%
Your%products%
/%services%/%
offerings
Your%business
1
2 3
4
Asking	
  the	
  “Right”	
  Questions
The	
  answers	
  to	
  the	
  business	
  questions	
  will	
  ultimately	
  
provide	
  business	
  context	
  for	
  the	
  “Analysis	
  Process”.	
  
Make	
  data-­‐
informed	
  
business	
  
decisions
Ask	
  business	
  
questions
Define	
  
business	
  
problems
Analysis	
  
Process
Data$
Scientist
Analysis	
  Process
Analysis(Process
Analyze(Data(&(Find(Insights
1.(Frame(the(
question
2.(Collect(raw(data
3.(Prepare(and(
explore(data
4.(Develop(model
5.(Evaluate,(
validate,(and(
interpret(results
6.(Communicate(
and(visualize(
results
Discussion(to(reach(
agreement(in(problem(
statement
Translate(business(questions(
and(context(into(a(problem(
statement
Complete	
  Business	
  Decision	
  Process
Analysis(Process
Analyze(Data(&(Find(Insights
Define(business(problems Ask(business(questions
Make(data?Informed(
business(decisions
1.(Frame(the(
question
2.(Collect(raw(data
3.(Prepare(and(
explore(data
4.(Develop(model
5.(Evaluate,(
validate,(and(
interpret(results
6.(Communicate(
and(visualize(
results
Business(
decision(
maker
Data(
Scientist
Discussion(to(reach(
agreement(in(problem(
statement
Translate(business(questions(
and(context(into(a(problem(
statement
Tools	
  and	
  Resources
Data$Science$Team
R,$Python,$Scala,$
Java,$CoffeeScript /$
JavaScript,$SQL,$Julia
Languages
Jupyter$(IPython),$
H2O$Flow,$…$
Notebooks$+$IDEs
Skillsets:
• Domain$knowledge
• Math$and$statistics
• Programming$ skills
• Databases
• Machine$learning
• Communication$and$visualization
Bridging	
  the	
  Communication	
  Gap
What	
  is	
  Machine	
  Learning?	
  
• Machine	
  reads	
  the	
  data,	
  learns	
  from	
  the	
  data,	
  
uses	
  it	
  to	
  make	
  predictions	
  
• Can	
  show	
  you	
  correlation	
  but	
  not	
  necessarily	
  
causation	
  
• Can	
  find	
  relationships	
  and	
  patterns	
  within	
  
volumes	
  of	
  data	
  that	
  the	
  human	
  mind	
  is	
  
incapable	
  of	
  processing
Note:	
  There	
  is	
  no	
  “right”	
  or	
  “best”	
  model	
  that	
  a	
  data	
  
scientist	
  can	
  use.	
  The	
  model	
  used	
  is	
  dependent	
  on	
  the	
  data,	
  
problem,	
  and	
  the	
  data	
  scientist.
Supervised	
  Learning
Business	
  Applications:
• Classification	
  
• Twitter	
  sentiments:	
  Rant	
  -­‐>	
  
negative,	
  Rave	
  -­‐>	
  positive	
  
• Coffee	
  vs.	
  tea	
  vs.	
  soda	
  drinker	
  
• Recommender	
  systems	
  
• Netflix’s	
  “More	
  Like	
  This”	
  
• Amazon’s	
  “Customers	
  Who	
  
Bought	
  This	
  Item	
  Also	
  
Bought”	
  
• Fraud	
  detection	
  
• Authorizing	
  transactions
• Known	
  right	
  answer,	
  using	
  
model	
  to	
  verify	
  
• Algorithm	
  tries	
  to	
  predict	
  
results	
  
• Based	
  on	
  its	
  training	
  data,	
  the	
  
program	
  can	
  make	
  accurate	
  
decisions	
  when	
  given	
  new	
  data	
  
• Examples	
  of	
  algorithms	
  and	
  
models:	
  GLM,	
  DRF,	
  GBM,	
  Deep	
  
Learning
Data	
  Science	
  Concept:
Unsupervised	
  Learning
Business	
  Applications:
• Anomaly	
  detection	
  
• outliers:	
  detecting	
  
irregular	
  heartbeats	
  
• computer	
  security	
  with	
  
unauthorized	
  access	
  	
  
• Clustering	
  	
  
• Grouping	
  users	
  by	
  salary	
  
• Grouping	
  users	
  by	
  
behavior
• No	
  “known”	
  answer,	
  using	
  
algorithms	
  to	
  determine	
  
answer	
  
• Algorithm	
  tries	
  to	
  identify	
  
patterns	
  in	
  the	
  data	
  
• General	
  understanding	
  of	
  
input	
  data	
  where	
  no	
  
prediction	
  is	
  needed	
  
• Examples	
  of	
  algorithms	
  and	
  
models:	
  K-­‐means,	
  PCA
Data	
  Science	
  Concept:
Classification	
  (Supervised)
Business	
  Applications:
• Will	
  customers	
  upgrade	
  to	
  new	
  
software?	
  	
  
• What	
  age	
  groups	
  tested	
  well	
  for	
  
this	
  new	
  TV	
  show?	
  (marketing	
  
campaigns)	
  
• Nigerian	
  419	
  (spam	
  
classification)	
  
• Will	
  the	
  real	
  Barack	
  Obama	
  
please	
  stand	
  up?	
  (fraud	
  
detection)
• Classification	
  is	
  the	
  process	
  of	
  
taking	
  an	
  input	
  and	
  assigning	
  a	
  
label	
  to	
  it.	
  
• The	
  labels	
  could	
  be	
  binomial	
  
(Yes,	
  No)	
  or	
  multinomial	
  (High,	
  
Medium,	
  Low).	
  	
  
• Examples	
  of	
  algorithms	
  and	
  
models:	
  Random	
  Forest
Data	
  Science	
  Concept:
Regression	
  (Supervised)
Business	
  Applications:
• How	
  much	
  money	
  would	
  a	
  user	
  
who	
  has	
  reached	
  level	
  200	
  in	
  
CandyCrush	
  spend	
  on	
  in-­‐app	
  
purchases?	
  (forecasting)	
  
• How	
  much	
  would	
  a	
  customer	
  
expect	
  to	
  pay	
  for	
  car	
  insurance	
  
based	
  on	
  age,	
  gender,	
  and	
  car	
  
type?	
  (prediction)	
  
• How	
  many	
  registered	
  meetup.com	
  
attendees	
  will	
  actually	
  show	
  up	
  
based	
  on	
  past	
  event	
  registration	
  
and	
  attendance?	
  (prediction)
• Regression	
  predict	
  a	
  continuous	
  
numerical	
  value	
  output	
  	
  
• Examples	
  of	
  algorithms	
  and	
  
models:	
  Linear	
  Regression,	
  
Random	
  Forest
Data	
  Science	
  Concept:
Deep	
  Learning	
  (Supervised	
  and	
  Unsupervised)
Business	
  Applications:
• Scanning	
  mug	
  shots	
  of	
  suspects	
  
against	
  FBI	
  database	
  (scanning	
  
image	
  classification)	
  
• Siri	
  (language	
  processing)	
  
• Early	
  detection	
  of	
  frustrated	
  
customers	
  who	
  call	
  into	
  call	
  
centers	
  (audio	
  processing)
• Uses	
  “features”	
  (multiple	
  
variables	
  impacting	
  a	
  result)	
  to	
  
identify	
  patterns	
  
• Uses	
  results	
  to	
  iteratively	
  
improve	
  predictions	
  for	
  new	
  
data
Data	
  Science	
  Concept:
Clustering	
  (Unsupervised)
Business	
  Applications:
• Identify	
  different	
  types	
  of	
  
shoppers	
  based	
  on	
  purchasing	
  
history	
  to	
  create	
  exclusive	
  
promotions	
  (market	
  
segmentation)	
  
• Identifying	
  groups	
  of	
  products	
  
people	
  like	
  to	
  buy	
  online	
  
• Identify	
  geographic	
  locations	
  
where	
  a	
  national	
  mobile	
  carrier	
  
should	
  install	
  its	
  next	
  cellular	
  
tower	
  to	
  optimize	
  for	
  its	
  user	
  
base	
  
• Grouping	
  a	
  set	
  of	
  objects	
  in	
  the	
  
same	
  group	
  that	
  are	
  more	
  
similar	
  to	
  each	
  other	
  than	
  other	
  
groups	
  	
  
• Examples	
  of	
  algorithms	
  and	
  
models:	
  K-­‐means	
  clustering,	
  
hierarchical	
  clustering,	
  DBSCAN
Data	
  Science	
  Concept:
Business	
  	
  
Examples:
Types	
  of	
  	
  
Machine	
  
Learning:
Machine	
  Learning	
  Summary
Supervised
• Calculating	
  estimated	
  
lifetime	
  value	
  
• Forecasting	
  and	
  prediction	
  
• Recommendation	
  engine	
  
• Fraud	
  detection
Unsupervised
Data	
  Science	
  	
  
Concepts:	
  
• Anomaly	
  detection	
  
• Determining	
  customer	
  
behavior	
  
• Imagine,	
  text,	
  and	
  audio	
  
processing	
  
• Classification	
  
• Regression	
  
• Deep	
  Learning
• Deep	
  Learning	
  
• Clustering
If	
  you	
  Want	
  to	
  Learn	
  More…	
  
• StackExchange:	
  stats.stackexchange.com	
  	
  
• Quora:	
  quora.com/Machine-­‐Learning	
  	
  
• Data	
  Science	
  in	
  H2O:	
  http://docs.h2o.ai/
h2oclassic/datascience/top.html	
  	
  
• Visualization	
  Introduction	
  to	
  Machine	
  Learning:	
  
r2d3.us/visual-­‐intro-­‐to-­‐machine-­‐learning-­‐part-­‐1	
  
• Machine	
  Learning	
  Map:	
  	
  http://scikit-­‐learn.org/
stable/tutorial/machine_learning_map/	
  
Questions	
  and	
  Answers

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

H2O World - Quora: Machine Learning Algorithms to Grow the World's Knowledge ...
H2O World - Quora: Machine Learning Algorithms to Grow the World's Knowledge ...H2O World - Quora: Machine Learning Algorithms to Grow the World's Knowledge ...
H2O World - Quora: Machine Learning Algorithms to Grow the World's Knowledge ...
 
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
 
H2O World - Data Science in Action @ 6sense - Viral Bajaria
H2O World - Data Science in Action @ 6sense - Viral BajariaH2O World - Data Science in Action @ 6sense - Viral Bajaria
H2O World - Data Science in Action @ 6sense - Viral Bajaria
 
H2O World - NCS Continuous Media Optimization w/H2O - Satya Satyamoorthy
H2O World - NCS Continuous Media Optimization w/H2O - Satya SatyamoorthyH2O World - NCS Continuous Media Optimization w/H2O - Satya Satyamoorthy
H2O World - NCS Continuous Media Optimization w/H2O - Satya Satyamoorthy
 
Advanced Analytics and Data Science Expertise
Advanced Analytics and Data Science ExpertiseAdvanced Analytics and Data Science Expertise
Advanced Analytics and Data Science Expertise
 
Leveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive IndustryLeveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive Industry
 
Data science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi PeriasamyData science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi Periasamy
 
Data Science Salon: Introduction to Machine Learning - Marketing Use Case
Data Science Salon: Introduction to Machine Learning - Marketing Use CaseData Science Salon: Introduction to Machine Learning - Marketing Use Case
Data Science Salon: Introduction to Machine Learning - Marketing Use Case
 
Agile data science
Agile data scienceAgile data science
Agile data science
 
1645 track 1 bress_using his laptop
1645 track 1 bress_using his laptop1645 track 1 bress_using his laptop
1645 track 1 bress_using his laptop
 
Back to Square One: Building a Data Science Team from Scratch
Back to Square One: Building a Data Science Team from ScratchBack to Square One: Building a Data Science Team from Scratch
Back to Square One: Building a Data Science Team from Scratch
 
Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Agile Analytics: Delivering on Promises by Atif Abdul Rahman
Agile Analytics: Delivering on Promises by Atif Abdul RahmanAgile Analytics: Delivering on Promises by Atif Abdul Rahman
Agile Analytics: Delivering on Promises by Atif Abdul Rahman
 
H2O World - Learning How Humans and Non-Humans Interact with Digital Ads
H2O World - Learning How Humans and Non-Humans Interact with Digital AdsH2O World - Learning How Humans and Non-Humans Interact with Digital Ads
H2O World - Learning How Humans and Non-Humans Interact with Digital Ads
 
Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)
 
Analyzing Unstructured Data in Hadoop Webinar
Analyzing Unstructured Data in Hadoop WebinarAnalyzing Unstructured Data in Hadoop Webinar
Analyzing Unstructured Data in Hadoop Webinar
 
Idiots guide to setting up a data science team
Idiots guide to setting up a data science teamIdiots guide to setting up a data science team
Idiots guide to setting up a data science team
 
1215 dataikulunchlearn sanford
1215 dataikulunchlearn sanford1215 dataikulunchlearn sanford
1215 dataikulunchlearn sanford
 
8 minute intro to data science
8 minute intro to data science 8 minute intro to data science
8 minute intro to data science
 

Andere mochten auch

Andere mochten auch (10)

H2O World - Welcome to H2O World with Arno Candel
H2O World - Welcome to H2O World with Arno CandelH2O World - Welcome to H2O World with Arno Candel
H2O World - Welcome to H2O World with Arno Candel
 
H2O World - Building a Smarter Application - Tom Kraljevic
H2O World - Building a Smarter Application - Tom KraljevicH2O World - Building a Smarter Application - Tom Kraljevic
H2O World - Building a Smarter Application - Tom Kraljevic
 
H2O World - H2O Deep Learning with Arno Candel
H2O World - H2O Deep Learning with Arno CandelH2O World - H2O Deep Learning with Arno Candel
H2O World - H2O Deep Learning with Arno Candel
 
H20: A platform for big math
H20: A platform for big math H20: A platform for big math
H20: A platform for big math
 
H2O World - Top 10 Deep Learning Tips & Tricks - Arno Candel
H2O World - Top 10 Deep Learning Tips & Tricks - Arno CandelH2O World - Top 10 Deep Learning Tips & Tricks - Arno Candel
H2O World - Top 10 Deep Learning Tips & Tricks - Arno Candel
 
Data Science, Machine Learning, and H2O
Data Science, Machine Learning, and H2OData Science, Machine Learning, and H2O
Data Science, Machine Learning, and H2O
 
Intro to Machine Learning with H2O and AWS
Intro to Machine Learning with H2O and AWSIntro to Machine Learning with H2O and AWS
Intro to Machine Learning with H2O and AWS
 
Jan vitek distributedrandomforest_5-2-2013
Jan vitek distributedrandomforest_5-2-2013Jan vitek distributedrandomforest_5-2-2013
Jan vitek distributedrandomforest_5-2-2013
 
Spring Boot Microservices vs Akka Actor Cluster
Spring Boot Microservices vs Akka Actor Cluster Spring Boot Microservices vs Akka Actor Cluster
Spring Boot Microservices vs Akka Actor Cluster
 
A Scalable Hierarchical Clustering Algorithm Using Spark: Spark Summit East t...
A Scalable Hierarchical Clustering Algorithm Using Spark: Spark Summit East t...A Scalable Hierarchical Clustering Algorithm Using Spark: Spark Summit East t...
A Scalable Hierarchical Clustering Algorithm Using Spark: Spark Summit East t...
 

Ähnlich wie H2O World - Machine Learning for non-data scientists

Ähnlich wie H2O World - Machine Learning for non-data scientists (20)

Scaling Training Data for AI Applications
Scaling Training Data for AI ApplicationsScaling Training Data for AI Applications
Scaling Training Data for AI Applications
 
Analytics in Online Retail
Analytics in Online RetailAnalytics in Online Retail
Analytics in Online Retail
 
Intro to machine learning
Intro to machine learningIntro to machine learning
Intro to machine learning
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecases
 
Predictive Analytics for Non-programmers
Predictive Analytics for Non-programmersPredictive Analytics for Non-programmers
Predictive Analytics for Non-programmers
 
A picture is worth a thousand words
A picture is worth a thousand wordsA picture is worth a thousand words
A picture is worth a thousand words
 
Huntel global webinar aligning data talent with your analytics needs
Huntel global webinar aligning data talent with your analytics needsHuntel global webinar aligning data talent with your analytics needs
Huntel global webinar aligning data talent with your analytics needs
 
predictive analysis and usage in procurement ppt 2017
predictive analysis and usage in procurement  ppt 2017predictive analysis and usage in procurement  ppt 2017
predictive analysis and usage in procurement ppt 2017
 
Data science 101
Data science 101Data science 101
Data science 101
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
 
Liberating data power of APIs
Liberating data power of APIsLiberating data power of APIs
Liberating data power of APIs
 
Getting Started with Product Analytics - A 101 Implementation Guide for Begin...
Getting Started with Product Analytics - A 101 Implementation Guide for Begin...Getting Started with Product Analytics - A 101 Implementation Guide for Begin...
Getting Started with Product Analytics - A 101 Implementation Guide for Begin...
 
Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise
 
Your AI and ML Projects Are Failing – Key Steps to Get Them Back on Track
Your AI and ML Projects Are Failing – Key Steps to Get Them Back on TrackYour AI and ML Projects Are Failing – Key Steps to Get Them Back on Track
Your AI and ML Projects Are Failing – Key Steps to Get Them Back on Track
 
Altron presentation on Emerging Technologies: Data Science and Artificial Int...
Altron presentation on Emerging Technologies: Data Science and Artificial Int...Altron presentation on Emerging Technologies: Data Science and Artificial Int...
Altron presentation on Emerging Technologies: Data Science and Artificial Int...
 
Know Your Market - Know Your Customer: What Web Data Reveals if You Know Wher...
Know Your Market - Know Your Customer: What Web Data Reveals if You Know Wher...Know Your Market - Know Your Customer: What Web Data Reveals if You Know Wher...
Know Your Market - Know Your Customer: What Web Data Reveals if You Know Wher...
 
D2C Insider Mumbai summit - Leverage AI for your D2C business by Rahul Pangam...
D2C Insider Mumbai summit - Leverage AI for your D2C business by Rahul Pangam...D2C Insider Mumbai summit - Leverage AI for your D2C business by Rahul Pangam...
D2C Insider Mumbai summit - Leverage AI for your D2C business by Rahul Pangam...
 
Tools and techniques for predictive analytics
Tools and techniques for predictive analyticsTools and techniques for predictive analytics
Tools and techniques for predictive analytics
 
Introduction to Data mining
Introduction to Data miningIntroduction to Data mining
Introduction to Data mining
 
Image Analytics for Retail
Image Analytics for RetailImage Analytics for Retail
Image Analytics for Retail
 

Mehr von Sri Ambati

Mehr von Sri Ambati (20)

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Generative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptxGenerative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptx
 
AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek
 
LLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5thLLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5th
 
Building, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionBuilding, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for Production
 
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
 
Risk Management for LLMs
Risk Management for LLMsRisk Management for LLMs
Risk Management for LLMs
 
Open-Source AI: Community is the Way
Open-Source AI: Community is the WayOpen-Source AI: Community is the Way
Open-Source AI: Community is the Way
 
Building Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2OBuilding Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2O
 
Applied Gen AI for the Finance Vertical
Applied Gen AI for the Finance Vertical Applied Gen AI for the Finance Vertical
Applied Gen AI for the Finance Vertical
 
Cutting Edge Tricks from LLM Papers
Cutting Edge Tricks from LLM PapersCutting Edge Tricks from LLM Papers
Cutting Edge Tricks from LLM Papers
 
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
 
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
 
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
 
LLM Interpretability
LLM Interpretability LLM Interpretability
LLM Interpretability
 
Never Reply to an Email Again
Never Reply to an Email AgainNever Reply to an Email Again
Never Reply to an Email Again
 
Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)
 
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
 
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
 
AI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneyAI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation Journey
 

Kürzlich hochgeladen

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 

Kürzlich hochgeladen (20)

Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 

H2O World - Machine Learning for non-data scientists

  • 1. Calling  Your  Shots  with  Data How  to  Ask  Smarter  Questions  to  Make   Better  Business  Decisions   November  9,  2015 Jessica  LanfordChen  Huang
  • 2. Need  a  handy  conference  guide? Download  our  app,  “H2O  World  2015”
  • 3. What  do  these  stickers  mean? I have H2O Installed I have Python installed I have R installed I have the H2O World data sets Pick  up  stickers  or  get  install  help  at  the   information  booth
  • 4. Agenda • Introduction   • Business  Decision  Process     • Tools  &  Resources   • Bridging  the  Communications  Gap   • Q&A
  • 5. Business  Decision  Process Make  data-­‐ informed   business   decisions Ask  business   questions Define   business   problems Analysis   Process
  • 6. Motivators  that  Influence  Business  Decisions Business  decisions  will   impact:   Customer  product   interaction   Customer  and  company   engagement   Product  development   ???   1 2 3 4 Your% customers% Your%products% /%services%/% offerings Your%business 1 2 3 4
  • 7. Asking  the  “Right”  Questions The  answers  to  the  business  questions  will  ultimately   provide  business  context  for  the  “Analysis  Process”.   Make  data-­‐ informed   business   decisions Ask  business   questions Define   business   problems Analysis   Process
  • 9. Complete  Business  Decision  Process Analysis(Process Analyze(Data(&(Find(Insights Define(business(problems Ask(business(questions Make(data?Informed( business(decisions 1.(Frame(the( question 2.(Collect(raw(data 3.(Prepare(and( explore(data 4.(Develop(model 5.(Evaluate,( validate,(and( interpret(results 6.(Communicate( and(visualize( results Business( decision( maker Data( Scientist Discussion(to(reach( agreement(in(problem( statement Translate(business(questions( and(context(into(a(problem( statement
  • 10. Tools  and  Resources Data$Science$Team R,$Python,$Scala,$ Java,$CoffeeScript /$ JavaScript,$SQL,$Julia Languages Jupyter$(IPython),$ H2O$Flow,$…$ Notebooks$+$IDEs Skillsets: • Domain$knowledge • Math$and$statistics • Programming$ skills • Databases • Machine$learning • Communication$and$visualization
  • 12. What  is  Machine  Learning?   • Machine  reads  the  data,  learns  from  the  data,   uses  it  to  make  predictions   • Can  show  you  correlation  but  not  necessarily   causation   • Can  find  relationships  and  patterns  within   volumes  of  data  that  the  human  mind  is   incapable  of  processing Note:  There  is  no  “right”  or  “best”  model  that  a  data   scientist  can  use.  The  model  used  is  dependent  on  the  data,   problem,  and  the  data  scientist.
  • 13. Supervised  Learning Business  Applications: • Classification   • Twitter  sentiments:  Rant  -­‐>   negative,  Rave  -­‐>  positive   • Coffee  vs.  tea  vs.  soda  drinker   • Recommender  systems   • Netflix’s  “More  Like  This”   • Amazon’s  “Customers  Who   Bought  This  Item  Also   Bought”   • Fraud  detection   • Authorizing  transactions • Known  right  answer,  using   model  to  verify   • Algorithm  tries  to  predict   results   • Based  on  its  training  data,  the   program  can  make  accurate   decisions  when  given  new  data   • Examples  of  algorithms  and   models:  GLM,  DRF,  GBM,  Deep   Learning Data  Science  Concept:
  • 14. Unsupervised  Learning Business  Applications: • Anomaly  detection   • outliers:  detecting   irregular  heartbeats   • computer  security  with   unauthorized  access     • Clustering     • Grouping  users  by  salary   • Grouping  users  by   behavior • No  “known”  answer,  using   algorithms  to  determine   answer   • Algorithm  tries  to  identify   patterns  in  the  data   • General  understanding  of   input  data  where  no   prediction  is  needed   • Examples  of  algorithms  and   models:  K-­‐means,  PCA Data  Science  Concept:
  • 15. Classification  (Supervised) Business  Applications: • Will  customers  upgrade  to  new   software?     • What  age  groups  tested  well  for   this  new  TV  show?  (marketing   campaigns)   • Nigerian  419  (spam   classification)   • Will  the  real  Barack  Obama   please  stand  up?  (fraud   detection) • Classification  is  the  process  of   taking  an  input  and  assigning  a   label  to  it.   • The  labels  could  be  binomial   (Yes,  No)  or  multinomial  (High,   Medium,  Low).     • Examples  of  algorithms  and   models:  Random  Forest Data  Science  Concept:
  • 16. Regression  (Supervised) Business  Applications: • How  much  money  would  a  user   who  has  reached  level  200  in   CandyCrush  spend  on  in-­‐app   purchases?  (forecasting)   • How  much  would  a  customer   expect  to  pay  for  car  insurance   based  on  age,  gender,  and  car   type?  (prediction)   • How  many  registered  meetup.com   attendees  will  actually  show  up   based  on  past  event  registration   and  attendance?  (prediction) • Regression  predict  a  continuous   numerical  value  output     • Examples  of  algorithms  and   models:  Linear  Regression,   Random  Forest Data  Science  Concept:
  • 17. Deep  Learning  (Supervised  and  Unsupervised) Business  Applications: • Scanning  mug  shots  of  suspects   against  FBI  database  (scanning   image  classification)   • Siri  (language  processing)   • Early  detection  of  frustrated   customers  who  call  into  call   centers  (audio  processing) • Uses  “features”  (multiple   variables  impacting  a  result)  to   identify  patterns   • Uses  results  to  iteratively   improve  predictions  for  new   data Data  Science  Concept:
  • 18. Clustering  (Unsupervised) Business  Applications: • Identify  different  types  of   shoppers  based  on  purchasing   history  to  create  exclusive   promotions  (market   segmentation)   • Identifying  groups  of  products   people  like  to  buy  online   • Identify  geographic  locations   where  a  national  mobile  carrier   should  install  its  next  cellular   tower  to  optimize  for  its  user   base   • Grouping  a  set  of  objects  in  the   same  group  that  are  more   similar  to  each  other  than  other   groups     • Examples  of  algorithms  and   models:  K-­‐means  clustering,   hierarchical  clustering,  DBSCAN Data  Science  Concept:
  • 19. Business     Examples: Types  of     Machine   Learning: Machine  Learning  Summary Supervised • Calculating  estimated   lifetime  value   • Forecasting  and  prediction   • Recommendation  engine   • Fraud  detection Unsupervised Data  Science     Concepts:   • Anomaly  detection   • Determining  customer   behavior   • Imagine,  text,  and  audio   processing   • Classification   • Regression   • Deep  Learning • Deep  Learning   • Clustering
  • 20. If  you  Want  to  Learn  More…   • StackExchange:  stats.stackexchange.com     • Quora:  quora.com/Machine-­‐Learning     • Data  Science  in  H2O:  http://docs.h2o.ai/ h2oclassic/datascience/top.html     • Visualization  Introduction  to  Machine  Learning:   r2d3.us/visual-­‐intro-­‐to-­‐machine-­‐learning-­‐part-­‐1   • Machine  Learning  Map:    http://scikit-­‐learn.org/ stable/tutorial/machine_learning_map/