SlideShare ist ein Scribd-Unternehmen logo
1 von 17
MACHINE LEARNING
Evren Korpeoglu, Data Science
Aarthi Srinivasan, Product Management
/Productschool @ProdSchool /ProductmanagementSV
What to Expect?
2
• What is machine learning ?
• Why is it important ?
• How do we use it ?
• Technical Concepts
• Examples
What is Machine Learning?
3
1. Science of getting computers to learn or recognize something without being explicitly
programmed – Andrew Ng
• Branch of Artificial Intelligence which is a branch of Computer Science
• Give lots of data to the computer so that it can figure it out
• One of the first examples is the computer checkers program by Arthur Samuel
* - ref: Andew Ng Courses, Big data: A revolution
2. Distinguish big data & machine learning: Big data is the data seed for creating
machine learning forests
• Big data collects information based on our digital exhaust (crumbs we leave in
the digital world) , demographics, preferences, health etc.
• Machine learning will mine this data and model behaviors with interactive
responses based on this data
Why do we need this?
4
1. Tons of applications impacting human health, utility and future simplification
Health & Wellness Utilitarian Future
• DNA sampling &
diagnosis
• Health reminders
& prevention
through AI tools
• Correlation studies
• Customizable
tablets
• Real time optimized
path maps
• Search Ranking
• Spam filter on email
• News aggregators
• Shopping
Recommendation
• Facebook face
recognition
• Age recognition
• Voice recognition –
Siri, Alexa
• Driverless cars
• Home decoration
Key Terms
5
• A set of data used to predict relationships.
• E.g. A diamond’s size, cut and clarity helps predicts the price. Data and
answers for each sample.
Training Set
• Uses training set to make a prediction.
• E.g. Model predicts diamond prices based on past prices.
Supervised Learning
• Provide data without suggesting anything so computer can identify patterns
or groupings.
• E.g. Customer segmentation, DNA groupings.
Unsupervised Learning
• Each distinct measurable data value you select in the training data set.
• E.g. A diamonds’ size is one of the feature’s for predicting price.
Features/ Variables /
Attributes
• Using the features provided in the training set make a prediction. Fit a curve
using the data provided.
• E.g. Price of diamond = X*Cut + Y*Clarity + Z*Size + features…
Supervised: Regression
• A defined set of categories for placing new data (observations)
• E.g. Presence of absence of cancer; Types of diabetes
Supervised: Classification
• Process of assigning observations into subsets
• E.g. Customer segment creations
Unsupervised: Clustering
Learning Steps
6
Collect /
Update User
Data
1
Create /
Update
Training Set
data
2
Create /
Update
algorithm for
training data
Update
Algorithm
Validate
Algorithm
3
Create
predictive
model
4
New real-time
observations
A/B Test &
Launch on
production
5
Data Wrangling and Feature Extraction
7
Spam Email
Detection
Title
Sender
Domain
# of
Recipients
Email
content
Country of
Origin
Non-
dictionary
Words
Hyperlinks
Address
Book
Length of
email
• Structured Data (Best)
– RDBMS, columnar data
– Strict Schema
– SQL
• Semi-Structured Data (Better)
– JSON, XML
– Enforce minimum schema
– JSON, XML Parser
• Unstructured Data
– Text, Image, Raw email
– No Schema
– Batch processing
– Regular expressions
– Map Reduce
GARBAGE IN GARBAGE OUT
Model Training
8
Feature
Extraction
(Feature
vector)
New
Text documents
User Activity
Images
Transaction history
Feature
Extraction
(Feature
vector)
Labels
Machine
Learning
Algorithm
Training / Testing
Text documents
User Activity
Images
Transaction history
Predictive
Model
Expected
Label
Model
Evaluation
Supervised learning techniques
9
• Linear classifier (numerical functions)
• Parametric (Probabilistic functions)
– Naïve Bayes, Hidden Markov models (HMM), Probabilistic
graphical models
• Non-parametric (Instance-based functions)
– K-nearest neighbors
• Non-metric (Symbolic functions)
– Classification and regression tree (CART)
• Aggregation
– Bagging (bootstrap + aggregation), Adaboost, Random
forest, Ensemble models
Linear Classifiers
10
• Logistic regression
– )
– w with minimum loss
– Solve iteratively using gradient descent
• Support vector machine (SVM)
– Maximum margin classifier
• Artificial Neural Networks
– Inspired from how neurons work
– Activation function (Sigmoid, ReLU etc.)
– Deep Learning
KNN / CART
11
• K-Nearest Neighbors
– Find K nearest training examples
– Majority vote
– Easy to implement
– Not scalable for real time predictions
• Classification and Regression Trees
– Easy to interpret for small trees
• Random Forests
– Ensemble of decision trees
– Usually performs very good
Unsupervised Learning
12
• Clustering
– K-means clustering
– Spectral clustering
• Dimensionality reduction
– Principal component analysis (PCA)
– Factor analysis
• Product Recommendations
– Collaborative Filtering
• Association Rules
– Market Basket Analysis
Model Evaluation
13
• Measure model performance
• Optimize model to improve prediction
quality
– Feature selection
– Hyperparameter tuning
• A/B Testing
• Explore/Exploit
• http://en.wikipedia.org/wiki/Precision_and_recall
Sample Architecture
14
-HADOOP
- SPARK
PREDICTION ENGINE
REAL TIME
DATA
SQL / NO SQL
Data Base
CLIENT MACHINE LEARNING
SYSTEM
Health & Wellness Sen.se Mother (iOT)
15
Amazon Echo & Personalization
16
Houzz Visual Match Deep Learning
17

Weitere ähnliche Inhalte

Was ist angesagt?

[2018 台灣人工智慧學校校友年會] Practical experience in mining and evaluating information...
[2018 台灣人工智慧學校校友年會] Practical experience in mining and evaluating information...[2018 台灣人工智慧學校校友年會] Practical experience in mining and evaluating information...
[2018 台灣人工智慧學校校友年會] Practical experience in mining and evaluating information...台灣資料科學年會
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2Roger Barga
 
How to get on the AI journey?
How to get on the AI journey? How to get on the AI journey?
How to get on the AI journey? Aarthi Srinivasan
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceKoo Ping Shung
 
Barga Data Science lecture 1
Barga Data Science lecture 1Barga Data Science lecture 1
Barga Data Science lecture 1Roger Barga
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Roger Barga
 
Machine Learning for Sales & Marketing
Machine Learning for Sales & MarketingMachine Learning for Sales & Marketing
Machine Learning for Sales & MarketingPiyush Saggi
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceLivePerson
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceNiko Vuokko
 
Barga Data Science lecture 4
Barga Data Science lecture 4Barga Data Science lecture 4
Barga Data Science lecture 4Roger Barga
 
BigMLSchool: Customer Segmentation
BigMLSchool: Customer SegmentationBigMLSchool: Customer Segmentation
BigMLSchool: Customer SegmentationBigML, Inc
 
Agile Analytics: Delivering on Promises by Atif Abdul Rahman
Agile Analytics: Delivering on Promises by Atif Abdul RahmanAgile Analytics: Delivering on Promises by Atif Abdul Rahman
Agile Analytics: Delivering on Promises by Atif Abdul RahmanAgile ME
 
8 minute intro to data science
8 minute intro to data science 8 minute intro to data science
8 minute intro to data science Mahesh Kumar CV
 
BigMLSchool: Trustworthy AI
BigMLSchool: Trustworthy AIBigMLSchool: Trustworthy AI
BigMLSchool: Trustworthy AIBigML, Inc
 
BigMLSchool: Dealing with Text
BigMLSchool: Dealing with TextBigMLSchool: Dealing with Text
BigMLSchool: Dealing with TextBigML, Inc
 
Data Science 101
Data Science 101Data Science 101
Data Science 101odsc
 
Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI Webina...
Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI  Webina...Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI  Webina...
Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI Webina...Pistoia Alliance
 
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Edureka!
 
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...Thoughtworks
 
Causal Inference, Reinforcement Learning, and Continuous Optimization
Causal Inference, Reinforcement Learning, and Continuous OptimizationCausal Inference, Reinforcement Learning, and Continuous Optimization
Causal Inference, Reinforcement Learning, and Continuous OptimizationScientificRevenue
 

Was ist angesagt? (20)

[2018 台灣人工智慧學校校友年會] Practical experience in mining and evaluating information...
[2018 台灣人工智慧學校校友年會] Practical experience in mining and evaluating information...[2018 台灣人工智慧學校校友年會] Practical experience in mining and evaluating information...
[2018 台灣人工智慧學校校友年會] Practical experience in mining and evaluating information...
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
 
How to get on the AI journey?
How to get on the AI journey? How to get on the AI journey?
How to get on the AI journey?
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Barga Data Science lecture 1
Barga Data Science lecture 1Barga Data Science lecture 1
Barga Data Science lecture 1
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015
 
Machine Learning for Sales & Marketing
Machine Learning for Sales & MarketingMachine Learning for Sales & Marketing
Machine Learning for Sales & Marketing
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Barga Data Science lecture 4
Barga Data Science lecture 4Barga Data Science lecture 4
Barga Data Science lecture 4
 
BigMLSchool: Customer Segmentation
BigMLSchool: Customer SegmentationBigMLSchool: Customer Segmentation
BigMLSchool: Customer Segmentation
 
Agile Analytics: Delivering on Promises by Atif Abdul Rahman
Agile Analytics: Delivering on Promises by Atif Abdul RahmanAgile Analytics: Delivering on Promises by Atif Abdul Rahman
Agile Analytics: Delivering on Promises by Atif Abdul Rahman
 
8 minute intro to data science
8 minute intro to data science 8 minute intro to data science
8 minute intro to data science
 
BigMLSchool: Trustworthy AI
BigMLSchool: Trustworthy AIBigMLSchool: Trustworthy AI
BigMLSchool: Trustworthy AI
 
BigMLSchool: Dealing with Text
BigMLSchool: Dealing with TextBigMLSchool: Dealing with Text
BigMLSchool: Dealing with Text
 
Data Science 101
Data Science 101Data Science 101
Data Science 101
 
Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI Webina...
Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI  Webina...Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI  Webina...
Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI Webina...
 
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
 
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
 
Causal Inference, Reinforcement Learning, and Continuous Optimization
Causal Inference, Reinforcement Learning, and Continuous OptimizationCausal Inference, Reinforcement Learning, and Continuous Optimization
Causal Inference, Reinforcement Learning, and Continuous Optimization
 

Ähnlich wie Machine learning

How Machine Learning Can Transform The Customer Experience
How Machine Learning Can Transform The Customer ExperienceHow Machine Learning Can Transform The Customer Experience
How Machine Learning Can Transform The Customer ExperienceProduct School
 
Data analytcis-first-steps
Data analytcis-first-stepsData analytcis-first-steps
Data analytcis-first-stepsShesha R
 
Choosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your needChoosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your needGibDevs
 
Machinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdfMachinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdfSaketBansal9
 
Data Mining - The Big Picture!
Data Mining - The Big Picture!Data Mining - The Big Picture!
Data Mining - The Big Picture!Khalid Salama
 
Data mining Basics and complete description onword
Data mining Basics and complete description onwordData mining Basics and complete description onword
Data mining Basics and complete description onwordSulman Ahmed
 
Altron presentation on Emerging Technologies: Data Science and Artificial Int...
Altron presentation on Emerging Technologies: Data Science and Artificial Int...Altron presentation on Emerging Technologies: Data Science and Artificial Int...
Altron presentation on Emerging Technologies: Data Science and Artificial Int...Robert Williams
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningSSSSSS354882
 
Protecting Artificial Intelligence/Machine Learning Inventions in the United ...
Protecting Artificial Intelligence/Machine Learning Inventions in the United ...Protecting Artificial Intelligence/Machine Learning Inventions in the United ...
Protecting Artificial Intelligence/Machine Learning Inventions in the United ...Knobbe Martens - Intellectual Property Law
 
Azure Machine Learning
Azure Machine LearningAzure Machine Learning
Azure Machine LearningMostafa
 
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxLesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxcloudserviceuit
 
Machine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An IntroMachine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An IntroSi Krishan
 
Data Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAData Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAjaved75
 
Chapter 4 Classification in data sience .pdf
Chapter 4 Classification in data sience .pdfChapter 4 Classification in data sience .pdf
Chapter 4 Classification in data sience .pdfAschalewAyele2
 
unit 1.2 supervised learning.pptx
unit 1.2 supervised learning.pptxunit 1.2 supervised learning.pptx
unit 1.2 supervised learning.pptxDr.Shweta
 
Unit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxUnit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxChitrachitrap
 
Introduction to Machine Learning - An overview and first step for candidate d...
Introduction to Machine Learning - An overview and first step for candidate d...Introduction to Machine Learning - An overview and first step for candidate d...
Introduction to Machine Learning - An overview and first step for candidate d...Lucas Jellema
 
It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!To Sum It Up
 

Ähnlich wie Machine learning (20)

How Machine Learning Can Transform The Customer Experience
How Machine Learning Can Transform The Customer ExperienceHow Machine Learning Can Transform The Customer Experience
How Machine Learning Can Transform The Customer Experience
 
machine learning
machine learningmachine learning
machine learning
 
Data analytcis-first-steps
Data analytcis-first-stepsData analytcis-first-steps
Data analytcis-first-steps
 
Choosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your needChoosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your need
 
Machinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdfMachinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdf
 
Data Mining - The Big Picture!
Data Mining - The Big Picture!Data Mining - The Big Picture!
Data Mining - The Big Picture!
 
Data mining Basics and complete description onword
Data mining Basics and complete description onwordData mining Basics and complete description onword
Data mining Basics and complete description onword
 
PPT s09-machine vision-s2
PPT s09-machine vision-s2PPT s09-machine vision-s2
PPT s09-machine vision-s2
 
Altron presentation on Emerging Technologies: Data Science and Artificial Int...
Altron presentation on Emerging Technologies: Data Science and Artificial Int...Altron presentation on Emerging Technologies: Data Science and Artificial Int...
Altron presentation on Emerging Technologies: Data Science and Artificial Int...
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Protecting Artificial Intelligence/Machine Learning Inventions in the United ...
Protecting Artificial Intelligence/Machine Learning Inventions in the United ...Protecting Artificial Intelligence/Machine Learning Inventions in the United ...
Protecting Artificial Intelligence/Machine Learning Inventions in the United ...
 
Azure Machine Learning
Azure Machine LearningAzure Machine Learning
Azure Machine Learning
 
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxLesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
 
Machine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An IntroMachine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An Intro
 
Data Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAData Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATA
 
Chapter 4 Classification in data sience .pdf
Chapter 4 Classification in data sience .pdfChapter 4 Classification in data sience .pdf
Chapter 4 Classification in data sience .pdf
 
unit 1.2 supervised learning.pptx
unit 1.2 supervised learning.pptxunit 1.2 supervised learning.pptx
unit 1.2 supervised learning.pptx
 
Unit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxUnit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptx
 
Introduction to Machine Learning - An overview and first step for candidate d...
Introduction to Machine Learning - An overview and first step for candidate d...Introduction to Machine Learning - An overview and first step for candidate d...
Introduction to Machine Learning - An overview and first step for candidate d...
 
It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!
 

Mehr von Aarthi Srinivasan

Design Thinking & Lean Product Process
Design Thinking & Lean Product ProcessDesign Thinking & Lean Product Process
Design Thinking & Lean Product ProcessAarthi Srinivasan
 
Blockchain Future & Investments 2018 - Women in Product
Blockchain Future & Investments 2018 - Women in Product Blockchain Future & Investments 2018 - Women in Product
Blockchain Future & Investments 2018 - Women in Product Aarthi Srinivasan
 
Accelerate: AI Trends in 2018
Accelerate: AI Trends in 2018Accelerate: AI Trends in 2018
Accelerate: AI Trends in 2018Aarthi Srinivasan
 
Types of Blockchain, AI and its future
Types of Blockchain, AI and its futureTypes of Blockchain, AI and its future
Types of Blockchain, AI and its futureAarthi Srinivasan
 
Blockchain, Bitcoin, Mining - My Product School Presentation
Blockchain, Bitcoin, Mining - My Product School Presentation Blockchain, Bitcoin, Mining - My Product School Presentation
Blockchain, Bitcoin, Mining - My Product School Presentation Aarthi Srinivasan
 

Mehr von Aarthi Srinivasan (8)

Design Thinking & Lean Product Process
Design Thinking & Lean Product ProcessDesign Thinking & Lean Product Process
Design Thinking & Lean Product Process
 
Savdlife Stanford Pitch
Savdlife Stanford Pitch Savdlife Stanford Pitch
Savdlife Stanford Pitch
 
AI as a platform
AI as a platformAI as a platform
AI as a platform
 
Blockchain Future & Investments 2018 - Women in Product
Blockchain Future & Investments 2018 - Women in Product Blockchain Future & Investments 2018 - Women in Product
Blockchain Future & Investments 2018 - Women in Product
 
Dual track-process-Aarthi
Dual track-process-AarthiDual track-process-Aarthi
Dual track-process-Aarthi
 
Accelerate: AI Trends in 2018
Accelerate: AI Trends in 2018Accelerate: AI Trends in 2018
Accelerate: AI Trends in 2018
 
Types of Blockchain, AI and its future
Types of Blockchain, AI and its futureTypes of Blockchain, AI and its future
Types of Blockchain, AI and its future
 
Blockchain, Bitcoin, Mining - My Product School Presentation
Blockchain, Bitcoin, Mining - My Product School Presentation Blockchain, Bitcoin, Mining - My Product School Presentation
Blockchain, Bitcoin, Mining - My Product School Presentation
 

Kürzlich hochgeladen

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 

Kürzlich hochgeladen (20)

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

Machine learning

  • 1. MACHINE LEARNING Evren Korpeoglu, Data Science Aarthi Srinivasan, Product Management /Productschool @ProdSchool /ProductmanagementSV
  • 2. What to Expect? 2 • What is machine learning ? • Why is it important ? • How do we use it ? • Technical Concepts • Examples
  • 3. What is Machine Learning? 3 1. Science of getting computers to learn or recognize something without being explicitly programmed – Andrew Ng • Branch of Artificial Intelligence which is a branch of Computer Science • Give lots of data to the computer so that it can figure it out • One of the first examples is the computer checkers program by Arthur Samuel * - ref: Andew Ng Courses, Big data: A revolution 2. Distinguish big data & machine learning: Big data is the data seed for creating machine learning forests • Big data collects information based on our digital exhaust (crumbs we leave in the digital world) , demographics, preferences, health etc. • Machine learning will mine this data and model behaviors with interactive responses based on this data
  • 4. Why do we need this? 4 1. Tons of applications impacting human health, utility and future simplification Health & Wellness Utilitarian Future • DNA sampling & diagnosis • Health reminders & prevention through AI tools • Correlation studies • Customizable tablets • Real time optimized path maps • Search Ranking • Spam filter on email • News aggregators • Shopping Recommendation • Facebook face recognition • Age recognition • Voice recognition – Siri, Alexa • Driverless cars • Home decoration
  • 5. Key Terms 5 • A set of data used to predict relationships. • E.g. A diamond’s size, cut and clarity helps predicts the price. Data and answers for each sample. Training Set • Uses training set to make a prediction. • E.g. Model predicts diamond prices based on past prices. Supervised Learning • Provide data without suggesting anything so computer can identify patterns or groupings. • E.g. Customer segmentation, DNA groupings. Unsupervised Learning • Each distinct measurable data value you select in the training data set. • E.g. A diamonds’ size is one of the feature’s for predicting price. Features/ Variables / Attributes • Using the features provided in the training set make a prediction. Fit a curve using the data provided. • E.g. Price of diamond = X*Cut + Y*Clarity + Z*Size + features… Supervised: Regression • A defined set of categories for placing new data (observations) • E.g. Presence of absence of cancer; Types of diabetes Supervised: Classification • Process of assigning observations into subsets • E.g. Customer segment creations Unsupervised: Clustering
  • 6. Learning Steps 6 Collect / Update User Data 1 Create / Update Training Set data 2 Create / Update algorithm for training data Update Algorithm Validate Algorithm 3 Create predictive model 4 New real-time observations A/B Test & Launch on production 5
  • 7. Data Wrangling and Feature Extraction 7 Spam Email Detection Title Sender Domain # of Recipients Email content Country of Origin Non- dictionary Words Hyperlinks Address Book Length of email • Structured Data (Best) – RDBMS, columnar data – Strict Schema – SQL • Semi-Structured Data (Better) – JSON, XML – Enforce minimum schema – JSON, XML Parser • Unstructured Data – Text, Image, Raw email – No Schema – Batch processing – Regular expressions – Map Reduce GARBAGE IN GARBAGE OUT
  • 8. Model Training 8 Feature Extraction (Feature vector) New Text documents User Activity Images Transaction history Feature Extraction (Feature vector) Labels Machine Learning Algorithm Training / Testing Text documents User Activity Images Transaction history Predictive Model Expected Label Model Evaluation
  • 9. Supervised learning techniques 9 • Linear classifier (numerical functions) • Parametric (Probabilistic functions) – Naïve Bayes, Hidden Markov models (HMM), Probabilistic graphical models • Non-parametric (Instance-based functions) – K-nearest neighbors • Non-metric (Symbolic functions) – Classification and regression tree (CART) • Aggregation – Bagging (bootstrap + aggregation), Adaboost, Random forest, Ensemble models
  • 10. Linear Classifiers 10 • Logistic regression – ) – w with minimum loss – Solve iteratively using gradient descent • Support vector machine (SVM) – Maximum margin classifier • Artificial Neural Networks – Inspired from how neurons work – Activation function (Sigmoid, ReLU etc.) – Deep Learning
  • 11. KNN / CART 11 • K-Nearest Neighbors – Find K nearest training examples – Majority vote – Easy to implement – Not scalable for real time predictions • Classification and Regression Trees – Easy to interpret for small trees • Random Forests – Ensemble of decision trees – Usually performs very good
  • 12. Unsupervised Learning 12 • Clustering – K-means clustering – Spectral clustering • Dimensionality reduction – Principal component analysis (PCA) – Factor analysis • Product Recommendations – Collaborative Filtering • Association Rules – Market Basket Analysis
  • 13. Model Evaluation 13 • Measure model performance • Optimize model to improve prediction quality – Feature selection – Hyperparameter tuning • A/B Testing • Explore/Exploit • http://en.wikipedia.org/wiki/Precision_and_recall
  • 14. Sample Architecture 14 -HADOOP - SPARK PREDICTION ENGINE REAL TIME DATA SQL / NO SQL Data Base CLIENT MACHINE LEARNING SYSTEM
  • 15. Health & Wellness Sen.se Mother (iOT) 15
  • 16. Amazon Echo & Personalization 16
  • 17. Houzz Visual Match Deep Learning 17