SlideShare a Scribd company logo
1 of 24
Download to read offline
Machine Learning in
Python using Scikit-
Learn
Submitted by Amol Agrawal as proposal for
Workshop in PyCon India 2016
Machine Learning
“Statistics on Steroids”
Exploring and recognizing patterns using data
analysis through computer science and statistics.
Lots of computer science ! Even more statistics!
Scikit-Learn
“The Black Box”
A Python module for all your Machine Learning
needs.
Easy to understand, easier to implement.
0. Beginning -
What? Where? Why? How?
.
➔ What is Machine Learning?
➔ What are the aims of this
workshop?
➔ What this workshop is not!
➔ Setup and basic config.
1. Scikit - Learn -
The module, our savior.
.
➔ About the module.
➔ But why Scikit - Learn ??
➔ Dependencies.
➔ Some basic terminologies.
➔ Let’s begin !
2. Machine Learning -
Or How I Learned To Stop
Worrying and Love the DATA
.
➔ Introduction
➔ Core Terminologies
➔ Definitions.
➔ Usage.
Machine Learning can be
broadly categorized into
TWO
CATEGORIES.
Let’s look at them.
3. Supervised Learning -
Classification and
Regression Definition
Herein, we have labeled
data, i.e, for each X we
know what the value of
Y will be !
So for new unlabelled
data, we can use the
labelled data to make
some predictions. .
3.1 Classification
Nearest Neighbors and
Support Vector
Machines
Definition
The labels have discrete
values.
Eg:
Is Mail Spam or Not
Bug or Not
3.2 Regression -
Linear Regression and
Random Forests Definition
The labels lie in a
continuous range:
Eg.
Stock Market Prediction
Housing Price Prediction
4. Unsupervised Learning -
Dimensionality Reduction
and Clustering Definition
We have NO LABELS !
Patterns must be found
and labels must be
generated on their own.
Time to freak out? NO !
4.1 Dimensionality Reduction -
Principal Component Analysis
Definition
When the dataset has a
large number of
features, out of which
some maybe redundant,
to save computational
power… Reduce the
Dimensions !
Eg. Working on Images
or Text
4.2 Clustering -
KMeans
Definition:
Divide the dataset into
clusters based on the
feature set.
Eg.
Find prominent colors in
an Image.
Cluster Documents
5. Model Selection and
Validation -
What model to use
and Hypertuning
Definition
What model to use
when? Which algorithm
to use when?
How to tune the
parameters?
How to get the best
results?
6. Real Word Examples-
Showcase of working
examples of how
ML is being used.
Definition:
Some real world
examples of ML
implemented and
showcased.
7. Where to go from here ?-
How to incorporate ML in
your work
Words of Caution
Definition:
How to integrate ML into
your work.
Do’s and Dont’s
8. Workflow
How the workshop will be conducted.:
➔ Ipython-Notebooks
Code will be made available for each
aforementioned section through
Ipython-Notebooks which makes it
easy to follow along, tweak and run it
quickly.
Scikit - Learn library will be utilized
throughout to show how it’s inherently
implemented algorithms can be used.
➔ Exercises
The notebooks will be littered with
incomplete code blocks for the
participants to attempt and solve.
9. Requirements
➔ Basic knowledge of Python
10. Takeaway
➔ Machine Learning Concepts
➔ How to use Scikit - Learn
➔ How to create ML models
quickly using Scikit - Learn.
Thank You.

More Related Content

What's hot

Yellowbrick: Steering machine learning with visual transformers
Yellowbrick: Steering machine learning with visual transformersYellowbrick: Steering machine learning with visual transformers
Yellowbrick: Steering machine learning with visual transformers
Rebecca Bilbro
 
(Py)testing the Limits of Machine Learning
(Py)testing the Limits of Machine Learning(Py)testing the Limits of Machine Learning
(Py)testing the Limits of Machine Learning
Rebecca Bilbro
 

What's hot (20)

Scikit-Learn: Machine Learning in Python
Scikit-Learn: Machine Learning in PythonScikit-Learn: Machine Learning in Python
Scikit-Learn: Machine Learning in Python
 
Machine Learning Overview
Machine Learning OverviewMachine Learning Overview
Machine Learning Overview
 
Data Product Architectures
Data Product ArchitecturesData Product Architectures
Data Product Architectures
 
Yellowbrick: Steering machine learning with visual transformers
Yellowbrick: Steering machine learning with visual transformersYellowbrick: Steering machine learning with visual transformers
Yellowbrick: Steering machine learning with visual transformers
 
TensorFlow in 3 sentences
TensorFlow in 3 sentencesTensorFlow in 3 sentences
TensorFlow in 3 sentences
 
IRJET- Unabridged Review of Supervised Machine Learning Regression and Classi...
IRJET- Unabridged Review of Supervised Machine Learning Regression and Classi...IRJET- Unabridged Review of Supervised Machine Learning Regression and Classi...
IRJET- Unabridged Review of Supervised Machine Learning Regression and Classi...
 
Visualizing the model selection process
Visualizing the model selection processVisualizing the model selection process
Visualizing the model selection process
 
Using Deep Learning to Find Similar Dresses
Using Deep Learning to Find Similar DressesUsing Deep Learning to Find Similar Dresses
Using Deep Learning to Find Similar Dresses
 
Dynamics in graph analysis (PyData Carolinas 2016)
Dynamics in graph analysis (PyData Carolinas 2016)Dynamics in graph analysis (PyData Carolinas 2016)
Dynamics in graph analysis (PyData Carolinas 2016)
 
Ppt shuai
Ppt shuaiPpt shuai
Ppt shuai
 
Siamese networks
Siamese networksSiamese networks
Siamese networks
 
Neural networks and google tensor flow
Neural networks and google tensor flowNeural networks and google tensor flow
Neural networks and google tensor flow
 
(Py)testing the Limits of Machine Learning
(Py)testing the Limits of Machine Learning(Py)testing the Limits of Machine Learning
(Py)testing the Limits of Machine Learning
 
Clustering: A Scikit Learn Tutorial
Clustering: A Scikit Learn TutorialClustering: A Scikit Learn Tutorial
Clustering: A Scikit Learn Tutorial
 
Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15
Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15
Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15
 
Europython - Machine Learning for dummies with Python
Europython - Machine Learning for dummies with PythonEuropython - Machine Learning for dummies with Python
Europython - Machine Learning for dummies with Python
 
Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)
 
Unsupervised Learning: Clustering
Unsupervised Learning: Clustering Unsupervised Learning: Clustering
Unsupervised Learning: Clustering
 
Google Developer Groups Talk - TensorFlow
Google Developer Groups Talk - TensorFlowGoogle Developer Groups Talk - TensorFlow
Google Developer Groups Talk - TensorFlow
 
Escaping the Black Box
Escaping the Black BoxEscaping the Black Box
Escaping the Black Box
 

Similar to Introduction to Machine Learning in Python using Scikit-Learn

Building Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine LearningBuilding Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine Learning
David Walker, CSM,CSD,MCP,MCAD,MCSD,MVP
 
A Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptxA Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptx
RajSingh512965
 
Building Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine LearningBuilding Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine Learning
David Walker, CSM,CSD,MCP,MCAD,MCSD,MVP
 
Leveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science ToolsLeveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science Tools
Domino Data Lab
 
Topic based and structured authoring - slides
Topic based and structured authoring - slidesTopic based and structured authoring - slides
Topic based and structured authoring - slides
Neil Perlin
 

Similar to Introduction to Machine Learning in Python using Scikit-Learn (20)

Andrew NG machine learning
Andrew NG machine learningAndrew NG machine learning
Andrew NG machine learning
 
Building Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine LearningBuilding Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine Learning
 
A Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptxA Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptx
 
How to use Artificial Intelligence with Python? Edureka
How to use Artificial Intelligence with Python? EdurekaHow to use Artificial Intelligence with Python? Edureka
How to use Artificial Intelligence with Python? Edureka
 
Machine learning: A Walk Through School Exams
Machine learning: A Walk Through School ExamsMachine learning: A Walk Through School Exams
Machine learning: A Walk Through School Exams
 
Introduction To TensorFlow
Introduction To TensorFlowIntroduction To TensorFlow
Introduction To TensorFlow
 
Building Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine LearningBuilding Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine Learning
 
Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning
 
Apple Machine Learning
Apple Machine LearningApple Machine Learning
Apple Machine Learning
 
How to implement artificial intelligence solutions
How to implement artificial intelligence solutionsHow to implement artificial intelligence solutions
How to implement artificial intelligence solutions
 
Leveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science ToolsLeveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science Tools
 
ML crash course
ML crash courseML crash course
ML crash course
 
Walk through of azure machine learning studio new features
Walk through of azure machine learning studio new featuresWalk through of azure machine learning studio new features
Walk through of azure machine learning studio new features
 
Artificial Intelligence with Python | Edureka
Artificial Intelligence with Python | EdurekaArtificial Intelligence with Python | Edureka
Artificial Intelligence with Python | Edureka
 
Topic based and structured authoring - slides
Topic based and structured authoring - slidesTopic based and structured authoring - slides
Topic based and structured authoring - slides
 
Topic based and structured authoring - slides
Topic based and structured authoring - slidesTopic based and structured authoring - slides
Topic based and structured authoring - slides
 
Python and data analytics
Python and data analyticsPython and data analytics
Python and data analytics
 
Machine Learning for Startups without PhDs
Machine Learning for Startups without PhDsMachine Learning for Startups without PhDs
Machine Learning for Startups without PhDs
 
Machine Learning for Startups without PhDs
Machine Learning for Startups without PhDsMachine Learning for Startups without PhDs
Machine Learning for Startups without PhDs
 
Tuning the Untunable - Insights on Deep Learning Optimization
Tuning the Untunable - Insights on Deep Learning OptimizationTuning the Untunable - Insights on Deep Learning Optimization
Tuning the Untunable - Insights on Deep Learning Optimization
 

Recently uploaded

Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Recently uploaded (20)

Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 

Introduction to Machine Learning in Python using Scikit-Learn

  • 1. Machine Learning in Python using Scikit- Learn Submitted by Amol Agrawal as proposal for Workshop in PyCon India 2016
  • 2. Machine Learning “Statistics on Steroids” Exploring and recognizing patterns using data analysis through computer science and statistics. Lots of computer science ! Even more statistics!
  • 3. Scikit-Learn “The Black Box” A Python module for all your Machine Learning needs. Easy to understand, easier to implement.
  • 4. 0. Beginning - What? Where? Why? How?
  • 5. . ➔ What is Machine Learning? ➔ What are the aims of this workshop? ➔ What this workshop is not! ➔ Setup and basic config.
  • 6. 1. Scikit - Learn - The module, our savior.
  • 7. . ➔ About the module. ➔ But why Scikit - Learn ?? ➔ Dependencies. ➔ Some basic terminologies. ➔ Let’s begin !
  • 8. 2. Machine Learning - Or How I Learned To Stop Worrying and Love the DATA
  • 9. . ➔ Introduction ➔ Core Terminologies ➔ Definitions. ➔ Usage.
  • 10. Machine Learning can be broadly categorized into TWO CATEGORIES. Let’s look at them.
  • 11.
  • 12. 3. Supervised Learning - Classification and Regression Definition Herein, we have labeled data, i.e, for each X we know what the value of Y will be ! So for new unlabelled data, we can use the labelled data to make some predictions. .
  • 13. 3.1 Classification Nearest Neighbors and Support Vector Machines Definition The labels have discrete values. Eg: Is Mail Spam or Not Bug or Not
  • 14. 3.2 Regression - Linear Regression and Random Forests Definition The labels lie in a continuous range: Eg. Stock Market Prediction Housing Price Prediction
  • 15. 4. Unsupervised Learning - Dimensionality Reduction and Clustering Definition We have NO LABELS ! Patterns must be found and labels must be generated on their own. Time to freak out? NO !
  • 16. 4.1 Dimensionality Reduction - Principal Component Analysis Definition When the dataset has a large number of features, out of which some maybe redundant, to save computational power… Reduce the Dimensions ! Eg. Working on Images or Text
  • 17. 4.2 Clustering - KMeans Definition: Divide the dataset into clusters based on the feature set. Eg. Find prominent colors in an Image. Cluster Documents
  • 18. 5. Model Selection and Validation - What model to use and Hypertuning Definition What model to use when? Which algorithm to use when? How to tune the parameters? How to get the best results?
  • 19. 6. Real Word Examples- Showcase of working examples of how ML is being used. Definition: Some real world examples of ML implemented and showcased.
  • 20. 7. Where to go from here ?- How to incorporate ML in your work Words of Caution Definition: How to integrate ML into your work. Do’s and Dont’s
  • 21. 8. Workflow How the workshop will be conducted.: ➔ Ipython-Notebooks Code will be made available for each aforementioned section through Ipython-Notebooks which makes it easy to follow along, tweak and run it quickly. Scikit - Learn library will be utilized throughout to show how it’s inherently implemented algorithms can be used. ➔ Exercises The notebooks will be littered with incomplete code blocks for the participants to attempt and solve.
  • 22. 9. Requirements ➔ Basic knowledge of Python
  • 23. 10. Takeaway ➔ Machine Learning Concepts ➔ How to use Scikit - Learn ➔ How to create ML models quickly using Scikit - Learn.