SlideShare a Scribd company logo
1 of 33
What is Data Science
• Data science, also known
as data-driven science, is
an interdisciplinary field
of scientific methods,
processes, algorithms and
systems to extract
knowledge or insights
from data in various forms,
either structured or
unstructured, similar
to datamining.
Need for DataScientists & Job opportunities
• Data volume is increasing in enterprises because of transactional data,
internet and mobile apps
• Decision making will have to be fast and accurate and should be
available at the point of need
• Without analytics its impossible to run large enterprises like Amazon,
Flipkart, Reliance Jio, Airtel, Citi Bank, Unilever, P&G, Google, IBM,
Microsoft, Alibaba, eBay, Tesco, Metro cash, Walmart…
• There is insufficient number of personnel skilled in analytics where as
demand is more
• Opportunities in Startups, IoT, Consumer Goods, eCommerce, KPOs,
BPOs, Telecom, B&F, Logistics, Utilities…
• Just browse through Naukri, Shine, Monster etc
What is Machine Learning?
• Machine learning teaches computers to do what comes naturally to
humans and animals: learn from experience.
• Machine learning algorithms use computational methods to “learn”
information directly from data without relying on a predetermined
equation as a model.
• The algorithms adaptively improve their performance as the number of
samples available for learning increases.
Real-World Applications
 With the rise in big data, machine learning has become particularly important
for solving problems in areas like these:
 Computational finance, for credit scoring and algorithmic trading
 Image processing and computer vision, for face recognition, motion
detection, and object detection
 Computational biology, for tumor detection, drug discovery, and DNA
sequencing
 Energy production, for price and load forecasting
 Automotive, aerospace, and manufacturing, for predictive maintenance
 Natural language processing
How Machine Learning Works
• Machine learning uses
two types of
techniques: supervised
learning, which trains a
model on known input
and output data so
that it can predict
future outputs, and
unsupervised learning,
which finds hidden
patterns or intrinsic
structures in input
data.
How Do You Decide Which Algorithm to Use?
• Algorithm
selection also
depends on
the size and
type of data
you’re
working with,
the insights
you want to
get from the
data, and how
those insights
will be used
When Should You Use Machine Learning?
• you have a complex task or problem involving a large
amount of data and lots of variables, but no existing
formula or equation
• Hand-written rules and equations are too complex—as in
face recognition and speech recognition
• The rules of a task are constantly changing—as in fraud
detection from transaction records.
• The nature of the data keeps changing, and the program
needs to adapt—as in automated trading,energy demand
forecasting, and predicting shopping trends.
Supervised Learning
• The aim of supervised machine learning is to build a model that makes
predictions based on evidence in the presence of uncertainty. A
supervised learning algorithm takes a known set of input data and known
responses to the data (output) and trains a model to generate reasonable
predictions for the response to new data.
• Supervised learning uses classification and regression techniques to
develop predictive models.
• Regression techniques predict continuous responses— for example,
changes in temperature or fluctuations in power demand. Typical
applications include electricity load forecasting and algorithmic trading.
• Classification techniques predict discrete responses—for example,
whether an email is genuine or spam, or whether a tumor is cancerous
or benign. Classification models
• classify input data into categories. Typical applications include medical
imaging, speech recognition, and credit scoring.
Unsupervised Learning
• Unsupervised learning finds hidden patterns or intrinsic structures in
data. It is used to draw inferences from datasets consisting of input
data without labeled responses.
• Clustering is the most common unsupervised learning technique. It is
used for exploratory data analysis to find hidden patterns or groupings
in data. Applications for clustering include gene sequence analysis,
market research, and object recognition.
Common Dimensionality Reduction Techniques
Why R?
• Free Software
• Versatile and crowd sourced for development
• Handle multiple platform
• End to End service in Data Science
• Functionality is divided into a number of packages
• Variety of analytical techniques 7000+ algorithms
• No restriction in length of column
• Integrates with other software
29
Data Types in R
• Vectors
• Matrices
• Arrays
• List
• DataFrame
30
Objects
• character
• numeric (real numbers)
• Integer
• Complex
• logical (True/False)
31
Data Operators
• Arithmetic+-*/%^
• Relational >=,<=,==,!=
• Logical ! and &
• Model Formula D ~ I
• Assignment = or <-
• List Index $
• Sequence :
32
Case Study
• Multiple Linear Regression Model
• Methods: All in, Step by Step, (Forward,Backward, Bi-directional),
Score comparison
• Independent Variables: R&D Spend, Administration, Marketing
Spend
• Dependent Variable: Profit
• Training Data 80% & Test Data 20%
33

More Related Content

Similar to What is Machine Learning.pptx

BIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNINGBIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNINGUmair Shafique
 
Choosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your needChoosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your needGibDevs
 
Operations Research and ICT A Keynote Address
Operations Research and ICT A Keynote AddressOperations Research and ICT A Keynote Address
Operations Research and ICT A Keynote AddressElvis Muyanja
 
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...Harbinger Systems - HRTech Builder of Choice
 
In-Depth Data Analytics
In-Depth Data AnalyticsIn-Depth Data Analytics
In-Depth Data AnalyticsYASH GAIKWAD
 
Machine Learning in Customer Analytics
Machine Learning in Customer AnalyticsMachine Learning in Customer Analytics
Machine Learning in Customer AnalyticsCourse5i
 
Machine learning is the new BI
Machine learning is the new BIMachine learning is the new BI
Machine learning is the new BICycloides
 
Big Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research ActivityBig Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research ActivityAndry Alamsyah
 
Machine Learning course in Chandigarh Join
Machine Learning course in Chandigarh JoinMachine Learning course in Chandigarh Join
Machine Learning course in Chandigarh Joinasmeerana605
 
Data Mining & Applications
Data Mining & ApplicationsData Mining & Applications
Data Mining & ApplicationsFazle Rabbi Ador
 
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptxINTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptxMadhumitha N
 
unit 1.2 supervised learning.pptx
unit 1.2 supervised learning.pptxunit 1.2 supervised learning.pptx
unit 1.2 supervised learning.pptxDr.Shweta
 
Unit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxUnit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxChitrachitrap
 
Introduction to Machine Learning.pptx
Introduction to Machine Learning.pptxIntroduction to Machine Learning.pptx
Introduction to Machine Learning.pptxDr. Amanpreet Kaur
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecasesSreenatha Reddy K R
 

Similar to What is Machine Learning.pptx (20)

BIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNINGBIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNING
 
Choosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your needChoosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your need
 
DOWLD SLIDES.pptx
DOWLD SLIDES.pptxDOWLD SLIDES.pptx
DOWLD SLIDES.pptx
 
NCCT.pptx
NCCT.pptxNCCT.pptx
NCCT.pptx
 
Operations Research and ICT A Keynote Address
Operations Research and ICT A Keynote AddressOperations Research and ICT A Keynote Address
Operations Research and ICT A Keynote Address
 
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...
 
In-Depth Data Analytics
In-Depth Data AnalyticsIn-Depth Data Analytics
In-Depth Data Analytics
 
Machine Learning in Customer Analytics
Machine Learning in Customer AnalyticsMachine Learning in Customer Analytics
Machine Learning in Customer Analytics
 
Machine learning is the new BI
Machine learning is the new BIMachine learning is the new BI
Machine learning is the new BI
 
Big Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research ActivityBig Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research Activity
 
Machine Learning course in Chandigarh Join
Machine Learning course in Chandigarh JoinMachine Learning course in Chandigarh Join
Machine Learning course in Chandigarh Join
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Data Mining & Applications
Data Mining & ApplicationsData Mining & Applications
Data Mining & Applications
 
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptxINTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
 
unit 1.2 supervised learning.pptx
unit 1.2 supervised learning.pptxunit 1.2 supervised learning.pptx
unit 1.2 supervised learning.pptx
 
Unit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxUnit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptx
 
Introduction to Machine Learning.pptx
Introduction to Machine Learning.pptxIntroduction to Machine Learning.pptx
Introduction to Machine Learning.pptx
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecases
 

Recently uploaded

Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...gajnagarg
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制vexqp
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制vexqp
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...nirzagarg
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样wsppdmt
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...Health
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制vexqp
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRajesh Mondal
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangeThinkInnovation
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制vexqp
 
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxVivek487417
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareGraham Ware
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowgargpaaro
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.pptibrahimabdi22
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schscnajjemba
 

Recently uploaded (20)

Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
 
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
 

What is Machine Learning.pptx

  • 1. What is Data Science • Data science, also known as data-driven science, is an interdisciplinary field of scientific methods, processes, algorithms and systems to extract knowledge or insights from data in various forms, either structured or unstructured, similar to datamining.
  • 2. Need for DataScientists & Job opportunities • Data volume is increasing in enterprises because of transactional data, internet and mobile apps • Decision making will have to be fast and accurate and should be available at the point of need • Without analytics its impossible to run large enterprises like Amazon, Flipkart, Reliance Jio, Airtel, Citi Bank, Unilever, P&G, Google, IBM, Microsoft, Alibaba, eBay, Tesco, Metro cash, Walmart… • There is insufficient number of personnel skilled in analytics where as demand is more • Opportunities in Startups, IoT, Consumer Goods, eCommerce, KPOs, BPOs, Telecom, B&F, Logistics, Utilities… • Just browse through Naukri, Shine, Monster etc
  • 3.
  • 4.
  • 5.
  • 6.
  • 7. What is Machine Learning? • Machine learning teaches computers to do what comes naturally to humans and animals: learn from experience. • Machine learning algorithms use computational methods to “learn” information directly from data without relying on a predetermined equation as a model. • The algorithms adaptively improve their performance as the number of samples available for learning increases.
  • 8. Real-World Applications  With the rise in big data, machine learning has become particularly important for solving problems in areas like these:  Computational finance, for credit scoring and algorithmic trading  Image processing and computer vision, for face recognition, motion detection, and object detection  Computational biology, for tumor detection, drug discovery, and DNA sequencing  Energy production, for price and load forecasting  Automotive, aerospace, and manufacturing, for predictive maintenance  Natural language processing
  • 9. How Machine Learning Works • Machine learning uses two types of techniques: supervised learning, which trains a model on known input and output data so that it can predict future outputs, and unsupervised learning, which finds hidden patterns or intrinsic structures in input data.
  • 10. How Do You Decide Which Algorithm to Use? • Algorithm selection also depends on the size and type of data you’re working with, the insights you want to get from the data, and how those insights will be used
  • 11. When Should You Use Machine Learning? • you have a complex task or problem involving a large amount of data and lots of variables, but no existing formula or equation • Hand-written rules and equations are too complex—as in face recognition and speech recognition • The rules of a task are constantly changing—as in fraud detection from transaction records. • The nature of the data keeps changing, and the program needs to adapt—as in automated trading,energy demand forecasting, and predicting shopping trends.
  • 12. Supervised Learning • The aim of supervised machine learning is to build a model that makes predictions based on evidence in the presence of uncertainty. A supervised learning algorithm takes a known set of input data and known responses to the data (output) and trains a model to generate reasonable predictions for the response to new data. • Supervised learning uses classification and regression techniques to develop predictive models. • Regression techniques predict continuous responses— for example, changes in temperature or fluctuations in power demand. Typical applications include electricity load forecasting and algorithmic trading. • Classification techniques predict discrete responses—for example, whether an email is genuine or spam, or whether a tumor is cancerous or benign. Classification models • classify input data into categories. Typical applications include medical imaging, speech recognition, and credit scoring.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22. Unsupervised Learning • Unsupervised learning finds hidden patterns or intrinsic structures in data. It is used to draw inferences from datasets consisting of input data without labeled responses. • Clustering is the most common unsupervised learning technique. It is used for exploratory data analysis to find hidden patterns or groupings in data. Applications for clustering include gene sequence analysis, market research, and object recognition.
  • 23.
  • 24.
  • 25.
  • 26.
  • 28.
  • 29. Why R? • Free Software • Versatile and crowd sourced for development • Handle multiple platform • End to End service in Data Science • Functionality is divided into a number of packages • Variety of analytical techniques 7000+ algorithms • No restriction in length of column • Integrates with other software 29
  • 30. Data Types in R • Vectors • Matrices • Arrays • List • DataFrame 30
  • 31. Objects • character • numeric (real numbers) • Integer • Complex • logical (True/False) 31
  • 32. Data Operators • Arithmetic+-*/%^ • Relational >=,<=,==,!= • Logical ! and & • Model Formula D ~ I • Assignment = or <- • List Index $ • Sequence : 32
  • 33. Case Study • Multiple Linear Regression Model • Methods: All in, Step by Step, (Forward,Backward, Bi-directional), Score comparison • Independent Variables: R&D Spend, Administration, Marketing Spend • Dependent Variable: Profit • Training Data 80% & Test Data 20% 33