SlideShare ist ein Scribd-Unternehmen logo
1 von 30
ICT 3202 - INTRODUCTION
TO DATA SCIENCE
BY
ENGR. JOHNSON C. UBAH
B.ENG, M.ENG, HCNA, ASM
Machine Learning and Statistics
Machine learning is the practice of programming computers to learn from
data.
Machine learning is a subfield of artificial intelligence (AI). The goal of
machine learning generally is to understand the structure of data and fit
that data into models that can be understood and utilized by people.
In machine learning, data is referred to as called training sets or
examples.
Intro. To Machine Learning
Machine learning differs from traditional computational approaches because;
Traditional computing algorithms are sets of steps followed by computers to
solve problems.
Machine learning algorithms allows computers to train on data inputs and use
statistical analysis in order to generate output values that falls within specific
range.
Why Machine Learning?
Lets assume you’d like to write a filter program without using machine learning
methods. The steps would be;
You’d take a look at what spam e-mails looks like
You’d write an algorithm to detect the patterns that you’ve seen and the
software would then flag the e-mails as spam
Finally, you’d test the program, and redo the first two steps again until the results
are good enough.
Why Machine learning?
This program contains very long list of rules and hence
difficult to maintain. But if done with machine learning, you
will be able to maintain it properly.
Programs that uses ML techniques will automatically detect
changes by users, and update their definition automatically.
Why Machine Learning?
Machine Learning algorithm with automatic update when users change preference
When to use machine learning
When you have a problem that requires many rules to find the
solution.
Very complex problems for which there is no solution with
traditional approach.
Non-stable environments: machine learning software can adapt to
new data.
Classification of ML
There are types of machine learning systems. We can divide them into
categories, depending on whether;
1. They have been trained with humans or not
◦ Supervised
◦ Unsupervised
◦ Semi-supervised
◦ Reinforcement learning
2. If they can learn incrementally
3. If they work simply by comparing new data points to find data points or can
detect new patterns in the data, and then will build a model.
Supervised and unsupervised learning
We can classify machine learning systems according to the type
and amount of human supervision during the training. They are;
◦ Supervised learning
◦ Unsupervised learning
◦ Semi-supervised learning
◦ Reinforced learning.
Supervised learning
When an algorithm learns from example data and associated target
responses that can consist of numeric values or string labels, such as
classes or tags, in order to later predict the correct response when
posed with new examples comes under the category of Supervised
learning.
This approach is indeed similar to human learning under the
supervision of a teacher.
Tasks carried out by supervised learning
Supervised learning groups together a task of
classification. The program is a good example of this
because it’s been trained with many emails at the same
time as their class.
Another example is to predict a numeric value like the
price of a flat, given a set of features (location, number
of rooms, facilities) called predictors; this task is called
regression.
Supervised learning algorithms
You should keep in mind that some regression algorithms can be
used for classification as well, and vise versa.
Some important supervised algorithms
◦ K-nearest neighbors
◦ Linear regression
◦ Neural network
◦ Support vector machines
◦ Logistic regression
◦ Decision trees and random forest
Unsupervised learning
Unsupervised learning occurs when an algorithm learns from plain examples
without any associated response, leaving to the algorithm to determine the data
patterns on its own.
This type of algorithm tends to restructure the data into something else, such as
new features that may represent a class or a new series of un-correlated values.
They are quite useful in providing humans with insights into the meaning of data
and new useful inputs to supervised machine learning algorithms.
Unsupervised learning
As a kind of learning, it resembles the methods humans use to figure
out that certain objects or events are from the same class, such as by
observing the degree of similarity between objects. Some
recommendation systems that you find on the web in the form of
marketing automation are based on this type of learning.
In this type of learning the data is unlabeled.
Unsupervised learning algorithms
Some unsupervised learning algorithms includes;
◦Clustering: k-means, hierarchical cluster analysis
◦Association rule learning: Eclat, apriori
◦Visualization and dimensionality reduction: kernel PCA, t-
distribution, PCA
Examples of unsupervised learning
suppose you’ve got many data on visitor, you can use one
algorithm to detect groups with similar visitors. 65% of your
visitors might be males who love watching movie in the
evening, while 30% watch plays in the evening: Using
clustering algorithm, we have the smaller groups.
Secondly, for visualization algorithms, you will need to give
them many data and unlabeled data as input, and then you
will get 2D or 3D visualization as an output. Feature
extraction takes place here.
Reinforcement learning
An Agent “AI system” will observe the
environment, performs given actions, and
then receive rewards in return.
Here, the agent must learn by itself.
You can find this type of learning in many
robotics applications that learns how to
walk.
Semi-supervised learning
where an incomplete training signal is given: a training set
with some (often many) of the target outputs missing.
There is a special case of this principle known as
Transduction where the entire set of problem instances is
known at learning time, except that part of the targets are
missing.
Bad and Insufficient quantity of Training
Data
Machine learning systems are not like children,
who can distinguish apples and oranges in all
sorts of colors and shapes, but they require lot of
data to work effectively, whether you’re working
with very simple programs and problems, or
complex applications like image processing and
speech recognition.
Poor Quality Data
If you are working with training data that is full of errors and
outliers, this will make it very hard for the system to detect
patterns, so it won’t work properly.
So, if you want your program to work well, you must spend
more time cleaning up your training data.
Irrelevant features
The system will only be able to learn if the training data contains enough features
and data that aren’t too irrelevant. The most important part of any ML project is to
develop good features. “feature engineering”
Feature engineering follows this process:
◦ Feature selection: selecting the most useful features
◦ Feature extraction: combining existing features to provide more useful features.
◦ Creation of new features: creation of new features, based on data.
Testing
To ensure your model is working well and that models can generalize
with new cases, you can try out new cases with it by putting the
model in the environment and then monitoring how it will perform.
This is good practice.
You should divide your data into two set, one for training and the
second for testing.
Testing
The generalization error is the rate of error by evaluation of your model on the
test set. The value you get will tell you if your model is good enough, and if it will
work properly.
If the error rate is low, the model is good and will perform properly and vice
versa.
It is advisable to use 80% of your data for training and 20% for testing
Overfitting the data
Overgeneralization in machine learning is called “overfitting”.
Overfitting occurs when the model is very complex for the amount of
training data given.
Solution
Gather more data for “training data”
Reduce the noise level
Select one with fewer parameters
Under-fitting the data
This the opposite of overfitting. You will encounter this when the model is very
simple to learn.
For example, using the example of quality of life, real life is more complex than
your model, so the predictions won’t yield the same, even in the training
examples.
Solution:
◦ Select the most powerful model, which has many parameters
◦ Feed the best features into your algorithms. Here, I’m referring to feature
engineering
◦ Reduce the constraints on your model
Underfitting
Software for this course
Python’s popularity may be due to the increased development of deep learning
frameworks available for this language recently, including TensorFlow, PyTorch,
and Keras. As a language that has readable syntax and the ability to be used as a
scripting language, Python proves to be powerful and straightforward both for
preprocessing data and working with data directly. The scikit-learn machine
learning library is built on top of several existing Python packages that Python
developers may already be familiar with, namely NumPy, SciPy, and Matplotlib.
Software for this course
MATLAB makes machine learning easy. With tools and functions for
handling big data, as well as apps to make machine learning accessible,
MATLAB is an ideal environment for applying machine learning to your
data analytics.
With MATLAB, engineers and data scientists have immediate access to
prebuilt functions, extensive toolboxes, and specialized apps
for classification, regression, and clustering.

Weitere ähnliche Inhalte

Was ist angesagt?

The fundamentals of Machine Learning
The fundamentals of Machine LearningThe fundamentals of Machine Learning
The fundamentals of Machine LearningHichem Felouat
 
Machine Learning Algorithms
Machine Learning AlgorithmsMachine Learning Algorithms
Machine Learning AlgorithmsDezyreAcademy
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningKmPooja4
 
Machine Learning Final presentation
Machine Learning Final presentation Machine Learning Final presentation
Machine Learning Final presentation AyanaRukasar
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningEng Teong Cheah
 
Machine Learning
Machine LearningMachine Learning
Machine LearningVivek Garg
 
Machine learning seminar ppt
Machine learning seminar pptMachine learning seminar ppt
Machine learning seminar pptRAHUL DANGWAL
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningSujith Jayaprakash
 
2.17Mb ppt
2.17Mb ppt2.17Mb ppt
2.17Mb pptbutest
 
Machine Learning
Machine LearningMachine Learning
Machine LearningRahul Kumar
 
Machine Learning
Machine LearningMachine Learning
Machine LearningKumar P
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Marina Santini
 
Forms of learning in ai
Forms of learning in aiForms of learning in ai
Forms of learning in aiRobert Antony
 

Was ist angesagt? (20)

The fundamentals of Machine Learning
The fundamentals of Machine LearningThe fundamentals of Machine Learning
The fundamentals of Machine Learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine Learning Algorithms
Machine Learning AlgorithmsMachine Learning Algorithms
Machine Learning Algorithms
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Machine Learning Final presentation
Machine Learning Final presentation Machine Learning Final presentation
Machine Learning Final presentation
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Machine learning seminar ppt
Machine learning seminar pptMachine learning seminar ppt
Machine learning seminar ppt
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
2.17Mb ppt
2.17Mb ppt2.17Mb ppt
2.17Mb ppt
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?
 
Machine Learning ppt
Machine Learning pptMachine Learning ppt
Machine Learning ppt
 
Forms of learning in ai
Forms of learning in aiForms of learning in ai
Forms of learning in ai
 

Ähnlich wie introduction to machine learning

Machine Learning Contents.pptx
Machine Learning Contents.pptxMachine Learning Contents.pptx
Machine Learning Contents.pptxNaveenkushwaha18
 
Machine Learning Landscape
Machine Learning LandscapeMachine Learning Landscape
Machine Learning LandscapeEng Teong Cheah
 
Intro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationIntro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationAnkit Gupta
 
An Introduction to Machine Learning
An Introduction to Machine LearningAn Introduction to Machine Learning
An Introduction to Machine LearningVedaj Padman
 
machine learning.docx
machine learning.docxmachine learning.docx
machine learning.docxJadhavArjun2
 
what-is-machine-learning-and-its-importance-in-todays-world.pdf
what-is-machine-learning-and-its-importance-in-todays-world.pdfwhat-is-machine-learning-and-its-importance-in-todays-world.pdf
what-is-machine-learning-and-its-importance-in-todays-world.pdfTemok IT Services
 
Supervised learning techniques and applications
Supervised learning techniques and applicationsSupervised learning techniques and applications
Supervised learning techniques and applicationsBenjaminlapid1
 
Intro/Overview on Machine Learning Presentation -2
Intro/Overview on Machine Learning Presentation -2Intro/Overview on Machine Learning Presentation -2
Intro/Overview on Machine Learning Presentation -2Ankit Gupta
 
INTERNSHIP ON MAcHINE LEARNING.pptx
INTERNSHIP ON MAcHINE LEARNING.pptxINTERNSHIP ON MAcHINE LEARNING.pptx
INTERNSHIP ON MAcHINE LEARNING.pptxsrikanthkallem1
 
Machine Learning with Python- Methods for Machine Learning.pptx
Machine Learning with Python- Methods for Machine Learning.pptxMachine Learning with Python- Methods for Machine Learning.pptx
Machine Learning with Python- Methods for Machine Learning.pptxiaeronlineexm
 
machine learning
machine learningmachine learning
machine learningMounisha A
 
Machine Learning Ch 1.ppt
Machine Learning Ch 1.pptMachine Learning Ch 1.ppt
Machine Learning Ch 1.pptARVIND SARDAR
 
Machine Learning Tutorial for Beginners
Machine Learning Tutorial for BeginnersMachine Learning Tutorial for Beginners
Machine Learning Tutorial for Beginnersgrinu
 
detailed Presentation on supervised learning
 detailed Presentation on supervised learning detailed Presentation on supervised learning
detailed Presentation on supervised learningZAMANCHBWN
 
Introduction To Machine Learning
Introduction To Machine LearningIntroduction To Machine Learning
Introduction To Machine LearningKnoldus Inc.
 
What is Machine Learning4-converted.pptx
What is Machine Learning4-converted.pptxWhat is Machine Learning4-converted.pptx
What is Machine Learning4-converted.pptxvinod756504
 

Ähnlich wie introduction to machine learning (20)

Machine Learning Contents.pptx
Machine Learning Contents.pptxMachine Learning Contents.pptx
Machine Learning Contents.pptx
 
Machine Learning Landscape
Machine Learning LandscapeMachine Learning Landscape
Machine Learning Landscape
 
Machine Learning by Rj
Machine Learning by RjMachine Learning by Rj
Machine Learning by Rj
 
Intro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationIntro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning Presentation
 
An Introduction to Machine Learning
An Introduction to Machine LearningAn Introduction to Machine Learning
An Introduction to Machine Learning
 
machine learning.docx
machine learning.docxmachine learning.docx
machine learning.docx
 
what-is-machine-learning-and-its-importance-in-todays-world.pdf
what-is-machine-learning-and-its-importance-in-todays-world.pdfwhat-is-machine-learning-and-its-importance-in-todays-world.pdf
what-is-machine-learning-and-its-importance-in-todays-world.pdf
 
Supervised learning techniques and applications
Supervised learning techniques and applicationsSupervised learning techniques and applications
Supervised learning techniques and applications
 
Machine Learning_Unit 2_Full.ppt.pdf
Machine Learning_Unit 2_Full.ppt.pdfMachine Learning_Unit 2_Full.ppt.pdf
Machine Learning_Unit 2_Full.ppt.pdf
 
Intro/Overview on Machine Learning Presentation -2
Intro/Overview on Machine Learning Presentation -2Intro/Overview on Machine Learning Presentation -2
Intro/Overview on Machine Learning Presentation -2
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
INTERNSHIP ON MAcHINE LEARNING.pptx
INTERNSHIP ON MAcHINE LEARNING.pptxINTERNSHIP ON MAcHINE LEARNING.pptx
INTERNSHIP ON MAcHINE LEARNING.pptx
 
Machine Learning with Python- Methods for Machine Learning.pptx
Machine Learning with Python- Methods for Machine Learning.pptxMachine Learning with Python- Methods for Machine Learning.pptx
Machine Learning with Python- Methods for Machine Learning.pptx
 
machine learning
machine learningmachine learning
machine learning
 
Machine Learning Ch 1.ppt
Machine Learning Ch 1.pptMachine Learning Ch 1.ppt
Machine Learning Ch 1.ppt
 
Machine Learning Tutorial for Beginners
Machine Learning Tutorial for BeginnersMachine Learning Tutorial for Beginners
Machine Learning Tutorial for Beginners
 
detailed Presentation on supervised learning
 detailed Presentation on supervised learning detailed Presentation on supervised learning
detailed Presentation on supervised learning
 
Introduction To Machine Learning
Introduction To Machine LearningIntroduction To Machine Learning
Introduction To Machine Learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
What is Machine Learning4-converted.pptx
What is Machine Learning4-converted.pptxWhat is Machine Learning4-converted.pptx
What is Machine Learning4-converted.pptx
 

Mehr von Johnson Ubah

Supervised learning
Supervised learningSupervised learning
Supervised learningJohnson Ubah
 
Statistical inference with Python
Statistical inference with PythonStatistical inference with Python
Statistical inference with PythonJohnson Ubah
 
Lecture 3 intro2data
Lecture 3 intro2dataLecture 3 intro2data
Lecture 3 intro2dataJohnson Ubah
 
OSI reference Model
OSI reference ModelOSI reference Model
OSI reference ModelJohnson Ubah
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data scienceJohnson Ubah
 
Network and computer forensics
Network and computer forensicsNetwork and computer forensics
Network and computer forensicsJohnson Ubah
 

Mehr von Johnson Ubah (7)

Supervised learning
Supervised learningSupervised learning
Supervised learning
 
Statistical inference with Python
Statistical inference with PythonStatistical inference with Python
Statistical inference with Python
 
Lecture 3 intro2data
Lecture 3 intro2dataLecture 3 intro2data
Lecture 3 intro2data
 
IP Addressing
IP AddressingIP Addressing
IP Addressing
 
OSI reference Model
OSI reference ModelOSI reference Model
OSI reference Model
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Network and computer forensics
Network and computer forensicsNetwork and computer forensics
Network and computer forensics
 

Kürzlich hochgeladen

Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themeitharjee
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...SOFTTECHHUB
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制vexqp
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...nirzagarg
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...Health
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...HyderabadDolls
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubaikojalkojal131
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...HyderabadDolls
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...gajnagarg
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowgargpaaro
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...kumargunjan9515
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfSayantanBiswas37
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangeThinkInnovation
 
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...gragchanchal546
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdfkhraisr
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 

Kürzlich hochgeladen (20)

Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about them
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 

introduction to machine learning

  • 1. ICT 3202 - INTRODUCTION TO DATA SCIENCE BY ENGR. JOHNSON C. UBAH B.ENG, M.ENG, HCNA, ASM
  • 2. Machine Learning and Statistics
  • 3. Machine learning is the practice of programming computers to learn from data. Machine learning is a subfield of artificial intelligence (AI). The goal of machine learning generally is to understand the structure of data and fit that data into models that can be understood and utilized by people. In machine learning, data is referred to as called training sets or examples.
  • 4. Intro. To Machine Learning Machine learning differs from traditional computational approaches because; Traditional computing algorithms are sets of steps followed by computers to solve problems. Machine learning algorithms allows computers to train on data inputs and use statistical analysis in order to generate output values that falls within specific range.
  • 5. Why Machine Learning? Lets assume you’d like to write a filter program without using machine learning methods. The steps would be; You’d take a look at what spam e-mails looks like You’d write an algorithm to detect the patterns that you’ve seen and the software would then flag the e-mails as spam Finally, you’d test the program, and redo the first two steps again until the results are good enough.
  • 6. Why Machine learning? This program contains very long list of rules and hence difficult to maintain. But if done with machine learning, you will be able to maintain it properly. Programs that uses ML techniques will automatically detect changes by users, and update their definition automatically.
  • 7. Why Machine Learning? Machine Learning algorithm with automatic update when users change preference
  • 8. When to use machine learning When you have a problem that requires many rules to find the solution. Very complex problems for which there is no solution with traditional approach. Non-stable environments: machine learning software can adapt to new data.
  • 9. Classification of ML There are types of machine learning systems. We can divide them into categories, depending on whether; 1. They have been trained with humans or not ◦ Supervised ◦ Unsupervised ◦ Semi-supervised ◦ Reinforcement learning 2. If they can learn incrementally 3. If they work simply by comparing new data points to find data points or can detect new patterns in the data, and then will build a model.
  • 10.
  • 11. Supervised and unsupervised learning We can classify machine learning systems according to the type and amount of human supervision during the training. They are; ◦ Supervised learning ◦ Unsupervised learning ◦ Semi-supervised learning ◦ Reinforced learning.
  • 12. Supervised learning When an algorithm learns from example data and associated target responses that can consist of numeric values or string labels, such as classes or tags, in order to later predict the correct response when posed with new examples comes under the category of Supervised learning. This approach is indeed similar to human learning under the supervision of a teacher.
  • 13. Tasks carried out by supervised learning Supervised learning groups together a task of classification. The program is a good example of this because it’s been trained with many emails at the same time as their class. Another example is to predict a numeric value like the price of a flat, given a set of features (location, number of rooms, facilities) called predictors; this task is called regression.
  • 14. Supervised learning algorithms You should keep in mind that some regression algorithms can be used for classification as well, and vise versa. Some important supervised algorithms ◦ K-nearest neighbors ◦ Linear regression ◦ Neural network ◦ Support vector machines ◦ Logistic regression ◦ Decision trees and random forest
  • 15. Unsupervised learning Unsupervised learning occurs when an algorithm learns from plain examples without any associated response, leaving to the algorithm to determine the data patterns on its own. This type of algorithm tends to restructure the data into something else, such as new features that may represent a class or a new series of un-correlated values. They are quite useful in providing humans with insights into the meaning of data and new useful inputs to supervised machine learning algorithms.
  • 16. Unsupervised learning As a kind of learning, it resembles the methods humans use to figure out that certain objects or events are from the same class, such as by observing the degree of similarity between objects. Some recommendation systems that you find on the web in the form of marketing automation are based on this type of learning. In this type of learning the data is unlabeled.
  • 17. Unsupervised learning algorithms Some unsupervised learning algorithms includes; ◦Clustering: k-means, hierarchical cluster analysis ◦Association rule learning: Eclat, apriori ◦Visualization and dimensionality reduction: kernel PCA, t- distribution, PCA
  • 18. Examples of unsupervised learning suppose you’ve got many data on visitor, you can use one algorithm to detect groups with similar visitors. 65% of your visitors might be males who love watching movie in the evening, while 30% watch plays in the evening: Using clustering algorithm, we have the smaller groups. Secondly, for visualization algorithms, you will need to give them many data and unlabeled data as input, and then you will get 2D or 3D visualization as an output. Feature extraction takes place here.
  • 19. Reinforcement learning An Agent “AI system” will observe the environment, performs given actions, and then receive rewards in return. Here, the agent must learn by itself. You can find this type of learning in many robotics applications that learns how to walk.
  • 20. Semi-supervised learning where an incomplete training signal is given: a training set with some (often many) of the target outputs missing. There is a special case of this principle known as Transduction where the entire set of problem instances is known at learning time, except that part of the targets are missing.
  • 21. Bad and Insufficient quantity of Training Data Machine learning systems are not like children, who can distinguish apples and oranges in all sorts of colors and shapes, but they require lot of data to work effectively, whether you’re working with very simple programs and problems, or complex applications like image processing and speech recognition.
  • 22. Poor Quality Data If you are working with training data that is full of errors and outliers, this will make it very hard for the system to detect patterns, so it won’t work properly. So, if you want your program to work well, you must spend more time cleaning up your training data.
  • 23. Irrelevant features The system will only be able to learn if the training data contains enough features and data that aren’t too irrelevant. The most important part of any ML project is to develop good features. “feature engineering” Feature engineering follows this process: ◦ Feature selection: selecting the most useful features ◦ Feature extraction: combining existing features to provide more useful features. ◦ Creation of new features: creation of new features, based on data.
  • 24. Testing To ensure your model is working well and that models can generalize with new cases, you can try out new cases with it by putting the model in the environment and then monitoring how it will perform. This is good practice. You should divide your data into two set, one for training and the second for testing.
  • 25. Testing The generalization error is the rate of error by evaluation of your model on the test set. The value you get will tell you if your model is good enough, and if it will work properly. If the error rate is low, the model is good and will perform properly and vice versa. It is advisable to use 80% of your data for training and 20% for testing
  • 26. Overfitting the data Overgeneralization in machine learning is called “overfitting”. Overfitting occurs when the model is very complex for the amount of training data given. Solution Gather more data for “training data” Reduce the noise level Select one with fewer parameters
  • 27. Under-fitting the data This the opposite of overfitting. You will encounter this when the model is very simple to learn. For example, using the example of quality of life, real life is more complex than your model, so the predictions won’t yield the same, even in the training examples. Solution: ◦ Select the most powerful model, which has many parameters ◦ Feed the best features into your algorithms. Here, I’m referring to feature engineering ◦ Reduce the constraints on your model
  • 29. Software for this course Python’s popularity may be due to the increased development of deep learning frameworks available for this language recently, including TensorFlow, PyTorch, and Keras. As a language that has readable syntax and the ability to be used as a scripting language, Python proves to be powerful and straightforward both for preprocessing data and working with data directly. The scikit-learn machine learning library is built on top of several existing Python packages that Python developers may already be familiar with, namely NumPy, SciPy, and Matplotlib.
  • 30. Software for this course MATLAB makes machine learning easy. With tools and functions for handling big data, as well as apps to make machine learning accessible, MATLAB is an ideal environment for applying machine learning to your data analytics. With MATLAB, engineers and data scientists have immediate access to prebuilt functions, extensive toolboxes, and specialized apps for classification, regression, and clustering.