SlideShare a Scribd company logo
1 of 55
What’s in it for you?
What is Data Science?
Basics of Python for Data Analysis
Why learn Python?
How to Install Python?
Python Libraries for Data Analysis
Exploratory analysis using Pandas
Introduction to series and data frame
Loan Prediction Problem
Data Wrangling using Pandas
Building a Predictive Model using Scikit-Learn
Logistic Regression
What is Data Science?
Example
Restaurants can predict how many
customers will visit on a weekend
and plan their food inventory to
handle the demand
Service Planning
System can be trained based on
customer behavior pattern to
predict the likelihood of a
customer buying a product
Customer Prediction
Data Science is about finding and exploring data in real world, and then using that knowledge to solve
business problems
Why Python?
Let’s first understand, why we
want to use Python?
Why Python?
The usage statistics based on google trends depict that Python is currently more popular than R or SAS
for Data Science!
Why Python?
SPEED PACKAGES DESIGN GOAL
But, there are various factors you should consider before deciding which language is best for
your Data Analysis:
Why Python?
SPEED PACKAGES DESIGN GOAL
But, there are various factors you should consider before deciding which language is best for
your Data Analysis:
Why Python?
SPEED PACKAGES DESIGN GOAL
But, there are various factors you should consider before deciding which language is best for
your Data Analysis:
Why Python?
For instructor
Design Goal:
Syntax rules in python helps in building application with concise and readable code base
Packages:
There are numerous packages in Python to choose from like pandas to aggregate & manipulate data, Seaborn or
matplotlib to visualize relational data to mention a few
Speed:
Studies suggest that Python is faster than several widely used languages. Also, we can further speed up python
using algorithms and tools
Installing Python
Now, let’s install Python to
begin the fun
Installing Python
• Go to: http://continuum io/downloads
• Scroll down to download the graphical installer
suitable for your operating system
After successful installation, you can launch Jupyter notebook from Anaconda Navigator
Anaconda comes with pre-installed libraries
In this tutorial, we will be working on Jupyter notebook using Python 3
Python libraries for Data Analysis
Let’s get to know some
important Python libraries for
Data Analysis
Python libraries for Data Analysis
There are many interesting libraries that have made Python popular with Data Scientists:
Python libraries for Data Analysis
Most useful library for variety of high level science and engineering modules like discrete Fourier
transform, Linear Algebra, Optimization and Sparse matrices
Pandas for structured data operations and manipulations It is extensively
used for data munging and preparation
The most powerful feature of NumPy is n-dimensional array This library also contains basic linear algebra
functions, Fourier transforms, advanced random number capabilities
Matplotlib for plotting vast variety of graphs, starting from histograms to line plots to heat plots
contains a lot of efficient tools for machine learning and statistical modeling including classification,
regression, clustering and dimensional reduction
For instructor
Python libraries for Data Analysis
Additional libraries, you might need:
Networkx & I graph
Tensorflow
BeautifulSoup
OS
Python libraries for Data Analysis
os for Operating system and file operations
networkx and igraph for graph based data manipulations
TensorFlow
BeautifulSoup for scrapping web
For instructor
What is SciPy?
SciPy is a set of scientific and numerical tools for Python
• It currently supports special functions, integration, ordinary
differential equation (ODE) solvers, gradient optimization, and
others
• It has fully-featured versions of the linear algebra modules
• It is built on top of NumPy
What is NumPy?
NumPy is the fundamental package for scientific computing with
Python. It contains:
• Powerful N-dimensional array object
• Tools for integrating C/C++ and Fortran code
• It has useful linear algebra, Fourier transform, and random number
capabilities
What is Pandas?
• The most useful Data Analysis library in Python
• Instrumental in increasing the use of Python in Data Science
community
• It is extensively used for data munging and preparation
Pandas is used for structured data operations & manipulations
Exploratory analysis using Pandas
Let’s understand the two most common terms used in Pandas:
Series Dataframe
Exploratory analysis using Pandas
A Series is a one-dimensional object that can
hold any data type such as integers, floats
and strings
Series
A DataFrame is a two dimensional object
that can have columns with potential
different data types
DataFrame
Pandas
Exploratory analysis using Pandas
Default column
names
Default index
Default index
Series DataFrame
Exploratory analysis using Pandas
Default column
names
Default index
Default index
Series DataFrame
Exploratory analysis using Pandas
Problem Statement: Based on customer data, predict whether a particular customer’s loan
will be approved or not
LOAN
Exploratory analysis using Pandas
Now, let’s explore our data using Pandas!
Exploratory analysis using Pandas
Import the necessary libraries and read the dataset using read_csv() function:
Exploratory analysis using Pandas
You can call describe() function to describe all the columns:
Exploratory analysis using Pandas
Let’s see numercial values’ distribution
1 Loan Amount
Exploratory analysis using Pandas
2 Applicant Income
Exploratory analysis using Pandas
Categorical values’ distribution using matplotlib library:
Credit History
Exploratory analysis using Pandas
Hence, ‘loanAmount’ and ‘ApplicantIncome’ needs
Data Wrangling as some extreme values are observed!
Data Wrangling using Pandas
Before proceeding further,
let’s understand what is
Data Wrangling and why we
need it?
Data Wrangling: Process of cleaning and unifying messy
and complex data sets
It reveals more information about your data
Enables decision-making skills in the organization
Helps to gather meaningful and precise data for the business
Data Wrangling using Pandas
Data Wrangling using Pandas
You can see if your data has missing values:
Data Wrangling using Pandas
And then you can replace the missing values:
Data Wrangling using Pandas
You can access the data types of each column in a DataFrame:
Data Wrangling using Pandas
You can perform basic math operations to know more about your data:
Data Wrangling using Pandas
You can combine your DataFrames:
Combining DataFrame objects can be done using simple concatenation (provided they have the same columns):
Creates an array of
specified shape and fills it
with random values using
numpy
Data Wrangling using Pandas
Data Wrangling using Pandas
Also, if your DataFrame do not have an identical structure:
Data Wrangling using Pandas
You can create a merged dataframe using the merge() function based on the key:
Model Building using Scikit-learn
Now, that we have done data
wrangling, let’s build a
predictive model
Model Building using Scikit-learn
We will use Scikit-learn
module as it provides a range
of supervised and
unsupervised learning
algorithms
Model Building using Scikit-learn
Importing the required scikit-learn module:
Model Building using Scikit-learn
Extracting the variables and then splitting the data into train and test:
Model Building using Scikit-learn
In this case, we will use Logistic
Regression model
Logistic Regression is appropriate
when the dependent variable is
binary
Model Building using Scikit-learn
Fitting the data into Logistic Regression model:
Model Building using Scikit-learn
Predicting the test results:
Model Building using Scikit-learn
To describe the performance of the model let’s build the confusion matrix on test data:
Model Building using Scikit-learn
Let’s calculate ACCURACY and PRECISION from confusion matrix:
False Positive
True Positive
False Negative
True Negative
Model Building using Scikit-learn
Let’s calculate ACCURACY and PRECISION from confusion matrix:
• Accuracy
Overall, how often is the classifier correct?
(TP+TN)/total = (103+18)/150 = 0.80
• Precision
When it predicts yes, how often is it correct?
TP/predicted yes = 103/130 = 0.79
Model Building using Scikit-learn
We can also find the accuracy through Python module:
Model Building using Scikit-learn
So , we have built a model with 80% accuracy
Summary
Data Science & its popularity with python Data Analysis Libraries in python Series and dataframe in pandas
Logistic Regression using scikitData wranglingExploratory analysis
Data Science With Python | Python For Data Science | Python Data Science Course | Simplilearn

More Related Content

What's hot

PPT on Data Science Using Python
PPT on Data Science Using PythonPPT on Data Science Using Python
PPT on Data Science Using PythonNishantKumar1179
 
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...Edureka!
 
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Edureka!
 
Introduction of data science
Introduction of data scienceIntroduction of data science
Introduction of data scienceTanujaSomvanshi1
 
Data Science Introduction
Data Science IntroductionData Science Introduction
Data Science IntroductionGang Tao
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceEdureka!
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data ScienceSpotle.ai
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceNiko Vuokko
 
Data visualization in Python
Data visualization in PythonData visualization in Python
Data visualization in PythonMarc Garcia
 
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...Edureka!
 
Introduction to Data Science.pptx
Introduction to Data Science.pptxIntroduction to Data Science.pptx
Introduction to Data Science.pptxVrishit Saraswat
 
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...Edureka!
 
Data science presentation
Data science presentationData science presentation
Data science presentationMSDEVMTL
 
Top 5 Python Libraries For Data Science | Python Libraries Explained | Python...
Top 5 Python Libraries For Data Science | Python Libraries Explained | Python...Top 5 Python Libraries For Data Science | Python Libraries Explained | Python...
Top 5 Python Libraries For Data Science | Python Libraries Explained | Python...Simplilearn
 

What's hot (20)

Data science
Data scienceData science
Data science
 
Data science
Data scienceData science
Data science
 
Data Analysis in Python
Data Analysis in PythonData Analysis in Python
Data Analysis in Python
 
PPT on Data Science Using Python
PPT on Data Science Using PythonPPT on Data Science Using Python
PPT on Data Science Using Python
 
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
 
Python for Data Science
Python for Data SciencePython for Data Science
Python for Data Science
 
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
 
Introduction of data science
Introduction of data scienceIntroduction of data science
Introduction of data science
 
Data Science Introduction
Data Science IntroductionData Science Introduction
Data Science Introduction
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data Science
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data visualization in Python
Data visualization in PythonData visualization in Python
Data visualization in Python
 
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
 
Introduction to Data Science.pptx
Introduction to Data Science.pptxIntroduction to Data Science.pptx
Introduction to Data Science.pptx
 
Data science Big Data
Data science Big DataData science Big Data
Data science Big Data
 
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Top 5 Python Libraries For Data Science | Python Libraries Explained | Python...
Top 5 Python Libraries For Data Science | Python Libraries Explained | Python...Top 5 Python Libraries For Data Science | Python Libraries Explained | Python...
Top 5 Python Libraries For Data Science | Python Libraries Explained | Python...
 

Similar to Data Science With Python | Python For Data Science | Python Data Science Course | Simplilearn

Basic of python for data analysis
Basic of python for data analysisBasic of python for data analysis
Basic of python for data analysisPramod Toraskar
 
Adarsh_Masekar(2GP19CS003).pptx
Adarsh_Masekar(2GP19CS003).pptxAdarsh_Masekar(2GP19CS003).pptx
Adarsh_Masekar(2GP19CS003).pptxhkabir55
 
Abhishek Training PPT.pptx
Abhishek Training PPT.pptxAbhishek Training PPT.pptx
Abhishek Training PPT.pptxKashishKashish22
 
Certified Python Business Analyst
Certified Python Business AnalystCertified Python Business Analyst
Certified Python Business AnalystAnkitSingh2134
 
employee turnover prediction document.docx
employee turnover prediction document.docxemployee turnover prediction document.docx
employee turnover prediction document.docxrohithprabhas1
 
Scipy Libraries to Work with Various Datasets.pptx
Scipy Libraries to Work with Various Datasets.pptxScipy Libraries to Work with Various Datasets.pptx
Scipy Libraries to Work with Various Datasets.pptxpooja chavan
 
Python for Data Science: A Comprehensive Guide
Python for Data Science: A Comprehensive GuidePython for Data Science: A Comprehensive Guide
Python for Data Science: A Comprehensive Guidepriyanka rajput
 
Data Science with Python course Outline.pptx
Data Science with Python course Outline.pptxData Science with Python course Outline.pptx
Data Science with Python course Outline.pptxFerdsilinks
 
London atlassian meetup 31 jan 2016 jira metrics-extract slides
London atlassian meetup 31 jan 2016 jira metrics-extract slidesLondon atlassian meetup 31 jan 2016 jira metrics-extract slides
London atlassian meetup 31 jan 2016 jira metrics-extract slidesRudiger Wolf
 
Data Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with ArcionData Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with ArcionAnant Corporation
 
PyDataStructs Tech Share at Quansight
PyDataStructs Tech Share at QuansightPyDataStructs Tech Share at Quansight
PyDataStructs Tech Share at QuansightGagandeep Singh
 

Similar to Data Science With Python | Python For Data Science | Python Data Science Course | Simplilearn (20)

Python ml
Python mlPython ml
Python ml
 
Basic of python for data analysis
Basic of python for data analysisBasic of python for data analysis
Basic of python for data analysis
 
Adarsh_Masekar(2GP19CS003).pptx
Adarsh_Masekar(2GP19CS003).pptxAdarsh_Masekar(2GP19CS003).pptx
Adarsh_Masekar(2GP19CS003).pptx
 
Python and data analytics
Python and data analyticsPython and data analytics
Python and data analytics
 
Abhishek Training PPT.pptx
Abhishek Training PPT.pptxAbhishek Training PPT.pptx
Abhishek Training PPT.pptx
 
Certified Python Business Analyst
Certified Python Business AnalystCertified Python Business Analyst
Certified Python Business Analyst
 
Session 2
Session 2Session 2
Session 2
 
employee turnover prediction document.docx
employee turnover prediction document.docxemployee turnover prediction document.docx
employee turnover prediction document.docx
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Scipy Libraries to Work with Various Datasets.pptx
Scipy Libraries to Work with Various Datasets.pptxScipy Libraries to Work with Various Datasets.pptx
Scipy Libraries to Work with Various Datasets.pptx
 
Python for Data Science: A Comprehensive Guide
Python for Data Science: A Comprehensive GuidePython for Data Science: A Comprehensive Guide
Python for Data Science: A Comprehensive Guide
 
Pa2 session 4
Pa2 session 4Pa2 session 4
Pa2 session 4
 
Data Science with Python course Outline.pptx
Data Science with Python course Outline.pptxData Science with Python course Outline.pptx
Data Science with Python course Outline.pptx
 
London atlassian meetup 31 jan 2016 jira metrics-extract slides
London atlassian meetup 31 jan 2016 jira metrics-extract slidesLondon atlassian meetup 31 jan 2016 jira metrics-extract slides
London atlassian meetup 31 jan 2016 jira metrics-extract slides
 
Pa1 session 6
Pa1 session 6Pa1 session 6
Pa1 session 6
 
Toolboxes for data scientists
Toolboxes for data scientistsToolboxes for data scientists
Toolboxes for data scientists
 
Data Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with ArcionData Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with Arcion
 
PyDataStructs Tech Share at Quansight
PyDataStructs Tech Share at QuansightPyDataStructs Tech Share at Quansight
PyDataStructs Tech Share at Quansight
 
DevOps for DataScience
DevOps for DataScienceDevOps for DataScience
DevOps for DataScience
 
Solved Big Data and Data Science Projects pdf.pdf
Solved Big Data and Data Science Projects pdf.pdfSolved Big Data and Data Science Projects pdf.pdf
Solved Big Data and Data Science Projects pdf.pdf
 

More from Simplilearn

ChatGPT in Cybersecurity
ChatGPT in CybersecurityChatGPT in Cybersecurity
ChatGPT in CybersecuritySimplilearn
 
Whatis SQL Injection.pptx
Whatis SQL Injection.pptxWhatis SQL Injection.pptx
Whatis SQL Injection.pptxSimplilearn
 
Top 5 High Paying Cloud Computing Jobs in 2023
 Top 5 High Paying Cloud Computing Jobs in 2023  Top 5 High Paying Cloud Computing Jobs in 2023
Top 5 High Paying Cloud Computing Jobs in 2023 Simplilearn
 
Types Of Cloud Jobs In 2024
Types Of Cloud Jobs In 2024Types Of Cloud Jobs In 2024
Types Of Cloud Jobs In 2024Simplilearn
 
Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...
Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...
Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...Simplilearn
 
What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...
What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...
What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...Simplilearn
 
Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...
Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...
Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...Simplilearn
 
React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...
React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...
React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...Simplilearn
 
Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...
Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...
Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...Simplilearn
 
How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...
How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...
How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...Simplilearn
 
Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...
Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...
Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...Simplilearn
 
Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...
Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...
Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...Simplilearn
 
Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...
Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...
Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...Simplilearn
 
React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...
React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...
React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...Simplilearn
 
What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...
What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...
What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...Simplilearn
 
How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...
How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...
How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...Simplilearn
 
WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...
WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...
WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...Simplilearn
 
Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...
Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...
Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...Simplilearn
 
How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...
How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...
How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...Simplilearn
 
How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...
How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...
How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...Simplilearn
 

More from Simplilearn (20)

ChatGPT in Cybersecurity
ChatGPT in CybersecurityChatGPT in Cybersecurity
ChatGPT in Cybersecurity
 
Whatis SQL Injection.pptx
Whatis SQL Injection.pptxWhatis SQL Injection.pptx
Whatis SQL Injection.pptx
 
Top 5 High Paying Cloud Computing Jobs in 2023
 Top 5 High Paying Cloud Computing Jobs in 2023  Top 5 High Paying Cloud Computing Jobs in 2023
Top 5 High Paying Cloud Computing Jobs in 2023
 
Types Of Cloud Jobs In 2024
Types Of Cloud Jobs In 2024Types Of Cloud Jobs In 2024
Types Of Cloud Jobs In 2024
 
Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...
Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...
Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...
 
What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...
What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...
What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...
 
Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...
Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...
Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...
 
React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...
React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...
React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...
 
Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...
Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...
Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...
 
How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...
How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...
How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...
 
Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...
Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...
Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...
 
Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...
Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...
Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...
 
Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...
Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...
Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...
 
React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...
React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...
React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...
 
What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...
What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...
What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...
 
How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...
How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...
How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...
 
WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...
WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...
WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...
 
Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...
Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...
Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...
 
How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...
How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...
How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...
 
How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...
How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...
How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...
 

Recently uploaded

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 

Recently uploaded (20)

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 

Data Science With Python | Python For Data Science | Python Data Science Course | Simplilearn

  • 1.
  • 2. What’s in it for you? What is Data Science? Basics of Python for Data Analysis Why learn Python? How to Install Python? Python Libraries for Data Analysis Exploratory analysis using Pandas Introduction to series and data frame Loan Prediction Problem Data Wrangling using Pandas Building a Predictive Model using Scikit-Learn Logistic Regression
  • 3. What is Data Science? Example Restaurants can predict how many customers will visit on a weekend and plan their food inventory to handle the demand Service Planning System can be trained based on customer behavior pattern to predict the likelihood of a customer buying a product Customer Prediction Data Science is about finding and exploring data in real world, and then using that knowledge to solve business problems
  • 4. Why Python? Let’s first understand, why we want to use Python?
  • 5. Why Python? The usage statistics based on google trends depict that Python is currently more popular than R or SAS for Data Science!
  • 6. Why Python? SPEED PACKAGES DESIGN GOAL But, there are various factors you should consider before deciding which language is best for your Data Analysis:
  • 7. Why Python? SPEED PACKAGES DESIGN GOAL But, there are various factors you should consider before deciding which language is best for your Data Analysis:
  • 8. Why Python? SPEED PACKAGES DESIGN GOAL But, there are various factors you should consider before deciding which language is best for your Data Analysis:
  • 9. Why Python? For instructor Design Goal: Syntax rules in python helps in building application with concise and readable code base Packages: There are numerous packages in Python to choose from like pandas to aggregate & manipulate data, Seaborn or matplotlib to visualize relational data to mention a few Speed: Studies suggest that Python is faster than several widely used languages. Also, we can further speed up python using algorithms and tools
  • 10. Installing Python Now, let’s install Python to begin the fun
  • 11. Installing Python • Go to: http://continuum io/downloads • Scroll down to download the graphical installer suitable for your operating system After successful installation, you can launch Jupyter notebook from Anaconda Navigator Anaconda comes with pre-installed libraries In this tutorial, we will be working on Jupyter notebook using Python 3
  • 12. Python libraries for Data Analysis Let’s get to know some important Python libraries for Data Analysis
  • 13. Python libraries for Data Analysis There are many interesting libraries that have made Python popular with Data Scientists:
  • 14. Python libraries for Data Analysis Most useful library for variety of high level science and engineering modules like discrete Fourier transform, Linear Algebra, Optimization and Sparse matrices Pandas for structured data operations and manipulations It is extensively used for data munging and preparation The most powerful feature of NumPy is n-dimensional array This library also contains basic linear algebra functions, Fourier transforms, advanced random number capabilities Matplotlib for plotting vast variety of graphs, starting from histograms to line plots to heat plots contains a lot of efficient tools for machine learning and statistical modeling including classification, regression, clustering and dimensional reduction For instructor
  • 15. Python libraries for Data Analysis Additional libraries, you might need: Networkx & I graph Tensorflow BeautifulSoup OS
  • 16. Python libraries for Data Analysis os for Operating system and file operations networkx and igraph for graph based data manipulations TensorFlow BeautifulSoup for scrapping web For instructor
  • 17. What is SciPy? SciPy is a set of scientific and numerical tools for Python • It currently supports special functions, integration, ordinary differential equation (ODE) solvers, gradient optimization, and others • It has fully-featured versions of the linear algebra modules • It is built on top of NumPy
  • 18. What is NumPy? NumPy is the fundamental package for scientific computing with Python. It contains: • Powerful N-dimensional array object • Tools for integrating C/C++ and Fortran code • It has useful linear algebra, Fourier transform, and random number capabilities
  • 19. What is Pandas? • The most useful Data Analysis library in Python • Instrumental in increasing the use of Python in Data Science community • It is extensively used for data munging and preparation Pandas is used for structured data operations & manipulations
  • 20. Exploratory analysis using Pandas Let’s understand the two most common terms used in Pandas: Series Dataframe
  • 21. Exploratory analysis using Pandas A Series is a one-dimensional object that can hold any data type such as integers, floats and strings Series A DataFrame is a two dimensional object that can have columns with potential different data types DataFrame Pandas
  • 22. Exploratory analysis using Pandas Default column names Default index Default index Series DataFrame
  • 23. Exploratory analysis using Pandas Default column names Default index Default index Series DataFrame
  • 24. Exploratory analysis using Pandas Problem Statement: Based on customer data, predict whether a particular customer’s loan will be approved or not LOAN
  • 25. Exploratory analysis using Pandas Now, let’s explore our data using Pandas!
  • 26. Exploratory analysis using Pandas Import the necessary libraries and read the dataset using read_csv() function:
  • 27. Exploratory analysis using Pandas You can call describe() function to describe all the columns:
  • 28. Exploratory analysis using Pandas Let’s see numercial values’ distribution 1 Loan Amount
  • 29. Exploratory analysis using Pandas 2 Applicant Income
  • 30. Exploratory analysis using Pandas Categorical values’ distribution using matplotlib library: Credit History
  • 31. Exploratory analysis using Pandas Hence, ‘loanAmount’ and ‘ApplicantIncome’ needs Data Wrangling as some extreme values are observed!
  • 32. Data Wrangling using Pandas Before proceeding further, let’s understand what is Data Wrangling and why we need it?
  • 33. Data Wrangling: Process of cleaning and unifying messy and complex data sets It reveals more information about your data Enables decision-making skills in the organization Helps to gather meaningful and precise data for the business Data Wrangling using Pandas
  • 34. Data Wrangling using Pandas You can see if your data has missing values:
  • 35. Data Wrangling using Pandas And then you can replace the missing values:
  • 36. Data Wrangling using Pandas You can access the data types of each column in a DataFrame:
  • 37. Data Wrangling using Pandas You can perform basic math operations to know more about your data:
  • 38. Data Wrangling using Pandas You can combine your DataFrames: Combining DataFrame objects can be done using simple concatenation (provided they have the same columns): Creates an array of specified shape and fills it with random values using numpy
  • 40. Data Wrangling using Pandas Also, if your DataFrame do not have an identical structure:
  • 41. Data Wrangling using Pandas You can create a merged dataframe using the merge() function based on the key:
  • 42. Model Building using Scikit-learn Now, that we have done data wrangling, let’s build a predictive model
  • 43. Model Building using Scikit-learn We will use Scikit-learn module as it provides a range of supervised and unsupervised learning algorithms
  • 44. Model Building using Scikit-learn Importing the required scikit-learn module:
  • 45. Model Building using Scikit-learn Extracting the variables and then splitting the data into train and test:
  • 46. Model Building using Scikit-learn In this case, we will use Logistic Regression model Logistic Regression is appropriate when the dependent variable is binary
  • 47. Model Building using Scikit-learn Fitting the data into Logistic Regression model:
  • 48. Model Building using Scikit-learn Predicting the test results:
  • 49. Model Building using Scikit-learn To describe the performance of the model let’s build the confusion matrix on test data:
  • 50. Model Building using Scikit-learn Let’s calculate ACCURACY and PRECISION from confusion matrix: False Positive True Positive False Negative True Negative
  • 51. Model Building using Scikit-learn Let’s calculate ACCURACY and PRECISION from confusion matrix: • Accuracy Overall, how often is the classifier correct? (TP+TN)/total = (103+18)/150 = 0.80 • Precision When it predicts yes, how often is it correct? TP/predicted yes = 103/130 = 0.79
  • 52. Model Building using Scikit-learn We can also find the accuracy through Python module:
  • 53. Model Building using Scikit-learn So , we have built a model with 80% accuracy
  • 54. Summary Data Science & its popularity with python Data Analysis Libraries in python Series and dataframe in pandas Logistic Regression using scikitData wranglingExploratory analysis

Editor's Notes

  1. Remove title case