SlideShare ist ein Scribd-Unternehmen logo
1 von 38
Agenda
Why Data Science?
What is Data Science?
Who is a Data Scientist?
What does a Data Scientist do?
How to solve a problem in Data Science?
Data Science Tools
Demo
Agenda
Why Data Science?
What is Data Science?
Who is a Data Scientist?
What does a Data Scientist do?
How to solve a problem in Data Science?
Data Science Tools
Demo
Why Data Science?
www.edureka.co/data-scienceData Science Certification Course using R
Why Data Science?
You can make better decisions, you can reduce your production costs by coming out with efficient ways, and give your
customers what they actually want!
Cost Reduction Faster & Better
Decision Making
Improved Services
and Products
Risk Detection
www.edureka.co/data-scienceData Science Certification Course using R
Why Data Science?
Data Science can help prevent Fraudulent transactions using advanced Machine Learning algorithms and prevent great
monetary losses.
What is Data Science?
www.edureka.co/data-scienceData Science Certification Course using R
What is Data Science?
Data Science is a blend of various tools, algorithms, and machine learning principles with the goal to discover hidden patterns
from the raw data.
DATA SCIENCE
Analysis Structure Algorithm Process Programming Insight
www.edureka.co/data-scienceData Science Certification Course using R
What is Data Science?
It is an inter-disciplinary field deploying scientific methods, processes and systems to gain insight from data in various forms.
Tell us something we don’t know already.
Statistics Code
Business
www.edureka.co/data-scienceData Science Certification Course using R
What is Data Science?
How is this different from what statisticians have been doing for years?
Business Administration
Exploratory Data Analysis
Machine Learning &
Advanced Algorithms
Data Product Engineering
Business Analyst
Data Scientist
Who is Data Scientist?
www.edureka.co/data-scienceData Science Certification Course using R
Who is a Data Scientist?
www.edureka.co/data-scienceData Science Certification Course using R
Who is a Data Scientist?
Statistics
Discrete Theory
Combinatorics
Decision Theory
Machine Learning
www.edureka.co/data-scienceData Science Certification Course using R
Who is a Data Scientist?
www.edureka.co/data-scienceData Science Certification Course using R
Who is a Data Scientist?
Economics
Finance
Operations
Management
Business
Intelligence
www.edureka.co/data-scienceData Science Certification Course using R
Who is a Data Scientist?
www.edureka.co/data-scienceData Science Certification Course using R
Who is a Data Scientist?
Computer Science
Software
Engineering
Systems
Development
What does a Data Scientist do?
www.edureka.co/data-scienceData Science Certification Course using R
Processing &
Cleansing Data
What does a Data Scientist do?
Data Mining
Building
Prediction
Models
Extending
Data
Optimizing and
building classifiers
using
Machine Learning
www.edureka.co/data-scienceData Science Certification Course using R
Processing &
Cleansing Data
What does a Data Scientist do?
Data Mining
Building
Prediction
Models
Extending
Data
Optimizing and
building classifiers
using
Machine Learning
www.edureka.co/data-scienceData Science Certification Course using R
What does a Data Scientist do?
Data Mining
Processing &
Cleansing Data
Building
Prediction
Models
Extending
Data
Optimizing and
building classifiers
using
Machine Learning
www.edureka.co/data-scienceData Science Certification Course using R
What does a Data Scientist do?
Data Mining
Processing &
Cleansing Data
Building
Prediction
Models
Extending
Data
Optimizing and
building classifiers
using
Machine Learning
www.edureka.co/data-scienceData Science Certification Course using R
What does a Data Scientist do?
Data Mining
Processing &
Cleansing Data
Building
Prediction
Models
Extending
Data
Optimizing and
building classifiers
using
Machine Learning
www.edureka.co/data-scienceData Science Certification Course using R
What does a Data Scientist do?
Data Mining
Processing &
Cleansing Data
Building
Prediction
Models
Extending
Data
Optimizing and
building classifiers
using
Machine Learning
How to solve a problem in Data Science?
www.edureka.co/data-scienceData Science Certification Course using R
How to solve a problem in Data Science?
3 62 41 5
Discovery
Data
Preparation
Model
Planning
Model
Building
Operationalize
Communicating
Results
www.edureka.co/data-scienceData Science Certification Course using R
How to solve a problem in Data Science?
1
3
2
4
Discovery
Data Preparation
Model Planning
Model Building
5
6
Operationalize
Communicate
➢ Discovery involves acquiring data from all identifies internal and
external resources that can help with a business solution.
➢ You assess if you have the required resources present in terms of
people, technology, time and data to support the project.
www.edureka.co/data-scienceData Science Certification Course using R
How to solve a problem in Data Science?
1
3
2
4
Discovery
Data Preparation
Model Planning
Model Building
5
6
Operationalize
Communicate
➢ In this phase, you require analytical sandbox in which you can
perform analytics for the entire duration of the project.
➢ This is what a Sandbox is supposed to look like;
➢ ETLT means to Extract, Transform, Load and Transform.
Preparing the
Analytics Sandbox
Performing ETLT Data Conditioning Survey & Visualize
www.edureka.co/data-scienceData Science Certification Course using R
How to solve a problem in Data Science?
1
3
2
4
Discovery
Data Preparation
Model Planning
Model Building
5
6
Operationalize
Communicate
➢ You will apply Exploratory Data Analytics (EDA) using various
statistical formulas and visualization tools.
Common Tools for Model Planning
R SAS/ ACCESS
SQL Service
Analysis Services
www.edureka.co/data-scienceData Science Certification Course using R
How to solve a problem in Data Science?
1
3
2
4
Discovery
Data Preparation
Model Planning
Model Building
5
6
Operationalize
Communicate
➢ In this phase, you will develop datasets for training and testing
purposes.
Common Tools for Model Building
SAS
Miner
WEKA SPCS MATLAB
Alpine
Miner
Statistica
www.edureka.co/data-scienceData Science Certification Course using R
How to solve a problem in Data Science?
1
3
2
4
Discovery
Data Preparation
Model Planning
Model Building
5
6
Operationalize
Communicate
➢ In this phase, you deliver final reports, briefings, code and technical
documents.
➢ In addition, sometimes a pilot project is also implemented in a real-
time production environment.
➢ This will provide you a clear picture of the performance and other
related constraints on a small scale before full deployment.
www.edureka.co/data-scienceData Science Certification Course using R
How to solve a problem in Data Science?
1
3
2
4
Discovery
Data Preparation
Model Planning
Model Building
5
6
Operationalize
Communicate
➢ You do the following things in this phase;
1. You identify all the key findings
2. communicate to the stakeholders
3. Look for performance constraints, if any
4. determine if the results of the project are a success or a failure
www.edureka.co/data-scienceData Science Certification Course using R
How to Choose an Algorithm in Data Science?
Is it A or B? Classification Algorithm
Is this weird? Anomaly Detection Algorithm
How much / How many? Regression Algorithm
How is this organised? Clustering Algorithm
What should I do next? Reinforcement Learning
www.edureka.co/data-scienceData Science Certification Course using R
What is machine Learning?
It is a type of Artificial Intelligence that makes the computers capable of learning on their own i.e without explicitly being
programmed. With machine learning, machines can update their own code, whenever they come across a new situation.
www.edureka.co/data-scienceData Science Certification Course using R
Categories of Algorithm
Supervised
Learning
1
Supervised Learning
is a type of machine
learning algorithm
that uses a known
dataset to make
predictions.
Unsupervised
Learning
2
Unsupervised
Learning is a type of
machine learning
algorithm that uses a
input datasets
without labelled
responses to draw
inference.
Reinforcement
Learning
3
Reinforcement
Learning is a type of
algorithm inspired by
behaviourist
psychology,
concerned with
taking actions to
maximise reward.
Data Science Tools
www.edureka.co/data-scienceData Science Certification Course using R
Data Science Tools
1.
Datasets Hadoop
4
Big Data
3
R programming
2
Spark
55
Demo
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial Using R | Edureka

Weitere ähnliche Inhalte

Was ist angesagt?

Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...
Simplilearn
 
Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...
Simplilearn
 

Was ist angesagt? (20)

Data science
Data science Data science
Data science
 
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
 
Data Science Introduction
Data Science IntroductionData Science Introduction
Data Science Introduction
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data Science
 
Data Science Full Course | Edureka
Data Science Full Course | EdurekaData Science Full Course | Edureka
Data Science Full Course | Edureka
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data science
Data scienceData science
Data science
 
Data Science
Data ScienceData Science
Data Science
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data Science
 
Introduction to data science.pptx
Introduction to data science.pptxIntroduction to data science.pptx
Introduction to data science.pptx
 
Introduction to data science club
Introduction to data science clubIntroduction to data science club
Introduction to data science club
 
Data science
Data scienceData science
Data science
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Data science Big Data
Data science Big DataData science Big Data
Data science Big Data
 
Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...
 
Career in Data Science
Career in Data ScienceCareer in Data Science
Career in Data Science
 
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
 
Ppt on data science
Ppt on data science Ppt on data science
Ppt on data science
 
Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...
 

Ähnlich wie Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial Using R | Edureka

Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Simplilearn
 
Challenges of Executing AI
Challenges of Executing AIChallenges of Executing AI
Challenges of Executing AI
Dr. Umesh Rao.Hodeghatta
 

Ähnlich wie Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial Using R | Edureka (20)

Learnmystuff - Training Catalog
Learnmystuff - Training CatalogLearnmystuff - Training Catalog
Learnmystuff - Training Catalog
 
Building successful data science teams
Building successful data science teamsBuilding successful data science teams
Building successful data science teams
 
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
 
Challenges of Executing AI
Challenges of Executing AIChallenges of Executing AI
Challenges of Executing AI
 
Data Science Training Course in Gurgaon.pptx
Data Science Training Course in Gurgaon.pptxData Science Training Course in Gurgaon.pptx
Data Science Training Course in Gurgaon.pptx
 
Course 8 : How to start your big data project by Eric Rodriguez
Course 8 : How to start your big data project by Eric Rodriguez Course 8 : How to start your big data project by Eric Rodriguez
Course 8 : How to start your big data project by Eric Rodriguez
 
The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products
 
Data Science.pptx
Data Science.pptxData Science.pptx
Data Science.pptx
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptx
 
Data Science Highlights
Data Science Highlights Data Science Highlights
Data Science Highlights
 
Become a successful Data Scientist. Start Now!
Become a successful Data Scientist. Start Now!Become a successful Data Scientist. Start Now!
Become a successful Data Scientist. Start Now!
 
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
 
How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)
 
Data Science for Beginners: A Step-by-Step Introduction
Data Science for Beginners: A Step-by-Step IntroductionData Science for Beginners: A Step-by-Step Introduction
Data Science for Beginners: A Step-by-Step Introduction
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
 
Data Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdfData Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdf
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Brochure data science learning path board-infinity (1)
Brochure   data science learning path board-infinity (1)Brochure   data science learning path board-infinity (1)
Brochure data science learning path board-infinity (1)
 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification course
 

Mehr von Edureka!

Mehr von Edureka! (20)

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Kürzlich hochgeladen (20)

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 

Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial Using R | Edureka

  • 1. Agenda Why Data Science? What is Data Science? Who is a Data Scientist? What does a Data Scientist do? How to solve a problem in Data Science? Data Science Tools Demo
  • 2. Agenda Why Data Science? What is Data Science? Who is a Data Scientist? What does a Data Scientist do? How to solve a problem in Data Science? Data Science Tools Demo
  • 4. www.edureka.co/data-scienceData Science Certification Course using R Why Data Science? You can make better decisions, you can reduce your production costs by coming out with efficient ways, and give your customers what they actually want! Cost Reduction Faster & Better Decision Making Improved Services and Products Risk Detection
  • 5. www.edureka.co/data-scienceData Science Certification Course using R Why Data Science? Data Science can help prevent Fraudulent transactions using advanced Machine Learning algorithms and prevent great monetary losses.
  • 6. What is Data Science?
  • 7. www.edureka.co/data-scienceData Science Certification Course using R What is Data Science? Data Science is a blend of various tools, algorithms, and machine learning principles with the goal to discover hidden patterns from the raw data. DATA SCIENCE Analysis Structure Algorithm Process Programming Insight
  • 8. www.edureka.co/data-scienceData Science Certification Course using R What is Data Science? It is an inter-disciplinary field deploying scientific methods, processes and systems to gain insight from data in various forms. Tell us something we don’t know already. Statistics Code Business
  • 9. www.edureka.co/data-scienceData Science Certification Course using R What is Data Science? How is this different from what statisticians have been doing for years? Business Administration Exploratory Data Analysis Machine Learning & Advanced Algorithms Data Product Engineering Business Analyst Data Scientist
  • 10. Who is Data Scientist?
  • 11. www.edureka.co/data-scienceData Science Certification Course using R Who is a Data Scientist?
  • 12. www.edureka.co/data-scienceData Science Certification Course using R Who is a Data Scientist? Statistics Discrete Theory Combinatorics Decision Theory Machine Learning
  • 13. www.edureka.co/data-scienceData Science Certification Course using R Who is a Data Scientist?
  • 14. www.edureka.co/data-scienceData Science Certification Course using R Who is a Data Scientist? Economics Finance Operations Management Business Intelligence
  • 15. www.edureka.co/data-scienceData Science Certification Course using R Who is a Data Scientist?
  • 16. www.edureka.co/data-scienceData Science Certification Course using R Who is a Data Scientist? Computer Science Software Engineering Systems Development
  • 17. What does a Data Scientist do?
  • 18. www.edureka.co/data-scienceData Science Certification Course using R Processing & Cleansing Data What does a Data Scientist do? Data Mining Building Prediction Models Extending Data Optimizing and building classifiers using Machine Learning
  • 19. www.edureka.co/data-scienceData Science Certification Course using R Processing & Cleansing Data What does a Data Scientist do? Data Mining Building Prediction Models Extending Data Optimizing and building classifiers using Machine Learning
  • 20. www.edureka.co/data-scienceData Science Certification Course using R What does a Data Scientist do? Data Mining Processing & Cleansing Data Building Prediction Models Extending Data Optimizing and building classifiers using Machine Learning
  • 21. www.edureka.co/data-scienceData Science Certification Course using R What does a Data Scientist do? Data Mining Processing & Cleansing Data Building Prediction Models Extending Data Optimizing and building classifiers using Machine Learning
  • 22. www.edureka.co/data-scienceData Science Certification Course using R What does a Data Scientist do? Data Mining Processing & Cleansing Data Building Prediction Models Extending Data Optimizing and building classifiers using Machine Learning
  • 23. www.edureka.co/data-scienceData Science Certification Course using R What does a Data Scientist do? Data Mining Processing & Cleansing Data Building Prediction Models Extending Data Optimizing and building classifiers using Machine Learning
  • 24. How to solve a problem in Data Science?
  • 25. www.edureka.co/data-scienceData Science Certification Course using R How to solve a problem in Data Science? 3 62 41 5 Discovery Data Preparation Model Planning Model Building Operationalize Communicating Results
  • 26. www.edureka.co/data-scienceData Science Certification Course using R How to solve a problem in Data Science? 1 3 2 4 Discovery Data Preparation Model Planning Model Building 5 6 Operationalize Communicate ➢ Discovery involves acquiring data from all identifies internal and external resources that can help with a business solution. ➢ You assess if you have the required resources present in terms of people, technology, time and data to support the project.
  • 27. www.edureka.co/data-scienceData Science Certification Course using R How to solve a problem in Data Science? 1 3 2 4 Discovery Data Preparation Model Planning Model Building 5 6 Operationalize Communicate ➢ In this phase, you require analytical sandbox in which you can perform analytics for the entire duration of the project. ➢ This is what a Sandbox is supposed to look like; ➢ ETLT means to Extract, Transform, Load and Transform. Preparing the Analytics Sandbox Performing ETLT Data Conditioning Survey & Visualize
  • 28. www.edureka.co/data-scienceData Science Certification Course using R How to solve a problem in Data Science? 1 3 2 4 Discovery Data Preparation Model Planning Model Building 5 6 Operationalize Communicate ➢ You will apply Exploratory Data Analytics (EDA) using various statistical formulas and visualization tools. Common Tools for Model Planning R SAS/ ACCESS SQL Service Analysis Services
  • 29. www.edureka.co/data-scienceData Science Certification Course using R How to solve a problem in Data Science? 1 3 2 4 Discovery Data Preparation Model Planning Model Building 5 6 Operationalize Communicate ➢ In this phase, you will develop datasets for training and testing purposes. Common Tools for Model Building SAS Miner WEKA SPCS MATLAB Alpine Miner Statistica
  • 30. www.edureka.co/data-scienceData Science Certification Course using R How to solve a problem in Data Science? 1 3 2 4 Discovery Data Preparation Model Planning Model Building 5 6 Operationalize Communicate ➢ In this phase, you deliver final reports, briefings, code and technical documents. ➢ In addition, sometimes a pilot project is also implemented in a real- time production environment. ➢ This will provide you a clear picture of the performance and other related constraints on a small scale before full deployment.
  • 31. www.edureka.co/data-scienceData Science Certification Course using R How to solve a problem in Data Science? 1 3 2 4 Discovery Data Preparation Model Planning Model Building 5 6 Operationalize Communicate ➢ You do the following things in this phase; 1. You identify all the key findings 2. communicate to the stakeholders 3. Look for performance constraints, if any 4. determine if the results of the project are a success or a failure
  • 32. www.edureka.co/data-scienceData Science Certification Course using R How to Choose an Algorithm in Data Science? Is it A or B? Classification Algorithm Is this weird? Anomaly Detection Algorithm How much / How many? Regression Algorithm How is this organised? Clustering Algorithm What should I do next? Reinforcement Learning
  • 33. www.edureka.co/data-scienceData Science Certification Course using R What is machine Learning? It is a type of Artificial Intelligence that makes the computers capable of learning on their own i.e without explicitly being programmed. With machine learning, machines can update their own code, whenever they come across a new situation.
  • 34. www.edureka.co/data-scienceData Science Certification Course using R Categories of Algorithm Supervised Learning 1 Supervised Learning is a type of machine learning algorithm that uses a known dataset to make predictions. Unsupervised Learning 2 Unsupervised Learning is a type of machine learning algorithm that uses a input datasets without labelled responses to draw inference. Reinforcement Learning 3 Reinforcement Learning is a type of algorithm inspired by behaviourist psychology, concerned with taking actions to maximise reward.
  • 36. www.edureka.co/data-scienceData Science Certification Course using R Data Science Tools 1. Datasets Hadoop 4 Big Data 3 R programming 2 Spark 55
  • 37. Demo