SlideShare ist ein Scribd-Unternehmen logo
1 von 24
Downloaden Sie, um offline zu lesen
CIJ is Sponsored By:
Career of Future
10/13/2015 1
About ME
Mohammed K. Barakat
• Industrial Engineer, The University of Jordan
• Business Excellence Manager-FINE Hygienic Paper Company
• Professional Engineer in Industrial Engineering (PE), (JCPQA-JEA)
• Project Management Professional (PMP), (PMI)
• Risk Management Professional (PMI-RMP), (PMI)
• Certified Six Sigma Black Belt (CSSBB), (ASQ)
• Certified Six Sigma Green Belt (CSSGB), (ASQ)
• Microsoft Certified Technology Specialist (MCTS), (Microsoft)
• Microsoft Certified Trainer (MCT), (Microsoft)
mohammedbarakat
MohdBarakat
MohdKBarakat
10/13/2015
2
Data Science: Career of the Future
10/13/2015
3
http://www.wired.com/insights/2014/06/tell-kids-data-scientists-doctors/
…Did you hear that? Data scientists earning more than
doctors…
…But salary is not the only reason…
…data scientists will have a measurable impact on the
future of healthcare.
Why Data Science?
10/13/2015
4
http://www.economist.com/node/15579717
…the quantity of information in the world is soaring
…150 exabytes (billion gigabytes) of data in 2005. This year,
it will create 1,200 exabytes…
…keeping up with this flood, and storing the bits that might
be useful, is difficult enough…
…Analyzing it, to spot patterns and extract useful
information, is harder..
…Even so, the data deluge is already starting to transform
business…
Why “Data Scientist” is a hugely important
profession in the next decade?
10/13/2015
5
“I keep saying that the sexy job in the next
10 years will be statisticians,” said Hal
Varian, chief economist at Google. “And I’m
not kidding.”
https://www.youtube.com/watch?v=pi472Mi3VLw
Why “Data Scientist” is a hugely important
profession in the next decade?
• …ability to take the data
10/13/2015
6
• …extract value from it
• …understand the process
• …visualize it
• …Not only at the professional level
• …communicate it
• …Ubiquitous data…but
• …Statisticians are just part of it
• …Scarcity in ability to understand data
and extract value from it
• …Managers need to access and
understand the data themselves
• …No army behind the scenes to
digest the information for you
What is Data Science?
10/13/2015
7
“Data Science is the extraction of knowledge from
large volumes of data that are structured or
unstructured”
often requires sorting through a great amount of
information and writing algorithms to extract insights
from this data.
What is Big Data?
10/13/2015
8
Big Data is high volume, high velocity, and/or high variety
information assets that require new forms of processing
to enable enhanced decision making, insight discovery
and process optimization."
The 3V’s of Big Data:
Volume: amount of data
Velocity: speed of data in and out
Variety: range of data type and sources
The Data Science Process
10/13/2015
9
The Data Scientist Toolbox
10/13/2015
10
R Software
a software environment for statistical
computing and graphics
The Data Scientist Toolbox
10/13/2015
11
RStudio
An open source software to make it easy for
anyone to analyze data with R
The Data Scientist Toolbox
10/13/2015
12
You’ve got to do a lot of
coding!
The Data Scientist Toolbox
10/13/2015
13
You’ve got to work out
a lot of statistics!
The Data Scientist Toolbox
10/13/2015
14
Github.com RPubs.com
Share your results and code
Publish your full report and build a personal Brand
The Data Scientist Toolbox
10/13/2015
15
RPubs.com
You’d be a Data Scientist…
…..evidence-based results
…..reproducible research
The Data Science process explained
10/13/2015
16
STEP 1: Getting and Cleaning Data
 Downloading files
 Reading data
 Raw vs. Tidy data
 Merging data
 Reshaping data
 Summarizing data
 Data ‘Housekeeping’
The Data Science process explained
10/13/2015
17
STEP 2: Exploratory Data Analysis
 understand data properties
 find patterns in data
 communicate results
 It is made quickly
 Many are made
 The goal is for personal understanding
The Data Science process explained
10/13/2015
18
STEP 3: Perform Statistical Inference
“Statistical inference is the process of drawing formal
conclusions from data”.
Some techniques and concepts:
 Sampling
 Randomization
 Hypothesis Testing
 Confidence Intervals (uncertainty)
 Experimental Design
The Data Science process explained
10/13/2015
19
STEP 4: Perform Regression Modelling
“a statistical process for estimating the
relationships among variables”
 understand how the value of the dependent
variable changes when any one of the
independent variables is varied.
 widely used for prediction (next step)
The Data Science process explained
10/13/2015
20
STEP 5: Perform Machine Learning
“is a computer's way of learning from examples
by using algorithms that take in data and
improve themselves to predict on new data”
Example:
The spam filter working in the background to
block your junk email.
The Data Science process explained
10/13/2015
21
STEP 6: Make your research Reproducible
“Make analytic data and code available so that
others may reproduce findings”
Why?!
To provide scientific evidence of your findings.
http://www.rpubs.com/mohammedkb/TransMPGAnalysis
What it takes you to be a good Data Scientist
10/13/2015
22
Business
skills Communications
skills
Analytical
skills
Computer
science
Statistics
Creativity
Scientific
Mindset
Passion &
Perseverance
What to do next?
10/13/2015
23
 Start learning about Data Science
 Go to the Massive Open Online Course (MOOC)
o Coursera/Data Science
o DataCamp
10/13/2015
24

Weitere ähnliche Inhalte

Was ist angesagt?

Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceSrishti44
 
Career in Data Science
Career in Data ScienceCareer in Data Science
Career in Data ScienceActonRoy
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecasesSreenatha Reddy K R
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceANOOP V S
 
Introduction to data science.pptx
Introduction to data science.pptxIntroduction to data science.pptx
Introduction to data science.pptxSadhanaParameswaran
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceEdureka!
 
Introduction to-data-science
Introduction to-data-scienceIntroduction to-data-science
Introduction to-data-scienceAhmad karawash
 
Data Science Introduction
Data Science IntroductionData Science Introduction
Data Science IntroductionGang Tao
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...Simplilearn
 
Big Data Analytics Powerpoint Presentation Slide
Big Data Analytics Powerpoint Presentation SlideBig Data Analytics Powerpoint Presentation Slide
Big Data Analytics Powerpoint Presentation SlideSlideTeam
 
Presentation on Big Data Analytics
Presentation on Big Data AnalyticsPresentation on Big Data Analytics
Presentation on Big Data AnalyticsS P Sajjan
 
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Edureka!
 

Was ist angesagt? (20)

Data science
Data scienceData science
Data science
 
Data Science
Data ScienceData Science
Data Science
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Career in Data Science
Career in Data ScienceCareer in Data Science
Career in Data Science
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecases
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Introduction to data science.pptx
Introduction to data science.pptxIntroduction to data science.pptx
Introduction to data science.pptx
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data science
Data science Data science
Data science
 
Data science
Data scienceData science
Data science
 
Introduction to-data-science
Introduction to-data-scienceIntroduction to-data-science
Introduction to-data-science
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Data science
Data scienceData science
Data science
 
Data Science Introduction
Data Science IntroductionData Science Introduction
Data Science Introduction
 
Data Science
Data ScienceData Science
Data Science
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
 
Big Data Analytics Powerpoint Presentation Slide
Big Data Analytics Powerpoint Presentation SlideBig Data Analytics Powerpoint Presentation Slide
Big Data Analytics Powerpoint Presentation Slide
 
Presentation on Big Data Analytics
Presentation on Big Data AnalyticsPresentation on Big Data Analytics
Presentation on Big Data Analytics
 
Data analytics
Data analyticsData analytics
Data analytics
 
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
 

Ähnlich wie Data science presentation 2nd CI day

1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptxarpit206900
 
Who is a data scientist
Who is a data scientist  Who is a data scientist
Who is a data scientist prateek kumar
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
Ch1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxAbderrahmanABID2
 
Analytic Transformation | 2013 Loras College Business Analytics Symposium
Analytic Transformation | 2013 Loras College Business Analytics SymposiumAnalytic Transformation | 2013 Loras College Business Analytics Symposium
Analytic Transformation | 2013 Loras College Business Analytics SymposiumCartegraph
 
How to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceHow to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceJuuso Parkkinen
 
L2 DS Tools and Application.pptx
L2 DS Tools and Application.pptxL2 DS Tools and Application.pptx
L2 DS Tools and Application.pptxShambhavi Vats
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020Joanne Luciano
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
 
A New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdfA New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdfArmyTrilidiaDevegaSK
 
Sameer Kumar Das International Conference Paper 53
Sameer Kumar Das International Conference Paper 53Sameer Kumar Das International Conference Paper 53
Sameer Kumar Das International Conference Paper 53Mr.Sameer Kumar Das
 
التنقيب في البيانات - Data Mining
التنقيب في البيانات -  Data Miningالتنقيب في البيانات -  Data Mining
التنقيب في البيانات - Data Miningnabil_alsharafi
 
Eecs6893 big dataanalytics-lecture1
Eecs6893 big dataanalytics-lecture1Eecs6893 big dataanalytics-lecture1
Eecs6893 big dataanalytics-lecture1Aravindharamanan S
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data scienceVipul Kalamkar
 
A Survey on Big Data Analytics: Challenges
A Survey on Big Data Analytics: ChallengesA Survey on Big Data Analytics: Challenges
A Survey on Big Data Analytics: ChallengesDr. Amarjeet Singh
 
A study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesA study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesBhanu Prakash
 

Ähnlich wie Data science presentation 2nd CI day (20)

1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
 
Who is a data scientist
Who is a data scientist  Who is a data scientist
Who is a data scientist
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Semantic Data Management
Semantic Data ManagementSemantic Data Management
Semantic Data Management
 
Ch1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptx
 
Analytic Transformation | 2013 Loras College Business Analytics Symposium
Analytic Transformation | 2013 Loras College Business Analytics SymposiumAnalytic Transformation | 2013 Loras College Business Analytics Symposium
Analytic Transformation | 2013 Loras College Business Analytics Symposium
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
How to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceHow to Prepare for a Career in Data Science
How to Prepare for a Career in Data Science
 
L2 DS Tools and Application.pptx
L2 DS Tools and Application.pptxL2 DS Tools and Application.pptx
L2 DS Tools and Application.pptx
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
A New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdfA New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdf
 
Sameer Kumar Das International Conference Paper 53
Sameer Kumar Das International Conference Paper 53Sameer Kumar Das International Conference Paper 53
Sameer Kumar Das International Conference Paper 53
 
التنقيب في البيانات - Data Mining
التنقيب في البيانات -  Data Miningالتنقيب في البيانات -  Data Mining
التنقيب في البيانات - Data Mining
 
Eecs6893 big dataanalytics-lecture1
Eecs6893 big dataanalytics-lecture1Eecs6893 big dataanalytics-lecture1
Eecs6893 big dataanalytics-lecture1
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
 
A Survey on Big Data Analytics: Challenges
A Survey on Big Data Analytics: ChallengesA Survey on Big Data Analytics: Challenges
A Survey on Big Data Analytics: Challenges
 
A study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesA study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websites
 

Kürzlich hochgeladen

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...amitlee9823
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 

Kürzlich hochgeladen (20)

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 

Data science presentation 2nd CI day

  • 1. CIJ is Sponsored By: Career of Future 10/13/2015 1
  • 2. About ME Mohammed K. Barakat • Industrial Engineer, The University of Jordan • Business Excellence Manager-FINE Hygienic Paper Company • Professional Engineer in Industrial Engineering (PE), (JCPQA-JEA) • Project Management Professional (PMP), (PMI) • Risk Management Professional (PMI-RMP), (PMI) • Certified Six Sigma Black Belt (CSSBB), (ASQ) • Certified Six Sigma Green Belt (CSSGB), (ASQ) • Microsoft Certified Technology Specialist (MCTS), (Microsoft) • Microsoft Certified Trainer (MCT), (Microsoft) mohammedbarakat MohdBarakat MohdKBarakat 10/13/2015 2
  • 3. Data Science: Career of the Future 10/13/2015 3 http://www.wired.com/insights/2014/06/tell-kids-data-scientists-doctors/ …Did you hear that? Data scientists earning more than doctors… …But salary is not the only reason… …data scientists will have a measurable impact on the future of healthcare.
  • 4. Why Data Science? 10/13/2015 4 http://www.economist.com/node/15579717 …the quantity of information in the world is soaring …150 exabytes (billion gigabytes) of data in 2005. This year, it will create 1,200 exabytes… …keeping up with this flood, and storing the bits that might be useful, is difficult enough… …Analyzing it, to spot patterns and extract useful information, is harder.. …Even so, the data deluge is already starting to transform business…
  • 5. Why “Data Scientist” is a hugely important profession in the next decade? 10/13/2015 5 “I keep saying that the sexy job in the next 10 years will be statisticians,” said Hal Varian, chief economist at Google. “And I’m not kidding.” https://www.youtube.com/watch?v=pi472Mi3VLw
  • 6. Why “Data Scientist” is a hugely important profession in the next decade? • …ability to take the data 10/13/2015 6 • …extract value from it • …understand the process • …visualize it • …Not only at the professional level • …communicate it • …Ubiquitous data…but • …Statisticians are just part of it • …Scarcity in ability to understand data and extract value from it • …Managers need to access and understand the data themselves • …No army behind the scenes to digest the information for you
  • 7. What is Data Science? 10/13/2015 7 “Data Science is the extraction of knowledge from large volumes of data that are structured or unstructured” often requires sorting through a great amount of information and writing algorithms to extract insights from this data.
  • 8. What is Big Data? 10/13/2015 8 Big Data is high volume, high velocity, and/or high variety information assets that require new forms of processing to enable enhanced decision making, insight discovery and process optimization." The 3V’s of Big Data: Volume: amount of data Velocity: speed of data in and out Variety: range of data type and sources
  • 9. The Data Science Process 10/13/2015 9
  • 10. The Data Scientist Toolbox 10/13/2015 10 R Software a software environment for statistical computing and graphics
  • 11. The Data Scientist Toolbox 10/13/2015 11 RStudio An open source software to make it easy for anyone to analyze data with R
  • 12. The Data Scientist Toolbox 10/13/2015 12 You’ve got to do a lot of coding!
  • 13. The Data Scientist Toolbox 10/13/2015 13 You’ve got to work out a lot of statistics!
  • 14. The Data Scientist Toolbox 10/13/2015 14 Github.com RPubs.com Share your results and code Publish your full report and build a personal Brand
  • 15. The Data Scientist Toolbox 10/13/2015 15 RPubs.com You’d be a Data Scientist… …..evidence-based results …..reproducible research
  • 16. The Data Science process explained 10/13/2015 16 STEP 1: Getting and Cleaning Data  Downloading files  Reading data  Raw vs. Tidy data  Merging data  Reshaping data  Summarizing data  Data ‘Housekeeping’
  • 17. The Data Science process explained 10/13/2015 17 STEP 2: Exploratory Data Analysis  understand data properties  find patterns in data  communicate results  It is made quickly  Many are made  The goal is for personal understanding
  • 18. The Data Science process explained 10/13/2015 18 STEP 3: Perform Statistical Inference “Statistical inference is the process of drawing formal conclusions from data”. Some techniques and concepts:  Sampling  Randomization  Hypothesis Testing  Confidence Intervals (uncertainty)  Experimental Design
  • 19. The Data Science process explained 10/13/2015 19 STEP 4: Perform Regression Modelling “a statistical process for estimating the relationships among variables”  understand how the value of the dependent variable changes when any one of the independent variables is varied.  widely used for prediction (next step)
  • 20. The Data Science process explained 10/13/2015 20 STEP 5: Perform Machine Learning “is a computer's way of learning from examples by using algorithms that take in data and improve themselves to predict on new data” Example: The spam filter working in the background to block your junk email.
  • 21. The Data Science process explained 10/13/2015 21 STEP 6: Make your research Reproducible “Make analytic data and code available so that others may reproduce findings” Why?! To provide scientific evidence of your findings. http://www.rpubs.com/mohammedkb/TransMPGAnalysis
  • 22. What it takes you to be a good Data Scientist 10/13/2015 22 Business skills Communications skills Analytical skills Computer science Statistics Creativity Scientific Mindset Passion & Perseverance
  • 23. What to do next? 10/13/2015 23  Start learning about Data Science  Go to the Massive Open Online Course (MOOC) o Coursera/Data Science o DataCamp