SlideShare ist ein Scribd-Unternehmen logo
1 von 61
Downloaden Sie, um offline zu lesen
How to become a Data
Scientist?
Manjunath Sindagi
Pune Developer’s Community Annual Event
20.01.2018
Co-Founder @Hyperdata.io
and
Data Science Advisor
Focus on:
Apply AI to Business Problems.
Agenda
●
● Why Data Scientist?
● Artificial Intelligence
● Being a Data Scientist
● Broad Skill Sets
● Skill Sets based on your Profile
● Typical Team in Data Science
● Conclusion & Final Remarks
● References to start
Why data scientist?
Why data
scientist?
● Buzzword
● Fancy, Cool
● Demand
● Data Growth
● Passion
No. 1 Job in Glassdoor for 2016 and 2017
Average Salary is Higher
Growth of Data Scientist - IBM
Any industry that has digitized data, people are
needed to support the ecosystem and find
insights from the data.
Challenges of
Data
4 Vs of Data
➢ Volume
➢ Velocity
➢ Variety
➢ Veracity
Data Science and Analytics (DSA) jobs remain
open an average of 45 days, five days longer
than the market average.
Big Data and Data Science Skills are
most challenging to recruit for and
potentially can create the greatest
disruption.
Some 59% of all DSA job demand
currently is in finance and insurance,
professional services and IT sector.
(according to IBM)
Job Trends from Indeed.com
Google Trends
Artificial Intelligence
Artificial
Intelligence
Artificial Intelligence is defined as
the science of making computers
do things that require intelligence
when done by Humans
Making sense out of data .
AI And Related Fields
Being a Data Scientist
Who is Data Scientist?
Who is Data
Scientist?
One who has wide breadth of
abilities:
● Academic curiosity
● Storytelling
● Product sense
● Engineering experience
● Cleverness
And above all
● deep domain expertise in
Mathematics, Statistical and
Machine Learning
T - Shaped Skill Set
What is Data Science?
Data science, also known as data-driven science, is an
interdisciplinary field about scientific methods, processes,
and systems to extract knowledge or insights from data in
various forms, either structured or unstructured, similar to
data mining.
- Wikipedia : https://en.wikipedia.org/wiki/Data_science
Data Science
Interdisciplinary Field
Being a Data Scientist is Jack of
All, Master of Everything!
Broad Skills -
Knowledge Prerequisites
Programming
● Strong Programming Skills
● Strong with Python, R or Java
● Fundamentally Strong Data
Structure Knowledge
● Debugging Skills
● Exceptional Problem Solving
Skills
Mathematics
● Algebra
● Statistics
● Differentiation
● Calculus
Core Areas
Pick One
● Information Retrieval
● Natural Language Processing
● Linguistics
● Machine Learning
● Image Processing
● Video Processing
● Speech Processing
● Then pick Neural Networks and
Deep Learning
Tools
and
Technology
If not all at least 3-4
● Excel
● R, Python
● Spark
● Hadoop
● Scala
● AWS
● Solr, Elastic Search
● New ML Libraries - Tensorflow,
Caffe
● Queueing System
Top Tools used in 2015-17 : Kdnuggets
Application of
Algorithms
● Practical Implementations
● Follow Kaggle, KDNuggets and
solve problems
● Ability to quickly suggest
algorithms to apply and also to
implement the same
● Working with a Mentor will help
Data Savvy
● Data Oriented Mindset
● Quickly understand the
problem and give solutions in
short span of time
● Ability to think how data can
add value to business and what
insights can be driven.
As a data scientist, if you know nothing
else, you need to know how to take
some data, munge it, clean it, filter it ,
mine it, visualize it and then validate.
It’s a very long process
Learning
PathWays
● MS/MTech/PhD
● Self Study
● Boot Camps
● Online Courses
Skill Set based on Current Profile
Skill Sets Focus
● Exceptionally Strong
Programming Skills
● Strong Data Structure
Knowledge
● Master Python, R, Java
● Github Profile
● Then, Work in
Companies to Solve
Problems
Freshers
Skill Sets Focus
● Mathematics
● Course - Take Up a
Course Online.
● Pick up a Area - ML, NLP,
Linguistics etc
● Apply and Solve
Problems in Kaggle
Programmers
(> 2 Years Experience)
Skill Sets Focus
● Strong AWS Knowledge
● Knowledge of ML/DL
Libraries and Tools
● Photographer’s Mind
● Be a Data Engineer than a
Scientist
● Practice, Practice,
Practice
Programmers
(> 2 Years Experience)
Skill Set Focus
Programmers with over 10
Years Experience
● Their curiosity helps to find the
problem on their own and they
solve it themselves.
● Can take a course and Talk to
people with experience in these
areas.
● Inability to admit the lack of
knowledge
● Understand Scale Challenges
with Data
Skill Set Focus
Business Analyst/Managers
● Take a Course
○ Understand how things are built
○ Not necessary to know
mathematics or programming
● Understand the steps in ML
○ Data Collection
○ Data Preparation
○ Model Selection
○ Training
○ Evaluation
○ Parameter Tuning
○ Prediction
Skill Set Focus
Business Analyst/Managers
● Incremental Systems
● Accuracy Models
● View AI Videos applied to
Business.
● Log the data properly in your
applications.
● Ability to convey problems to
Solutios Architects
Skill Set Focus
Database Admins
● Understand different types of Data
○ Text,Images,Numbers,Files etc
● Learn all about storage
mechanisms, advantages,
disadvantages of different
databases
○ NoSQL - Mongo, Cassandra, GraphDB
(Neo4J), CouchDB
○ SQL
Skill Set Focus
Database Admins
● Ability to convey what
database is optimal to what
type of data.
● Design and Build Models for
various kinds of data on paper
● Practice Modeling of Data
Extensively.
Skill Set Focus
Domain Experts/CxOs
● Same as Business
Analysts/Managers.
● Formulating the Business
around Data
● AI is used to solve business
problem
● AI is used for Automation
Typical Team
So, is it possible to be Jack of All,
and Master of Everything?
Typical Team
● Domain Expert (Business
Analyst/Product Manager)
● Solutions Architect
● Developer - Data
Collection/Preparation
● Data Scientist/NLP
● Data Engineer.
● Application Developer (Expert in
Visualization domain)
Data Scientist vs Data Engineers
Data Scientist
● Mathematics, Statistics etc.
● Expert in ML, NLP etc in at
least one of these areas..
● Knows to Apply Different ML
Models and Algorithms
Data Engineers
● Takes inputs from Data
Scientists once the problem is
solved
● Exceptionally good at
Programming and different
tools
● Solves problems at Scale
● Productionize the solutions.
Concluding
Remarks
● Programming
● Engineering Problems
● Data Organization and
Modeling Problems
● Data collection and Preparation
Major Challenges in Data
Data Science -
Task
Allocation
Data Acquisition and
Preparation - Major time
consuming task
Concluding
Remarks
● Everything is on cloud, let’s use it.
● Unaware of Business Value
● Clueless About Data Science and
Related Technologies
● Solutions Architect and Domain
Experts are critical to know before
you join.
● Vision of the Company
Major Challenges in Companies
Final Remarks Work with Mentor
Have a Photographer’s Mind.
Choose your career wisely
References to Start
Online Course
● Coursera : Andrew NG Machine
Learning Course https://goo.gl/fDTwSE
● Youtube : Prof. Sengupta
https://goo.gl/JGG6th
People and
Books
● People to follow.
○ Andrew NG
○ Bernard Marr - AI Journalist.
○ Geoffrey Hinton
○ Roman Trusov
○ Many people :
https://www.quora.com/Who-are-some-
notable-machine-learning-researchers
● Books
○ Programming Collective Intelligence
Q & A
Thank You :)

Weitere ähnliche Inhalte

Was ist angesagt?

Introduction to Python for Data Science
Introduction to Python for Data ScienceIntroduction to Python for Data Science
Introduction to Python for Data ScienceArc & Codementor
 
How to become a Data Scientist?
How to become a Data Scientist? How to become a Data Scientist?
How to become a Data Scientist? HackerEarth
 
Data+Science : A First Course
Data+Science : A First CourseData+Science : A First Course
Data+Science : A First CourseArnab Majumdar
 
Data science e machine learning
Data science e machine learningData science e machine learning
Data science e machine learningGiuseppe Manco
 
Claudia Gold: Learning Data Science Online
Claudia Gold: Learning Data Science OnlineClaudia Gold: Learning Data Science Online
Claudia Gold: Learning Data Science Onlinesfdatascience
 
The path to be a data scientist
The path to be a data scientistThe path to be a data scientist
The path to be a data scientistPoo Kuan Hoong
 
Life of a data scientist (pub)
Life of a data scientist (pub)Life of a data scientist (pub)
Life of a data scientist (pub)Buhwan Jeong
 
Introduction to Data Science and Large-scale Machine Learning
Introduction to Data Science and Large-scale Machine LearningIntroduction to Data Science and Large-scale Machine Learning
Introduction to Data Science and Large-scale Machine LearningNik Spirin
 
Myths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsMyths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsDavid Pittman
 
Data Science presentation for elementary school students
Data Science presentation for elementary school studentsData Science presentation for elementary school students
Data Science presentation for elementary school studentsMelanie Manning, CFA
 
From Lab to Factory: Or how to turn data into value
From Lab to Factory: Or how to turn data into valueFrom Lab to Factory: Or how to turn data into value
From Lab to Factory: Or how to turn data into valuePeadar Coyle
 
Clare Corthell: Learning Data Science Online
Clare Corthell: Learning Data Science OnlineClare Corthell: Learning Data Science Online
Clare Corthell: Learning Data Science Onlinesfdatascience
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceNiko Vuokko
 
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Edureka!
 
How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace Mohamadreza Mohtat
 
From SQL to Python - A Beginner's Guide to Making the Switch
From SQL to Python - A Beginner's Guide to Making the SwitchFrom SQL to Python - A Beginner's Guide to Making the Switch
From SQL to Python - A Beginner's Guide to Making the SwitchRachel Berryman
 

Was ist angesagt? (20)

Introduction to Python for Data Science
Introduction to Python for Data ScienceIntroduction to Python for Data Science
Introduction to Python for Data Science
 
How to become a Data Scientist?
How to become a Data Scientist? How to become a Data Scientist?
How to become a Data Scientist?
 
Data+Science : A First Course
Data+Science : A First CourseData+Science : A First Course
Data+Science : A First Course
 
Data science e machine learning
Data science e machine learningData science e machine learning
Data science e machine learning
 
Claudia Gold: Learning Data Science Online
Claudia Gold: Learning Data Science OnlineClaudia Gold: Learning Data Science Online
Claudia Gold: Learning Data Science Online
 
The path to be a data scientist
The path to be a data scientistThe path to be a data scientist
The path to be a data scientist
 
Life of a data scientist (pub)
Life of a data scientist (pub)Life of a data scientist (pub)
Life of a data scientist (pub)
 
Introduction to Data Science and Large-scale Machine Learning
Introduction to Data Science and Large-scale Machine LearningIntroduction to Data Science and Large-scale Machine Learning
Introduction to Data Science and Large-scale Machine Learning
 
Myths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsMyths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data Scientists
 
Data Science presentation for elementary school students
Data Science presentation for elementary school studentsData Science presentation for elementary school students
Data Science presentation for elementary school students
 
From Lab to Factory: Or how to turn data into value
From Lab to Factory: Or how to turn data into valueFrom Lab to Factory: Or how to turn data into value
From Lab to Factory: Or how to turn data into value
 
Clare Corthell: Learning Data Science Online
Clare Corthell: Learning Data Science OnlineClare Corthell: Learning Data Science Online
Clare Corthell: Learning Data Science Online
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data Science: Past, Present, and Future
Data Science: Past, Present, and FutureData Science: Past, Present, and Future
Data Science: Past, Present, and Future
 
First Steps on Big Data
First Steps on Big DataFirst Steps on Big Data
First Steps on Big Data
 
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
 
How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace
 
From SQL to Python - A Beginner's Guide to Making the Switch
From SQL to Python - A Beginner's Guide to Making the SwitchFrom SQL to Python - A Beginner's Guide to Making the Switch
From SQL to Python - A Beginner's Guide to Making the Switch
 
Data science
Data scienceData science
Data science
 
Data Science using Python
Data Science using PythonData Science using Python
Data Science using Python
 

Ähnlich wie How to become a data scientist

Job Profiles in Big Data - StackDataLabs
Job Profiles in Big Data - StackDataLabsJob Profiles in Big Data - StackDataLabs
Job Profiles in Big Data - StackDataLabsStack Data Labs
 
Building successful data science teams
Building successful data science teamsBuilding successful data science teams
Building successful data science teamsVenkatesh Umaashankar
 
Key Roles In Data-Driven Organisation
Key Roles In Data-Driven OrganisationKey Roles In Data-Driven Organisation
Key Roles In Data-Driven OrganisationKnoldus Inc.
 
Key Roles In Data-Driven Organisation
Key Roles In Data-Driven OrganisationKey Roles In Data-Driven Organisation
Key Roles In Data-Driven OrganisationKnoldus Inc.
 
General introduction to AI ML DL DS
General introduction to AI ML DL DSGeneral introduction to AI ML DL DS
General introduction to AI ML DL DSRoopesh Kohad
 
Working with data.pdf
Working with data.pdfWorking with data.pdf
Working with data.pdfBJZarate
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceSampath Kumar
 
Careers in Data Science _ Navigating the Digital Frontier (1).pptx
Careers in Data Science _  Navigating the Digital Frontier (1).pptxCareers in Data Science _  Navigating the Digital Frontier (1).pptx
Careers in Data Science _ Navigating the Digital Frontier (1).pptx2075AAGEPRATIK
 
AI hype or reality
AI  hype or realityAI  hype or reality
AI hype or realityAwantik Das
 
Hiring tips for data roles - Nikunj Verma (C.E.O & Co-founder at CutShort.io)
Hiring tips for data roles - Nikunj Verma (C.E.O & Co-founder at CutShort.io)Hiring tips for data roles - Nikunj Verma (C.E.O & Co-founder at CutShort.io)
Hiring tips for data roles - Nikunj Verma (C.E.O & Co-founder at CutShort.io)Anupran Trivedi
 
Big Data for Data Scientists - Info Session
Big Data for Data Scientists - Info SessionBig Data for Data Scientists - Info Session
Big Data for Data Scientists - Info SessionWeCloudData
 
Data Con LA 2019 - The challenges of data science for veteran media organizat...
Data Con LA 2019 - The challenges of data science for veteran media organizat...Data Con LA 2019 - The challenges of data science for veteran media organizat...
Data Con LA 2019 - The challenges of data science for veteran media organizat...Data Con LA
 
Become a successful Data Scientist. Start Now!
Become a successful Data Scientist. Start Now!Become a successful Data Scientist. Start Now!
Become a successful Data Scientist. Start Now!Edology
 
Data Science Environment with R on openSUSE Leap 15.1
Data Science Environment with R on openSUSE Leap 15.1Data Science Environment with R on openSUSE Leap 15.1
Data Science Environment with R on openSUSE Leap 15.1Sabar Suwarsono
 
Data Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxData Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxCarolineRebeccaD
 
Data science a practitioner's perspective
Data science  a practitioner's perspectiveData science  a practitioner's perspective
Data science a practitioner's perspectiveAmir Ziai
 
Choosing the Right Database - Facebook DevC Malang Hackdays 2017
Choosing the Right Database - Facebook DevC Malang Hackdays 2017Choosing the Right Database - Facebook DevC Malang Hackdays 2017
Choosing the Right Database - Facebook DevC Malang Hackdays 2017Rendy Bambang Junior
 

Ähnlich wie How to become a data scientist (20)

Job Profiles in Big Data - StackDataLabs
Job Profiles in Big Data - StackDataLabsJob Profiles in Big Data - StackDataLabs
Job Profiles in Big Data - StackDataLabs
 
Building successful data science teams
Building successful data science teamsBuilding successful data science teams
Building successful data science teams
 
Key Roles In Data-Driven Organisation
Key Roles In Data-Driven OrganisationKey Roles In Data-Driven Organisation
Key Roles In Data-Driven Organisation
 
Key Roles In Data-Driven Organisation
Key Roles In Data-Driven OrganisationKey Roles In Data-Driven Organisation
Key Roles In Data-Driven Organisation
 
Brochure
BrochureBrochure
Brochure
 
General introduction to AI ML DL DS
General introduction to AI ML DL DSGeneral introduction to AI ML DL DS
General introduction to AI ML DL DS
 
Working with data.pdf
Working with data.pdfWorking with data.pdf
Working with data.pdf
 
L15.pptx
L15.pptxL15.pptx
L15.pptx
 
Data science as career
Data science as careerData science as career
Data science as career
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Careers in Data Science _ Navigating the Digital Frontier (1).pptx
Careers in Data Science _  Navigating the Digital Frontier (1).pptxCareers in Data Science _  Navigating the Digital Frontier (1).pptx
Careers in Data Science _ Navigating the Digital Frontier (1).pptx
 
AI hype or reality
AI  hype or realityAI  hype or reality
AI hype or reality
 
Hiring tips for data roles - Nikunj Verma (C.E.O & Co-founder at CutShort.io)
Hiring tips for data roles - Nikunj Verma (C.E.O & Co-founder at CutShort.io)Hiring tips for data roles - Nikunj Verma (C.E.O & Co-founder at CutShort.io)
Hiring tips for data roles - Nikunj Verma (C.E.O & Co-founder at CutShort.io)
 
Big Data for Data Scientists - Info Session
Big Data for Data Scientists - Info SessionBig Data for Data Scientists - Info Session
Big Data for Data Scientists - Info Session
 
Data Con LA 2019 - The challenges of data science for veteran media organizat...
Data Con LA 2019 - The challenges of data science for veteran media organizat...Data Con LA 2019 - The challenges of data science for veteran media organizat...
Data Con LA 2019 - The challenges of data science for veteran media organizat...
 
Become a successful Data Scientist. Start Now!
Become a successful Data Scientist. Start Now!Become a successful Data Scientist. Start Now!
Become a successful Data Scientist. Start Now!
 
Data Science Environment with R on openSUSE Leap 15.1
Data Science Environment with R on openSUSE Leap 15.1Data Science Environment with R on openSUSE Leap 15.1
Data Science Environment with R on openSUSE Leap 15.1
 
Data Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxData Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptx
 
Data science a practitioner's perspective
Data science  a practitioner's perspectiveData science  a practitioner's perspective
Data science a practitioner's perspective
 
Choosing the Right Database - Facebook DevC Malang Hackdays 2017
Choosing the Right Database - Facebook DevC Malang Hackdays 2017Choosing the Right Database - Facebook DevC Malang Hackdays 2017
Choosing the Right Database - Facebook DevC Malang Hackdays 2017
 

Kürzlich hochgeladen

GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 

Kürzlich hochgeladen (20)

Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 

How to become a data scientist

  • 1. How to become a Data Scientist? Manjunath Sindagi Pune Developer’s Community Annual Event 20.01.2018
  • 3. Focus on: Apply AI to Business Problems.
  • 4. Agenda ● ● Why Data Scientist? ● Artificial Intelligence ● Being a Data Scientist ● Broad Skill Sets ● Skill Sets based on your Profile ● Typical Team in Data Science ● Conclusion & Final Remarks ● References to start
  • 6. Why data scientist? ● Buzzword ● Fancy, Cool ● Demand ● Data Growth ● Passion
  • 7. No. 1 Job in Glassdoor for 2016 and 2017
  • 9. Growth of Data Scientist - IBM
  • 10. Any industry that has digitized data, people are needed to support the ecosystem and find insights from the data.
  • 11. Challenges of Data 4 Vs of Data ➢ Volume ➢ Velocity ➢ Variety ➢ Veracity
  • 12. Data Science and Analytics (DSA) jobs remain open an average of 45 days, five days longer than the market average.
  • 13. Big Data and Data Science Skills are most challenging to recruit for and potentially can create the greatest disruption.
  • 14. Some 59% of all DSA job demand currently is in finance and insurance, professional services and IT sector. (according to IBM)
  • 15. Job Trends from Indeed.com
  • 18. Artificial Intelligence Artificial Intelligence is defined as the science of making computers do things that require intelligence when done by Humans Making sense out of data .
  • 19. AI And Related Fields
  • 20. Being a Data Scientist
  • 21. Who is Data Scientist?
  • 22. Who is Data Scientist? One who has wide breadth of abilities: ● Academic curiosity ● Storytelling ● Product sense ● Engineering experience ● Cleverness And above all ● deep domain expertise in Mathematics, Statistical and Machine Learning
  • 23. T - Shaped Skill Set
  • 24. What is Data Science? Data science, also known as data-driven science, is an interdisciplinary field about scientific methods, processes, and systems to extract knowledge or insights from data in various forms, either structured or unstructured, similar to data mining. - Wikipedia : https://en.wikipedia.org/wiki/Data_science
  • 26. Being a Data Scientist is Jack of All, Master of Everything!
  • 27. Broad Skills - Knowledge Prerequisites
  • 28. Programming ● Strong Programming Skills ● Strong with Python, R or Java ● Fundamentally Strong Data Structure Knowledge ● Debugging Skills ● Exceptional Problem Solving Skills
  • 29. Mathematics ● Algebra ● Statistics ● Differentiation ● Calculus
  • 30. Core Areas Pick One ● Information Retrieval ● Natural Language Processing ● Linguistics ● Machine Learning ● Image Processing ● Video Processing ● Speech Processing ● Then pick Neural Networks and Deep Learning
  • 31. Tools and Technology If not all at least 3-4 ● Excel ● R, Python ● Spark ● Hadoop ● Scala ● AWS ● Solr, Elastic Search ● New ML Libraries - Tensorflow, Caffe ● Queueing System
  • 32. Top Tools used in 2015-17 : Kdnuggets
  • 33. Application of Algorithms ● Practical Implementations ● Follow Kaggle, KDNuggets and solve problems ● Ability to quickly suggest algorithms to apply and also to implement the same ● Working with a Mentor will help
  • 34. Data Savvy ● Data Oriented Mindset ● Quickly understand the problem and give solutions in short span of time ● Ability to think how data can add value to business and what insights can be driven.
  • 35. As a data scientist, if you know nothing else, you need to know how to take some data, munge it, clean it, filter it , mine it, visualize it and then validate. It’s a very long process
  • 36. Learning PathWays ● MS/MTech/PhD ● Self Study ● Boot Camps ● Online Courses
  • 37. Skill Set based on Current Profile
  • 38. Skill Sets Focus ● Exceptionally Strong Programming Skills ● Strong Data Structure Knowledge ● Master Python, R, Java ● Github Profile ● Then, Work in Companies to Solve Problems Freshers
  • 39. Skill Sets Focus ● Mathematics ● Course - Take Up a Course Online. ● Pick up a Area - ML, NLP, Linguistics etc ● Apply and Solve Problems in Kaggle Programmers (> 2 Years Experience)
  • 40. Skill Sets Focus ● Strong AWS Knowledge ● Knowledge of ML/DL Libraries and Tools ● Photographer’s Mind ● Be a Data Engineer than a Scientist ● Practice, Practice, Practice Programmers (> 2 Years Experience)
  • 41. Skill Set Focus Programmers with over 10 Years Experience ● Their curiosity helps to find the problem on their own and they solve it themselves. ● Can take a course and Talk to people with experience in these areas. ● Inability to admit the lack of knowledge ● Understand Scale Challenges with Data
  • 42. Skill Set Focus Business Analyst/Managers ● Take a Course ○ Understand how things are built ○ Not necessary to know mathematics or programming ● Understand the steps in ML ○ Data Collection ○ Data Preparation ○ Model Selection ○ Training ○ Evaluation ○ Parameter Tuning ○ Prediction
  • 43. Skill Set Focus Business Analyst/Managers ● Incremental Systems ● Accuracy Models ● View AI Videos applied to Business. ● Log the data properly in your applications. ● Ability to convey problems to Solutios Architects
  • 44. Skill Set Focus Database Admins ● Understand different types of Data ○ Text,Images,Numbers,Files etc ● Learn all about storage mechanisms, advantages, disadvantages of different databases ○ NoSQL - Mongo, Cassandra, GraphDB (Neo4J), CouchDB ○ SQL
  • 45. Skill Set Focus Database Admins ● Ability to convey what database is optimal to what type of data. ● Design and Build Models for various kinds of data on paper ● Practice Modeling of Data Extensively.
  • 46. Skill Set Focus Domain Experts/CxOs ● Same as Business Analysts/Managers. ● Formulating the Business around Data ● AI is used to solve business problem ● AI is used for Automation
  • 48. So, is it possible to be Jack of All, and Master of Everything?
  • 49. Typical Team ● Domain Expert (Business Analyst/Product Manager) ● Solutions Architect ● Developer - Data Collection/Preparation ● Data Scientist/NLP ● Data Engineer. ● Application Developer (Expert in Visualization domain)
  • 50. Data Scientist vs Data Engineers
  • 51. Data Scientist ● Mathematics, Statistics etc. ● Expert in ML, NLP etc in at least one of these areas.. ● Knows to Apply Different ML Models and Algorithms
  • 52. Data Engineers ● Takes inputs from Data Scientists once the problem is solved ● Exceptionally good at Programming and different tools ● Solves problems at Scale ● Productionize the solutions.
  • 53. Concluding Remarks ● Programming ● Engineering Problems ● Data Organization and Modeling Problems ● Data collection and Preparation Major Challenges in Data
  • 54. Data Science - Task Allocation Data Acquisition and Preparation - Major time consuming task
  • 55. Concluding Remarks ● Everything is on cloud, let’s use it. ● Unaware of Business Value ● Clueless About Data Science and Related Technologies ● Solutions Architect and Domain Experts are critical to know before you join. ● Vision of the Company Major Challenges in Companies
  • 56. Final Remarks Work with Mentor Have a Photographer’s Mind. Choose your career wisely
  • 58. Online Course ● Coursera : Andrew NG Machine Learning Course https://goo.gl/fDTwSE ● Youtube : Prof. Sengupta https://goo.gl/JGG6th
  • 59. People and Books ● People to follow. ○ Andrew NG ○ Bernard Marr - AI Journalist. ○ Geoffrey Hinton ○ Roman Trusov ○ Many people : https://www.quora.com/Who-are-some- notable-machine-learning-researchers ● Books ○ Programming Collective Intelligence
  • 60. Q & A