SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Current and Future Challenges in Data Science
Nathaniel Shimoni
First a bit about myself…
• Married to Yonit & father of Yahav & Arbel
• Lecturer & researcher @ Ben Gurion University
• Worked as a data science researcher @
• Co-founded the non-profit “Deep-Learning-Boot-Camp”
• Co-founder of Dudes (still in stealth)
• Ranked (at peak) #175 of over 80,000 active Kaggle competitors
• Self taught and very(!) curious
Data is everywhere
Nature monitoring
Automatic x-ray analysis
Seizure prediction
Fashion simulations
Video analysis
Speech 2 Text 2 Speech
But what is data science?
Well… the answer largely depends on who do we ask…
We will use some loose/partial definition most can agree upon:
Data science is:
Both academic and practical research field that is aimed to
programmatically extract insights and knowledge
from data in a reproduceable & generalizable manner
Training
data
New unseen
data
The enablers!
or
“These are the things that
made such abilities possible”
Current state – enablers
Hugh increase in compute ability mainly GPU compute and RAM
up to 48GB RAM on a single card & 11GB for most consumer cards
Highly available and
scalable through
cloud services
Current state – enablers
Large open and labeled datasets
“BookCorpus” dataset
(11,038 books)
MSCOCOcelebsAImageNet
Yelp Open Datasets
Current state – enablers
Open source code and large model zoo
UNet
Efficient Net
Faster RCNN
Current state – enablers
Good ability to perform transfer-learning and fine tuning between datasets
transfer-learning
Current tasks
and common applications
Current state tasks and applications – visual domain (partial list)
visual
domain
Classification
Detection
Segmentation
Pose
Estimation
Style transfer
Generation
(GANs)Document
Authentication
Deep Fake &
Face-swap
Adversarial
attacks
Depth
estimation
Tracking
Search Within
Images
Object & Crowd
Counting
Regression
Similarity &
metric learning
Image based
search
Severity
Estimation
Object price
Estimation
Super
resolution
Image
colorization
Image
captioning
Visual
question
answering
OCR
Image-Text
cross learning
Disease
classification
Activity
classification
Medical
imagery
segmentation
Risky objects
segmentation
Route
segmentation
auto. vehicle
Object Relation
Inference
Current tasks and applications – language domain (partial list)
language
domain
Classification
Machine
Translation
Free text to
structured
Sentiment
analysis
Question
Answering
summarization
Inappropriate
content
Semantic Text Similarity
(e.g. for search relevance)
Research
papers
legal
medical
Call centers
Automatic
Rating
Stress
detection
Named Entity
Recognition
Part of Speech
Tagging
Semantic Role
Labeling
Conversational
Bots
Free Text
querying
Online
Translation
Auto
Language
Classification
Current tasks and applications - other domains (extremely partial list)
Other
domains
Classification
Time series
sound Tabular Data
Graph based
learning
Speech 2 Text
Text 2 Speech
Speaker
Separation
Background
Noise removal
classification forecasting
Signal
disaggregation
segmentation
Anomaly
detection
regression
clustering
Content
recommendation
Node / Graph
Embeddings
Graph Relation
Inference
Partial Graph
Completion
Current challenges
Current challenges
• Dirty data (garbage in garbage out problem)
Current challenges
• Industry talent gap
Job postings at top companies
Increase in job postings since 2012
Current challenges
• Accelerating rate of change
Current challenges
• Limited or lack of monitoring in production systems
In many production systems monitoring is either lacking or limited
Training data
Production data
On deployment
Production data
2 month later
Production data
4 month later
Future challenges
Future challenges
• Bias detection and bias correction
professional hairstyle for work:
unprofessional hairstyle for work:
Gender bias in word embeddings:
Man -> doctor woman -> nurse
Future challenges
• Learning from very few samples
So, how many
data samples do
you have???
Future challenges
• Explainability of results
We need to
operate. urgently!
But why???
Future challenges
• Cross modality learning
The dog is about
to chase a cat
Voice
Image / video
text
Complementary inputs to the same model
Future challenges
• Function learning
A chair is an object
you can seat on
A parking spot is a place to
leave the car when not driving
Future challenges
• Wide adoption of content generation
Great potential in:
• Film making
• Transcribing
• Content candidate generation
• Art
• Simulation
Future challenges
• Authenticity of documents, content and digital records
Future challenges
• AutoML and End2End applications
data
Task (?)
Desired
results
Future challenges
• Continuous monitoring and validation
With great power comes great responsibility
https://github.com/daviddao/awful-ai
Well… these are powerful tools – we must use them carefully
Future challenges
Data science involves many
other positions
Communication with
business is crucial for success
Management must become
data-literate
• Metrics
• Tasks
• Validation
• Findings to actions
• Engineering
• Analysts
• Business
• IT
• Results & goals
• Assumptions
• Domain knowledge
• Capabilities
• Needs (both sides)

Weitere ähnliche Inhalte

Was ist angesagt?

In Quest of Requirements Engineering Research that Industry Needs
In Quest of Requirements Engineering Research that Industry NeedsIn Quest of Requirements Engineering Research that Industry Needs
In Quest of Requirements Engineering Research that Industry NeedsDaniel Mendez
 
From Labelling Open data images to building a private recommender system
From Labelling Open data images to building a private recommender systemFrom Labelling Open data images to building a private recommender system
From Labelling Open data images to building a private recommender systemPierre Gutierrez
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceNiko Vuokko
 
8 minute intro to data science
8 minute intro to data science 8 minute intro to data science
8 minute intro to data science Mahesh Kumar CV
 
Data Science 101
Data Science 101Data Science 101
Data Science 101ideatoipo
 
Data Science 101
Data Science 101Data Science 101
Data Science 101odsc
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2Roger Barga
 
Data Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLData Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLPaco Nathan
 
A survey of 2013 data science salary survey”
A survey of   2013 data science salary survey”A survey of   2013 data science salary survey”
A survey of 2013 data science salary survey”show you
 
Cracking the Coding Interview (Oct 2012)
Cracking the Coding Interview (Oct 2012)Cracking the Coding Interview (Oct 2012)
Cracking the Coding Interview (Oct 2012)careercup
 
SUNG PARK PREDICT 422 Group Project Presentation
SUNG PARK PREDICT 422 Group Project PresentationSUNG PARK PREDICT 422 Group Project Presentation
SUNG PARK PREDICT 422 Group Project PresentationSung Park
 
How to Identify, Train or Become a Data Scientist
How to Identify, Train or Become a Data ScientistHow to Identify, Train or Become a Data Scientist
How to Identify, Train or Become a Data ScientistInside Analysis
 
Software Visualization Today - Systematic Literature Review
Software Visualization Today - Systematic Literature ReviewSoftware Visualization Today - Systematic Literature Review
Software Visualization Today - Systematic Literature ReviewMindtrek
 
How to become a data scientist
How to become a data scientist How to become a data scientist
How to become a data scientist Manjunath Sindagi
 
Agile data science
Agile data scienceAgile data science
Agile data scienceJoel Horwitz
 
Machine Learning for Domain Experts
Machine Learning for Domain ExpertsMachine Learning for Domain Experts
Machine Learning for Domain ExpertsMehmet Alican Noyan
 
What I Learned from Four Years of Science-ing the Crap Out of DevOps
What I Learned from Four Years of Science-ing the Crap Out of DevOpsWhat I Learned from Four Years of Science-ing the Crap Out of DevOps
What I Learned from Four Years of Science-ing the Crap Out of DevOpsVMware Tanzu
 

Was ist angesagt? (17)

In Quest of Requirements Engineering Research that Industry Needs
In Quest of Requirements Engineering Research that Industry NeedsIn Quest of Requirements Engineering Research that Industry Needs
In Quest of Requirements Engineering Research that Industry Needs
 
From Labelling Open data images to building a private recommender system
From Labelling Open data images to building a private recommender systemFrom Labelling Open data images to building a private recommender system
From Labelling Open data images to building a private recommender system
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
8 minute intro to data science
8 minute intro to data science 8 minute intro to data science
8 minute intro to data science
 
Data Science 101
Data Science 101Data Science 101
Data Science 101
 
Data Science 101
Data Science 101Data Science 101
Data Science 101
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
 
Data Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLData Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAML
 
A survey of 2013 data science salary survey”
A survey of   2013 data science salary survey”A survey of   2013 data science salary survey”
A survey of 2013 data science salary survey”
 
Cracking the Coding Interview (Oct 2012)
Cracking the Coding Interview (Oct 2012)Cracking the Coding Interview (Oct 2012)
Cracking the Coding Interview (Oct 2012)
 
SUNG PARK PREDICT 422 Group Project Presentation
SUNG PARK PREDICT 422 Group Project PresentationSUNG PARK PREDICT 422 Group Project Presentation
SUNG PARK PREDICT 422 Group Project Presentation
 
How to Identify, Train or Become a Data Scientist
How to Identify, Train or Become a Data ScientistHow to Identify, Train or Become a Data Scientist
How to Identify, Train or Become a Data Scientist
 
Software Visualization Today - Systematic Literature Review
Software Visualization Today - Systematic Literature ReviewSoftware Visualization Today - Systematic Literature Review
Software Visualization Today - Systematic Literature Review
 
How to become a data scientist
How to become a data scientist How to become a data scientist
How to become a data scientist
 
Agile data science
Agile data scienceAgile data science
Agile data science
 
Machine Learning for Domain Experts
Machine Learning for Domain ExpertsMachine Learning for Domain Experts
Machine Learning for Domain Experts
 
What I Learned from Four Years of Science-ing the Crap Out of DevOps
What I Learned from Four Years of Science-ing the Crap Out of DevOpsWhat I Learned from Four Years of Science-ing the Crap Out of DevOps
What I Learned from Four Years of Science-ing the Crap Out of DevOps
 

Ähnlich wie Current and future challenges in data science

Using Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and developmentUsing Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and developmentEleanor Howe
 
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talkNYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talkVivian S. Zhang
 
Chapter1 introduction
Chapter1 introductionChapter1 introduction
Chapter1 introductionDinesh K
 
Data Science Master Specialisation
Data Science Master SpecialisationData Science Master Specialisation
Data Science Master SpecialisationArjen de Vries
 
Data Science Consulting at ThoughtWorks -- NYC Open Data Meetup
Data Science Consulting at ThoughtWorks -- NYC Open Data MeetupData Science Consulting at ThoughtWorks -- NYC Open Data Meetup
Data Science Consulting at ThoughtWorks -- NYC Open Data MeetupDavid Johnston
 
Investigating Performance: Design & Outcomes with xAPI | LSCon 2017
Investigating Performance: Design & Outcomes with xAPI | LSCon 2017Investigating Performance: Design & Outcomes with xAPI | LSCon 2017
Investigating Performance: Design & Outcomes with xAPI | LSCon 2017HT2 Labs
 
2016 davis-biotech
2016 davis-biotech2016 davis-biotech
2016 davis-biotechc.titus.brown
 
Artificial Intelligence for Medicine
Artificial Intelligence for MedicineArtificial Intelligence for Medicine
Artificial Intelligence for MedicineTassilo Klein
 
How to Enhance Your Career with AI
How to Enhance Your Career with AIHow to Enhance Your Career with AI
How to Enhance Your Career with AIKeita Broadwater
 
20240104 HICSS Panel on AI and Legal Ethical 20240103 v7.pptx
20240104 HICSS  Panel on AI and Legal Ethical 20240103 v7.pptx20240104 HICSS  Panel on AI and Legal Ethical 20240103 v7.pptx
20240104 HICSS Panel on AI and Legal Ethical 20240103 v7.pptxISSIP
 
Semantic Solutions from Information Exploration.pptx
Semantic Solutions from Information Exploration.pptxSemantic Solutions from Information Exploration.pptx
Semantic Solutions from Information Exploration.pptxInformation Exploration
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data ScienceSpotle.ai
 
116 Machine learning for Product Managers
116   Machine learning for Product Managers116   Machine learning for Product Managers
116 Machine learning for Product ManagersProductCamp Boston
 
Machine learning for product managers. Presented at Boston ProductCamp (June...
Machine learning for product  managers. Presented at Boston ProductCamp (June...Machine learning for product  managers. Presented at Boston ProductCamp (June...
Machine learning for product managers. Presented at Boston ProductCamp (June...Mukund Seshadri
 
Introduction to machine learning and deep learning
Introduction to machine learning and deep learningIntroduction to machine learning and deep learning
Introduction to machine learning and deep learningShishir Choudhary
 
AI for information management: why and how
AI for information management: why and howAI for information management: why and how
AI for information management: why and howAnna Divoli
 
Geoff what is_medical_informatics_oct2012
Geoff what is_medical_informatics_oct2012Geoff what is_medical_informatics_oct2012
Geoff what is_medical_informatics_oct2012Geoffrey Rutledge
 
Assessment Project Management in the Real World - Hour Three
Assessment Project Management in the Real World - Hour ThreeAssessment Project Management in the Real World - Hour Three
Assessment Project Management in the Real World - Hour ThreeJen Rutner
 

Ähnlich wie Current and future challenges in data science (20)

Using Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and developmentUsing Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and development
 
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talkNYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
 
Chapter1 introduction
Chapter1 introductionChapter1 introduction
Chapter1 introduction
 
2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
 
Data Science Master Specialisation
Data Science Master SpecialisationData Science Master Specialisation
Data Science Master Specialisation
 
Data Science Consulting at ThoughtWorks -- NYC Open Data Meetup
Data Science Consulting at ThoughtWorks -- NYC Open Data MeetupData Science Consulting at ThoughtWorks -- NYC Open Data Meetup
Data Science Consulting at ThoughtWorks -- NYC Open Data Meetup
 
Investigating Performance: Design & Outcomes with xAPI | LSCon 2017
Investigating Performance: Design & Outcomes with xAPI | LSCon 2017Investigating Performance: Design & Outcomes with xAPI | LSCon 2017
Investigating Performance: Design & Outcomes with xAPI | LSCon 2017
 
2016 davis-biotech
2016 davis-biotech2016 davis-biotech
2016 davis-biotech
 
Artificial Intelligence for Medicine
Artificial Intelligence for MedicineArtificial Intelligence for Medicine
Artificial Intelligence for Medicine
 
How to Enhance Your Career with AI
How to Enhance Your Career with AIHow to Enhance Your Career with AI
How to Enhance Your Career with AI
 
Deep learning for NLP
Deep learning for NLPDeep learning for NLP
Deep learning for NLP
 
20240104 HICSS Panel on AI and Legal Ethical 20240103 v7.pptx
20240104 HICSS  Panel on AI and Legal Ethical 20240103 v7.pptx20240104 HICSS  Panel on AI and Legal Ethical 20240103 v7.pptx
20240104 HICSS Panel on AI and Legal Ethical 20240103 v7.pptx
 
Semantic Solutions from Information Exploration.pptx
Semantic Solutions from Information Exploration.pptxSemantic Solutions from Information Exploration.pptx
Semantic Solutions from Information Exploration.pptx
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data Science
 
116 Machine learning for Product Managers
116   Machine learning for Product Managers116   Machine learning for Product Managers
116 Machine learning for Product Managers
 
Machine learning for product managers. Presented at Boston ProductCamp (June...
Machine learning for product  managers. Presented at Boston ProductCamp (June...Machine learning for product  managers. Presented at Boston ProductCamp (June...
Machine learning for product managers. Presented at Boston ProductCamp (June...
 
Introduction to machine learning and deep learning
Introduction to machine learning and deep learningIntroduction to machine learning and deep learning
Introduction to machine learning and deep learning
 
AI for information management: why and how
AI for information management: why and howAI for information management: why and how
AI for information management: why and how
 
Geoff what is_medical_informatics_oct2012
Geoff what is_medical_informatics_oct2012Geoff what is_medical_informatics_oct2012
Geoff what is_medical_informatics_oct2012
 
Assessment Project Management in the Real World - Hour Three
Assessment Project Management in the Real World - Hour ThreeAssessment Project Management in the Real World - Hour Three
Assessment Project Management in the Real World - Hour Three
 

KĂźrzlich hochgeladen

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...amitlee9823
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachBoston Institute of Analytics
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...karishmasinghjnh
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectBoston Institute of Analytics
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsJoseMangaJr1
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
hybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptxhybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptx9to5mart
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...amitlee9823
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal
 

KĂźrzlich hochgeladen (20)

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
hybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptxhybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptx
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 

Current and future challenges in data science

  • 1. Current and Future Challenges in Data Science Nathaniel Shimoni
  • 2. First a bit about myself… • Married to Yonit & father of Yahav & Arbel • Lecturer & researcher @ Ben Gurion University • Worked as a data science researcher @ • Co-founded the non-profit “Deep-Learning-Boot-Camp” • Co-founder of Dudes (still in stealth) • Ranked (at peak) #175 of over 80,000 active Kaggle competitors • Self taught and very(!) curious
  • 3. Data is everywhere Nature monitoring Automatic x-ray analysis Seizure prediction Fashion simulations Video analysis Speech 2 Text 2 Speech
  • 4. But what is data science? Well… the answer largely depends on who do we ask… We will use some loose/partial definition most can agree upon: Data science is: Both academic and practical research field that is aimed to programmatically extract insights and knowledge from data in a reproduceable & generalizable manner Training data New unseen data
  • 5. The enablers! or “These are the things that made such abilities possible”
  • 6. Current state – enablers Hugh increase in compute ability mainly GPU compute and RAM up to 48GB RAM on a single card & 11GB for most consumer cards Highly available and scalable through cloud services
  • 7. Current state – enablers Large open and labeled datasets “BookCorpus” dataset (11,038 books) MSCOCOcelebsAImageNet Yelp Open Datasets
  • 8. Current state – enablers Open source code and large model zoo UNet Efficient Net Faster RCNN
  • 9. Current state – enablers Good ability to perform transfer-learning and fine tuning between datasets transfer-learning
  • 10. Current tasks and common applications
  • 11. Current state tasks and applications – visual domain (partial list) visual domain Classification Detection Segmentation Pose Estimation Style transfer Generation (GANs)Document Authentication Deep Fake & Face-swap Adversarial attacks Depth estimation Tracking Search Within Images Object & Crowd Counting Regression Similarity & metric learning Image based search Severity Estimation Object price Estimation Super resolution Image colorization Image captioning Visual question answering OCR Image-Text cross learning Disease classification Activity classification Medical imagery segmentation Risky objects segmentation Route segmentation auto. vehicle Object Relation Inference
  • 12. Current tasks and applications – language domain (partial list) language domain Classification Machine Translation Free text to structured Sentiment analysis Question Answering summarization Inappropriate content Semantic Text Similarity (e.g. for search relevance) Research papers legal medical Call centers Automatic Rating Stress detection Named Entity Recognition Part of Speech Tagging Semantic Role Labeling Conversational Bots Free Text querying Online Translation Auto Language Classification
  • 13. Current tasks and applications - other domains (extremely partial list) Other domains Classification Time series sound Tabular Data Graph based learning Speech 2 Text Text 2 Speech Speaker Separation Background Noise removal classification forecasting Signal disaggregation segmentation Anomaly detection regression clustering Content recommendation Node / Graph Embeddings Graph Relation Inference Partial Graph Completion
  • 15. Current challenges • Dirty data (garbage in garbage out problem)
  • 16. Current challenges • Industry talent gap Job postings at top companies Increase in job postings since 2012
  • 18. Current challenges • Limited or lack of monitoring in production systems In many production systems monitoring is either lacking or limited Training data Production data On deployment Production data 2 month later Production data 4 month later
  • 20. Future challenges • Bias detection and bias correction professional hairstyle for work: unprofessional hairstyle for work: Gender bias in word embeddings: Man -> doctor woman -> nurse
  • 21. Future challenges • Learning from very few samples So, how many data samples do you have???
  • 22. Future challenges • Explainability of results We need to operate. urgently! But why???
  • 23. Future challenges • Cross modality learning The dog is about to chase a cat Voice Image / video text Complementary inputs to the same model
  • 24. Future challenges • Function learning A chair is an object you can seat on A parking spot is a place to leave the car when not driving
  • 25. Future challenges • Wide adoption of content generation Great potential in: • Film making • Transcribing • Content candidate generation • Art • Simulation
  • 26. Future challenges • Authenticity of documents, content and digital records
  • 27. Future challenges • AutoML and End2End applications data Task (?) Desired results
  • 28. Future challenges • Continuous monitoring and validation
  • 29. With great power comes great responsibility https://github.com/daviddao/awful-ai Well… these are powerful tools – we must use them carefully
  • 30. Future challenges Data science involves many other positions Communication with business is crucial for success Management must become data-literate • Metrics • Tasks • Validation • Findings to actions • Engineering • Analysts • Business • IT • Results & goals • Assumptions • Domain knowledge • Capabilities • Needs (both sides)