SlideShare ist ein Scribd-Unternehmen logo
1 von 15
Downloaden Sie, um offline zu lesen
Geert Verstraeten
Managing Partner - Python Predictions
Data Innovation Summit
27th June 2018
#DISUMMIT
Seeing the forest through the trees
Data science explained
in 10 concrete applications
Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT
Why this workshop?
My cocktail problem
in Data Science
In the beginning of my
career (2001), when I
mentioned to people I was
predicting human behavior
using data, I received a lot
of blank stares
Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT
Nowadays (2018) my
friend tells me that the
camera on his smartphone
has AI – and I’m doing the
blank staring -
“What does a camera with
AI do? And why would I
need that?”
AI, machine learning and
data science are hyped
today – I decided to focus
on some cool yet concrete
applications of data
science here
AIRPORT USE CASE IDEAS
Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT
SEEING THE FOREST THROUGH THE TREES
I’ve aimed to provide an idea of
what data science is, based on
10 concrete examples of 10
different and important data
science applications
Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT
AUDIO & TEXT ANALYTICS
Some towns in Belgium are experimenting with
a system to detect loneliness among older
people – most will reply they are not, but the
audio information (such as intonation) reveals
much more than the text (link - in Dutch)
Extracting useful information out of text and audio inputs
Other examples: call centers may use
audio signals to detect whether
customers are likely to leave – or
whether clients call them out of
loneliness versus when they have
issues with their product or service,
but also chatbots are cool examples
of improved text analytics
Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT
RECOMMENDER SYSTEMS
In his PhD for VDAB, Michael Reusens used
recommender systems to match jobs to
candidates – in that case it’s not only important
that the candidate finds the job interesting, the
idea is that jobs are recommended where the
recruiter would find the candidate interesting -
so it should be a match from both sides (link)
Recommending the right offers or content to the right audience
Other examples: movie recommendations by Netflix are
probably the most known application in Data Science,
but news recommendations work in similar ways, and
dating apps might face similar challenges as VDAB.
Recommending the right politician to the right voter could
work in the same way but is obviously more sensitive ☺
Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT
COMPUTER VISION
SAS is contributing the a project
named WildTrack, an organization
counting the number of animals in
the wild for certain species. While
such studies depended on expert
track finders before, everyone
with a smartphone is now
encouraged to take photographs
of footprints, and using image
recognition, they are classified
into the right species, which leads
to better information (link)
Extracting useful information out of images and video
Many other examples: two high school students predict
forest fires by examining the soil – Octinion produced a
prototype robot for picking strawberries at the right time
– diagnosis of medical images surpassed human expert
level (for example for detecting melanoma) – robotic
weed control – detection of plastic waste in the ocean
PREDICTIVE ANALYTICS
Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT
Predicting behavior in a business context, with interpretation
One of the most interesting
projects we’ve worked on, is a
project that started out as
employee burnout prediction
with sdworx, as presented on
the Data Innovation Summit
2017. In this case, we learned
a lot about absenteeism,
constructed a model with good
quality on aggregated level,
and we taught sdworx how
they can do similar projects for
their clients (link).
Some other examples: Companies are typically
predicting who will buy a product, who will become a
valuable client in the future, who will leave the
relationship, who will repay their debts, who will show
fraudulent behavior, who will leave their employer, …
In most of these cases, the interpretation of the model
is equally important as the predictive quality. A great
summary can be found in this book
Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT
EXPERIMENTAL DESIGN
Testing different options and approaches to maximize impact
Prediction by itself does
not create value, but
the resulting actions do.
Yet to be sure value is
created, these actions
should be tested
through carefully
designed experiments.
Experimentation is common in
online environments, but also
useful offline. It may range
from simple A / B tests to
complex tests – for example
some debt collectors have
optimized the way they collect
through offline experimentation
of collection strategies
Few companies understand this
better than Booking.com, who created
a platform allowing hundreds of
employees to run hundreds of small
experiments at any time - resulting in
conversion levels 2-3x the industry
average (link)
ANOMALY DETECTION
Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT
Data Science is in many cases concerned by
understanding the patterns. Yet sometimes, it
may be interesting to look at what deviates. For
example, volcanic eruptions can be detected
based on anomalies in thermal features such as
geysers, hot springs and lava flows (link)
Some other examples: anomaly
detection is most commonly used
to detect fraud (see for example
how HSBC screens card fraud),
crime in general (including
cybersecurity) – such type of
behavior is very volatile, but
consistent in the fact that it differs
from average, normal behavior.
Another remarkable example:
Chinese researchers are also
using anomalies in the genepool
to identify potential athletes.
It may be interesting to detect what deviates from the pattern
Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT
SEGMENTATION
Grouping objects based on their similarity
Perhaps one of the
oldest applications in the
book, but certainly
incredibly valuable for
exploratory work. One of
the coolest projects
we’ve done is predicting
needs of Private Banking
clients – not all wealthy
clients of a bank expect
the same benefits –
some like VIP events, but
others want immediate
financial information, etc
(no link or details due to
confidentiality).
Some other examples: Due to the
strategic nature of segmentation
projects, it proved difficult to find
concrete cases online. But in this
context, it may be fun to look at the
different segments of participants in
the Belgian Data Science community
(article and dashboard).
Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT
IOT ANALYTICS
Extracting information of connected devices
It is easy to find analytics of sensor
data, but more difficult to find cases
where the connectivity between
devices is key. Google Nest is a great
example – in the sense that it learns
from past behavior (e.g. when
someone is absent) and can
communicate to other devices (e.g. to
turn off the oven).
Some other examples:
The Belgian Red Devils
are training using
sensors that measure
fatigue to prevent
injuries, Volvo Trucks
claims to be able to
reduce standstill of it’s
fleet by 80% by
predicting the need for
maintenance, and
sensors are crucial for
many applications in
healthcare
NETWORK ANALYSIS
Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT
Sometimes the most important information is the network itself
In specific cases, the understanding
of a network of people and things
can help solve a case. For example,
in an analysis of the 9-11 attacks, It
has been estimated that the 9-11
network could have been
dismantled if just three central
nodes had been eliminated (link)
Some other examples: Network analysis
is a common technique for fraud and
crime detection. It can also serve to
detect churn (wich customer will leave a
company), to understand which
companies are connected through
shareholders and which authors are
connected through publications, etc
Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT
PROCESS MINING
Understand and reduce inefficient processes
one of the largest academic hospitals
in the Netherlands uses process
mining to reduce waiting times,
increase efficiency and optimize
treatments (link)
Processes are everywhere:
from job applications to
complaint handling, road
works, and new client
onboarding. Increasing their
efficiency often leads to
reduced costs and improved
(customer) experience.
Another concrete example of
using process mining to
improve the Customer
Journey: ING Belgium
About half the time was spend
explaining the examples I had
found in preparing the
session, with help from many
friends on LinkedIn
In the other half of the time,
the audience interacted with
their often colorful examples,
which resulted in a fun and
interactive session that
seemed to be appreciated
Looking back at this, an exciting
journey into exploring what Data
Science is today, and how it
leads to concrete value. Thanks
to all who participated and
provided input!

Weitere ähnliche Inhalte

Was ist angesagt?

Exploratory data analysis data visualization
Exploratory data analysis data visualizationExploratory data analysis data visualization
Exploratory data analysis data visualizationDr. Hamdan Al-Sabri
 
Introduction to statistics 2013
Introduction to statistics 2013Introduction to statistics 2013
Introduction to statistics 2013Mohammad Ihmeidan
 
Inter quartile range
Inter quartile rangeInter quartile range
Inter quartile rangeKen Plummer
 
Prescriptive Analytics
Prescriptive AnalyticsPrescriptive Analytics
Prescriptive AnalyticsŁukasz Grala
 
Cramer row inequality
Cramer row inequality Cramer row inequality
Cramer row inequality VashuGupta8
 
Dichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianDichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianLaure Wynants
 
Predictive Analytics - An Overview
Predictive Analytics - An OverviewPredictive Analytics - An Overview
Predictive Analytics - An OverviewMachinePulse
 
Logistic regression
Logistic regressionLogistic regression
Logistic regressionAkhash Kumar
 
Telecom Churn Prediction Presentation
Telecom Churn Prediction PresentationTelecom Churn Prediction Presentation
Telecom Churn Prediction PresentationPinintiHarishReddy
 
Exploratory data analysis with Python
Exploratory data analysis with PythonExploratory data analysis with Python
Exploratory data analysis with PythonDavis David
 
Big data analytics in banking sector
Big data analytics in banking sectorBig data analytics in banking sector
Big data analytics in banking sectorAnil Rana
 
The Advantages and Disadvantages of Big Data
The Advantages and Disadvantages of Big DataThe Advantages and Disadvantages of Big Data
The Advantages and Disadvantages of Big DataNicha Tatsaneeyapan
 
Introduction to statistics & data analysis
Introduction to statistics & data analysisIntroduction to statistics & data analysis
Introduction to statistics & data analysisAsmaUmar4
 
Attribute oriented analysis
Attribute oriented analysisAttribute oriented analysis
Attribute oriented analysisHirra Sultan
 
What is a two sample z test?
What is a two sample z test?What is a two sample z test?
What is a two sample z test?Ken Plummer
 

Was ist angesagt? (20)

Exploratory data analysis data visualization
Exploratory data analysis data visualizationExploratory data analysis data visualization
Exploratory data analysis data visualization
 
Introduction to statistics 2013
Introduction to statistics 2013Introduction to statistics 2013
Introduction to statistics 2013
 
Statistics for data science
Statistics for data science Statistics for data science
Statistics for data science
 
Inter quartile range
Inter quartile rangeInter quartile range
Inter quartile range
 
Prescriptive Analytics
Prescriptive AnalyticsPrescriptive Analytics
Prescriptive Analytics
 
Cramer row inequality
Cramer row inequality Cramer row inequality
Cramer row inequality
 
Missing data handling
Missing data handlingMissing data handling
Missing data handling
 
Dichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianDichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatistician
 
Predictive Analytics - An Overview
Predictive Analytics - An OverviewPredictive Analytics - An Overview
Predictive Analytics - An Overview
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Telecom Churn Prediction Presentation
Telecom Churn Prediction PresentationTelecom Churn Prediction Presentation
Telecom Churn Prediction Presentation
 
Data Science
Data ScienceData Science
Data Science
 
Exploratory data analysis with Python
Exploratory data analysis with PythonExploratory data analysis with Python
Exploratory data analysis with Python
 
Big data analytics in banking sector
Big data analytics in banking sectorBig data analytics in banking sector
Big data analytics in banking sector
 
The Advantages and Disadvantages of Big Data
The Advantages and Disadvantages of Big DataThe Advantages and Disadvantages of Big Data
The Advantages and Disadvantages of Big Data
 
Chapter 1 what is statistics
Chapter 1 what is statisticsChapter 1 what is statistics
Chapter 1 what is statistics
 
Introduction to statistics & data analysis
Introduction to statistics & data analysisIntroduction to statistics & data analysis
Introduction to statistics & data analysis
 
Attribute oriented analysis
Attribute oriented analysisAttribute oriented analysis
Attribute oriented analysis
 
FDS_dept_ppt.pptx
FDS_dept_ppt.pptxFDS_dept_ppt.pptx
FDS_dept_ppt.pptx
 
What is a two sample z test?
What is a two sample z test?What is a two sample z test?
What is a two sample z test?
 

Ähnlich wie Data science explained in 10 applications

Putting data science into perspective
Putting data science into perspectivePutting data science into perspective
Putting data science into perspectiveSravan Ankaraju
 
Is big data just a buzzword -Big data simply explained
Is big data just a buzzword -Big data simply explainedIs big data just a buzzword -Big data simply explained
Is big data just a buzzword -Big data simply explainedVivek Srivastava
 
Fontys Eric van Tol
Fontys Eric van TolFontys Eric van Tol
Fontys Eric van TolTalentEvent
 
BigData Analytics_1.7
BigData Analytics_1.7BigData Analytics_1.7
BigData Analytics_1.7Rohit Mittal
 
Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...
Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...
Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...Dana Gardner
 
Age Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big DataAge Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big DataAgeFriendlyEconomy
 
Activity Streaming as Information X-Docking
Activity Streaming as Information X-DockingActivity Streaming as Information X-Docking
Activity Streaming as Information X-DockingKai Riemer
 
The data science revolution in insurance
The data science revolution in insuranceThe data science revolution in insurance
The data science revolution in insuranceStefano Perfetti
 
A study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesA study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesBhanu Prakash
 
Global Data Management: Governance, Security and Usefulness in a Hybrid World
Global Data Management: Governance, Security and Usefulness in a Hybrid WorldGlobal Data Management: Governance, Security and Usefulness in a Hybrid World
Global Data Management: Governance, Security and Usefulness in a Hybrid WorldNeil Raden
 
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015Edgar Alejandro Villegas
 
Research brief observability and modern enterprise
Research brief   observability and modern enterpriseResearch brief   observability and modern enterprise
Research brief observability and modern enterpriseRishidot Research
 
Big Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin MalhotraBig Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin MalhotraVin Malhotra
 
Small data vs. Big data : back to the basics
Small data vs. Big data : back to the basicsSmall data vs. Big data : back to the basics
Small data vs. Big data : back to the basicsAhmed Banafa
 
MBA-TU-Thailand:BigData for business startup.
MBA-TU-Thailand:BigData for business startup.MBA-TU-Thailand:BigData for business startup.
MBA-TU-Thailand:BigData for business startup.stelligence
 
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAIMAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAIBig Data Week
 
IRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET Journal
 

Ähnlich wie Data science explained in 10 applications (20)

Big data upload
Big data uploadBig data upload
Big data upload
 
Putting data science into perspective
Putting data science into perspectivePutting data science into perspective
Putting data science into perspective
 
Is big data just a buzzword -Big data simply explained
Is big data just a buzzword -Big data simply explainedIs big data just a buzzword -Big data simply explained
Is big data just a buzzword -Big data simply explained
 
Fontys Eric van Tol
Fontys Eric van TolFontys Eric van Tol
Fontys Eric van Tol
 
BigData Analytics_1.7
BigData Analytics_1.7BigData Analytics_1.7
BigData Analytics_1.7
 
1 UNIT-DSP.pptx
1 UNIT-DSP.pptx1 UNIT-DSP.pptx
1 UNIT-DSP.pptx
 
What is business analytics
What is business analyticsWhat is business analytics
What is business analytics
 
Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...
Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...
Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...
 
Age Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big DataAge Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big Data
 
Activity Streaming as Information X-Docking
Activity Streaming as Information X-DockingActivity Streaming as Information X-Docking
Activity Streaming as Information X-Docking
 
The data science revolution in insurance
The data science revolution in insuranceThe data science revolution in insurance
The data science revolution in insurance
 
A study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesA study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websites
 
Global Data Management: Governance, Security and Usefulness in a Hybrid World
Global Data Management: Governance, Security and Usefulness in a Hybrid WorldGlobal Data Management: Governance, Security and Usefulness in a Hybrid World
Global Data Management: Governance, Security and Usefulness in a Hybrid World
 
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
 
Research brief observability and modern enterprise
Research brief   observability and modern enterpriseResearch brief   observability and modern enterprise
Research brief observability and modern enterprise
 
Big Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin MalhotraBig Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin Malhotra
 
Small data vs. Big data : back to the basics
Small data vs. Big data : back to the basicsSmall data vs. Big data : back to the basics
Small data vs. Big data : back to the basics
 
MBA-TU-Thailand:BigData for business startup.
MBA-TU-Thailand:BigData for business startup.MBA-TU-Thailand:BigData for business startup.
MBA-TU-Thailand:BigData for business startup.
 
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAIMAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
 
IRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth Enhancement
 

Mehr von Python Predictions

Long live customer segmentation at Delhaize
Long live customer segmentation at DelhaizeLong live customer segmentation at Delhaize
Long live customer segmentation at DelhaizePython Predictions
 
Can algorithms help to reduce absenteeism
Can algorithms help to reduce absenteeismCan algorithms help to reduce absenteeism
Can algorithms help to reduce absenteeismPython Predictions
 
Project definition workshop #disummit 2019
Project definition workshop #disummit 2019Project definition workshop #disummit 2019
Project definition workshop #disummit 2019Python Predictions
 
From a needs-based segmentation to customer relevancy
From a needs-based segmentation to customer relevancyFrom a needs-based segmentation to customer relevancy
From a needs-based segmentation to customer relevancyPython Predictions
 
The pivotal power of data science - Influencing the business model of a media...
The pivotal power of data science - Influencing the business model of a media...The pivotal power of data science - Influencing the business model of a media...
The pivotal power of data science - Influencing the business model of a media...Python Predictions
 
Teaser Lingerie Recommendations with Van de Velde - Data Innovation Summit 20...
Teaser Lingerie Recommendations with Van de Velde - Data Innovation Summit 20...Teaser Lingerie Recommendations with Van de Velde - Data Innovation Summit 20...
Teaser Lingerie Recommendations with Van de Velde - Data Innovation Summit 20...Python Predictions
 
Python Predictions: Core applications and services
Python Predictions: Core applications and servicesPython Predictions: Core applications and services
Python Predictions: Core applications and servicesPython Predictions
 
Python Predictions: Innovation
Python Predictions: InnovationPython Predictions: Innovation
Python Predictions: InnovationPython Predictions
 
Python Predictions: Training & Coaching offer
Python Predictions: Training & Coaching offerPython Predictions: Training & Coaching offer
Python Predictions: Training & Coaching offerPython Predictions
 
Managing Projects in Predictive Analytics
Managing Projects in Predictive AnalyticsManaging Projects in Predictive Analytics
Managing Projects in Predictive AnalyticsPython Predictions
 
Process mining - a case by ING Belgium and Python Predictions (update Oct 2016)
Process mining - a case by ING Belgium and Python Predictions (update Oct 2016)Process mining - a case by ING Belgium and Python Predictions (update Oct 2016)
Process mining - a case by ING Belgium and Python Predictions (update Oct 2016)Python Predictions
 
Predictive Analytics in Fundraising - Success Story DSC
Predictive Analytics in Fundraising - Success Story DSCPredictive Analytics in Fundraising - Success Story DSC
Predictive Analytics in Fundraising - Success Story DSCPython Predictions
 
Predicting Customer Long Term Value at Eni Belgium
Predicting Customer Long Term Value at Eni BelgiumPredicting Customer Long Term Value at Eni Belgium
Predicting Customer Long Term Value at Eni BelgiumPython Predictions
 
Process mining - a case by ING Belgium and Python Predictions
Process mining - a case by ING Belgium and Python PredictionsProcess mining - a case by ING Belgium and Python Predictions
Process mining - a case by ING Belgium and Python PredictionsPython Predictions
 
Predictive analytics and data science at ING Belgium
Predictive analytics and data science at ING BelgiumPredictive analytics and data science at ING Belgium
Predictive analytics and data science at ING BelgiumPython Predictions
 
Winning with analytics - Data Innovation Summit 2015 - Made in Belgium
Winning with analytics - Data Innovation Summit 2015 - Made in BelgiumWinning with analytics - Data Innovation Summit 2015 - Made in Belgium
Winning with analytics - Data Innovation Summit 2015 - Made in BelgiumPython Predictions
 
Combining Logit and Ensemble Modeling for Increased Customer Churn Detection
Combining Logit and Ensemble Modeling  for Increased Customer Churn DetectionCombining Logit and Ensemble Modeling  for Increased Customer Churn Detection
Combining Logit and Ensemble Modeling for Increased Customer Churn DetectionPython Predictions
 

Mehr von Python Predictions (19)

AI inspiration session in HR
AI inspiration session in HRAI inspiration session in HR
AI inspiration session in HR
 
Long live customer segmentation at Delhaize
Long live customer segmentation at DelhaizeLong live customer segmentation at Delhaize
Long live customer segmentation at Delhaize
 
Can algorithms help to reduce absenteeism
Can algorithms help to reduce absenteeismCan algorithms help to reduce absenteeism
Can algorithms help to reduce absenteeism
 
Project definition workshop #disummit 2019
Project definition workshop #disummit 2019Project definition workshop #disummit 2019
Project definition workshop #disummit 2019
 
From a needs-based segmentation to customer relevancy
From a needs-based segmentation to customer relevancyFrom a needs-based segmentation to customer relevancy
From a needs-based segmentation to customer relevancy
 
The pivotal power of data science - Influencing the business model of a media...
The pivotal power of data science - Influencing the business model of a media...The pivotal power of data science - Influencing the business model of a media...
The pivotal power of data science - Influencing the business model of a media...
 
Teaser Lingerie Recommendations with Van de Velde - Data Innovation Summit 20...
Teaser Lingerie Recommendations with Van de Velde - Data Innovation Summit 20...Teaser Lingerie Recommendations with Van de Velde - Data Innovation Summit 20...
Teaser Lingerie Recommendations with Van de Velde - Data Innovation Summit 20...
 
Python Predictions: Core applications and services
Python Predictions: Core applications and servicesPython Predictions: Core applications and services
Python Predictions: Core applications and services
 
Python Predictions: Innovation
Python Predictions: InnovationPython Predictions: Innovation
Python Predictions: Innovation
 
Python Predictions: Training & Coaching offer
Python Predictions: Training & Coaching offerPython Predictions: Training & Coaching offer
Python Predictions: Training & Coaching offer
 
Managing Projects in Predictive Analytics
Managing Projects in Predictive AnalyticsManaging Projects in Predictive Analytics
Managing Projects in Predictive Analytics
 
Predicting employee burnout
Predicting employee burnoutPredicting employee burnout
Predicting employee burnout
 
Process mining - a case by ING Belgium and Python Predictions (update Oct 2016)
Process mining - a case by ING Belgium and Python Predictions (update Oct 2016)Process mining - a case by ING Belgium and Python Predictions (update Oct 2016)
Process mining - a case by ING Belgium and Python Predictions (update Oct 2016)
 
Predictive Analytics in Fundraising - Success Story DSC
Predictive Analytics in Fundraising - Success Story DSCPredictive Analytics in Fundraising - Success Story DSC
Predictive Analytics in Fundraising - Success Story DSC
 
Predicting Customer Long Term Value at Eni Belgium
Predicting Customer Long Term Value at Eni BelgiumPredicting Customer Long Term Value at Eni Belgium
Predicting Customer Long Term Value at Eni Belgium
 
Process mining - a case by ING Belgium and Python Predictions
Process mining - a case by ING Belgium and Python PredictionsProcess mining - a case by ING Belgium and Python Predictions
Process mining - a case by ING Belgium and Python Predictions
 
Predictive analytics and data science at ING Belgium
Predictive analytics and data science at ING BelgiumPredictive analytics and data science at ING Belgium
Predictive analytics and data science at ING Belgium
 
Winning with analytics - Data Innovation Summit 2015 - Made in Belgium
Winning with analytics - Data Innovation Summit 2015 - Made in BelgiumWinning with analytics - Data Innovation Summit 2015 - Made in Belgium
Winning with analytics - Data Innovation Summit 2015 - Made in Belgium
 
Combining Logit and Ensemble Modeling for Increased Customer Churn Detection
Combining Logit and Ensemble Modeling  for Increased Customer Churn DetectionCombining Logit and Ensemble Modeling  for Increased Customer Churn Detection
Combining Logit and Ensemble Modeling for Increased Customer Churn Detection
 

Kürzlich hochgeladen

Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Dave Litwiller
 
Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Servicediscovermytutordmt
 
GD Birla and his contribution in management
GD Birla and his contribution in managementGD Birla and his contribution in management
GD Birla and his contribution in managementchhavia330
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Dipal Arora
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Lviv Startup Club
 
Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Neil Kimberley
 
Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessAggregage
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst SummitHolger Mueller
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth MarketingShawn Pang
 
Understanding the Pakistan Budgeting Process: Basics and Key Insights
Understanding the Pakistan Budgeting Process: Basics and Key InsightsUnderstanding the Pakistan Budgeting Process: Basics and Key Insights
Understanding the Pakistan Budgeting Process: Basics and Key Insightsseribangash
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...Paul Menig
 
HONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsHONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsMichael W. Hawkins
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation SlidesKeppelCorporation
 
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 DelhiCall Girls in Delhi
 
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...anilsa9823
 
Best Basmati Rice Manufacturers in India
Best Basmati Rice Manufacturers in IndiaBest Basmati Rice Manufacturers in India
Best Basmati Rice Manufacturers in IndiaShree Krishna Exports
 
A DAY IN THE LIFE OF A SALESMAN / WOMAN
A DAY IN THE LIFE OF A  SALESMAN / WOMANA DAY IN THE LIFE OF A  SALESMAN / WOMAN
A DAY IN THE LIFE OF A SALESMAN / WOMANIlamathiKannappan
 
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLMONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLSeo
 
It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayNZSG
 

Kürzlich hochgeladen (20)

Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
 
Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Service
 
GD Birla and his contribution in management
GD Birla and his contribution in managementGD Birla and his contribution in management
GD Birla and his contribution in management
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
 
Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023
 
Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for Success
 
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst Summit
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
 
Understanding the Pakistan Budgeting Process: Basics and Key Insights
Understanding the Pakistan Budgeting Process: Basics and Key InsightsUnderstanding the Pakistan Budgeting Process: Basics and Key Insights
Understanding the Pakistan Budgeting Process: Basics and Key Insights
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...
 
HONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsHONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael Hawkins
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
 
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
 
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
 
Best Basmati Rice Manufacturers in India
Best Basmati Rice Manufacturers in IndiaBest Basmati Rice Manufacturers in India
Best Basmati Rice Manufacturers in India
 
A DAY IN THE LIFE OF A SALESMAN / WOMAN
A DAY IN THE LIFE OF A  SALESMAN / WOMANA DAY IN THE LIFE OF A  SALESMAN / WOMAN
A DAY IN THE LIFE OF A SALESMAN / WOMAN
 
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLMONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
 
It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 May
 

Data science explained in 10 applications

  • 1. Geert Verstraeten Managing Partner - Python Predictions Data Innovation Summit 27th June 2018 #DISUMMIT Seeing the forest through the trees Data science explained in 10 concrete applications
  • 2. Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT Why this workshop? My cocktail problem in Data Science In the beginning of my career (2001), when I mentioned to people I was predicting human behavior using data, I received a lot of blank stares
  • 3. Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT Nowadays (2018) my friend tells me that the camera on his smartphone has AI – and I’m doing the blank staring - “What does a camera with AI do? And why would I need that?” AI, machine learning and data science are hyped today – I decided to focus on some cool yet concrete applications of data science here
  • 4. AIRPORT USE CASE IDEAS Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT SEEING THE FOREST THROUGH THE TREES I’ve aimed to provide an idea of what data science is, based on 10 concrete examples of 10 different and important data science applications
  • 5. Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT AUDIO & TEXT ANALYTICS Some towns in Belgium are experimenting with a system to detect loneliness among older people – most will reply they are not, but the audio information (such as intonation) reveals much more than the text (link - in Dutch) Extracting useful information out of text and audio inputs Other examples: call centers may use audio signals to detect whether customers are likely to leave – or whether clients call them out of loneliness versus when they have issues with their product or service, but also chatbots are cool examples of improved text analytics
  • 6. Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT RECOMMENDER SYSTEMS In his PhD for VDAB, Michael Reusens used recommender systems to match jobs to candidates – in that case it’s not only important that the candidate finds the job interesting, the idea is that jobs are recommended where the recruiter would find the candidate interesting - so it should be a match from both sides (link) Recommending the right offers or content to the right audience Other examples: movie recommendations by Netflix are probably the most known application in Data Science, but news recommendations work in similar ways, and dating apps might face similar challenges as VDAB. Recommending the right politician to the right voter could work in the same way but is obviously more sensitive ☺
  • 7. Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT COMPUTER VISION SAS is contributing the a project named WildTrack, an organization counting the number of animals in the wild for certain species. While such studies depended on expert track finders before, everyone with a smartphone is now encouraged to take photographs of footprints, and using image recognition, they are classified into the right species, which leads to better information (link) Extracting useful information out of images and video Many other examples: two high school students predict forest fires by examining the soil – Octinion produced a prototype robot for picking strawberries at the right time – diagnosis of medical images surpassed human expert level (for example for detecting melanoma) – robotic weed control – detection of plastic waste in the ocean
  • 8. PREDICTIVE ANALYTICS Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT Predicting behavior in a business context, with interpretation One of the most interesting projects we’ve worked on, is a project that started out as employee burnout prediction with sdworx, as presented on the Data Innovation Summit 2017. In this case, we learned a lot about absenteeism, constructed a model with good quality on aggregated level, and we taught sdworx how they can do similar projects for their clients (link). Some other examples: Companies are typically predicting who will buy a product, who will become a valuable client in the future, who will leave the relationship, who will repay their debts, who will show fraudulent behavior, who will leave their employer, … In most of these cases, the interpretation of the model is equally important as the predictive quality. A great summary can be found in this book
  • 9. Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT EXPERIMENTAL DESIGN Testing different options and approaches to maximize impact Prediction by itself does not create value, but the resulting actions do. Yet to be sure value is created, these actions should be tested through carefully designed experiments. Experimentation is common in online environments, but also useful offline. It may range from simple A / B tests to complex tests – for example some debt collectors have optimized the way they collect through offline experimentation of collection strategies Few companies understand this better than Booking.com, who created a platform allowing hundreds of employees to run hundreds of small experiments at any time - resulting in conversion levels 2-3x the industry average (link)
  • 10. ANOMALY DETECTION Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT Data Science is in many cases concerned by understanding the patterns. Yet sometimes, it may be interesting to look at what deviates. For example, volcanic eruptions can be detected based on anomalies in thermal features such as geysers, hot springs and lava flows (link) Some other examples: anomaly detection is most commonly used to detect fraud (see for example how HSBC screens card fraud), crime in general (including cybersecurity) – such type of behavior is very volatile, but consistent in the fact that it differs from average, normal behavior. Another remarkable example: Chinese researchers are also using anomalies in the genepool to identify potential athletes. It may be interesting to detect what deviates from the pattern
  • 11. Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT SEGMENTATION Grouping objects based on their similarity Perhaps one of the oldest applications in the book, but certainly incredibly valuable for exploratory work. One of the coolest projects we’ve done is predicting needs of Private Banking clients – not all wealthy clients of a bank expect the same benefits – some like VIP events, but others want immediate financial information, etc (no link or details due to confidentiality). Some other examples: Due to the strategic nature of segmentation projects, it proved difficult to find concrete cases online. But in this context, it may be fun to look at the different segments of participants in the Belgian Data Science community (article and dashboard).
  • 12. Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT IOT ANALYTICS Extracting information of connected devices It is easy to find analytics of sensor data, but more difficult to find cases where the connectivity between devices is key. Google Nest is a great example – in the sense that it learns from past behavior (e.g. when someone is absent) and can communicate to other devices (e.g. to turn off the oven). Some other examples: The Belgian Red Devils are training using sensors that measure fatigue to prevent injuries, Volvo Trucks claims to be able to reduce standstill of it’s fleet by 80% by predicting the need for maintenance, and sensors are crucial for many applications in healthcare
  • 13. NETWORK ANALYSIS Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT Sometimes the most important information is the network itself In specific cases, the understanding of a network of people and things can help solve a case. For example, in an analysis of the 9-11 attacks, It has been estimated that the 9-11 network could have been dismantled if just three central nodes had been eliminated (link) Some other examples: Network analysis is a common technique for fraud and crime detection. It can also serve to detect churn (wich customer will leave a company), to understand which companies are connected through shareholders and which authors are connected through publications, etc
  • 14. Geert Verstraeten – Seeing the forest through the trees – @pythongeert – #DISUMMIT PROCESS MINING Understand and reduce inefficient processes one of the largest academic hospitals in the Netherlands uses process mining to reduce waiting times, increase efficiency and optimize treatments (link) Processes are everywhere: from job applications to complaint handling, road works, and new client onboarding. Increasing their efficiency often leads to reduced costs and improved (customer) experience. Another concrete example of using process mining to improve the Customer Journey: ING Belgium
  • 15. About half the time was spend explaining the examples I had found in preparing the session, with help from many friends on LinkedIn In the other half of the time, the audience interacted with their often colorful examples, which resulted in a fun and interactive session that seemed to be appreciated Looking back at this, an exciting journey into exploring what Data Science is today, and how it leads to concrete value. Thanks to all who participated and provided input!