SlideShare a Scribd company logo
1 of 82
© OCTO 2015
Tél : +41 (0) 21 312 94 15
www.octo.com
Avenue du théatre, 7
1005 Lausanne SUISSEData Science & Machine Learning
Alexandre Masselot
amasselot@octo.com
@alex_mass
Catherine Zwahlen
czwahlen@octo.com
2016 is the Year
of Big Data
@OCTO Switzerland
Big Data Romandie
OCTO PUBLICATIONS
OCTO TECHNOLOGY > THERE IS A BETTER WAY 4
WE ARE CONSUMING DATA SCIENCE EVERY DAY!
Facial recognition
Spam detection
WE ARE CONSUMING DATA SCIENCE EVERY DAY!
Voice
recognition
WE ARE CONSUMING DATA SCIENCE EVERY DAY!
Movie
recommendation
WE ARE CONSUMING DATA SCIENCE EVERY DAY!
9
DATA SCIENCE, A DOMAIN DRIVEN BY COMPETITION
To solve your business problems!
Problem Data Crowd
Knowledge
& Tools
Model for
Prediction
OCTO Folks Work Hard, Play Hard
◉ Caisse de dépôts - score de délivrance d'un brevet européen
◉ Argus - prédiction du prix de vente de véhicules d'occasion
◉ SNCF - prédiction de la fréquentation des gares en Ile de France
◉ Imperial College London - Loan Default Prediction
◉ Allstate – purchase prediction challenge
◉ Tradeshift – Text classification
◉ Microsoft - Malware classification
OCTO, there is a better way to learn, recruit and have fun!
1st
2&4
3rd
6th
13th
2nd
5th
DATA SCIENCE TONIGHT
OCTO TECHNOLOGY > THERE IS A BETTER WAY 11
Visualization
1
2
3
4
Why the buzz about data science?
Demystifying machine learning
Data science in your business
12
“Data science is an interdisciplinary field about
processes and systems to extract knowledge
or insights from data”
OCTO TECHNOLOGY > THERE IS A BETTER WAY
https://en.wikipedia.org/wiki/Data_science
13OCTO TECHNOLOGY > THERE IS A BETTER WAY
Cray 2 iPhone 4=1 1
15OCTO TECHNOLOGY > THERE IS A BETTER WAY
16
AGILE DATA SCIENCE
OCTO TECHNOLOGY > THERE IS A BETTER WAY
DATA SCIENCE TONIGHT
OCTO TECHNOLOGY > THERE IS A BETTER WAY 17
Visualization
1
2
3
4
Why the buzz about data science?
Demystifying machine learning
Data science in your business
18
“Machine learning explores the study and
construction of algorithms that can learn
from and make predictions on data”
OCTO TECHNOLOGY > THERE IS A BETTER WAY
https://en.wikipedia.org/wiki/Machine_learning
19
MACHINE LEARNING
Conditions
OCTO TECHNOLOGY > THERE IS A BETTER WAY
1
2
3
A pattern exists
The problem cannot be described
analytically by a mathematical formula
Data, data, data
Machine learning algorithms exists
for many years
In general, model performances
improve with more data
20OCTO TECHNOLOGY > THERE IS A BETTER WAY
21
FLIGHT CHARACTERISTICS
OCTO TECHNOLOGY > THERE IS A BETTER WAY 21
Flight #
Dep
Airport
Dep
Hour
Dep
Week Day
Aircraft
Model
…
1 SYD 8:10 1 A330
2 SYD 14:15 2 B777
3 MEL 18:10 1 B777
4 PER 6:50 4 A320
5 SYD 9:50 3 A320
6 PER 12:10 1 A320
7 TZN 14:50 1 B777
8 MEL 14:15 4 A320
9 SYD 8:30 3 A320
10 MEL 16:40 1 A320
11 MEL 9:30 3 B747
12 TZN 9:30 1 A320
13 PER 9:50 3 A320
14 SYD 13:10 1 A320
22
EVENTS
OCTO TECHNOLOGY > THERE IS A BETTER WAY 22
Flight #
Dep
Airport
Dep
Hour
Dep
Week Day
Aircraft
Model
…
Actual
Delay
1 SYD 8:10 1 A330 0
2 SYD 14:15 2 B777 3
3 MEL 18:10 1 B777 0
4 PER 6:50 4 A320 17
5 SYD 9:50 3 A320 0
6 PER 12:10 1 A320 23
7 TZN 14:50 1 B777 0
8 MEL 14:15 4 A320 0
9 SYD 8:30 3 A320 0
10 MEL 16:40 1 A320 12
11 MEL 9:30 3 B747 32
12 TZN 9:30 1 A320 20
13 PER 9:50 3 A320 0
14 SYD 13:10 1 A320 9
23
EVENTS
OCTO TECHNOLOGY > THERE IS A BETTER WAY 23
Flight #
Dep
Airport
Dep
Hour
Dep
Week Day
Aircraft
Model
…
Actual
Delay
1 SYD 8:10 1 A330 0
2 SYD 14:15 2 B777 3
3 MEL 18:10 1 B777 0
4 PER 6:50 4 A320 17
5 SYD 9:50 3 A320 0
6 PER 12:10 1 A320 23
7 TZN 14:50 1 B777 0
8 MEL 14:15 4 A320 0
9 SYD 8:30 3 A320 0
10 MEL 16:40 1 A320 12
11 MEL 9:30 3 B747 32
12 TZN 9:30 1 A320 20
13 PER 9:50 3 A320 0
14 SYD 13:10 1 A320 9
A flight is labeled “delayed”
if actual delay >= 15min
24
LABEL
OCTO TECHNOLOGY > THERE IS A BETTER WAY 24
Flight #
Dep
Airport
Dep
Hour
Dep
Week Day
Aircraft
Model
…
Actual
Delay
1 SYD 8:10 1 A330 0
2 SYD 14:15 2 B777 3
3 MEL 18:10 1 B777 0
4 PER 6:50 4 A320 17
5 SYD 9:50 3 A320 0
6 PER 12:10 1 A320 23
7 TZN 14:50 1 B777 0
8 MEL 14:15 4 A320 0
9 SYD 8:30 3 A320 0
10 MEL 16:40 1 A320 12
11 MEL 9:30 3 B747 32
12 TZN 9:30 1 A320 20
13 PER 9:50 3 A320 0
14 SYD 13:10 1 A320 9
Class
0
0
0
1
0
1
0
0
0
0
1
1
0
0
25
BUILD A MODEL
OCTO TECHNOLOGY > THERE IS A BETTER WAY
1 SYD 8:10 1 A330 0
2 SYD 14:15 2 B777 0
3 MEL 18:10 1 B777 0
4 PER 6:50 4 A320 1
5 SYD 9:50 3 A320 0
6 PER 12:10 1 A320 1
7 TZN 14:50 1 B777 0
8 MEL 14:15 4 A320 0
9 SYD 8:30 3 A320 0
10 MEL 16:40 1 A320 0
… … … … … …
11 MEL 9:30 3 B747 1
12 TZN 9:30 1 A320 1
13 PER 9:50 3 A320 0
14 SYD 13:10 1 A320 0
Flight
#
Dep
Airport
Dep
Hour
Dep
Week Day
Aircraft
Model Delay
θ1
θ2
θ3
…
θn
X Y
26
LOGISTIC REGRESSION
Classification algorithm
OCTO TECHNOLOGY > THERE IS A BETTER WAY
27
DECISION TREE
Classification algorithm
OCTO TECHNOLOGY > THERE IS A BETTER WAY
DoW
>5
Month
>5
PAX
>35%
AoD
=“SYD”
no
no
no
yes
yes
yes
yesno
+-
-+-
28
RANDOM FOREST
Classification algorithm
OCTO TECHNOLOGY > THERE IS A BETTER WAY 28
29
TEST CLASSIFIER
OCTO TECHNOLOGY > THERE IS A BETTER WAY 29
Flight #
Dep
Airport
Dep
Hour
Dep
Week Day
Aircraft
Model
…
9 SYD 8:30 3 A320
1 positive (delayed)
0 negative (on time)
30
A PERFECT CLASSIFIER
OCTO TECHNOLOGY > THERE IS A BETTER WAY 30
Flight #
Dep
Airport
Dep
Hour
Dep
Week Day
Aircraft
Model
1 SYD 8:10 1 A330
2 SYD 14:15 2 B777
3 MEL 18:10 1 B777
4 PER 6:50 4 A320
5 SYD 9:50 3 A320
6 PER 12:10 1 A320
7 TZN 14:50 1 B777
8 MEL 14:15 4 A320
9 SYD 8:30 3 A320
10 MEL 16:40 1 A320
11 MEL 9:30 3 B747
12 TZN 9:30 1 A320
13 PER 9:50 3 A320
14 SYD 13:10 1 A320
1
1
1
1
0
0
0
0
0
0
0
0
0
0
31
1
1
0
1
0
1
0
0
0
0
0
0
1
0
A MORE REALISTIC CLASSIFIER
OCTO TECHNOLOGY > THERE IS A BETTER WAY 31
Flight #
Dep
Airport
Dep
Hour
Dep
Week Day
Aircraft
Model
1 SYD 8:10 1 A330
2 SYD 14:15 2 B777
3 MEL 18:10 1 B777
4 PER 6:50 4 A320
5 SYD 9:50 3 A320
6 PER 12:10 1 A320
7 TZN 14:50 1 B777
8 MEL 14:15 4 A320
9 SYD 8:30 3 A320
10 MEL 16:40 1 A320
11 MEL 9:30 3 B747
12 TZN 9:30 1 A320
13 PER 9:50 3 A320
14 SYD 13:10 1 A320
Wrongly
classified
32
CONFUSION MATRIX
The summary to optimize
OCTO TECHNOLOGY > THERE IS A BETTER WAY
32
Actually
delayed on time
Predicted
+
(delayed)
3 2
-
(on time)
1 8
True Positive
False Negative
False Positive
True Negative
33
PERFORMANCE INDICATORS
OCTO TECHNOLOGY > THERE IS A BETTER WAY
33
Actually
delayed on time
Predicted
+
(delayed)
3 2
-
(on time)
1 8
TP
FN
FP
TN
False Positive Rate =
True Positive Rate =
TP
TP + FN
FP
FP + TN(1 – Specificity)
(Sensitivity)
Precision =
TP
TP + FP
Recall =
TP
TP + FN
34
0.9
0.8
0.8
0.3
0.2
0.1
0.5
0.4
0.5
0.4
0.3
0.7
0.8
0.5
CLASSIFIER
Assigning a continuous score of being delayed
OCTO TECHNOLOGY > THERE IS A BETTER WAY 34
0 1
+-
35
PREDICTOR SCORE DISTRIBUTION
OCTO TECHNOLOGY > THERE IS A BETTER WAY 35
Score
Delayed flights
On time
flights
Eventscount
A perfect
score cutoff
36
PREDICTOR SCORE DISTRIBUTION
Fixing a score cutoff leads to false positive and negative
OCTO TECHNOLOGY > THERE IS A BETTER WAY 36
Score
False Positive
False Negative
Eventscount
37
ROC CURVES TO COMPARE CLASSIFIERS
Fixing score cutoffs lead to different false positive and negative rates
OCTO TECHNOLOGY > THERE IS A BETTER WAY 37
False Positive Rate
TruePositiveRate
0
1
0 1
38
ROC AND ROLL
 ROC allow to compare different models
 Area Under the Curve (AUC) is only a projection of the overall
performance
 Significantly different models can have close ROC
 Other comparisons methods exists (and are intimately related to ROC):
> Precision/Recall
> LIFT
A few comments about ROC curves
OCTO TECHNOLOGY > THERE IS A BETTER WAY 38
AUC
39
MODELS & DATA
Precision score for the TOP 20%
Traditional models Advanced models Advanced models
with more data
Advanced models
with more data
and more features
Precision
40
MODELS & DATA
Traditional models Advanced models Advanced models
with more data
Advanced models
with more data
and more features
Precision
Precision score for the TOP 20%
MODELS & DATA
Traditional models Advanced models Advanced models
with more data
Advanced models
with more data
and more features
Precision
Precision score for the TOP 20%
42
FIGHT DELAY PREDICTION: RESULTS
All reasons for delays
 Overall improvement by a factor 3
Focus on air traffic
 Overall improvement by a factor 6
Delay caused by passengers
 No improvement
10% LIFT score
OCTO TECHNOLOGY > THERE IS A BETTER WAY
43
PREDICT NUMBER OF PASSENGERS ON A PLANE
Optimize catering
OCTO TECHNOLOGY > THERE IS A BETTER WAY 43
t0 - 4 hours t0
Flight
Number
Booked Departure
port
… Departure
hour
0777 152 PER … 14
1116 201 SYD … 9
0961 92 BNE … 6
0538 189 MEL … 12
1078 136 SYD … 23
Final Number
of passengers
164
186
125
189
87
t
?
~ 50 explanatory variables
X y
t0 - 1 hour
44
RESULTS
OCTO TECHNOLOGY > THERE IS A BETTER WAY
Passenger
difference
No model Model
< 5 55% 69%
< 10 80% 89%
$1-2M per year
45
UNSTRUCTURED DATA
OCTO TECHNOLOGY > THERE IS A BETTER WAY
47
1
48
WHAT ARE THE FEATURES?
mimagesfortraining
n features
X
…
6
…
Y
49
WHAT ARE THE FEATURES?
5
4
3
2
1
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
0
0
4
5
5
5
5
4
1
4
4
1
0
1
5
4
5
1
0
0
0
1
5
1
5
0
0
0
0
0
5
4
4
0
0
0
0
0
2
5
2
0
0
0
0
0
0
0
0
50
WHAT ARE THE FEATURES?
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
0
0
4
5
5
5
5
4
1
4
4
1
0
1
5
4
5
1
0
0
0
1
5
1
5
0
0
0
0
0
5
4
4
0
0
0
0
0
2
5
2
0
0
0
0
0
0
0
0
= 6
(…)
6
6
3
…
0
7
n features
mimagesfortraining
X Y
51
NEURAL NETWORK
OCTO TECHNOLOGY > THERE IS A BETTER WAY
CAN COMPUTER VISION SPOT DISTRACTED DRIVERS?
 24 Juin 2016 – Julien Krywyk
OCTO TECHNOLOGY > THERE IS A BETTER WAY 52
Phone right Safe Text right Phone left Text left
Speaking Makeup Behind Drink Radio
OCTO TECHNOLOGY > THERE IS A BETTER WAY 53
Build classifier
Train 22K images Test 80K images
Predicted
classes
X Y
Make predictions
?
DEEP LEARNING
OCTO TECHNOLOGY > THERE IS A BETTER WAY 54
Identify pixels
Identify edges and
simple shape
Identify complex
shapes and object
Identify which shape to
be used to define a
human face
DEEP LEARNING
Transfer learning
OCTO TECHNOLOGY > THERE IS A BETTER WAY 55
n features
X Y
Features
extractions
pre-trained CNN
DATA SCIENCE TONIGHT
OCTO TECHNOLOGY > THERE IS A BETTER WAY 56
Visualization
1
2
3
4
Why the buzz about data science?
Demystifying machine learning
Data science in your business
57
VISUALIZATION
OCTO TECHNOLOGY > THERE IS A BETTER WAY
Understand
Communicate
results & analysis
58
1880: TEXTILE PRODUCTION IN ENGLAND (OTTO NEURATH, ~1920)
Changing the world by educating people about the world around them
OCTO TECHNOLOGY > THERE IS A BETTER WAY
59
NAPOLEON 1812 CAMPAIGN (CHARLES MINARD, 1869)
OCTO TECHNOLOGY > THERE IS A BETTER WAY
60
HOW TRUMP PUSHED THE ELECTION MAP TO THE RIGHT (NEW YORK TIMES)
OCTO TECHNOLOGY > THERE IS A BETTER WAY
61
VISUALIZATION TO GET ACQUAINTED WITH DATA
OCTO TECHNOLOGY > THERE IS A BETTER WAY
EXPLORATION: FLIGHT DELAY PER MONTH AND DAY OF WEEK
63
DATA VISUALIZATION
Correlation between ‘Departure Hour’ and passenger delta
OCTO TECHNOLOGY > THERE IS A BETTER WAY 63
64
NOTEBOOKS
Interactive data analysis
OCTO TECHNOLOGY > THERE IS A BETTER WAY
65
VISUALIZATION AS A GAME CHANGER
OCTO TECHNOLOGY > THERE IS A BETTER WAY
66
VALIDATION
OCTO TECHNOLOGY > THERE IS A BETTER WAY
https://github.com/genentech/fishtones-js
DATA SCIENCE TONIGHT
OCTO TECHNOLOGY > THERE IS A BETTER WAY 69
Visualization
1
2
3
4
Why the buzz about data science?
Demystifying machine learning
Data science in your business
70
I WANT A DATA SCIENTIST!
OCTO TECHNOLOGY > THERE IS A BETTER WAY
71OCTO TECHNOLOGY > THERE IS A BETTER WAY
72
AGILE DATA SCIENCE
OCTO TECHNOLOGY > THERE IS A BETTER WAY
Agile Data science
Feature
Team
Operations
Business
analyst
Developper
tech expertProject
Manager
Data
scientist
Architect
Individuals and interactions over processes and tools
Working software over comprehensive documentation
Customer collaboration over contract negotiation
Responding to change over following a plan
That is, while there is value in the items on the right, we value the items on the left more
OCTO TECHNOLOGY > THERE IS A BETTER WAY
BUILDING A DATALAB
OCTO TECHNOLOGY > THERE IS A BETTER WAY 75
Source System Collect, storage et data preparation Analysis delivery
External
sources
Datalab
Existing infrastructure
(multiples sources)
ETL
Extract
cleanup,
transfor
m
load
Staging area Datawarehouse
technical layer
(referential/
Operation)
Datamart
technique
(zone de collecte)
Datamart
(management,
marketing, sales
User access
(Reporting, Analytics)
Batch
• Analyses
• Indicators
• Statistics
Online
• Dashboards
• Reporting
• Requests
Administratio
n
• Admini
• Validation
DEVOPS – EMBRACING NEW KNOW HOW
And new collaborations…
Data Scientist
• Innovates
• With new technologies
“What !? A unit test on my
neural network???
OPS
• Look after rationalization
“What!? Your piece of Scala
calls a Python library embedding C ???”
OCTO TECHNOLOGY > THERE IS A BETTER WAY
78
DEMOCRATIZATION
 cours
OCTO TECHNOLOGY > THERE IS A BETTER WAY
1 million
enrollments
OCTO TECHNOLOGY > THERE IS A BETTER WAY
81
Business must be aware of opportunities to use
algorithms
BUSINESS & DATA SCIENCE
OCTO TECHNOLOGY > THERE IS A BETTER WAY
Data must be easily accessible
Focus on lowest time to market possible
USE CASES CLASSES AND THEIR BUSINESS VALUE
OCTO TECHNOLOGY > THERE IS A BETTER WAY 82
The prediction is a
support for decision
Analyses support
data understanding
The prediction is the
decision
Business value
OCTO TECHNOLOGY > THERE IS A BETTER WAY 83
???
???

More Related Content

Viewers also liked

формы предпринимательской деятельности
формы предпринимательской деятельностиформы предпринимательской деятельности
формы предпринимательской деятельностиIvan Adonov
 
GRIEVANCE MECHANISM - Total
GRIEVANCE MECHANISM - TotalGRIEVANCE MECHANISM - Total
GRIEVANCE MECHANISM - TotalEthical Sector
 
Exame de fisica de 2005
Exame de fisica de 2005Exame de fisica de 2005
Exame de fisica de 2005Ezio VLK
 
Pathway to Gratitude: How to Create Thankful Habits
Pathway to Gratitude: How to Create Thankful Habits Pathway to Gratitude: How to Create Thankful Habits
Pathway to Gratitude: How to Create Thankful Habits Brittany Ritcher
 
нов Microsoft office power point presentation
нов Microsoft office power point presentationнов Microsoft office power point presentation
нов Microsoft office power point presentationPlamenchetoo
 
PA proposta di Diego Zardini
PA proposta di Diego ZardiniPA proposta di Diego Zardini
PA proposta di Diego ZardiniAntonino Leone
 
O&G SWIA Consultation Presentation (April 2014) - Myanmar Centre for Responsi...
O&G SWIA Consultation Presentation (April 2014) - Myanmar Centre for Responsi...O&G SWIA Consultation Presentation (April 2014) - Myanmar Centre for Responsi...
O&G SWIA Consultation Presentation (April 2014) - Myanmar Centre for Responsi...Ethical Sector
 
SHELL / BG OFFSHORE SEISMIC SURVEY CAMPAIGN IEE PROCESS Lessons learned
SHELL / BG OFFSHORE SEISMIC SURVEY CAMPAIGN IEE PROCESS Lessons learnedSHELL / BG OFFSHORE SEISMIC SURVEY CAMPAIGN IEE PROCESS Lessons learned
SHELL / BG OFFSHORE SEISMIC SURVEY CAMPAIGN IEE PROCESS Lessons learnedEthical Sector
 
farman Ali Khan paper
farman Ali Khan paperfarman Ali Khan paper
farman Ali Khan paperFarman Khan
 

Viewers also liked (13)

формы предпринимательской деятельности
формы предпринимательской деятельностиформы предпринимательской деятельности
формы предпринимательской деятельности
 
Occupational Identity
Occupational IdentityOccupational Identity
Occupational Identity
 
Chude02 nhom02
Chude02 nhom02Chude02 nhom02
Chude02 nhom02
 
GRIEVANCE MECHANISM - Total
GRIEVANCE MECHANISM - TotalGRIEVANCE MECHANISM - Total
GRIEVANCE MECHANISM - Total
 
Exame de fisica de 2005
Exame de fisica de 2005Exame de fisica de 2005
Exame de fisica de 2005
 
Pathway to Gratitude: How to Create Thankful Habits
Pathway to Gratitude: How to Create Thankful Habits Pathway to Gratitude: How to Create Thankful Habits
Pathway to Gratitude: How to Create Thankful Habits
 
нов Microsoft office power point presentation
нов Microsoft office power point presentationнов Microsoft office power point presentation
нов Microsoft office power point presentation
 
Prayer
PrayerPrayer
Prayer
 
PA proposta di Diego Zardini
PA proposta di Diego ZardiniPA proposta di Diego Zardini
PA proposta di Diego Zardini
 
O&G SWIA Consultation Presentation (April 2014) - Myanmar Centre for Responsi...
O&G SWIA Consultation Presentation (April 2014) - Myanmar Centre for Responsi...O&G SWIA Consultation Presentation (April 2014) - Myanmar Centre for Responsi...
O&G SWIA Consultation Presentation (April 2014) - Myanmar Centre for Responsi...
 
SHELL / BG OFFSHORE SEISMIC SURVEY CAMPAIGN IEE PROCESS Lessons learned
SHELL / BG OFFSHORE SEISMIC SURVEY CAMPAIGN IEE PROCESS Lessons learnedSHELL / BG OFFSHORE SEISMIC SURVEY CAMPAIGN IEE PROCESS Lessons learned
SHELL / BG OFFSHORE SEISMIC SURVEY CAMPAIGN IEE PROCESS Lessons learned
 
500 033 android
500 033 android500 033 android
500 033 android
 
farman Ali Khan paper
farman Ali Khan paperfarman Ali Khan paper
farman Ali Khan paper
 

Similar to Afterwork Big Data - Data Science & Machine Learning : explorer, comprendre et prédire

CASSIS Controlled Time of Arrival Flight Trials: Results and Analysis Present...
CASSIS Controlled Time of Arrival Flight Trials: Results and Analysis Present...CASSIS Controlled Time of Arrival Flight Trials: Results and Analysis Present...
CASSIS Controlled Time of Arrival Flight Trials: Results and Analysis Present...Bruce Ware, PMI Certified PM Professional
 
KNN and regression Tree
KNN and regression TreeKNN and regression Tree
KNN and regression TreeAsmar Farooq
 
informs_poster_v9_sg
informs_poster_v9_sginforms_poster_v9_sg
informs_poster_v9_sgShreya Gupta
 
India Aviation ICT Forum 2013 - Manish Sinha, Deputy COO, Hyderabad Internati...
India Aviation ICT Forum 2013 - Manish Sinha, Deputy COO, Hyderabad Internati...India Aviation ICT Forum 2013 - Manish Sinha, Deputy COO, Hyderabad Internati...
India Aviation ICT Forum 2013 - Manish Sinha, Deputy COO, Hyderabad Internati...SITA
 
Senior Aviation Limited Upload
Senior Aviation Limited UploadSenior Aviation Limited Upload
Senior Aviation Limited UploadJeremy Mitchell
 
Queuing (Transportation Engineering)
Queuing (Transportation Engineering)Queuing (Transportation Engineering)
Queuing (Transportation Engineering)Hossam Shafiq I
 
Course project for CEE 4674
Course project for CEE 4674Course project for CEE 4674
Course project for CEE 4674Junqi Hu
 
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdf
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdfDell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdf
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdfAhmedMaher181036
 
Overview ITD \'09
Overview ITD \'09Overview ITD \'09
Overview ITD \'09k_scarpe
 
Presentation AME conference "Aircraft Engineers' Power to Innovate"
Presentation AME conference "Aircraft Engineers' Power to Innovate"Presentation AME conference "Aircraft Engineers' Power to Innovate"
Presentation AME conference "Aircraft Engineers' Power to Innovate"MROair
 
Technical Debt Must Die: Communicating Code to Business Stakeholders
Technical Debt Must Die: Communicating Code to Business StakeholdersTechnical Debt Must Die: Communicating Code to Business Stakeholders
Technical Debt Must Die: Communicating Code to Business StakeholdersMatt Eland
 
EffectiveAirportManagement-Diffs-Sols-191213.ppt
EffectiveAirportManagement-Diffs-Sols-191213.pptEffectiveAirportManagement-Diffs-Sols-191213.ppt
EffectiveAirportManagement-Diffs-Sols-191213.pptSURAJ SINGH SAINI
 
Data-driven model-based restructuring of enterprise transaction operations
Data-driven model-based restructuring of enterprise transaction operationsData-driven model-based restructuring of enterprise transaction operations
Data-driven model-based restructuring of enterprise transaction operationsSudhendu Rai
 
Corporate Presentation
Corporate Presentation Corporate Presentation
Corporate Presentation Embraer RI
 
Arq12610161 091209104850-phpapp01
Arq12610161 091209104850-phpapp01Arq12610161 091209104850-phpapp01
Arq12610161 091209104850-phpapp01cirobondesan
 
computer science project on movie booking system
computer science project on movie booking systemcomputer science project on movie booking system
computer science project on movie booking systemAnurag Yadav
 
Olympuskpres20111118
Olympuskpres20111118Olympuskpres20111118
Olympuskpres20111118cargouav
 
회사소개서(항공) 낙우산업(주) 영문
회사소개서(항공) 낙우산업(주) 영문회사소개서(항공) 낙우산업(주) 영문
회사소개서(항공) 낙우산업(주) 영문ssuser8d2a8a
 
Data driven modeling of systemic delay propagation under severe meteorologica...
Data driven modeling of systemic delay propagation under severe meteorologica...Data driven modeling of systemic delay propagation under severe meteorologica...
Data driven modeling of systemic delay propagation under severe meteorologica...Innaxis Foundation and Research Institute
 

Similar to Afterwork Big Data - Data Science & Machine Learning : explorer, comprendre et prédire (20)

CASSIS Controlled Time of Arrival Flight Trials: Results and Analysis Present...
CASSIS Controlled Time of Arrival Flight Trials: Results and Analysis Present...CASSIS Controlled Time of Arrival Flight Trials: Results and Analysis Present...
CASSIS Controlled Time of Arrival Flight Trials: Results and Analysis Present...
 
KNN and regression Tree
KNN and regression TreeKNN and regression Tree
KNN and regression Tree
 
informs_poster_v9_sg
informs_poster_v9_sginforms_poster_v9_sg
informs_poster_v9_sg
 
India Aviation ICT Forum 2013 - Manish Sinha, Deputy COO, Hyderabad Internati...
India Aviation ICT Forum 2013 - Manish Sinha, Deputy COO, Hyderabad Internati...India Aviation ICT Forum 2013 - Manish Sinha, Deputy COO, Hyderabad Internati...
India Aviation ICT Forum 2013 - Manish Sinha, Deputy COO, Hyderabad Internati...
 
Sergio Martins - OPERATIONS AND PASSENGERS WORKSHOP - PANEL 2
Sergio Martins - OPERATIONS AND PASSENGERS WORKSHOP - PANEL 2Sergio Martins - OPERATIONS AND PASSENGERS WORKSHOP - PANEL 2
Sergio Martins - OPERATIONS AND PASSENGERS WORKSHOP - PANEL 2
 
Senior Aviation Limited Upload
Senior Aviation Limited UploadSenior Aviation Limited Upload
Senior Aviation Limited Upload
 
Queuing (Transportation Engineering)
Queuing (Transportation Engineering)Queuing (Transportation Engineering)
Queuing (Transportation Engineering)
 
Course project for CEE 4674
Course project for CEE 4674Course project for CEE 4674
Course project for CEE 4674
 
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdf
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdfDell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdf
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdf
 
Overview ITD \'09
Overview ITD \'09Overview ITD \'09
Overview ITD \'09
 
Presentation AME conference "Aircraft Engineers' Power to Innovate"
Presentation AME conference "Aircraft Engineers' Power to Innovate"Presentation AME conference "Aircraft Engineers' Power to Innovate"
Presentation AME conference "Aircraft Engineers' Power to Innovate"
 
Technical Debt Must Die: Communicating Code to Business Stakeholders
Technical Debt Must Die: Communicating Code to Business StakeholdersTechnical Debt Must Die: Communicating Code to Business Stakeholders
Technical Debt Must Die: Communicating Code to Business Stakeholders
 
EffectiveAirportManagement-Diffs-Sols-191213.ppt
EffectiveAirportManagement-Diffs-Sols-191213.pptEffectiveAirportManagement-Diffs-Sols-191213.ppt
EffectiveAirportManagement-Diffs-Sols-191213.ppt
 
Data-driven model-based restructuring of enterprise transaction operations
Data-driven model-based restructuring of enterprise transaction operationsData-driven model-based restructuring of enterprise transaction operations
Data-driven model-based restructuring of enterprise transaction operations
 
Corporate Presentation
Corporate Presentation Corporate Presentation
Corporate Presentation
 
Arq12610161 091209104850-phpapp01
Arq12610161 091209104850-phpapp01Arq12610161 091209104850-phpapp01
Arq12610161 091209104850-phpapp01
 
computer science project on movie booking system
computer science project on movie booking systemcomputer science project on movie booking system
computer science project on movie booking system
 
Olympuskpres20111118
Olympuskpres20111118Olympuskpres20111118
Olympuskpres20111118
 
회사소개서(항공) 낙우산업(주) 영문
회사소개서(항공) 낙우산업(주) 영문회사소개서(항공) 낙우산업(주) 영문
회사소개서(항공) 낙우산업(주) 영문
 
Data driven modeling of systemic delay propagation under severe meteorologica...
Data driven modeling of systemic delay propagation under severe meteorologica...Data driven modeling of systemic delay propagation under severe meteorologica...
Data driven modeling of systemic delay propagation under severe meteorologica...
 

More from OCTO Technology Suisse

An afterwork on Microservices by @OCTO Technology Switzerland
An afterwork on Microservices  by @OCTO Technology SwitzerlandAn afterwork on Microservices  by @OCTO Technology Switzerland
An afterwork on Microservices by @OCTO Technology SwitzerlandOCTO Technology Suisse
 
Afterwork Devops : vision et pratiques
Afterwork Devops : vision et pratiquesAfterwork Devops : vision et pratiques
Afterwork Devops : vision et pratiquesOCTO Technology Suisse
 
Êtes-vous API dans votre organisation ?
Êtes-vous API dans votre organisation ?Êtes-vous API dans votre organisation ?
Êtes-vous API dans votre organisation ?OCTO Technology Suisse
 
big data et data viz - du lac à votre écran - afterwork
big data et data viz - du lac à votre écran - afterwork big data et data viz - du lac à votre écran - afterwork
big data et data viz - du lac à votre écran - afterwork OCTO Technology Suisse
 
Dev wednesday-swiss-transport-realtime
Dev wednesday-swiss-transport-realtimeDev wednesday-swiss-transport-realtime
Dev wednesday-swiss-transport-realtimeOCTO Technology Suisse
 
Réussissez le développement de votre prochaine application web ou mobile
Réussissez le développement de votre prochaine application web ou mobileRéussissez le développement de votre prochaine application web ou mobile
Réussissez le développement de votre prochaine application web ou mobileOCTO Technology Suisse
 
L'ADN d'un développement produit réussi
L'ADN d'un développement produit réussiL'ADN d'un développement produit réussi
L'ADN d'un développement produit réussiOCTO Technology Suisse
 
Fintech : concurrents ou partenaires ?
Fintech : concurrents ou partenaires ?Fintech : concurrents ou partenaires ?
Fintech : concurrents ou partenaires ?OCTO Technology Suisse
 
Fintech demain comment travailler ensemble
Fintech   demain comment travailler ensembleFintech   demain comment travailler ensemble
Fintech demain comment travailler ensembleOCTO Technology Suisse
 
Softshake 2015 - Des small data aux big data - Méthodes et Technologies
Softshake 2015 - Des small data aux big data - Méthodes et TechnologiesSoftshake 2015 - Des small data aux big data - Méthodes et Technologies
Softshake 2015 - Des small data aux big data - Méthodes et TechnologiesOCTO Technology Suisse
 
Qu'est qu'une Data Driven Company à l'heure de la digitalisation ?
Qu'est qu'une Data Driven Company à l'heure de la digitalisation ?Qu'est qu'une Data Driven Company à l'heure de la digitalisation ?
Qu'est qu'une Data Driven Company à l'heure de la digitalisation ?OCTO Technology Suisse
 
OCTO Technology - Data Driven Company - SITB15
OCTO Technology - Data Driven Company - SITB15OCTO Technology - Data Driven Company - SITB15
OCTO Technology - Data Driven Company - SITB15OCTO Technology Suisse
 
De la pensée projet à la pensée produit
De la pensée projet à la pensée produitDe la pensée projet à la pensée produit
De la pensée projet à la pensée produitOCTO Technology Suisse
 
Les Business Analysts face à l'agilité : de nouveaux challenges à relever
Les Business Analysts face à l'agilité : de nouveaux challenges à releverLes Business Analysts face à l'agilité : de nouveaux challenges à relever
Les Business Analysts face à l'agilité : de nouveaux challenges à releverOCTO Technology Suisse
 

More from OCTO Technology Suisse (20)

An afterwork on Microservices by @OCTO Technology Switzerland
An afterwork on Microservices  by @OCTO Technology SwitzerlandAn afterwork on Microservices  by @OCTO Technology Switzerland
An afterwork on Microservices by @OCTO Technology Switzerland
 
Afterwork Devops : vision et pratiques
Afterwork Devops : vision et pratiquesAfterwork Devops : vision et pratiques
Afterwork Devops : vision et pratiques
 
Êtes-vous API dans votre organisation ?
Êtes-vous API dans votre organisation ?Êtes-vous API dans votre organisation ?
Êtes-vous API dans votre organisation ?
 
Afterwork "Décollez vers le Cloud"
Afterwork "Décollez vers le Cloud"Afterwork "Décollez vers le Cloud"
Afterwork "Décollez vers le Cloud"
 
big data et data viz - du lac à votre écran - afterwork
big data et data viz - du lac à votre écran - afterwork big data et data viz - du lac à votre écran - afterwork
big data et data viz - du lac à votre écran - afterwork
 
2017 03-29-elastic-meetup-kibana
2017 03-29-elastic-meetup-kibana2017 03-29-elastic-meetup-kibana
2017 03-29-elastic-meetup-kibana
 
Dev wednesday-swiss-transport-realtime
Dev wednesday-swiss-transport-realtimeDev wednesday-swiss-transport-realtime
Dev wednesday-swiss-transport-realtime
 
Réussissez le développement de votre prochaine application web ou mobile
Réussissez le développement de votre prochaine application web ou mobileRéussissez le développement de votre prochaine application web ou mobile
Réussissez le développement de votre prochaine application web ou mobile
 
L'ADN d'un développement produit réussi
L'ADN d'un développement produit réussiL'ADN d'un développement produit réussi
L'ADN d'un développement produit réussi
 
Fintech : concurrents ou partenaires ?
Fintech : concurrents ou partenaires ?Fintech : concurrents ou partenaires ?
Fintech : concurrents ou partenaires ?
 
Fintech demain comment travailler ensemble
Fintech   demain comment travailler ensembleFintech   demain comment travailler ensemble
Fintech demain comment travailler ensemble
 
Softshake 2015 - Des small data aux big data - Méthodes et Technologies
Softshake 2015 - Des small data aux big data - Méthodes et TechnologiesSoftshake 2015 - Des small data aux big data - Méthodes et Technologies
Softshake 2015 - Des small data aux big data - Méthodes et Technologies
 
Démystifions l'API-culture!
Démystifions l'API-culture!Démystifions l'API-culture!
Démystifions l'API-culture!
 
Qu'est qu'une Data Driven Company à l'heure de la digitalisation ?
Qu'est qu'une Data Driven Company à l'heure de la digitalisation ?Qu'est qu'une Data Driven Company à l'heure de la digitalisation ?
Qu'est qu'une Data Driven Company à l'heure de la digitalisation ?
 
OCTO Technology - Data Driven Company - SITB15
OCTO Technology - Data Driven Company - SITB15OCTO Technology - Data Driven Company - SITB15
OCTO Technology - Data Driven Company - SITB15
 
Afterwork - La Révolution Digitale
Afterwork - La Révolution DigitaleAfterwork - La Révolution Digitale
Afterwork - La Révolution Digitale
 
Brochure Vers l'entreprise Agile
Brochure Vers l'entreprise AgileBrochure Vers l'entreprise Agile
Brochure Vers l'entreprise Agile
 
De la pensée projet à la pensée produit
De la pensée projet à la pensée produitDe la pensée projet à la pensée produit
De la pensée projet à la pensée produit
 
Agile & Top Management
Agile & Top ManagementAgile & Top Management
Agile & Top Management
 
Les Business Analysts face à l'agilité : de nouveaux challenges à relever
Les Business Analysts face à l'agilité : de nouveaux challenges à releverLes Business Analysts face à l'agilité : de nouveaux challenges à relever
Les Business Analysts face à l'agilité : de nouveaux challenges à relever
 

Recently uploaded

100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 

Recently uploaded (20)

100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 

Afterwork Big Data - Data Science & Machine Learning : explorer, comprendre et prédire

  • 1. © OCTO 2015 Tél : +41 (0) 21 312 94 15 www.octo.com Avenue du théatre, 7 1005 Lausanne SUISSEData Science & Machine Learning
  • 3. 2016 is the Year of Big Data @OCTO Switzerland Big Data Romandie
  • 4. OCTO PUBLICATIONS OCTO TECHNOLOGY > THERE IS A BETTER WAY 4
  • 5. WE ARE CONSUMING DATA SCIENCE EVERY DAY! Facial recognition
  • 6. Spam detection WE ARE CONSUMING DATA SCIENCE EVERY DAY!
  • 7. Voice recognition WE ARE CONSUMING DATA SCIENCE EVERY DAY!
  • 8. Movie recommendation WE ARE CONSUMING DATA SCIENCE EVERY DAY!
  • 9. 9 DATA SCIENCE, A DOMAIN DRIVEN BY COMPETITION To solve your business problems! Problem Data Crowd Knowledge & Tools Model for Prediction
  • 10. OCTO Folks Work Hard, Play Hard ◉ Caisse de dépôts - score de délivrance d'un brevet européen ◉ Argus - prédiction du prix de vente de véhicules d'occasion ◉ SNCF - prédiction de la fréquentation des gares en Ile de France ◉ Imperial College London - Loan Default Prediction ◉ Allstate – purchase prediction challenge ◉ Tradeshift – Text classification ◉ Microsoft - Malware classification OCTO, there is a better way to learn, recruit and have fun! 1st 2&4 3rd 6th 13th 2nd 5th
  • 11. DATA SCIENCE TONIGHT OCTO TECHNOLOGY > THERE IS A BETTER WAY 11 Visualization 1 2 3 4 Why the buzz about data science? Demystifying machine learning Data science in your business
  • 12. 12 “Data science is an interdisciplinary field about processes and systems to extract knowledge or insights from data” OCTO TECHNOLOGY > THERE IS A BETTER WAY https://en.wikipedia.org/wiki/Data_science
  • 13. 13OCTO TECHNOLOGY > THERE IS A BETTER WAY Cray 2 iPhone 4=1 1
  • 14.
  • 15. 15OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 16. 16 AGILE DATA SCIENCE OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 17. DATA SCIENCE TONIGHT OCTO TECHNOLOGY > THERE IS A BETTER WAY 17 Visualization 1 2 3 4 Why the buzz about data science? Demystifying machine learning Data science in your business
  • 18. 18 “Machine learning explores the study and construction of algorithms that can learn from and make predictions on data” OCTO TECHNOLOGY > THERE IS A BETTER WAY https://en.wikipedia.org/wiki/Machine_learning
  • 19. 19 MACHINE LEARNING Conditions OCTO TECHNOLOGY > THERE IS A BETTER WAY 1 2 3 A pattern exists The problem cannot be described analytically by a mathematical formula Data, data, data Machine learning algorithms exists for many years In general, model performances improve with more data
  • 20. 20OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 21. 21 FLIGHT CHARACTERISTICS OCTO TECHNOLOGY > THERE IS A BETTER WAY 21 Flight # Dep Airport Dep Hour Dep Week Day Aircraft Model … 1 SYD 8:10 1 A330 2 SYD 14:15 2 B777 3 MEL 18:10 1 B777 4 PER 6:50 4 A320 5 SYD 9:50 3 A320 6 PER 12:10 1 A320 7 TZN 14:50 1 B777 8 MEL 14:15 4 A320 9 SYD 8:30 3 A320 10 MEL 16:40 1 A320 11 MEL 9:30 3 B747 12 TZN 9:30 1 A320 13 PER 9:50 3 A320 14 SYD 13:10 1 A320
  • 22. 22 EVENTS OCTO TECHNOLOGY > THERE IS A BETTER WAY 22 Flight # Dep Airport Dep Hour Dep Week Day Aircraft Model … Actual Delay 1 SYD 8:10 1 A330 0 2 SYD 14:15 2 B777 3 3 MEL 18:10 1 B777 0 4 PER 6:50 4 A320 17 5 SYD 9:50 3 A320 0 6 PER 12:10 1 A320 23 7 TZN 14:50 1 B777 0 8 MEL 14:15 4 A320 0 9 SYD 8:30 3 A320 0 10 MEL 16:40 1 A320 12 11 MEL 9:30 3 B747 32 12 TZN 9:30 1 A320 20 13 PER 9:50 3 A320 0 14 SYD 13:10 1 A320 9
  • 23. 23 EVENTS OCTO TECHNOLOGY > THERE IS A BETTER WAY 23 Flight # Dep Airport Dep Hour Dep Week Day Aircraft Model … Actual Delay 1 SYD 8:10 1 A330 0 2 SYD 14:15 2 B777 3 3 MEL 18:10 1 B777 0 4 PER 6:50 4 A320 17 5 SYD 9:50 3 A320 0 6 PER 12:10 1 A320 23 7 TZN 14:50 1 B777 0 8 MEL 14:15 4 A320 0 9 SYD 8:30 3 A320 0 10 MEL 16:40 1 A320 12 11 MEL 9:30 3 B747 32 12 TZN 9:30 1 A320 20 13 PER 9:50 3 A320 0 14 SYD 13:10 1 A320 9 A flight is labeled “delayed” if actual delay >= 15min
  • 24. 24 LABEL OCTO TECHNOLOGY > THERE IS A BETTER WAY 24 Flight # Dep Airport Dep Hour Dep Week Day Aircraft Model … Actual Delay 1 SYD 8:10 1 A330 0 2 SYD 14:15 2 B777 3 3 MEL 18:10 1 B777 0 4 PER 6:50 4 A320 17 5 SYD 9:50 3 A320 0 6 PER 12:10 1 A320 23 7 TZN 14:50 1 B777 0 8 MEL 14:15 4 A320 0 9 SYD 8:30 3 A320 0 10 MEL 16:40 1 A320 12 11 MEL 9:30 3 B747 32 12 TZN 9:30 1 A320 20 13 PER 9:50 3 A320 0 14 SYD 13:10 1 A320 9 Class 0 0 0 1 0 1 0 0 0 0 1 1 0 0
  • 25. 25 BUILD A MODEL OCTO TECHNOLOGY > THERE IS A BETTER WAY 1 SYD 8:10 1 A330 0 2 SYD 14:15 2 B777 0 3 MEL 18:10 1 B777 0 4 PER 6:50 4 A320 1 5 SYD 9:50 3 A320 0 6 PER 12:10 1 A320 1 7 TZN 14:50 1 B777 0 8 MEL 14:15 4 A320 0 9 SYD 8:30 3 A320 0 10 MEL 16:40 1 A320 0 … … … … … … 11 MEL 9:30 3 B747 1 12 TZN 9:30 1 A320 1 13 PER 9:50 3 A320 0 14 SYD 13:10 1 A320 0 Flight # Dep Airport Dep Hour Dep Week Day Aircraft Model Delay θ1 θ2 θ3 … θn X Y
  • 26. 26 LOGISTIC REGRESSION Classification algorithm OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 27. 27 DECISION TREE Classification algorithm OCTO TECHNOLOGY > THERE IS A BETTER WAY DoW >5 Month >5 PAX >35% AoD =“SYD” no no no yes yes yes yesno +- -+-
  • 28. 28 RANDOM FOREST Classification algorithm OCTO TECHNOLOGY > THERE IS A BETTER WAY 28
  • 29. 29 TEST CLASSIFIER OCTO TECHNOLOGY > THERE IS A BETTER WAY 29 Flight # Dep Airport Dep Hour Dep Week Day Aircraft Model … 9 SYD 8:30 3 A320 1 positive (delayed) 0 negative (on time)
  • 30. 30 A PERFECT CLASSIFIER OCTO TECHNOLOGY > THERE IS A BETTER WAY 30 Flight # Dep Airport Dep Hour Dep Week Day Aircraft Model 1 SYD 8:10 1 A330 2 SYD 14:15 2 B777 3 MEL 18:10 1 B777 4 PER 6:50 4 A320 5 SYD 9:50 3 A320 6 PER 12:10 1 A320 7 TZN 14:50 1 B777 8 MEL 14:15 4 A320 9 SYD 8:30 3 A320 10 MEL 16:40 1 A320 11 MEL 9:30 3 B747 12 TZN 9:30 1 A320 13 PER 9:50 3 A320 14 SYD 13:10 1 A320 1 1 1 1 0 0 0 0 0 0 0 0 0 0
  • 31. 31 1 1 0 1 0 1 0 0 0 0 0 0 1 0 A MORE REALISTIC CLASSIFIER OCTO TECHNOLOGY > THERE IS A BETTER WAY 31 Flight # Dep Airport Dep Hour Dep Week Day Aircraft Model 1 SYD 8:10 1 A330 2 SYD 14:15 2 B777 3 MEL 18:10 1 B777 4 PER 6:50 4 A320 5 SYD 9:50 3 A320 6 PER 12:10 1 A320 7 TZN 14:50 1 B777 8 MEL 14:15 4 A320 9 SYD 8:30 3 A320 10 MEL 16:40 1 A320 11 MEL 9:30 3 B747 12 TZN 9:30 1 A320 13 PER 9:50 3 A320 14 SYD 13:10 1 A320 Wrongly classified
  • 32. 32 CONFUSION MATRIX The summary to optimize OCTO TECHNOLOGY > THERE IS A BETTER WAY 32 Actually delayed on time Predicted + (delayed) 3 2 - (on time) 1 8 True Positive False Negative False Positive True Negative
  • 33. 33 PERFORMANCE INDICATORS OCTO TECHNOLOGY > THERE IS A BETTER WAY 33 Actually delayed on time Predicted + (delayed) 3 2 - (on time) 1 8 TP FN FP TN False Positive Rate = True Positive Rate = TP TP + FN FP FP + TN(1 – Specificity) (Sensitivity) Precision = TP TP + FP Recall = TP TP + FN
  • 34. 34 0.9 0.8 0.8 0.3 0.2 0.1 0.5 0.4 0.5 0.4 0.3 0.7 0.8 0.5 CLASSIFIER Assigning a continuous score of being delayed OCTO TECHNOLOGY > THERE IS A BETTER WAY 34 0 1 +-
  • 35. 35 PREDICTOR SCORE DISTRIBUTION OCTO TECHNOLOGY > THERE IS A BETTER WAY 35 Score Delayed flights On time flights Eventscount A perfect score cutoff
  • 36. 36 PREDICTOR SCORE DISTRIBUTION Fixing a score cutoff leads to false positive and negative OCTO TECHNOLOGY > THERE IS A BETTER WAY 36 Score False Positive False Negative Eventscount
  • 37. 37 ROC CURVES TO COMPARE CLASSIFIERS Fixing score cutoffs lead to different false positive and negative rates OCTO TECHNOLOGY > THERE IS A BETTER WAY 37 False Positive Rate TruePositiveRate 0 1 0 1
  • 38. 38 ROC AND ROLL  ROC allow to compare different models  Area Under the Curve (AUC) is only a projection of the overall performance  Significantly different models can have close ROC  Other comparisons methods exists (and are intimately related to ROC): > Precision/Recall > LIFT A few comments about ROC curves OCTO TECHNOLOGY > THERE IS A BETTER WAY 38 AUC
  • 39. 39 MODELS & DATA Precision score for the TOP 20% Traditional models Advanced models Advanced models with more data Advanced models with more data and more features Precision
  • 40. 40 MODELS & DATA Traditional models Advanced models Advanced models with more data Advanced models with more data and more features Precision Precision score for the TOP 20%
  • 41. MODELS & DATA Traditional models Advanced models Advanced models with more data Advanced models with more data and more features Precision Precision score for the TOP 20%
  • 42. 42 FIGHT DELAY PREDICTION: RESULTS All reasons for delays  Overall improvement by a factor 3 Focus on air traffic  Overall improvement by a factor 6 Delay caused by passengers  No improvement 10% LIFT score OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 43. 43 PREDICT NUMBER OF PASSENGERS ON A PLANE Optimize catering OCTO TECHNOLOGY > THERE IS A BETTER WAY 43 t0 - 4 hours t0 Flight Number Booked Departure port … Departure hour 0777 152 PER … 14 1116 201 SYD … 9 0961 92 BNE … 6 0538 189 MEL … 12 1078 136 SYD … 23 Final Number of passengers 164 186 125 189 87 t ? ~ 50 explanatory variables X y t0 - 1 hour
  • 44. 44 RESULTS OCTO TECHNOLOGY > THERE IS A BETTER WAY Passenger difference No model Model < 5 55% 69% < 10 80% 89% $1-2M per year
  • 45. 45 UNSTRUCTURED DATA OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 46.
  • 47. 47 1
  • 48. 48 WHAT ARE THE FEATURES? mimagesfortraining n features X … 6 … Y
  • 49. 49 WHAT ARE THE FEATURES? 5 4 3 2 1 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 0 0 0 4 5 5 5 5 4 1 4 4 1 0 1 5 4 5 1 0 0 0 1 5 1 5 0 0 0 0 0 5 4 4 0 0 0 0 0 2 5 2 0 0 0 0 0 0 0 0
  • 50. 50 WHAT ARE THE FEATURES? 0 0 0 0 0 0 0 0 0 0 0 1 0 1 0 0 0 4 5 5 5 5 4 1 4 4 1 0 1 5 4 5 1 0 0 0 1 5 1 5 0 0 0 0 0 5 4 4 0 0 0 0 0 2 5 2 0 0 0 0 0 0 0 0 = 6 (…) 6 6 3 … 0 7 n features mimagesfortraining X Y
  • 51. 51 NEURAL NETWORK OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 52. CAN COMPUTER VISION SPOT DISTRACTED DRIVERS?  24 Juin 2016 – Julien Krywyk OCTO TECHNOLOGY > THERE IS A BETTER WAY 52 Phone right Safe Text right Phone left Text left Speaking Makeup Behind Drink Radio
  • 53. OCTO TECHNOLOGY > THERE IS A BETTER WAY 53 Build classifier Train 22K images Test 80K images Predicted classes X Y Make predictions ?
  • 54. DEEP LEARNING OCTO TECHNOLOGY > THERE IS A BETTER WAY 54 Identify pixels Identify edges and simple shape Identify complex shapes and object Identify which shape to be used to define a human face
  • 55. DEEP LEARNING Transfer learning OCTO TECHNOLOGY > THERE IS A BETTER WAY 55 n features X Y Features extractions pre-trained CNN
  • 56. DATA SCIENCE TONIGHT OCTO TECHNOLOGY > THERE IS A BETTER WAY 56 Visualization 1 2 3 4 Why the buzz about data science? Demystifying machine learning Data science in your business
  • 57. 57 VISUALIZATION OCTO TECHNOLOGY > THERE IS A BETTER WAY Understand Communicate results & analysis
  • 58. 58 1880: TEXTILE PRODUCTION IN ENGLAND (OTTO NEURATH, ~1920) Changing the world by educating people about the world around them OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 59. 59 NAPOLEON 1812 CAMPAIGN (CHARLES MINARD, 1869) OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 60. 60 HOW TRUMP PUSHED THE ELECTION MAP TO THE RIGHT (NEW YORK TIMES) OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 61. 61 VISUALIZATION TO GET ACQUAINTED WITH DATA OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 62. EXPLORATION: FLIGHT DELAY PER MONTH AND DAY OF WEEK
  • 63. 63 DATA VISUALIZATION Correlation between ‘Departure Hour’ and passenger delta OCTO TECHNOLOGY > THERE IS A BETTER WAY 63
  • 64. 64 NOTEBOOKS Interactive data analysis OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 65. 65 VISUALIZATION AS A GAME CHANGER OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 66. 66 VALIDATION OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 68.
  • 69. DATA SCIENCE TONIGHT OCTO TECHNOLOGY > THERE IS A BETTER WAY 69 Visualization 1 2 3 4 Why the buzz about data science? Demystifying machine learning Data science in your business
  • 70. 70 I WANT A DATA SCIENTIST! OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 71. 71OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 72. 72 AGILE DATA SCIENCE OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 73. Agile Data science Feature Team Operations Business analyst Developper tech expertProject Manager Data scientist Architect Individuals and interactions over processes and tools Working software over comprehensive documentation Customer collaboration over contract negotiation Responding to change over following a plan That is, while there is value in the items on the right, we value the items on the left more
  • 74. OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 75. BUILDING A DATALAB OCTO TECHNOLOGY > THERE IS A BETTER WAY 75 Source System Collect, storage et data preparation Analysis delivery External sources Datalab Existing infrastructure (multiples sources) ETL Extract cleanup, transfor m load Staging area Datawarehouse technical layer (referential/ Operation) Datamart technique (zone de collecte) Datamart (management, marketing, sales User access (Reporting, Analytics) Batch • Analyses • Indicators • Statistics Online • Dashboards • Reporting • Requests Administratio n • Admini • Validation
  • 76. DEVOPS – EMBRACING NEW KNOW HOW And new collaborations… Data Scientist • Innovates • With new technologies “What !? A unit test on my neural network??? OPS • Look after rationalization “What!? Your piece of Scala calls a Python library embedding C ???”
  • 77. OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 78. 78 DEMOCRATIZATION  cours OCTO TECHNOLOGY > THERE IS A BETTER WAY 1 million enrollments
  • 79. OCTO TECHNOLOGY > THERE IS A BETTER WAY
  • 80. 81 Business must be aware of opportunities to use algorithms BUSINESS & DATA SCIENCE OCTO TECHNOLOGY > THERE IS A BETTER WAY Data must be easily accessible Focus on lowest time to market possible
  • 81. USE CASES CLASSES AND THEIR BUSINESS VALUE OCTO TECHNOLOGY > THERE IS A BETTER WAY 82 The prediction is a support for decision Analyses support data understanding The prediction is the decision Business value
  • 82. OCTO TECHNOLOGY > THERE IS A BETTER WAY 83 ??? ???