SlideShare ist ein Scribd-Unternehmen logo
1 von 88
Downloaden Sie, um offline zu lesen
ARE YOU BETTER THAN AARE YOU BETTER THAN A
COIN TOSS?COIN TOSS?
BY JOHN OLIVER AND RICHARD WARBURTONBY JOHN OLIVER AND RICHARD WARBURTON
WHO ARE WE?WHO ARE WE?
Why you should care
The Fundamentals
Practical Problems
Applying the Theory
'EXPERTS" AREN'T VERY GOOD'EXPERTS" AREN'T VERY GOOD
BIG DATA SOLVES ALLBIG DATA SOLVES ALL
KNOWN PROBLEMSKNOWN PROBLEMS
BIG DATABIG DATA SOLVES ALLSOLVES ALL
KNOWN PROBLEMSKNOWN PROBLEMS
... HELPS... HELPS
VALIDATION = TESTSVALIDATION = TESTS
FOR DATAFOR DATA
PART 1: FUNDAMENTALSPART 1: FUNDAMENTALS
NULL HYPOTHESISNULL HYPOTHESIS
Untilproven otherwise there is no relationship between
phenomena
WHEN YOU HEAR "WOLF!" THERE IS A WOLF NEARBYWHEN YOU HEAR "WOLF!" THERE IS A WOLF NEARBY
Cry"Wolf!" StayQuiet
Wolf Nearby Ok False Negative
Its reallyachicken! False Positive Ok
WHY IS THIS IMPORTANT?WHY IS THIS IMPORTANT?
Itis better thatten guilty persons escape than
thatone innocentsuffer
-William Blackstone
STATIC ANALYSISSTATIC ANALYSIS
COST BENEFIT ANALYSISCOST BENEFIT ANALYSIS
Costs alotto jailan innocentman
Costs verylittle to show someone an inappropriate house
Credibility, Liberty, Moralityare also costs
CHOOSE THE RIGHT MEASUREMENTCHOOSE THE RIGHT MEASUREMENT
There's more than one conceptof accuracy
RECALLRECALL
number of true positives /number of actuallytrue values
PRECISIONPRECISION
number of true positives /predicted true value
F MEASUREF MEASURE
CASE STUDY: MEMORY LEAKSCASE STUDY: MEMORY LEAKS
About~10%of our datasethad memoryleaks
Predict"never leaks memory"~= 0.9 accuracy, butF1 = 0
Our algorithm ~= 0.9 accuracyand F1 ~= 0.9
PROBLEM: RELIABILITY OF MEASUREMENTPROBLEM: RELIABILITY OF MEASUREMENT
RULE OF THUMBRULE OF THUMB
If itlooks like random noise, itprobablyis random noise.
SOLUTION: CHECK YOUR DATASOLUTION: CHECK YOUR DATA
Low Standard Deviation
Coefficientof Variation = Standard Deviation /Mean
CAVEAT: NON-NORMAL DISTRIBUTONSCAVEAT: NON-NORMAL DISTRIBUTONS
SOLUTION: GO MADSOLUTION: GO MAD
MEDIAN ABSOLUTE DEVIATIONMEDIAN ABSOLUTE DEVIATION
PROBLEM: EXPERIMENTAL FLUKESPROBLEM: EXPERIMENTAL FLUKES
IS YOUR A/B TEST A HEISEN TEST?IS YOUR A/B TEST A HEISEN TEST?
SOLUTION: P-VALUESOLUTION: P-VALUE
SCIENCE WORKS - B****ES!SCIENCE WORKS - B****ES!
PRACTICAL PROBLEMSPRACTICAL PROBLEMS
PART 2PART 2
PROBLEM: FALSE PROPHETSPROBLEM: FALSE PROPHETS
I'M AN EXPERT, LISTEN TO ME!I'M AN EXPERT, LISTEN TO ME!
SOLUTION: ESTABLISH GOALS AND HYPOTHESIS THEN TESTSOLUTION: ESTABLISH GOALS AND HYPOTHESIS THEN TEST
SOLUTIONSSOLUTIONS
PROBLEM: CODE QUALITYPROBLEM: CODE QUALITY
The math works :-) the code does not:-(
@headinthebox
GROWTH IN A TIME OF DEBTGROWTH IN A TIME OF DEBT
SOLUTION: SOFTWARE ENGINEERING PRACTICESSOLUTION: SOFTWARE ENGINEERING PRACTICES
Everyone Lies
-House
SOLUTION: UNDERSTAND BIASES AND DESIGNSOLUTION: UNDERSTAND BIASES AND DESIGN
AROUND THEMAROUND THEM
Gay couples should have an equal rightto get
married, notjustto have civil partnerships
Populus: 65%vs 27%
Marriage should continue to be defined as alife-
long exclusive commitmentbetween aman and
awoman
Comres + Catholic Voices: 22%vs 70%
ACQUIESCENCE BIASACQUIESCENCE BIAS
Answer yes if there’s apositive connotation
REMOVAL OF PARTICULAR ADVERTISING AND SPONSORSHIP BANSREMOVAL OF PARTICULAR ADVERTISING AND SPONSORSHIP BANS
FOR: 1045
AGAINST: 731
ABSTAIN: 121
Motion Carried
MAINTAINING AN ETHICAL UNION BY REAFFIRMING ADVERTISING AND SPONSORSHIP BANSMAINTAINING AN ETHICAL UNION BY REAFFIRMING ADVERTISING AND SPONSORSHIP BANS
FOR: 858
AGAINST: 755
ABSTAIN: 166
Motion Carried
SOLUTION: PHRASE QUESTIONS NEUTRALLYSOLUTION: PHRASE QUESTIONS NEUTRALLY
And onlyhave one question
SOCIAL DESIRABILITYSOCIAL DESIRABILITY
Poor people overestimate their income, rich people under
estimate it.
SOLUTIONSSOLUTIONS
Anonymisation
Confidentiality
Randomized Response
Bogus Pipeline
BIAS TOWARDS THE FIRST ANSWER OF A QUESTIONBIAS TOWARDS THE FIRST ANSWER OF A QUESTION
Make sure to randomise the order of answers
WHAT WILL THE NEXT CRISIS IN WASHINGTON BE?WHAT WILL THE NEXT CRISIS IN WASHINGTON BE?
Fightover the debtceiling
Difficultyavertingautomatic cuts to the Pentagon
Failure to pass basic budgetbills
Allof the above
http://www.foxnews.com/politics/elections/2012/you-decide/what-will-next-crisis-washington-be
PROBLEM: CORRELATION DOESN’T IMPLY CAUSALITYPROBLEM: CORRELATION DOESN’T IMPLY CAUSALITY
DATABASE AND NETWORK ACTIVITY CORRELATINGDATABASE AND NETWORK ACTIVITY CORRELATING
Performance Diagnosis: was actuallyaGC Problem.
SOLUTION: DOMAIN KNOWLEDGESOLUTION: DOMAIN KNOWLEDGE
SOLUTIONSSOLUTIONS
Use domain knowledge -ask Pilots
Stratified sample sets
Measure outcomes -are planes survivingmore?
BE RIGOROUSBE RIGOROUS
PART 3: APPLYING THEPART 3: APPLYING THE
THEORYTHEORY
CORRELATIONCORRELATIONA MEASURE OF THE STRENGTH OF DEPENDENCE BETWEEN TWO VARIABLESA MEASURE OF THE STRENGTH OF DEPENDENCE BETWEEN TWO VARIABLES
PEARSON CORRELATIONPEARSON CORRELATION
Err...Justlook itup
(Assumes linear relationship)
Range Strength
<0.4 Weak/No Correlation
<0.7 Some Correlation
>0.7 StrongCorrelation
CASE STUDY: PERFORMANCE PROBLEM WITH HIGH SYSTEMCASE STUDY: PERFORMANCE PROBLEM WITH HIGH SYSTEM
TIMETIME
Hypothesis: caused byDisk I/O
Correlation Strength: 0.78453
MACHINE LEARNINGMACHINE LEARNING
Application of statistics to learn arelationship
HOW MANY CLUSTERS?HOW MANY CLUSTERS?
WHERE'S THE ELBOW?WHERE'S THE ELBOW?
FITTINGFITTING
FITTINGFITTING
SOLUTION:SOLUTION:
CROSS VALIDATIONCROSS VALIDATION
CHOOSE CROSS VALIDATION DATA WISELYCHOOSE CROSS VALIDATION DATA WISELY
SELF VALIDATINGSELF VALIDATING
Ensemble methods -Train lots of weak classifiers and merge
RANDOM FOREST AND BAGGINGRANDOM FOREST AND BAGGING
Divide the datainto bootstrap sets
Use the restfor calculatingerror
LEARNING CURVESLEARNING CURVES
HOW MUCH IS TOO MUCH?HOW MUCH IS TOO MUCH?
MONITOR PRODUCTION DATA...IT CHANGESMONITOR PRODUCTION DATA...IT CHANGES
Does itlook like the same datathatyou learntwith?
A/B TEST NEW SYSTEMSA/B TEST NEW SYSTEMS
Satisfaction/Profit/Traffic...
COMMON THREADSCOMMON THREADS
Trainingseterrors are misleading
Cross Validation, Production Monitored Values are the ones
thatreallymatter
Visualise and compare these errors
CONCLUSIONCONCLUSION
Analytics are increasinglyimportant
Wide varietyof statisticaland practicaltips to getthem right
Have fun and Bestof luck!
@johno_oliver
@RichardWarburto
QUESTIONS?QUESTIONS?
http://insightfullogic.com

Weitere ähnliche Inhalte

Ähnlich wie Better than a coin toss

How Four Statistical Rules Forecast Who Wins a Competitive Bid
How Four Statistical Rules Forecast Who Wins a Competitive BidHow Four Statistical Rules Forecast Who Wins a Competitive Bid
How Four Statistical Rules Forecast Who Wins a Competitive Bid
IntelCollab.com
 
Impact.Tech "Statistical Literacy for Deep Tech"
Impact.Tech "Statistical Literacy for Deep Tech"Impact.Tech "Statistical Literacy for Deep Tech"
Impact.Tech "Statistical Literacy for Deep Tech"
Impact.Tech
 
Against Abortions Essays. Legalization of Abortion Free Essay Sample on Samp...
Against Abortions Essays.  Legalization of Abortion Free Essay Sample on Samp...Against Abortions Essays.  Legalization of Abortion Free Essay Sample on Samp...
Against Abortions Essays. Legalization of Abortion Free Essay Sample on Samp...
Vanessa Martinez
 
Why Robots, AI, AL and Quantum Computing
Why Robots, AI, AL and Quantum ComputingWhy Robots, AI, AL and Quantum Computing
Why Robots, AI, AL and Quantum Computing
University of Hertfordshire
 

Ähnlich wie Better than a coin toss (20)

Whs inspection common errors (self guided).ppt
Whs inspection common errors (self guided).pptWhs inspection common errors (self guided).ppt
Whs inspection common errors (self guided).ppt
 
Zadeh Bisc2004
Zadeh Bisc2004Zadeh Bisc2004
Zadeh Bisc2004
 
Disaster planning
Disaster planningDisaster planning
Disaster planning
 
How Four Statistical Rules Forecast Who Wins a Competitive Bid
How Four Statistical Rules Forecast Who Wins a Competitive BidHow Four Statistical Rules Forecast Who Wins a Competitive Bid
How Four Statistical Rules Forecast Who Wins a Competitive Bid
 
Impact.Tech "Statistical Literacy for Deep Tech"
Impact.Tech "Statistical Literacy for Deep Tech"Impact.Tech "Statistical Literacy for Deep Tech"
Impact.Tech "Statistical Literacy for Deep Tech"
 
Some presentation
Some presentationSome presentation
Some presentation
 
AAAI19-Open.pptx
AAAI19-Open.pptxAAAI19-Open.pptx
AAAI19-Open.pptx
 
Design Sprints at Scale
Design Sprints at ScaleDesign Sprints at Scale
Design Sprints at Scale
 
Intro to modelling-supervised learning
Intro to modelling-supervised learningIntro to modelling-supervised learning
Intro to modelling-supervised learning
 
Methodological Mistakes and Econometric Consequences
Methodological Mistakes and Econometric ConsequencesMethodological Mistakes and Econometric Consequences
Methodological Mistakes and Econometric Consequences
 
Statistics
StatisticsStatistics
Statistics
 
Paradoxes and Fallacies - Resolving some well-known puzzles with Bayesian net...
Paradoxes and Fallacies - Resolving some well-known puzzles with Bayesian net...Paradoxes and Fallacies - Resolving some well-known puzzles with Bayesian net...
Paradoxes and Fallacies - Resolving some well-known puzzles with Bayesian net...
 
The Curse of P90
The Curse of P90The Curse of P90
The Curse of P90
 
DeepSec 2014 - The Measured CSO
DeepSec 2014 - The Measured CSODeepSec 2014 - The Measured CSO
DeepSec 2014 - The Measured CSO
 
Zksnarks in english
Zksnarks in englishZksnarks in english
Zksnarks in english
 
From Lag to Lead: Actionable Analytics
From Lag to Lead: Actionable AnalyticsFrom Lag to Lead: Actionable Analytics
From Lag to Lead: Actionable Analytics
 
Against Abortions Essays. Legalization of Abortion Free Essay Sample on Samp...
Against Abortions Essays.  Legalization of Abortion Free Essay Sample on Samp...Against Abortions Essays.  Legalization of Abortion Free Essay Sample on Samp...
Against Abortions Essays. Legalization of Abortion Free Essay Sample on Samp...
 
Bayesian networks and the search for causality
Bayesian networks and the search for causalityBayesian networks and the search for causality
Bayesian networks and the search for causality
 
A Tutorial to AI Ethics - Fairness, Bias & Perception
A Tutorial to AI Ethics - Fairness, Bias & Perception A Tutorial to AI Ethics - Fairness, Bias & Perception
A Tutorial to AI Ethics - Fairness, Bias & Perception
 
Why Robots, AI, AL and Quantum Computing
Why Robots, AI, AL and Quantum ComputingWhy Robots, AI, AL and Quantum Computing
Why Robots, AI, AL and Quantum Computing
 

Mehr von RichardWarburton

Mehr von RichardWarburton (20)

Fantastic performance and where to find it
Fantastic performance and where to find itFantastic performance and where to find it
Fantastic performance and where to find it
 
Production profiling what, why and how technical audience (3)
Production profiling  what, why and how   technical audience (3)Production profiling  what, why and how   technical audience (3)
Production profiling what, why and how technical audience (3)
 
Production profiling: What, Why and How
Production profiling: What, Why and HowProduction profiling: What, Why and How
Production profiling: What, Why and How
 
Production profiling what, why and how (JBCN Edition)
Production profiling  what, why and how (JBCN Edition)Production profiling  what, why and how (JBCN Edition)
Production profiling what, why and how (JBCN Edition)
 
Production Profiling: What, Why and How
Production Profiling: What, Why and HowProduction Profiling: What, Why and How
Production Profiling: What, Why and How
 
Java collections the force awakens
Java collections  the force awakensJava collections  the force awakens
Java collections the force awakens
 
Generics Past, Present and Future (Latest)
Generics Past, Present and Future (Latest)Generics Past, Present and Future (Latest)
Generics Past, Present and Future (Latest)
 
Collections forceawakens
Collections forceawakensCollections forceawakens
Collections forceawakens
 
Generics past, present and future
Generics  past, present and futureGenerics  past, present and future
Generics past, present and future
 
Jvm profiling under the hood
Jvm profiling under the hoodJvm profiling under the hood
Jvm profiling under the hood
 
How to run a hackday
How to run a hackdayHow to run a hackday
How to run a hackday
 
Generics Past, Present and Future
Generics Past, Present and FutureGenerics Past, Present and Future
Generics Past, Present and Future
 
Pragmatic functional refactoring with java 8 (1)
Pragmatic functional refactoring with java 8 (1)Pragmatic functional refactoring with java 8 (1)
Pragmatic functional refactoring with java 8 (1)
 
Performance and predictability (1)
Performance and predictability (1)Performance and predictability (1)
Performance and predictability (1)
 
Performance and predictability
Performance and predictabilityPerformance and predictability
Performance and predictability
 
Twins: Object Oriented Programming and Functional Programming
Twins: Object Oriented Programming and Functional ProgrammingTwins: Object Oriented Programming and Functional Programming
Twins: Object Oriented Programming and Functional Programming
 
Introduction to lambda behave
Introduction to lambda behaveIntroduction to lambda behave
Introduction to lambda behave
 
Introduction to lambda behave
Introduction to lambda behaveIntroduction to lambda behave
Introduction to lambda behave
 
Performance and predictability
Performance and predictabilityPerformance and predictability
Performance and predictability
 
Simplifying java with lambdas (short)
Simplifying java with lambdas (short)Simplifying java with lambdas (short)
Simplifying java with lambdas (short)
 

Kürzlich hochgeladen

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 

Better than a coin toss