SlideShare ist ein Scribd-Unternehmen logo
1 von 28
bite-sized lecture
       @neal_lathia
            october 7, 2011
my research:
urban data mining
over half of us live
in cities, by 2050 –
70% will
the oyster card
what tools can we design to help
travellers?
one example:
there is more to urban mobility
than just moving.
who are you?
where do you want to go?
how often?
how?
when?
how are you paying?
what route?
+ how do we travel? //
how do we spend?
+ do travellers make the
correct decisions? (no)
+ can we help them with
recommendations? (yes)
(%)          pay as you go purchases
                                     49.8         < 5 GBP
                                     24.2         5 – 10 GBP
                                     15.5         10 – 20 GBP
                                     (%)          travel card purchases
                                     70.8         7-day travel card
                                     15.8         1-month travel card
                                     11.6         7-day bus/tram pass

                           Purchase Behaviour
              30
                                                          Travel
              25                                          Cards
                                                          PAYG
              20
% Purchases




              15


              10


               5


               0
                   Mon   Tue   Wed    Thu   Fri     Sat      Sun
Purchase Geography                                Mobility Flow
45
                                                                                   Zone 1
40
                                          PAYG                                     Zone 2
                                          Travel Cards                             Zone 3
35
                                                                                   Zone 4
30                                                                                 Zone 5
                                                                                   Zone 6
25
                                                         arrive
20

15

10

 5                                                        depart
 0
     1   2   3       4    5    6      7       8     9
high regularity – in movement,
purchases
small increments, short terms
is this ideal?
luckily,
computers are good at
counting. let them do it.
idea:
compare what you bought to
what you could have bought
(was it cheaper?).
repeat 300,000 times.
results for this data:
£2.5 million overspend
using this sample to estimate the entire
city means we overspend by:

£200 million per year
by making the wrong decisions.
£200 million per year
by making the wrong decision?

not understanding how we will
need public transport (but..)
failing to match fares with our
needs (but...)
pop quiz:
who has bought something on
amazon?
so you know what a
recommender system is?
recommender system:
data + machine learning for
personalised results
we tested
recommender systems for oyster
purchases, which were 74-98% accurate.
                          Accuracy (%)                     Savings (GBP)
              Dataset 1         Dataset 2      Dataset 1          Dataset 2
Baseline           74.99             76.91       326,447.95         306,145.85
Naïve Bayes        77.46             80.71       393,585.81         369,232.24
k-NN (5)           96.74             97.09       465,822.17         426,375.85
C4.5               98.01             98.29       473,918.38         434,082.81
Oracle             100                   100     479,583.91         438,923.30
bite-sized lecture
       @neal_lathia
            october 7, 2011
further reading:

N. Lathia, L. Capra. Mining Mobility Data to Minimise Travellers'
Spending on Public Transport. In ACM KDD 2011, San Diego, USA.

N. Lathia, J. Froehlich, L. Capra. Mining Public Transport Data for
Personalised Intelligent Transport Systems. In IEEE ICDM 2010,
Sydney, Australia.

N. Lathia and L. Capra. How Smart is Your Smart card? Measuring
Travel Behaviours, Perceptions, and Incentives. In ACM UbiComp
2011, Beijing, China.

Weitere ähnliche Inhalte

Andere mochten auch

Overjet/Overbite
Overjet/OverbiteOverjet/Overbite
Overjet/Overbite
Raj Kumar
 
class ii division 2 malocclusion
class ii division 2 malocclusionclass ii division 2 malocclusion
class ii division 2 malocclusion
Rami Aldori
 
Treatment of class ii malocclusions
Treatment of class ii malocclusionsTreatment of class ii malocclusions
Treatment of class ii malocclusions
Sapeedeh Afzal
 

Andere mochten auch (8)

Class 2 division 2 malocclusion /certified fixed orthodontic courses by India...
Class 2 division 2 malocclusion /certified fixed orthodontic courses by India...Class 2 division 2 malocclusion /certified fixed orthodontic courses by India...
Class 2 division 2 malocclusion /certified fixed orthodontic courses by India...
 
Deep bite.. /certified fixed orthodontic courses by Indian dental academy
Deep bite.. /certified fixed orthodontic courses by Indian dental academy Deep bite.. /certified fixed orthodontic courses by Indian dental academy
Deep bite.. /certified fixed orthodontic courses by Indian dental academy
 
Risk factors and risk assessment of periodontal disease.
Risk factors and risk assessment of periodontal disease.Risk factors and risk assessment of periodontal disease.
Risk factors and risk assessment of periodontal disease.
 
Overjet/Overbite
Overjet/OverbiteOverjet/Overbite
Overjet/Overbite
 
Risk factors
Risk factors Risk factors
Risk factors
 
class ii division 2 malocclusion
class ii division 2 malocclusionclass ii division 2 malocclusion
class ii division 2 malocclusion
 
Treatment of class ii malocclusions
Treatment of class ii malocclusionsTreatment of class ii malocclusions
Treatment of class ii malocclusions
 
Management of class ii division 1 malocclusion
Management of class ii division 1 malocclusionManagement of class ii division 1 malocclusion
Management of class ii division 1 malocclusion
 

Mehr von Neal Lathia

Using Smartphones to Research Daily Life
Using Smartphones to Research Daily LifeUsing Smartphones to Research Daily Life
Using Smartphones to Research Daily Life
Neal Lathia
 

Mehr von Neal Lathia (20)

Everything around the NLP (London.AI Feb 2021)
Everything around the NLP (London.AI Feb 2021)Everything around the NLP (London.AI Feb 2021)
Everything around the NLP (London.AI Feb 2021)
 
Using machine learning for customer service (Data Talks Club)
Using machine learning for customer service (Data Talks Club)Using machine learning for customer service (Data Talks Club)
Using machine learning for customer service (Data Talks Club)
 
Using language models to supercharge Monzo’s customer support
 Using language models to supercharge Monzo’s customer support Using language models to supercharge Monzo’s customer support
Using language models to supercharge Monzo’s customer support
 
Making Better Decisions Faster
Making Better Decisions FasterMaking Better Decisions Faster
Making Better Decisions Faster
 
Machine Learning, Faster
Machine Learning, FasterMachine Learning, Faster
Machine Learning, Faster
 
AI & Personalised Experiences
AI & Personalised ExperiencesAI & Personalised Experiences
AI & Personalised Experiences
 
Opportunities & Challenges in Personalised Travel
Opportunities & Challenges in Personalised TravelOpportunities & Challenges in Personalised Travel
Opportunities & Challenges in Personalised Travel
 
Bootstrapping a Destination Recommendation Engine
Bootstrapping a Destination Recommendation EngineBootstrapping a Destination Recommendation Engine
Bootstrapping a Destination Recommendation Engine
 
Machine Learning for Product Managers
Machine Learning for Product ManagersMachine Learning for Product Managers
Machine Learning for Product Managers
 
Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)
 
Happier and Healthier with Smartphone Data
Happier and Healthier with Smartphone DataHappier and Healthier with Smartphone Data
Happier and Healthier with Smartphone Data
 
Data Science in Digital Health
Data Science in Digital HealthData Science in Digital Health
Data Science in Digital Health
 
Using Smartphones to Measure (and Intervene in) Daily Life
Using Smartphones to Measure (and Intervene in) Daily LifeUsing Smartphones to Measure (and Intervene in) Daily Life
Using Smartphones to Measure (and Intervene in) Daily Life
 
Analysing Daily Behaviours with Large-Scale Smartphone Data
Analysing Daily Behaviours with Large-Scale Smartphone DataAnalysing Daily Behaviours with Large-Scale Smartphone Data
Analysing Daily Behaviours with Large-Scale Smartphone Data
 
Cambridge Quantified Self Meetup
Cambridge Quantified Self MeetupCambridge Quantified Self Meetup
Cambridge Quantified Self Meetup
 
Data Science in #mHealth
Data Science in #mHealthData Science in #mHealth
Data Science in #mHealth
 
Tube Star: Crowd-Sourced Experiences on Public Transport
Tube Star: Crowd-Sourced Experiences on Public Transport Tube Star: Crowd-Sourced Experiences on Public Transport
Tube Star: Crowd-Sourced Experiences on Public Transport
 
Emotion Sense: From Design to Deployment
Emotion Sense: From Design to DeploymentEmotion Sense: From Design to Deployment
Emotion Sense: From Design to Deployment
 
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
 
Using Smartphones to Research Daily Life
Using Smartphones to Research Daily LifeUsing Smartphones to Research Daily Life
Using Smartphones to Research Daily Life
 

Kürzlich hochgeladen

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Kürzlich hochgeladen (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

UCL Bite-Sized Lunch Lecture

  • 1. bite-sized lecture @neal_lathia october 7, 2011
  • 3. over half of us live in cities, by 2050 – 70% will
  • 4.
  • 5.
  • 6.
  • 7.
  • 9. what tools can we design to help travellers?
  • 10. one example: there is more to urban mobility than just moving.
  • 11.
  • 12. who are you? where do you want to go? how often? how? when? how are you paying? what route?
  • 13. + how do we travel? // how do we spend? + do travellers make the correct decisions? (no) + can we help them with recommendations? (yes)
  • 14. (%) pay as you go purchases 49.8 < 5 GBP 24.2 5 – 10 GBP 15.5 10 – 20 GBP (%) travel card purchases 70.8 7-day travel card 15.8 1-month travel card 11.6 7-day bus/tram pass Purchase Behaviour 30 Travel 25 Cards PAYG 20 % Purchases 15 10 5 0 Mon Tue Wed Thu Fri Sat Sun
  • 15. Purchase Geography Mobility Flow 45 Zone 1 40 PAYG Zone 2 Travel Cards Zone 3 35 Zone 4 30 Zone 5 Zone 6 25 arrive 20 15 10 5 depart 0 1 2 3 4 5 6 7 8 9
  • 16. high regularity – in movement, purchases small increments, short terms is this ideal?
  • 17. luckily, computers are good at counting. let them do it. idea: compare what you bought to what you could have bought (was it cheaper?). repeat 300,000 times.
  • 18. results for this data: £2.5 million overspend
  • 19. using this sample to estimate the entire city means we overspend by: £200 million per year by making the wrong decisions.
  • 20. £200 million per year by making the wrong decision? not understanding how we will need public transport (but..) failing to match fares with our needs (but...)
  • 21. pop quiz: who has bought something on amazon?
  • 22. so you know what a recommender system is?
  • 23. recommender system: data + machine learning for personalised results
  • 24. we tested recommender systems for oyster purchases, which were 74-98% accurate. Accuracy (%) Savings (GBP) Dataset 1 Dataset 2 Dataset 1 Dataset 2 Baseline 74.99 76.91 326,447.95 306,145.85 Naïve Bayes 77.46 80.71 393,585.81 369,232.24 k-NN (5) 96.74 97.09 465,822.17 426,375.85 C4.5 98.01 98.29 473,918.38 434,082.81 Oracle 100 100 479,583.91 438,923.30
  • 25.
  • 26.
  • 27. bite-sized lecture @neal_lathia october 7, 2011
  • 28. further reading: N. Lathia, L. Capra. Mining Mobility Data to Minimise Travellers' Spending on Public Transport. In ACM KDD 2011, San Diego, USA. N. Lathia, J. Froehlich, L. Capra. Mining Public Transport Data for Personalised Intelligent Transport Systems. In IEEE ICDM 2010, Sydney, Australia. N. Lathia and L. Capra. How Smart is Your Smart card? Measuring Travel Behaviours, Perceptions, and Incentives. In ACM UbiComp 2011, Beijing, China.