SlideShare a Scribd company logo
1 of 36
Download to read offline
1
How to use data science to affect
company change
Dr. Jonathan Roberts
Chief Innovation Officer, Dotdash
First convince people that science can
be applied to their problems
2
First convince people that science can
be applied to their problems
3
Physics is all about
extracting meaning from
messy time series data
First convince people that science can
be applied to their problems
4
Physics is all about
extracting meaning from
messy time series data
Turns out the internet has some
messy time series data too
Provide clear simple answers to
complex problems
5
Spend as much timing getting to the right
question, as getting to the right answer.
6
Spend as much timing getting to the right
question, as getting to the right answer.
Asking the right question is rarer
than being able to answer it
7
Don’t mix up machine learning and
human learning.
8
Don’t mix up machine learning and
human learning.
Interpretability matters. Never use a
neural net if a linear fit will work.
9
Keep results concise.
Provide the problem, the answer, and the
uncertainties.
10
Make every slide title the conclusion.
Your slides will be taken out of context.
11
The result? You get a lot of questions
• What makes people click on a link?
• Is there a ‘best’ length of a piece of content?
• Do twitter followers affect search traction?
• Is the concept of a ‘kitten’ more closely related the concept of a ‘cat’ or the concept
of ‘cuteness’?
12
The result? You get a lot of questions
• What makes people click on a link?
• Is there a ‘best’ length of a piece of content?
• Do twitter followers affect search traction?
• Is the concept of a ‘kitten’ more closely related the concept of a ‘cat’ or the concept
of ‘cuteness’?
What is About.com?
13
A Little History - and a Problem
• About is one year older than Google
14
A Little History - and a Problem
• About is one year older than Google
• It was sold by the New York Times in 2012
15
A Little History - and a Problem
• About is one year older than Google
• It was sold by the New York Times in 2012
• When IAC bought it, there were millions of pieces of content, covering over a thousand
topics, read by millions of people every day.
16
A Little History - and a Problem
• About is one year older than Google
• It was sold by the New York Times in 2012
• When IAC bought it, there were millions of pieces of content, covering over a thousand
topics, read by millions of people every day.
• We had 1000 writers, 200 full time staff, 10 editors, and 1 problem
17
A Little History - and a Problem
• About is one year older than Google
• It was sold by the New York Times in 2012
• When IAC bought it, there were millions of pieces of content, covering over a thousand
topics, read by millions of people every day.
• We had 1000 writers, 200 full time staff, 10 editors, and 1 problem:
What was About.com?
18
Understanding our company was a
big data problem.
1. Categorise all the content
2. Provide one clear simple plot
3. Use our content to understand our audience and tell data driven stories.
19
The interests of the internet
Since Jan 1, 2000
Don’t just provide a data point.
Tell a story
21
Are American’s more interested in
gymnastics during an Olympics?
22
Are American’s more interested in
gymnastics during an Olympics?
Or in football during the Superbowl?
23
Are American’s more interested in
gymnastics during an Olympics?
Or in football during the Superbowl?
It’s Gymnastics. For every Olympics in
the 21st Century.
24
“We don’t have a millennial audience”
25
“We don’t have a millennial audience”
Yes we do – we are representative of the
internet. And let me tell you about them.
26
Millennial women are 3x more interested
in going to Paris than non-millennial
women
27
Millennial women are 3x more interested
in going to Paris than non-millennial
women
Millennial guys are just as un-interested
in going to Paris as non-millennial guys.
28
Build a stable baseline, and look for
exceptions.
• We don’t need to use February 2017
to predict March 2017.
• We can use 20 years of Marches to
predict each day of March 2017.
• Every day we know whether the
country is behaving as expected.
29
On November 9th the world changed
• Health interest dropped 40% in one hour
• It did not return to normal for three days
• We see the same pattern after the Super Bowl, and after
holidays
The election gave the country a three day hangover
30
So when you listen to 20 years of
data, what does it tell you?
• One great recipe is better than 10 slight variations
We focused on deepening our best content
• Updating good content beats writing new articles
Our articles are now updated at least every six months
• Get out of the way of users, and answer the question they have right now
We prioritised page speed, clean simple design, and recommendations off the
user’s intent
• You can’t be all things to all people
About.com is the wrong product for today’s internet
31
We listened
32
In 2016 we broke up About.
33
• With a stable baseline, you can see
the world change:
• Through Q3, traffic in 2016 followed
the daily prediction within a few
percent
• We launched Lifewire on October
15th
• After a short (expected) dip – visits
went through the roof.
And it totally worked.
Lifewire traffic vs. Seasonal Prediction
All five mature launches are now top 10 sites, and by
far the fastest growing in their categories.
34
Conclusions
• Companies (and the internet) are big data problems.
• Executives aren’t used to seeing clear answers on complex systems.
• Spend as much time on the right question, as on the right answer.
• Don’t sacrifice human learning at the expense of machine learning.
• One simple plot is much more valuable than a deep dive into methodology.
• Find a way to tell a story: anyone can retell a story, very few can retell a research
paper.
35
thanks.Dr Jon Roberts | jroberts@dotdash.com
36

More Related Content

Similar to How to Use Data Science to Affect Company Change

Hord Tipton Keynote Address
Hord Tipton Keynote AddressHord Tipton Keynote Address
Hord Tipton Keynote AddressNathaniel Palmer
 
The Future of Information Architecture
The Future of Information ArchitectureThe Future of Information Architecture
The Future of Information ArchitectureChristina Wodtke
 
Teaching and learning in the 21st century
Teaching and learning in the 21st century Teaching and learning in the 21st century
Teaching and learning in the 21st century altamiraedu
 
Content Matters - My View on the Importance of Content Strategy
Content Matters - My View on the Importance of Content StrategyContent Matters - My View on the Importance of Content Strategy
Content Matters - My View on the Importance of Content StrategyCognizant Technology Solutions
 
Did You Know Iii Revised 777
Did You Know Iii Revised 777Did You Know Iii Revised 777
Did You Know Iii Revised 777Roy Zondernaam
 
5 Reasons Our Children Are About To Miss Out On The Greatest Opportunity In T...
5 Reasons Our Children Are About To Miss Out On The Greatest Opportunity In T...5 Reasons Our Children Are About To Miss Out On The Greatest Opportunity In T...
5 Reasons Our Children Are About To Miss Out On The Greatest Opportunity In T...iBridge Hub
 
CIL Stats Workshop April1 2022 Abram Silk.pdf
CIL Stats Workshop April1 2022 Abram Silk.pdfCIL Stats Workshop April1 2022 Abram Silk.pdf
CIL Stats Workshop April1 2022 Abram Silk.pdfStephen Abram
 
OSF x Highway Africa: New forms of storytelling, distribution and revenue
OSF x Highway Africa: New forms of storytelling, distribution and revenueOSF x Highway Africa: New forms of storytelling, distribution and revenue
OSF x Highway Africa: New forms of storytelling, distribution and revenueThe Splice Newsroom
 
Print, pixels & people 10a
Print, pixels & people 10aPrint, pixels & people 10a
Print, pixels & people 10aLogan Aimone
 
The New Future | 2014 Global Leadership Summit
The New Future | 2014 Global Leadership SummitThe New Future | 2014 Global Leadership Summit
The New Future | 2014 Global Leadership SummitLondon Business School
 
Targeting Millennials
Targeting MillennialsTargeting Millennials
Targeting MillennialsKeith Kaplan
 
I Was A Guest Lecturer at Yeditepe University MBA Program in Turkey
I Was A Guest Lecturer at Yeditepe University MBA Program in TurkeyI Was A Guest Lecturer at Yeditepe University MBA Program in Turkey
I Was A Guest Lecturer at Yeditepe University MBA Program in TurkeyFahri Karakas
 
Social Media: Efficient Tool or Wasteful Distraction?
Social Media: Efficient Tool or Wasteful Distraction?Social Media: Efficient Tool or Wasteful Distraction?
Social Media: Efficient Tool or Wasteful Distraction?David Mullings
 
Annual Summit for Digital Innovation in Education 2019
Annual Summit for Digital Innovation in Education 2019Annual Summit for Digital Innovation in Education 2019
Annual Summit for Digital Innovation in Education 2019Rosie Crompton
 
WWHD: What Would Harvey Do? An introduction to social media using the (hypoth...
WWHD: What Would Harvey Do? An introduction to social media using the (hypoth...WWHD: What Would Harvey Do? An introduction to social media using the (hypoth...
WWHD: What Would Harvey Do? An introduction to social media using the (hypoth...CanadaHelps / MyCharityConnects
 
The Week: Wrong questions, Right answers
The Week:  Wrong questions, Right answersThe Week:  Wrong questions, Right answers
The Week: Wrong questions, Right answerssifter3000
 

Similar to How to Use Data Science to Affect Company Change (20)

Hord Tipton Keynote Address
Hord Tipton Keynote AddressHord Tipton Keynote Address
Hord Tipton Keynote Address
 
The Future of Information Architecture
The Future of Information ArchitectureThe Future of Information Architecture
The Future of Information Architecture
 
Teaching and learning in the 21st century
Teaching and learning in the 21st century Teaching and learning in the 21st century
Teaching and learning in the 21st century
 
Content Matters - My View on the Importance of Content Strategy
Content Matters - My View on the Importance of Content StrategyContent Matters - My View on the Importance of Content Strategy
Content Matters - My View on the Importance of Content Strategy
 
Did You Know Iii Revised 777
Did You Know Iii Revised 777Did You Know Iii Revised 777
Did You Know Iii Revised 777
 
5 Reasons Our Children Are About To Miss Out On The Greatest Opportunity In T...
5 Reasons Our Children Are About To Miss Out On The Greatest Opportunity In T...5 Reasons Our Children Are About To Miss Out On The Greatest Opportunity In T...
5 Reasons Our Children Are About To Miss Out On The Greatest Opportunity In T...
 
CIL Stats Workshop April1 2022 Abram Silk.pdf
CIL Stats Workshop April1 2022 Abram Silk.pdfCIL Stats Workshop April1 2022 Abram Silk.pdf
CIL Stats Workshop April1 2022 Abram Silk.pdf
 
OSF x Highway Africa: New forms of storytelling, distribution and revenue
OSF x Highway Africa: New forms of storytelling, distribution and revenueOSF x Highway Africa: New forms of storytelling, distribution and revenue
OSF x Highway Africa: New forms of storytelling, distribution and revenue
 
Did You Know 4 Nov 2007
Did You Know 4 Nov 2007Did You Know 4 Nov 2007
Did You Know 4 Nov 2007
 
Print, pixels & people 10a
Print, pixels & people 10aPrint, pixels & people 10a
Print, pixels & people 10a
 
The New Future | 2014 Global Leadership Summit
The New Future | 2014 Global Leadership SummitThe New Future | 2014 Global Leadership Summit
The New Future | 2014 Global Leadership Summit
 
Targeting Millennials
Targeting MillennialsTargeting Millennials
Targeting Millennials
 
I Was A Guest Lecturer at Yeditepe University MBA Program in Turkey
I Was A Guest Lecturer at Yeditepe University MBA Program in TurkeyI Was A Guest Lecturer at Yeditepe University MBA Program in Turkey
I Was A Guest Lecturer at Yeditepe University MBA Program in Turkey
 
Social Media: Efficient Tool or Wasteful Distraction?
Social Media: Efficient Tool or Wasteful Distraction?Social Media: Efficient Tool or Wasteful Distraction?
Social Media: Efficient Tool or Wasteful Distraction?
 
Bmma digital push pull.pptx
Bmma digital push pull.pptxBmma digital push pull.pptx
Bmma digital push pull.pptx
 
Annual Summit for Digital Innovation in Education 2019
Annual Summit for Digital Innovation in Education 2019Annual Summit for Digital Innovation in Education 2019
Annual Summit for Digital Innovation in Education 2019
 
WWHD: What Would Harvey Do? An introduction to social media using the (hypoth...
WWHD: What Would Harvey Do? An introduction to social media using the (hypoth...WWHD: What Would Harvey Do? An introduction to social media using the (hypoth...
WWHD: What Would Harvey Do? An introduction to social media using the (hypoth...
 
2015 Arts Midwest Workshop: Embracing the Digital Age
2015 Arts Midwest Workshop: Embracing the Digital Age2015 Arts Midwest Workshop: Embracing the Digital Age
2015 Arts Midwest Workshop: Embracing the Digital Age
 
Solace se draft final
Solace se draft finalSolace se draft final
Solace se draft final
 
The Week: Wrong questions, Right answers
The Week:  Wrong questions, Right answersThe Week:  Wrong questions, Right answers
The Week: Wrong questions, Right answers
 

More from Domino Data Lab

What's in your workflow? Bringing data science workflows to business analysis...
What's in your workflow? Bringing data science workflows to business analysis...What's in your workflow? Bringing data science workflows to business analysis...
What's in your workflow? Bringing data science workflows to business analysis...Domino Data Lab
 
The Proliferation of New Database Technologies and Implications for Data Scie...
The Proliferation of New Database Technologies and Implications for Data Scie...The Proliferation of New Database Technologies and Implications for Data Scie...
The Proliferation of New Database Technologies and Implications for Data Scie...Domino Data Lab
 
Racial Bias in Policing: an analysis of Illinois traffic stops data
Racial Bias in Policing: an analysis of Illinois traffic stops dataRacial Bias in Policing: an analysis of Illinois traffic stops data
Racial Bias in Policing: an analysis of Illinois traffic stops dataDomino Data Lab
 
Data Quality Analytics: Understanding what is in your data, before using it
Data Quality Analytics: Understanding what is in your data, before using itData Quality Analytics: Understanding what is in your data, before using it
Data Quality Analytics: Understanding what is in your data, before using itDomino Data Lab
 
Supporting innovation in insurance with randomized experimentation
Supporting innovation in insurance with randomized experimentationSupporting innovation in insurance with randomized experimentation
Supporting innovation in insurance with randomized experimentationDomino Data Lab
 
Leveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive IndustryLeveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive IndustryDomino Data Lab
 
Summertime Analytics: Predicting E. coli and West Nile Virus
Summertime Analytics: Predicting E. coli and West Nile VirusSummertime Analytics: Predicting E. coli and West Nile Virus
Summertime Analytics: Predicting E. coli and West Nile VirusDomino Data Lab
 
Reproducible Dashboards and other great things to do with Jupyter
Reproducible Dashboards and other great things to do with JupyterReproducible Dashboards and other great things to do with Jupyter
Reproducible Dashboards and other great things to do with JupyterDomino Data Lab
 
GeoViz: A Canvas for Data Science
GeoViz: A Canvas for Data ScienceGeoViz: A Canvas for Data Science
GeoViz: A Canvas for Data ScienceDomino Data Lab
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Domino Data Lab
 
Doing your first Kaggle (Python for Big Data sets)
Doing your first Kaggle (Python for Big Data sets)Doing your first Kaggle (Python for Big Data sets)
Doing your first Kaggle (Python for Big Data sets)Domino Data Lab
 
Leveraged Analytics at Scale
Leveraged Analytics at ScaleLeveraged Analytics at Scale
Leveraged Analytics at ScaleDomino Data Lab
 
How I Learned to Stop Worrying and Love Linked Data
How I Learned to Stop Worrying and Love Linked DataHow I Learned to Stop Worrying and Love Linked Data
How I Learned to Stop Worrying and Love Linked DataDomino Data Lab
 
Software Engineering for Data Scientists
Software Engineering for Data ScientistsSoftware Engineering for Data Scientists
Software Engineering for Data ScientistsDomino Data Lab
 
Moving Data Science from an Event to A Program: Considerations in Creating Su...
Moving Data Science from an Event to A Program: Considerations in Creating Su...Moving Data Science from an Event to A Program: Considerations in Creating Su...
Moving Data Science from an Event to A Program: Considerations in Creating Su...Domino Data Lab
 
Building Data Analytics pipelines in the cloud using serverless technology
Building Data Analytics pipelines in the cloud using serverless technologyBuilding Data Analytics pipelines in the cloud using serverless technology
Building Data Analytics pipelines in the cloud using serverless technologyDomino Data Lab
 
Leveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science ToolsLeveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science ToolsDomino Data Lab
 
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino Data Lab
 
The Role and Importance of Curiosity in Data Science
The Role and Importance of Curiosity in Data ScienceThe Role and Importance of Curiosity in Data Science
The Role and Importance of Curiosity in Data ScienceDomino Data Lab
 

More from Domino Data Lab (20)

What's in your workflow? Bringing data science workflows to business analysis...
What's in your workflow? Bringing data science workflows to business analysis...What's in your workflow? Bringing data science workflows to business analysis...
What's in your workflow? Bringing data science workflows to business analysis...
 
The Proliferation of New Database Technologies and Implications for Data Scie...
The Proliferation of New Database Technologies and Implications for Data Scie...The Proliferation of New Database Technologies and Implications for Data Scie...
The Proliferation of New Database Technologies and Implications for Data Scie...
 
Racial Bias in Policing: an analysis of Illinois traffic stops data
Racial Bias in Policing: an analysis of Illinois traffic stops dataRacial Bias in Policing: an analysis of Illinois traffic stops data
Racial Bias in Policing: an analysis of Illinois traffic stops data
 
Data Quality Analytics: Understanding what is in your data, before using it
Data Quality Analytics: Understanding what is in your data, before using itData Quality Analytics: Understanding what is in your data, before using it
Data Quality Analytics: Understanding what is in your data, before using it
 
Supporting innovation in insurance with randomized experimentation
Supporting innovation in insurance with randomized experimentationSupporting innovation in insurance with randomized experimentation
Supporting innovation in insurance with randomized experimentation
 
Leveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive IndustryLeveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive Industry
 
Summertime Analytics: Predicting E. coli and West Nile Virus
Summertime Analytics: Predicting E. coli and West Nile VirusSummertime Analytics: Predicting E. coli and West Nile Virus
Summertime Analytics: Predicting E. coli and West Nile Virus
 
Reproducible Dashboards and other great things to do with Jupyter
Reproducible Dashboards and other great things to do with JupyterReproducible Dashboards and other great things to do with Jupyter
Reproducible Dashboards and other great things to do with Jupyter
 
GeoViz: A Canvas for Data Science
GeoViz: A Canvas for Data ScienceGeoViz: A Canvas for Data Science
GeoViz: A Canvas for Data Science
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field
 
Doing your first Kaggle (Python for Big Data sets)
Doing your first Kaggle (Python for Big Data sets)Doing your first Kaggle (Python for Big Data sets)
Doing your first Kaggle (Python for Big Data sets)
 
Leveraged Analytics at Scale
Leveraged Analytics at ScaleLeveraged Analytics at Scale
Leveraged Analytics at Scale
 
How I Learned to Stop Worrying and Love Linked Data
How I Learned to Stop Worrying and Love Linked DataHow I Learned to Stop Worrying and Love Linked Data
How I Learned to Stop Worrying and Love Linked Data
 
Software Engineering for Data Scientists
Software Engineering for Data ScientistsSoftware Engineering for Data Scientists
Software Engineering for Data Scientists
 
Making Big Data Smart
Making Big Data SmartMaking Big Data Smart
Making Big Data Smart
 
Moving Data Science from an Event to A Program: Considerations in Creating Su...
Moving Data Science from an Event to A Program: Considerations in Creating Su...Moving Data Science from an Event to A Program: Considerations in Creating Su...
Moving Data Science from an Event to A Program: Considerations in Creating Su...
 
Building Data Analytics pipelines in the cloud using serverless technology
Building Data Analytics pipelines in the cloud using serverless technologyBuilding Data Analytics pipelines in the cloud using serverless technology
Building Data Analytics pipelines in the cloud using serverless technology
 
Leveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science ToolsLeveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science Tools
 
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...
 
The Role and Importance of Curiosity in Data Science
The Role and Importance of Curiosity in Data ScienceThe Role and Importance of Curiosity in Data Science
The Role and Importance of Curiosity in Data Science
 

Recently uploaded

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 

Recently uploaded (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 

How to Use Data Science to Affect Company Change

  • 1. 1 How to use data science to affect company change Dr. Jonathan Roberts Chief Innovation Officer, Dotdash
  • 2. First convince people that science can be applied to their problems 2
  • 3. First convince people that science can be applied to their problems 3 Physics is all about extracting meaning from messy time series data
  • 4. First convince people that science can be applied to their problems 4 Physics is all about extracting meaning from messy time series data Turns out the internet has some messy time series data too
  • 5. Provide clear simple answers to complex problems 5
  • 6. Spend as much timing getting to the right question, as getting to the right answer. 6
  • 7. Spend as much timing getting to the right question, as getting to the right answer. Asking the right question is rarer than being able to answer it 7
  • 8. Don’t mix up machine learning and human learning. 8
  • 9. Don’t mix up machine learning and human learning. Interpretability matters. Never use a neural net if a linear fit will work. 9
  • 10. Keep results concise. Provide the problem, the answer, and the uncertainties. 10
  • 11. Make every slide title the conclusion. Your slides will be taken out of context. 11
  • 12. The result? You get a lot of questions • What makes people click on a link? • Is there a ‘best’ length of a piece of content? • Do twitter followers affect search traction? • Is the concept of a ‘kitten’ more closely related the concept of a ‘cat’ or the concept of ‘cuteness’? 12
  • 13. The result? You get a lot of questions • What makes people click on a link? • Is there a ‘best’ length of a piece of content? • Do twitter followers affect search traction? • Is the concept of a ‘kitten’ more closely related the concept of a ‘cat’ or the concept of ‘cuteness’? What is About.com? 13
  • 14. A Little History - and a Problem • About is one year older than Google 14
  • 15. A Little History - and a Problem • About is one year older than Google • It was sold by the New York Times in 2012 15
  • 16. A Little History - and a Problem • About is one year older than Google • It was sold by the New York Times in 2012 • When IAC bought it, there were millions of pieces of content, covering over a thousand topics, read by millions of people every day. 16
  • 17. A Little History - and a Problem • About is one year older than Google • It was sold by the New York Times in 2012 • When IAC bought it, there were millions of pieces of content, covering over a thousand topics, read by millions of people every day. • We had 1000 writers, 200 full time staff, 10 editors, and 1 problem 17
  • 18. A Little History - and a Problem • About is one year older than Google • It was sold by the New York Times in 2012 • When IAC bought it, there were millions of pieces of content, covering over a thousand topics, read by millions of people every day. • We had 1000 writers, 200 full time staff, 10 editors, and 1 problem: What was About.com? 18
  • 19. Understanding our company was a big data problem. 1. Categorise all the content 2. Provide one clear simple plot 3. Use our content to understand our audience and tell data driven stories. 19
  • 20. The interests of the internet Since Jan 1, 2000
  • 21. Don’t just provide a data point. Tell a story 21
  • 22. Are American’s more interested in gymnastics during an Olympics? 22
  • 23. Are American’s more interested in gymnastics during an Olympics? Or in football during the Superbowl? 23
  • 24. Are American’s more interested in gymnastics during an Olympics? Or in football during the Superbowl? It’s Gymnastics. For every Olympics in the 21st Century. 24
  • 25. “We don’t have a millennial audience” 25
  • 26. “We don’t have a millennial audience” Yes we do – we are representative of the internet. And let me tell you about them. 26
  • 27. Millennial women are 3x more interested in going to Paris than non-millennial women 27
  • 28. Millennial women are 3x more interested in going to Paris than non-millennial women Millennial guys are just as un-interested in going to Paris as non-millennial guys. 28
  • 29. Build a stable baseline, and look for exceptions. • We don’t need to use February 2017 to predict March 2017. • We can use 20 years of Marches to predict each day of March 2017. • Every day we know whether the country is behaving as expected. 29
  • 30. On November 9th the world changed • Health interest dropped 40% in one hour • It did not return to normal for three days • We see the same pattern after the Super Bowl, and after holidays The election gave the country a three day hangover 30
  • 31. So when you listen to 20 years of data, what does it tell you? • One great recipe is better than 10 slight variations We focused on deepening our best content • Updating good content beats writing new articles Our articles are now updated at least every six months • Get out of the way of users, and answer the question they have right now We prioritised page speed, clean simple design, and recommendations off the user’s intent • You can’t be all things to all people About.com is the wrong product for today’s internet 31
  • 32. We listened 32 In 2016 we broke up About.
  • 33. 33 • With a stable baseline, you can see the world change: • Through Q3, traffic in 2016 followed the daily prediction within a few percent • We launched Lifewire on October 15th • After a short (expected) dip – visits went through the roof. And it totally worked. Lifewire traffic vs. Seasonal Prediction
  • 34. All five mature launches are now top 10 sites, and by far the fastest growing in their categories. 34
  • 35. Conclusions • Companies (and the internet) are big data problems. • Executives aren’t used to seeing clear answers on complex systems. • Spend as much time on the right question, as on the right answer. • Don’t sacrifice human learning at the expense of machine learning. • One simple plot is much more valuable than a deep dive into methodology. • Find a way to tell a story: anyone can retell a story, very few can retell a research paper. 35
  • 36. thanks.Dr Jon Roberts | jroberts@dotdash.com 36