SlideShare ist ein Scribd-Unternehmen logo
1 von 15
Downloaden Sie, um offline zu lesen
Motivation
• end-of-term projects, and even capstone experiences, can
become run-of-the-mill
• students seeking to please the instructor, sometimes
assuming instructor already knows how to do the analysis;
and so their input isn’t really important.
• depending on project, the best students might not get to
“shine”
• grades are important, but can discourage risk-taking
• there are few opportunities to wrestle with complex, real data
What is ASA DataFest?
• A celebration of data!
• A competition for teams of undergraduates to
find insight and meaning in a rich and complex
data set.
• A data hackathon
A typical DataFest
• Friday Night
• Meet the data
• Friday night through Sunday afternoon
• Work furiously. Eat.
• Talk to roving statisticians from industry 

and academics
• Sunday afternoon
• 5 minute presentations to judges
• Winners announced
Data
• 2011: Los Angeles Police Department Arrest Reports

Make a policy recommendation to reduce crime in Los Angeles.
• 2012: kiva.com lending data

What motivates people to lend money, and what factors are associated with paying loans?
• 2013: eHarmony dating data

What qualities do people look for in prospective dates?
• 2014: GridPoint energy consumption data

How can clients best save money and energy?
• 2015: edmunds.com

Detect insights into the process of car shopping to make shopping process easier for visitors.
• 2016: Ticketmaster

How can fans be better connected to the concerts they wish to attend?
Prizes
• Best Insight
• Best Visualization
• Best Use of External Data
Not StatsFest
• Not about statistical modeling
• No pre-defined “correct” outcome
• Many access points for students at different
levels
• Emphasis on data
• “fast” analysis
Why DataFest?
• Friendly competition brings out best
• “Group work” in a setting that actually requires
teamwork
• Access to complex data that isn’t available to
(most) classrooms
• Cultural indoctrination (the “secret sauce”?)
Choosing the data
• The data must have a personality!
• a spokeperson explains why the data are important
and what they hope to learn
• Many variables (p more important than n)
• Aim for about 1 GB
• Context is key: accessible, interesting, cool
• 5-6 months time working with data donor to prep data
“Secret Sauce”
• “To my mind, the crucial but unappreciated
methodology driving predictive modeling’s
succcess is…the Common Task Framework” 

— D. Donoho “50 Years of Data Science”
CTF Key Features
• Shared data
• A set of competitors
• Judges



In Donoho’s setting, the goal is prediction. But
more generally, DF encourages improvement
through shared information between communities.
community
Professionals

Undergrads
Faculty Grad students
Alumni
hosting your own
amstat.org/education/datafest
More
 https://www.causeweb.org/cause/ecots/ecots16/posters/d/3.
Video
Articles

Weitere ähnliche Inhalte

Was ist angesagt?

Research output in Irish H.E. academic libraries 2000-2015
Research output in Irish H.E. academic libraries 2000-2015 Research output in Irish H.E. academic libraries 2000-2015
Research output in Irish H.E. academic libraries 2000-2015 Terry O'Brien
 
Student research behavior — prototype application (at CIL)
Student research behavior — prototype application (at CIL)Student research behavior — prototype application (at CIL)
Student research behavior — prototype application (at CIL)danw421
 
"Research output in Irish H.E.academic libraries 2000-2015" Terry O’Brien & K...
"Research output in Irish H.E.academic libraries 2000-2015" Terry O’Brien & K..."Research output in Irish H.E.academic libraries 2000-2015" Terry O’Brien & K...
"Research output in Irish H.E.academic libraries 2000-2015" Terry O’Brien & K...CONUL Conference
 
Requirements for Learning Analytics
Requirements for Learning AnalyticsRequirements for Learning Analytics
Requirements for Learning AnalyticsTore Hoel
 
Clear expectations and habits of mind: a self-evaluation checklist for studen...
Clear expectations and habits of mind: a self-evaluation checklist for studen...Clear expectations and habits of mind: a self-evaluation checklist for studen...
Clear expectations and habits of mind: a self-evaluation checklist for studen...IL Group (CILIP Information Literacy Group)
 
Is there a statistically significant relationship between library resource ac...
Is there a statistically significant relationship between library resource ac...Is there a statistically significant relationship between library resource ac...
Is there a statistically significant relationship between library resource ac...northerncollaboration
 
i-Collaborate - A system for Building Callaborative Group Processes to Enhanc...
i-Collaborate - A system for Building Callaborative Group Processes to Enhanc...i-Collaborate - A system for Building Callaborative Group Processes to Enhanc...
i-Collaborate - A system for Building Callaborative Group Processes to Enhanc...CITE
 
Xiao Hu "Learning Analytics Initiatives"
Xiao Hu "Learning Analytics Initiatives"Xiao Hu "Learning Analytics Initiatives"
Xiao Hu "Learning Analytics Initiatives"CITE
 
Case Study: Increasing Access through OER Adoption
Case Study: Increasing Access through OER AdoptionCase Study: Increasing Access through OER Adoption
Case Study: Increasing Access through OER AdoptionJeremy Anderson
 
Job Embedded Presentation
Job Embedded PresentationJob Embedded Presentation
Job Embedded PresentationNina Franco
 
Data Inference
Data InferenceData Inference
Data InferenceL H
 
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...CITE
 
Carter ACSPRI July2016
Carter ACSPRI July2016Carter ACSPRI July2016
Carter ACSPRI July2016Jackie Carter
 
cause effect diagram of a sample problem
cause effect diagram of a sample problemcause effect diagram of a sample problem
cause effect diagram of a sample problemImran Sajol
 
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"CITE
 
College Board #DeliveringOpportunity Presentation - 3-5-14
College Board #DeliveringOpportunity Presentation - 3-5-14College Board #DeliveringOpportunity Presentation - 3-5-14
College Board #DeliveringOpportunity Presentation - 3-5-14CollegeBoardSM
 
Designing Effective Research Assignments
Designing Effective Research AssignmentsDesigning Effective Research Assignments
Designing Effective Research AssignmentsSuzanne Bernsten
 
Crafting a research agenda (in memes)
Crafting a research agenda (in memes) Crafting a research agenda (in memes)
Crafting a research agenda (in memes) George Veletsianos
 

Was ist angesagt? (20)

Research output in Irish H.E. academic libraries 2000-2015
Research output in Irish H.E. academic libraries 2000-2015 Research output in Irish H.E. academic libraries 2000-2015
Research output in Irish H.E. academic libraries 2000-2015
 
Strategic Metrics
Strategic MetricsStrategic Metrics
Strategic Metrics
 
Kyrillidou Session One NISO Training Assessment Practices
Kyrillidou Session One NISO Training Assessment PracticesKyrillidou Session One NISO Training Assessment Practices
Kyrillidou Session One NISO Training Assessment Practices
 
Student research behavior — prototype application (at CIL)
Student research behavior — prototype application (at CIL)Student research behavior — prototype application (at CIL)
Student research behavior — prototype application (at CIL)
 
"Research output in Irish H.E.academic libraries 2000-2015" Terry O’Brien & K...
"Research output in Irish H.E.academic libraries 2000-2015" Terry O’Brien & K..."Research output in Irish H.E.academic libraries 2000-2015" Terry O’Brien & K...
"Research output in Irish H.E.academic libraries 2000-2015" Terry O’Brien & K...
 
Requirements for Learning Analytics
Requirements for Learning AnalyticsRequirements for Learning Analytics
Requirements for Learning Analytics
 
Clear expectations and habits of mind: a self-evaluation checklist for studen...
Clear expectations and habits of mind: a self-evaluation checklist for studen...Clear expectations and habits of mind: a self-evaluation checklist for studen...
Clear expectations and habits of mind: a self-evaluation checklist for studen...
 
Is there a statistically significant relationship between library resource ac...
Is there a statistically significant relationship between library resource ac...Is there a statistically significant relationship between library resource ac...
Is there a statistically significant relationship between library resource ac...
 
i-Collaborate - A system for Building Callaborative Group Processes to Enhanc...
i-Collaborate - A system for Building Callaborative Group Processes to Enhanc...i-Collaborate - A system for Building Callaborative Group Processes to Enhanc...
i-Collaborate - A system for Building Callaborative Group Processes to Enhanc...
 
Xiao Hu "Learning Analytics Initiatives"
Xiao Hu "Learning Analytics Initiatives"Xiao Hu "Learning Analytics Initiatives"
Xiao Hu "Learning Analytics Initiatives"
 
Case Study: Increasing Access through OER Adoption
Case Study: Increasing Access through OER AdoptionCase Study: Increasing Access through OER Adoption
Case Study: Increasing Access through OER Adoption
 
Job Embedded Presentation
Job Embedded PresentationJob Embedded Presentation
Job Embedded Presentation
 
Data Inference
Data InferenceData Inference
Data Inference
 
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...
 
Carter ACSPRI July2016
Carter ACSPRI July2016Carter ACSPRI July2016
Carter ACSPRI July2016
 
cause effect diagram of a sample problem
cause effect diagram of a sample problemcause effect diagram of a sample problem
cause effect diagram of a sample problem
 
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
 
College Board #DeliveringOpportunity Presentation - 3-5-14
College Board #DeliveringOpportunity Presentation - 3-5-14College Board #DeliveringOpportunity Presentation - 3-5-14
College Board #DeliveringOpportunity Presentation - 3-5-14
 
Designing Effective Research Assignments
Designing Effective Research AssignmentsDesigning Effective Research Assignments
Designing Effective Research Assignments
 
Crafting a research agenda (in memes)
Crafting a research agenda (in memes) Crafting a research agenda (in memes)
Crafting a research agenda (in memes)
 

Andere mochten auch

Customer churn using aml (002)
Customer churn using aml (002)Customer churn using aml (002)
Customer churn using aml (002)CCG
 
H2O World - Solving Customer Churn with Machine Learning - Julian Bharadwaj
H2O World - Solving Customer Churn with Machine Learning - Julian BharadwajH2O World - Solving Customer Churn with Machine Learning - Julian Bharadwaj
H2O World - Solving Customer Churn with Machine Learning - Julian BharadwajSri Ambati
 
Churn prediction data modeling
Churn prediction data modelingChurn prediction data modeling
Churn prediction data modelingPierre Gutierrez
 
11 commandments revised
11 commandments revised11 commandments revised
11 commandments revisedColleen Young
 
Digital Defense for Activists (and the rest of us)
Digital Defense for Activists (and the rest of us)Digital Defense for Activists (and the rest of us)
Digital Defense for Activists (and the rest of us)Michele Chubirka
 
Amazon Machine Learning Case Study: Predicting Customer Churn
Amazon Machine Learning Case Study: Predicting Customer ChurnAmazon Machine Learning Case Study: Predicting Customer Churn
Amazon Machine Learning Case Study: Predicting Customer ChurnAmazon Web Services
 
AWS re:Invent 2016: Predicting Customer Churn with Amazon Machine Learning (M...
AWS re:Invent 2016: Predicting Customer Churn with Amazon Machine Learning (M...AWS re:Invent 2016: Predicting Customer Churn with Amazon Machine Learning (M...
AWS re:Invent 2016: Predicting Customer Churn with Amazon Machine Learning (M...Amazon Web Services
 
The Good, The Bad and The Greys of Data Visualisation Design
The Good, The Bad and The Greys of Data Visualisation DesignThe Good, The Bad and The Greys of Data Visualisation Design
The Good, The Bad and The Greys of Data Visualisation DesignAndy Kirk
 
Beyond Churn Prediction : An Introduction to uplift modeling
Beyond Churn Prediction : An Introduction to uplift modelingBeyond Churn Prediction : An Introduction to uplift modeling
Beyond Churn Prediction : An Introduction to uplift modelingPierre Gutierrez
 
Collaborative Filtering with Spark
Collaborative Filtering with SparkCollaborative Filtering with Spark
Collaborative Filtering with SparkChris Johnson
 
Algorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyAlgorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyChris Johnson
 
Time Series Analysis and Mining with R
Time Series Analysis and Mining with RTime Series Analysis and Mining with R
Time Series Analysis and Mining with RYanchang Zhao
 

Andere mochten auch (13)

Customer churn using aml (002)
Customer churn using aml (002)Customer churn using aml (002)
Customer churn using aml (002)
 
H2O World - Solving Customer Churn with Machine Learning - Julian Bharadwaj
H2O World - Solving Customer Churn with Machine Learning - Julian BharadwajH2O World - Solving Customer Churn with Machine Learning - Julian Bharadwaj
H2O World - Solving Customer Churn with Machine Learning - Julian Bharadwaj
 
Churn prediction data modeling
Churn prediction data modelingChurn prediction data modeling
Churn prediction data modeling
 
11 commandments revised
11 commandments revised11 commandments revised
11 commandments revised
 
Digital Defense for Activists (and the rest of us)
Digital Defense for Activists (and the rest of us)Digital Defense for Activists (and the rest of us)
Digital Defense for Activists (and the rest of us)
 
Amazon Machine Learning Case Study: Predicting Customer Churn
Amazon Machine Learning Case Study: Predicting Customer ChurnAmazon Machine Learning Case Study: Predicting Customer Churn
Amazon Machine Learning Case Study: Predicting Customer Churn
 
AWS re:Invent 2016: Predicting Customer Churn with Amazon Machine Learning (M...
AWS re:Invent 2016: Predicting Customer Churn with Amazon Machine Learning (M...AWS re:Invent 2016: Predicting Customer Churn with Amazon Machine Learning (M...
AWS re:Invent 2016: Predicting Customer Churn with Amazon Machine Learning (M...
 
The Good, The Bad and The Greys of Data Visualisation Design
The Good, The Bad and The Greys of Data Visualisation DesignThe Good, The Bad and The Greys of Data Visualisation Design
The Good, The Bad and The Greys of Data Visualisation Design
 
Beyond Churn Prediction : An Introduction to uplift modeling
Beyond Churn Prediction : An Introduction to uplift modelingBeyond Churn Prediction : An Introduction to uplift modeling
Beyond Churn Prediction : An Introduction to uplift modeling
 
Collaborative Filtering with Spark
Collaborative Filtering with SparkCollaborative Filtering with Spark
Collaborative Filtering with Spark
 
Algorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyAlgorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at Spotify
 
Churn Predictive Modelling
Churn Predictive ModellingChurn Predictive Modelling
Churn Predictive Modelling
 
Time Series Analysis and Mining with R
Time Series Analysis and Mining with RTime Series Analysis and Mining with R
Time Series Analysis and Mining with R
 

Ähnlich wie Rob Gould - The ASA DataFest: Learning by Doing

Data to Insights with Gogo's Data Science Lead
Data to Insights with Gogo's Data Science LeadData to Insights with Gogo's Data Science Lead
Data to Insights with Gogo's Data Science LeadPromotable
 
Trendspotting: Helping you make sense of large information sources
Trendspotting: Helping you make sense of large information sourcesTrendspotting: Helping you make sense of large information sources
Trendspotting: Helping you make sense of large information sourcesMarieke Guy
 
Will Bigger and Better Data Help Deliver More Major Donors?
Will Bigger and Better Data Help Deliver More Major Donors?Will Bigger and Better Data Help Deliver More Major Donors?
Will Bigger and Better Data Help Deliver More Major Donors?Azadi Sheridan
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data ScienceThinkful
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Thinkful
 
Ola presentation to guide discussion includes personas
Ola presentation to guide discussion includes personasOla presentation to guide discussion includes personas
Ola presentation to guide discussion includes personasStephen Abram
 
How to Use Data for Product Success with Jet.com Data Manager
How to Use Data for Product Success with Jet.com Data ManagerHow to Use Data for Product Success with Jet.com Data Manager
How to Use Data for Product Success with Jet.com Data ManagerProduct School
 
Wagner Analytics Bb World2012
Wagner Analytics Bb World2012Wagner Analytics Bb World2012
Wagner Analytics Bb World2012Ellen Wagner
 
Gathering Feedback in a Hybrid Environment
Gathering Feedback in a Hybrid EnvironmentGathering Feedback in a Hybrid Environment
Gathering Feedback in a Hybrid EnvironmentTechSoupConnectLondo
 
Foundational Strategies for Trust in Big Data Part 2: Understanding Your Data
Foundational Strategies for Trust in Big Data Part 2: Understanding Your DataFoundational Strategies for Trust in Big Data Part 2: Understanding Your Data
Foundational Strategies for Trust in Big Data Part 2: Understanding Your DataPrecisely
 
Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Thinkful
 
UX Burlington 2017: Exploratory Research in UX Design
UX Burlington 2017: Exploratory Research in UX DesignUX Burlington 2017: Exploratory Research in UX Design
UX Burlington 2017: Exploratory Research in UX DesignSarah Fathallah
 
Big Data for Small Businesses
Big Data for Small BusinessesBig Data for Small Businesses
Big Data for Small BusinessesVivastream
 
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopUsing Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopLynn Connaway
 
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopUsing Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopOCLC
 
Data Collection for Research Based Organizations to Aid Research!
Data Collection for Research Based Organizations to Aid Research!Data Collection for Research Based Organizations to Aid Research!
Data Collection for Research Based Organizations to Aid Research!NTEN
 
Case Study: "Making Sense of Data at Any Size"
Case Study: "Making Sense of Data at Any Size"Case Study: "Making Sense of Data at Any Size"
Case Study: "Making Sense of Data at Any Size"iMedia Connection
 

Ähnlich wie Rob Gould - The ASA DataFest: Learning by Doing (20)

Data to Insights with Gogo's Data Science Lead
Data to Insights with Gogo's Data Science LeadData to Insights with Gogo's Data Science Lead
Data to Insights with Gogo's Data Science Lead
 
Trendspotting: Helping you make sense of large information sources
Trendspotting: Helping you make sense of large information sourcesTrendspotting: Helping you make sense of large information sources
Trendspotting: Helping you make sense of large information sources
 
Will Bigger and Better Data Help Deliver More Major Donors?
Will Bigger and Better Data Help Deliver More Major Donors?Will Bigger and Better Data Help Deliver More Major Donors?
Will Bigger and Better Data Help Deliver More Major Donors?
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
 
Ola presentation to guide discussion includes personas
Ola presentation to guide discussion includes personasOla presentation to guide discussion includes personas
Ola presentation to guide discussion includes personas
 
How to Use Data for Product Success with Jet.com Data Manager
How to Use Data for Product Success with Jet.com Data ManagerHow to Use Data for Product Success with Jet.com Data Manager
How to Use Data for Product Success with Jet.com Data Manager
 
Wagner Analytics Bb World2012
Wagner Analytics Bb World2012Wagner Analytics Bb World2012
Wagner Analytics Bb World2012
 
Gathering Feedback in a Hybrid Environment
Gathering Feedback in a Hybrid EnvironmentGathering Feedback in a Hybrid Environment
Gathering Feedback in a Hybrid Environment
 
Digital Economics
Digital EconomicsDigital Economics
Digital Economics
 
Digital Economics
Digital EconomicsDigital Economics
Digital Economics
 
Foundational Strategies for Trust in Big Data Part 2: Understanding Your Data
Foundational Strategies for Trust in Big Data Part 2: Understanding Your DataFoundational Strategies for Trust in Big Data Part 2: Understanding Your Data
Foundational Strategies for Trust in Big Data Part 2: Understanding Your Data
 
Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)
 
UX Burlington 2017: Exploratory Research in UX Design
UX Burlington 2017: Exploratory Research in UX DesignUX Burlington 2017: Exploratory Research in UX Design
UX Burlington 2017: Exploratory Research in UX Design
 
Big Data for Small Businesses
Big Data for Small BusinessesBig Data for Small Businesses
Big Data for Small Businesses
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopUsing Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
 
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopUsing Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
 
Data Collection for Research Based Organizations to Aid Research!
Data Collection for Research Based Organizations to Aid Research!Data Collection for Research Based Organizations to Aid Research!
Data Collection for Research Based Organizations to Aid Research!
 
Case Study: "Making Sense of Data at Any Size"
Case Study: "Making Sense of Data at Any Size"Case Study: "Making Sense of Data at Any Size"
Case Study: "Making Sense of Data at Any Size"
 

Kürzlich hochgeladen

Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...JojoEDelaCruz
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSMae Pangan
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Projectjordimapav
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationRosabel UA
 
Dust Of Snow By Robert Frost Class-X English CBSE
Dust Of Snow By Robert Frost Class-X English CBSEDust Of Snow By Robert Frost Class-X English CBSE
Dust Of Snow By Robert Frost Class-X English CBSEaurabinda banchhor
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
EMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docxEMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docxElton John Embodo
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataBabyAnnMotar
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 

Kürzlich hochgeladen (20)

Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHS
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Project
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translation
 
Dust Of Snow By Robert Frost Class-X English CBSE
Dust Of Snow By Robert Frost Class-X English CBSEDust Of Snow By Robert Frost Class-X English CBSE
Dust Of Snow By Robert Frost Class-X English CBSE
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
EMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docxEMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docx
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped data
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 

Rob Gould - The ASA DataFest: Learning by Doing

  • 1.
  • 2. Motivation • end-of-term projects, and even capstone experiences, can become run-of-the-mill • students seeking to please the instructor, sometimes assuming instructor already knows how to do the analysis; and so their input isn’t really important. • depending on project, the best students might not get to “shine” • grades are important, but can discourage risk-taking • there are few opportunities to wrestle with complex, real data
  • 3. What is ASA DataFest? • A celebration of data! • A competition for teams of undergraduates to find insight and meaning in a rich and complex data set. • A data hackathon
  • 4. A typical DataFest • Friday Night • Meet the data • Friday night through Sunday afternoon • Work furiously. Eat. • Talk to roving statisticians from industry 
 and academics • Sunday afternoon • 5 minute presentations to judges • Winners announced
  • 5. Data • 2011: Los Angeles Police Department Arrest Reports
 Make a policy recommendation to reduce crime in Los Angeles. • 2012: kiva.com lending data
 What motivates people to lend money, and what factors are associated with paying loans? • 2013: eHarmony dating data
 What qualities do people look for in prospective dates? • 2014: GridPoint energy consumption data
 How can clients best save money and energy? • 2015: edmunds.com
 Detect insights into the process of car shopping to make shopping process easier for visitors. • 2016: Ticketmaster
 How can fans be better connected to the concerts they wish to attend?
  • 6. Prizes • Best Insight • Best Visualization • Best Use of External Data
  • 7. Not StatsFest • Not about statistical modeling • No pre-defined “correct” outcome • Many access points for students at different levels • Emphasis on data • “fast” analysis
  • 8. Why DataFest? • Friendly competition brings out best • “Group work” in a setting that actually requires teamwork • Access to complex data that isn’t available to (most) classrooms • Cultural indoctrination (the “secret sauce”?)
  • 9. Choosing the data • The data must have a personality! • a spokeperson explains why the data are important and what they hope to learn • Many variables (p more important than n) • Aim for about 1 GB • Context is key: accessible, interesting, cool • 5-6 months time working with data donor to prep data
  • 10. “Secret Sauce” • “To my mind, the crucial but unappreciated methodology driving predictive modeling’s succcess is…the Common Task Framework” 
 — D. Donoho “50 Years of Data Science”
  • 11. CTF Key Features • Shared data • A set of competitors • Judges
 
 In Donoho’s setting, the goal is prediction. But more generally, DF encourages improvement through shared information between communities.
  • 14.