SlideShare ist ein Scribd-Unternehmen logo
1 von 17
Oxford Internet Institute Summer Doctoral Programme
iSchool, University of Toronto
July 2013
Big Data in the Social Sciences
Slides produced with:
Ralph Schroeder, Professor & MSc Programme Director
Dr Eric T. Meyer
Research Fellow & DPhil Programme Director
eric.meyer@oii.ox.ac.uk
http://www.oii.ox.ac.uk/people/?id=120
@etmeyer
Source: Leonard John Matthews, CC-BY-SA (http://www.flickr.com/photos/mythoto/3033590171)
Big data are data that are unprecedented in scale and scope
in relation to a given phenomenon.They are often streams
of data (rather than fixed datasets), accumulating large volumes,
often at high velocity.
Is the tail of the availability of big data and computational
methods wagging the dog of good research questions and
advancing social science?
If not, how do big data advance research?
What are the opportunities and challenges?
Source: http://www.flickr.com/photos/nakedcharlton/597075830/ Source: http://www.flickr.com/photos/jamescridland/613445810/
BusinessValue versus AcademicValue
Strategic Knowledge
• Generally time-limited (with exceptions)
• Value comes from knowing what your competitors don’t
• Often has high monetary value if it can be exploited
Durable Knowledge
• Less time-limited (with exceptions)
• Value comes from adding to the world’s knowledge (the global
brain is cumulative/scientific)
• Rarely has direct monetary value, but has value in terms of
creating the possibility both of future knowledge and of future
exploitation and commercial uses
BusinessValue versus AcademicValue
Marketing  Tailoring
Forecasting  Prediction
ComplexTrends  Linking datasets plus modelling
From Big Data to Big (Hi-res) Picture
Is there a risk that the (big data) tail will
wag the (research) dog?
Image source: Leonard John Matthews, CC-BY-SA
(http://www.flickr.com/photos/mythoto/3033590171)
?
?
?
?
?
?
?
?
?
“Surprisingly, the distribution of
types of search query did not vary
significantly across the different
Lifestyle Groups (p>0.01).”
Source: Waller, V. (2011). “Not Just Information:Who Searches for What on the Search Engine Google?” Journal of the American Society for Information Science &
Technology 62(4): 761-775.
Case 1: Search engine behaviour
Waller’s analysis ofAustralian Google Users
Key findings:
Mainly leisure
< 2% contemporary issues
No perceptible ‘class’ differences
Novel advance:
Unprecedented insight into what people search for
Challenge:
Replicability
Securing access to commercial data
J Michel et al. Science 2011;331:176-182
Case 2: Large-scale text analysis
Michel et al. ‘culturomic’ analysis of 5 Million Digitized Google
Books and Heuser & Le-Khac of 2779 19th Century British
Novels
Key findings:
Patterns of key terms
Industrialization tied to shift from abstract to concrete
words
Novel advance:
Replicability, extension to other areas, systematic
analysis of cultural materials
Challenge:
Data quality
Case 3:Twitter-bots
OII master’s students Alexander Furnas and Devin Gaffney saw a large spike in then-US
presidential candidate Mitt Romney’sTwitter followers, and decided to look at the new
followers:
Furnas, A. and Gaffney, D. (2012). ‘Statistical Probability That Mitt Romney's New Twitter Followers Are Just Normal Users: 0%’. The Atlantic, July
31, http://www.theatlantic.com/technology/archive/2012/07/statistical-probability-that-mitt-romneys-new-twitter-followers-are-just-normal-users-0/260539/ (accessed August 31, 2012).
Conclusions
Savage and Burrows?, who ask are commercial data outpacing
social science?
Boyd and Crawford?, who ask if big data raise ethical and
epistemological conundrums?
... Not entirely ...
The connection between research technologies and the
advance of knowledge
The threats and opportunities represented by unprecedented
windows into people’s minds and thoughts
Does this lead to more ‘scientific’ (i.e. cumulative) social
sciences and humanities?
See http://www.oii.ox.ac.uk/research/projects/?id=98
Additional readings and references
Bond, Robert et al. (2012). ‘A 61-million-person experiment in social influence and political
mobilization’, Nature 489: 295–298.
Bruns,A. and Liang,Y.E. (2012). ‘Tools and methods for capturingTwitter data during natural disasters’, First
Monday, 17 (4 – 2), http://firstmonday.org/htbin/cgiwrap/bin/ojs/index.php/fm/article/viewArticle/3937/3193
Furnas, A. and Gaffney, D. (2012). ‘Statistical ProbabilityThat Mitt Romney's NewTwitter Followers Are Just
Normal Users: 0%’. The Atlantic, July 31, http://www.theatlantic.com/technology/archive/2012/07/statistical-
probability-that-mitt-romneys-new-twitter-followers-are-just-normal-users-0/260539/ (accessed August
31, 2012).
Giles, J. (2012). ‘Making the Links: From E-mails to Social Networks, the DigitalTraces left Life in the
ModernWorld areTransforming Social Science’, Nature, 488: 448-50.
Kwak, H. et al. (2010). ‘What isTwitter, a Social Network or a News Media?’ Proceedings of the 19th
InternationalWorldWide Web (WWW) Conference, April 26-30, 2010, Raleigh NC.
Manyika, J. et al. (2011). ‘Big data: the next frontier for innovation, competition and productivity’, McKinsey
Global Institute, available at: http://www.mckinsey.com/insights/mgi/research/technology_and_innovation/
big_data_the_next_frontier_for_innovation (last accessed August 29, 2012).
Silver, Nate. (2012). The Signal and the Noise:The Art and Science of Prediction. London:Allen Lane.
Tancer, B. (2009). Click:What Millions of People are Doing Online andWhy It Matters. NewYork: Harper
Collins, 2009.
Wu, S. , J.M. Hofman,W.A. Mason, and D.J. Watts, (2011). ‘Who says what to whom on twitter’, Proceedings
of the 20th international conference onWorldWideWeb. (on DuncanWatts
webpage, http://research.microsoft.com/en-us/people/duncan/, last accessed August 29, 2012).
With support from:
Dr Eric T. Meyer
Research Fellow & DPhil Programme Director
eric.meyer@oii.ox.ac.uk
http://www.oii.ox.ac.uk/people/?id=120
@etmeyer

Weitere ähnliche Inhalte

Was ist angesagt?

Twitter Data Analytics
Twitter Data AnalyticsTwitter Data Analytics
Twitter Data Analytics
rupika08
 
Social Media Use by Canadian Academic Librarians
Social Media Use by Canadian Academic LibrariansSocial Media Use by Canadian Academic Librarians
Social Media Use by Canadian Academic Librarians
CARLsurvey2010
 
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Digital Methods Initiative
 

Was ist angesagt? (18)

Social Media for Researchers
Social Media for ResearchersSocial Media for Researchers
Social Media for Researchers
 
Rogers studyingpoliticalissues mar2014_optimized_ii_
Rogers studyingpoliticalissues mar2014_optimized_ii_Rogers studyingpoliticalissues mar2014_optimized_ii_
Rogers studyingpoliticalissues mar2014_optimized_ii_
 
2014 ATHS Summer
2014 ATHS Summer2014 ATHS Summer
2014 ATHS Summer
 
Big data in social sciences and IT developments (ethics considerations)
Big data in social sciences and IT developments (ethics considerations)Big data in social sciences and IT developments (ethics considerations)
Big data in social sciences and IT developments (ethics considerations)
 
빅데이터 시대의 미디어&커뮤니케이션 교육과 연구
빅데이터 시대의 미디어&커뮤니케이션 교육과 연구빅데이터 시대의 미디어&커뮤니케이션 교육과 연구
빅데이터 시대의 미디어&커뮤니케이션 교육과 연구
 
e-Research: A Social Informatics Perspective
e-Research: A Social Informatics Perspectivee-Research: A Social Informatics Perspective
e-Research: A Social Informatics Perspective
 
Twitter Data Analytics
Twitter Data AnalyticsTwitter Data Analytics
Twitter Data Analytics
 
Crim 4391 homeland security fall15
Crim 4391 homeland security fall15Crim 4391 homeland security fall15
Crim 4391 homeland security fall15
 
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
 
Social Media Use by Canadian Academic Librarians
Social Media Use by Canadian Academic LibrariansSocial Media Use by Canadian Academic Librarians
Social Media Use by Canadian Academic Librarians
 
Shall we use social media for our research?
Shall we use social media for our research?Shall we use social media for our research?
Shall we use social media for our research?
 
What Does Your Repository Do? Measuring and Calculating Impact
What Does Your Repository Do?  Measuring and Calculating ImpactWhat Does Your Repository Do?  Measuring and Calculating Impact
What Does Your Repository Do? Measuring and Calculating Impact
 
Digital Methods by Richard Rogers
Digital Methods by Richard RogersDigital Methods by Richard Rogers
Digital Methods by Richard Rogers
 
Yimei zhu poster for methods fair 2012
Yimei zhu poster for methods fair 2012Yimei zhu poster for methods fair 2012
Yimei zhu poster for methods fair 2012
 
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
 
Small Worlds Social Graphs Social Media
Small Worlds Social Graphs Social MediaSmall Worlds Social Graphs Social Media
Small Worlds Social Graphs Social Media
 
Scholarly Social Machines
Scholarly Social MachinesScholarly Social Machines
Scholarly Social Machines
 
Mike Thelwall: Introduction to Webometrics
Mike Thelwall: Introduction to WebometricsMike Thelwall: Introduction to Webometrics
Mike Thelwall: Introduction to Webometrics
 

Andere mochten auch

Andere mochten auch (7)

The Internet is Big Data: How internet research has changed our understandin...
The Internet is Big Data: How internet research has changed our understandin...The Internet is Big Data: How internet research has changed our understandin...
The Internet is Big Data: How internet research has changed our understandin...
 
The Internet, Science, and Transformations of Knowledge
The Internet, Science, and Transformations of KnowledgeThe Internet, Science, and Transformations of Knowledge
The Internet, Science, and Transformations of Knowledge
 
FIA Budapest - Meyer
FIA Budapest - MeyerFIA Budapest - Meyer
FIA Budapest - Meyer
 
Reinventing Research? Information Practices in the Humanites Information Prof...
Reinventing Research? Information Practices in the Humanites Information Prof...Reinventing Research? Information Practices in the Humanites Information Prof...
Reinventing Research? Information Practices in the Humanites Information Prof...
 
OII Summer Doctoral Programme 2010: Global brain by Meyer & Schroeder
OII Summer Doctoral Programme 2010: Global brain by Meyer & SchroederOII Summer Doctoral Programme 2010: Global brain by Meyer & Schroeder
OII Summer Doctoral Programme 2010: Global brain by Meyer & Schroeder
 
Quantifying the impacts of investment in humanities archives
Quantifying the impacts of investment in humanities archivesQuantifying the impacts of investment in humanities archives
Quantifying the impacts of investment in humanities archives
 
The End(s) of e-Research
The End(s) of e-ResearchThe End(s) of e-Research
The End(s) of e-Research
 

Ähnlich wie Meyer Big Data SDP13

Altmetrics: Listening & Giving Voice to Ideas with Social Media Data
Altmetrics: Listening & Giving Voice to Ideas with Social Media DataAltmetrics: Listening & Giving Voice to Ideas with Social Media Data
Altmetrics: Listening & Giving Voice to Ideas with Social Media Data
Toronto Metropolitan University
 
Ralph schroeder and eric meyer
Ralph schroeder and eric meyerRalph schroeder and eric meyer
Ralph schroeder and eric meyer
oiisdp
 
Grounded theory meets big data: One way to marry ethnography and digital methods
Grounded theory meets big data: One way to marry ethnography and digital methodsGrounded theory meets big data: One way to marry ethnography and digital methods
Grounded theory meets big data: One way to marry ethnography and digital methods
Citizens in the Making
 
Respond to these two classmates’ posts. 1. After reading thi.docx
Respond to these two classmates’ posts. 1. After reading thi.docxRespond to these two classmates’ posts. 1. After reading thi.docx
Respond to these two classmates’ posts. 1. After reading thi.docx
daynamckernon
 
Respond to at least two of your classmates’ posts. 1. After .docx
Respond to at least two of your classmates’ posts. 1. After .docxRespond to at least two of your classmates’ posts. 1. After .docx
Respond to at least two of your classmates’ posts. 1. After .docx
daynamckernon
 
After reading this journal article regarding ethics of interne.docx
After reading this journal article regarding ethics of interne.docxAfter reading this journal article regarding ethics of interne.docx
After reading this journal article regarding ethics of interne.docx
rosiecabaniss
 
RUNNING HEAD BIG DATA IN SOCIAL MEDIA .docx
RUNNING HEAD BIG DATA IN SOCIAL MEDIA                            .docxRUNNING HEAD BIG DATA IN SOCIAL MEDIA                            .docx
RUNNING HEAD BIG DATA IN SOCIAL MEDIA .docx
cheryllwashburn
 

Ähnlich wie Meyer Big Data SDP13 (20)

Accessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeAccessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science Knowledge
 
Challenges in-archiving-twitter
Challenges in-archiving-twitterChallenges in-archiving-twitter
Challenges in-archiving-twitter
 
Altmetrics: Listening & Giving Voice to Ideas with Social Media Data
Altmetrics: Listening & Giving Voice to Ideas with Social Media DataAltmetrics: Listening & Giving Voice to Ideas with Social Media Data
Altmetrics: Listening & Giving Voice to Ideas with Social Media Data
 
Research with Social Media Data: Stewardship & Ethical Considerations
Research with Social Media Data: Stewardship & Ethical ConsiderationsResearch with Social Media Data: Stewardship & Ethical Considerations
Research with Social Media Data: Stewardship & Ethical Considerations
 
Ralph schroeder and eric meyer
Ralph schroeder and eric meyerRalph schroeder and eric meyer
Ralph schroeder and eric meyer
 
Grounded theory meets big data: One way to marry ethnography and digital methods
Grounded theory meets big data: One way to marry ethnography and digital methodsGrounded theory meets big data: One way to marry ethnography and digital methods
Grounded theory meets big data: One way to marry ethnography and digital methods
 
Respond to these two classmates’ posts. 1. After reading thi.docx
Respond to these two classmates’ posts. 1. After reading thi.docxRespond to these two classmates’ posts. 1. After reading thi.docx
Respond to these two classmates’ posts. 1. After reading thi.docx
 
“Big data” in human services organisations: Practical problems and ethical di...
“Big data” in human services organisations: Practical problems and ethical di...“Big data” in human services organisations: Practical problems and ethical di...
“Big data” in human services organisations: Practical problems and ethical di...
 
Social Media Research Methods
Social Media Research MethodsSocial Media Research Methods
Social Media Research Methods
 
Respond to at least two of your classmates’ posts. 1. After .docx
Respond to at least two of your classmates’ posts. 1. After .docxRespond to at least two of your classmates’ posts. 1. After .docx
Respond to at least two of your classmates’ posts. 1. After .docx
 
Who are We Studying: Humans or Bots?
Who are We Studying: Humans or Bots? Who are We Studying: Humans or Bots?
Who are We Studying: Humans or Bots?
 
After reading this journal article regarding ethics of interne.docx
After reading this journal article regarding ethics of interne.docxAfter reading this journal article regarding ethics of interne.docx
After reading this journal article regarding ethics of interne.docx
 
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCESBROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
 
The evolution of research on social media
The evolution of research on social mediaThe evolution of research on social media
The evolution of research on social media
 
Decomposing Social and Semantic Networks in Emerging “Big Data” Research
Decomposing Social and Semantic Networks in Emerging “Big Data” ResearchDecomposing Social and Semantic Networks in Emerging “Big Data” Research
Decomposing Social and Semantic Networks in Emerging “Big Data” Research
 
Easy Data, Hard Data? Twitter Research and the Politics of Data Access
Easy Data, Hard Data? Twitter Research and the Politics of Data AccessEasy Data, Hard Data? Twitter Research and the Politics of Data Access
Easy Data, Hard Data? Twitter Research and the Politics of Data Access
 
What do we get from Twitter - and what not?
What do we get from Twitter - and what not?What do we get from Twitter - and what not?
What do we get from Twitter - and what not?
 
RUNNING HEAD BIG DATA IN SOCIAL MEDIA .docx
RUNNING HEAD BIG DATA IN SOCIAL MEDIA                            .docxRUNNING HEAD BIG DATA IN SOCIAL MEDIA                            .docx
RUNNING HEAD BIG DATA IN SOCIAL MEDIA .docx
 
New Media, New Ethics - ICA 2012
New Media, New Ethics - ICA 2012New Media, New Ethics - ICA 2012
New Media, New Ethics - ICA 2012
 
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
 

Mehr von Eric Meyer

MLA 2013: Social Science Tools for understanding the impact of the digital hu...
MLA 2013: Social Science Tools for understanding the impact of the digital hu...MLA 2013: Social Science Tools for understanding the impact of the digital hu...
MLA 2013: Social Science Tools for understanding the impact of the digital hu...
Eric Meyer
 

Mehr von Eric Meyer (19)

Digital Research and Big Data: Is the Tail Wagging the Dog?
Digital Research and Big Data: Is the Tail Wagging the Dog?Digital Research and Big Data: Is the Tail Wagging the Dog?
Digital Research and Big Data: Is the Tail Wagging the Dog?
 
MLA 2013: Social Science Tools for understanding the impact of the digital hu...
MLA 2013: Social Science Tools for understanding the impact of the digital hu...MLA 2013: Social Science Tools for understanding the impact of the digital hu...
MLA 2013: Social Science Tools for understanding the impact of the digital hu...
 
tidsrdhoxss2012
tidsrdhoxss2012tidsrdhoxss2012
tidsrdhoxss2012
 
DTC-OII Ethnography Online 2011
DTC-OII Ethnography Online 2011DTC-OII Ethnography Online 2011
DTC-OII Ethnography Online 2011
 
JISC-WW1
JISC-WW1JISC-WW1
JISC-WW1
 
TIDSR-DHOx
TIDSR-DHOxTIDSR-DHOx
TIDSR-DHOx
 
i3 Conference Keynote, Aberdeen
i3 Conference Keynote, Aberdeeni3 Conference Keynote, Aberdeen
i3 Conference Keynote, Aberdeen
 
Reinventing Research? Information Practices in the Humanites Launch
Reinventing Research? Information Practices in the Humanites LaunchReinventing Research? Information Practices in the Humanites Launch
Reinventing Research? Information Practices in the Humanites Launch
 
Virtual Environments and the Future of Collaboration
Virtual Environments and the Future of CollaborationVirtual Environments and the Future of Collaboration
Virtual Environments and the Future of Collaboration
 
Scholarship in the Digital Age
Scholarship in the Digital AgeScholarship in the Digital Age
Scholarship in the Digital Age
 
Sharing ideas and sharing data: Researchers and Web 2.0
Sharing ideas and sharing data: Researchers and Web 2.0Sharing ideas and sharing data: Researchers and Web 2.0
Sharing ideas and sharing data: Researchers and Web 2.0
 
WWWoH
WWWoHWWWoH
WWWoH
 
TIDSR
TIDSRTIDSR
TIDSR
 
ASIS&T ProQuest Dissertation of the Year Award Presentation
ASIS&T ProQuest Dissertation of the Year Award PresentationASIS&T ProQuest Dissertation of the Year Award Presentation
ASIS&T ProQuest Dissertation of the Year Award Presentation
 
Moving from small science to big science: Social and organizational impedimen...
Moving from small science to big science: Social and organizational impedimen...Moving from small science to big science: Social and organizational impedimen...
Moving from small science to big science: Social and organizational impedimen...
 
e-Research 2.0: Taking the measure of Web 2.0 in e-Research
e-Research 2.0: Taking the measure of Web 2.0 in e-Researche-Research 2.0: Taking the measure of Web 2.0 in e-Research
e-Research 2.0: Taking the measure of Web 2.0 in e-Research
 
Talking 'bout a revolution: Framing e-Research as a computerization movement
Talking 'bout a revolution: Framing e-Research as a computerization movementTalking 'bout a revolution: Framing e-Research as a computerization movement
Talking 'bout a revolution: Framing e-Research as a computerization movement
 
Gauging the Impact of e-Research in the Social Sciences
Gauging the Impact of e-Research in the Social SciencesGauging the Impact of e-Research in the Social Sciences
Gauging the Impact of e-Research in the Social Sciences
 
e-Social Science as an Experience Technology: Distance From, and Attitudes To...
e-Social Science as an Experience Technology: Distance From, and Attitudes To...e-Social Science as an Experience Technology: Distance From, and Attitudes To...
e-Social Science as an Experience Technology: Distance From, and Attitudes To...
 

Kürzlich hochgeladen

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 

Meyer Big Data SDP13

  • 1. Oxford Internet Institute Summer Doctoral Programme iSchool, University of Toronto July 2013 Big Data in the Social Sciences Slides produced with: Ralph Schroeder, Professor & MSc Programme Director Dr Eric T. Meyer Research Fellow & DPhil Programme Director eric.meyer@oii.ox.ac.uk http://www.oii.ox.ac.uk/people/?id=120 @etmeyer
  • 2. Source: Leonard John Matthews, CC-BY-SA (http://www.flickr.com/photos/mythoto/3033590171)
  • 3. Big data are data that are unprecedented in scale and scope in relation to a given phenomenon.They are often streams of data (rather than fixed datasets), accumulating large volumes, often at high velocity. Is the tail of the availability of big data and computational methods wagging the dog of good research questions and advancing social science? If not, how do big data advance research? What are the opportunities and challenges?
  • 4. Source: http://www.flickr.com/photos/nakedcharlton/597075830/ Source: http://www.flickr.com/photos/jamescridland/613445810/
  • 5. BusinessValue versus AcademicValue Strategic Knowledge • Generally time-limited (with exceptions) • Value comes from knowing what your competitors don’t • Often has high monetary value if it can be exploited
  • 6. Durable Knowledge • Less time-limited (with exceptions) • Value comes from adding to the world’s knowledge (the global brain is cumulative/scientific) • Rarely has direct monetary value, but has value in terms of creating the possibility both of future knowledge and of future exploitation and commercial uses BusinessValue versus AcademicValue
  • 7. Marketing  Tailoring Forecasting  Prediction ComplexTrends  Linking datasets plus modelling From Big Data to Big (Hi-res) Picture
  • 8. Is there a risk that the (big data) tail will wag the (research) dog? Image source: Leonard John Matthews, CC-BY-SA (http://www.flickr.com/photos/mythoto/3033590171)
  • 9. ? ? ? ? ? ? ? ? ? “Surprisingly, the distribution of types of search query did not vary significantly across the different Lifestyle Groups (p>0.01).” Source: Waller, V. (2011). “Not Just Information:Who Searches for What on the Search Engine Google?” Journal of the American Society for Information Science & Technology 62(4): 761-775.
  • 10. Case 1: Search engine behaviour Waller’s analysis ofAustralian Google Users Key findings: Mainly leisure < 2% contemporary issues No perceptible ‘class’ differences Novel advance: Unprecedented insight into what people search for Challenge: Replicability Securing access to commercial data
  • 11. J Michel et al. Science 2011;331:176-182
  • 12. Case 2: Large-scale text analysis Michel et al. ‘culturomic’ analysis of 5 Million Digitized Google Books and Heuser & Le-Khac of 2779 19th Century British Novels Key findings: Patterns of key terms Industrialization tied to shift from abstract to concrete words Novel advance: Replicability, extension to other areas, systematic analysis of cultural materials Challenge: Data quality
  • 13. Case 3:Twitter-bots OII master’s students Alexander Furnas and Devin Gaffney saw a large spike in then-US presidential candidate Mitt Romney’sTwitter followers, and decided to look at the new followers: Furnas, A. and Gaffney, D. (2012). ‘Statistical Probability That Mitt Romney's New Twitter Followers Are Just Normal Users: 0%’. The Atlantic, July 31, http://www.theatlantic.com/technology/archive/2012/07/statistical-probability-that-mitt-romneys-new-twitter-followers-are-just-normal-users-0/260539/ (accessed August 31, 2012).
  • 14. Conclusions Savage and Burrows?, who ask are commercial data outpacing social science? Boyd and Crawford?, who ask if big data raise ethical and epistemological conundrums? ... Not entirely ... The connection between research technologies and the advance of knowledge The threats and opportunities represented by unprecedented windows into people’s minds and thoughts Does this lead to more ‘scientific’ (i.e. cumulative) social sciences and humanities?
  • 16. Additional readings and references Bond, Robert et al. (2012). ‘A 61-million-person experiment in social influence and political mobilization’, Nature 489: 295–298. Bruns,A. and Liang,Y.E. (2012). ‘Tools and methods for capturingTwitter data during natural disasters’, First Monday, 17 (4 – 2), http://firstmonday.org/htbin/cgiwrap/bin/ojs/index.php/fm/article/viewArticle/3937/3193 Furnas, A. and Gaffney, D. (2012). ‘Statistical ProbabilityThat Mitt Romney's NewTwitter Followers Are Just Normal Users: 0%’. The Atlantic, July 31, http://www.theatlantic.com/technology/archive/2012/07/statistical- probability-that-mitt-romneys-new-twitter-followers-are-just-normal-users-0/260539/ (accessed August 31, 2012). Giles, J. (2012). ‘Making the Links: From E-mails to Social Networks, the DigitalTraces left Life in the ModernWorld areTransforming Social Science’, Nature, 488: 448-50. Kwak, H. et al. (2010). ‘What isTwitter, a Social Network or a News Media?’ Proceedings of the 19th InternationalWorldWide Web (WWW) Conference, April 26-30, 2010, Raleigh NC. Manyika, J. et al. (2011). ‘Big data: the next frontier for innovation, competition and productivity’, McKinsey Global Institute, available at: http://www.mckinsey.com/insights/mgi/research/technology_and_innovation/ big_data_the_next_frontier_for_innovation (last accessed August 29, 2012). Silver, Nate. (2012). The Signal and the Noise:The Art and Science of Prediction. London:Allen Lane. Tancer, B. (2009). Click:What Millions of People are Doing Online andWhy It Matters. NewYork: Harper Collins, 2009. Wu, S. , J.M. Hofman,W.A. Mason, and D.J. Watts, (2011). ‘Who says what to whom on twitter’, Proceedings of the 20th international conference onWorldWideWeb. (on DuncanWatts webpage, http://research.microsoft.com/en-us/people/duncan/, last accessed August 29, 2012).
  • 17. With support from: Dr Eric T. Meyer Research Fellow & DPhil Programme Director eric.meyer@oii.ox.ac.uk http://www.oii.ox.ac.uk/people/?id=120 @etmeyer

Hinweis der Redaktion

  1. Show increasing value graph and something showing store of knowledge
  2. Show increasing value graph and something showing store of knowledge
  3. Show increasing value graph and something showing store of knowledge
  4. Implications for STSPolicy implicationsPractice implicationsIntermediation (algorithms between researchers and their artifacts) and disintermediation (more direct access to traces of people’s behaviour)Researchers getting both more distant from their objects of research and closer to their objects of research, and both have important implicationsEngagement with businessStrategic value versus academic valueShort term business knowledge versus durable knowledge