SlideShare ist ein Scribd-Unternehmen logo
1 von 35
Downloaden Sie, um offline zu lesen
Big Data in the Arts
and Humanities
Andrew Prescott, University of Glasgow
AHRC Theme Leader for Digital Transformations
Big Data in a Transdisciplinary Perspective

7th Herrenhausen Conference of the Volkswagen Foundation
25 March 2015
Neurone activity in the brain of a zebra fish embryo.
Each video sequence is one terabyte in size.
Ahrens, M. B. & Keller, P. J. Nature Meth. http://dx.doi.org/10.1038/NMETH.2434 (2013)
The high frequency telescopes of the Square Kilometre Array will
produce 1 exabyte per day (more than current global internet traffic) in
first phase. This will eventually rise to many Petabits (1015) per
second, more than 10 times the current global internet traffic
BIG HUMANITIES DATASETS
Sound and Video:
• Shoa Holocaust Survivors testimonials collection is 20 terabytes
(cf. Sloan Digital Sky Survey 10 terabytes)
• The BBC’s digital assets are estimated at about 52 petabytes of
data
Structured data:
• US National Archives and Records Administration: 142 TB of
data; estimated 347 PB by 2022
• Ancestry holds 14 billion records and is adding 2 million records
daily. Brightsolid's (Findmypast) new data centre in Aberdeen
will have 400 petabytes of storage
• Web archives: multi-petabyte
Linguistic corpora:
• Corpus of American Contemporary English: 450 million words
• Wikipedia Corpus: 1.9 billion words
• Google American books n-grams: 155 billion words
THE CHANGING NATURE OF THE PRIMARY
MATERIALS OF HUMANITIES RESEARCH
• The papers of the British prime minister William Ewart
Gladstone (1809-1898): approx. 160,000 documents
in 762 volumes.
• Margaret Thatcher archive: 1 million documents in
3,000 boxes occupying 300 metres of shelving
• Enron Corporation Corpus, acquired by Federal
Energy Regulatory Commission during enquiry into
corporation’s collapse. Approx. 600,000 e-mails
generated by 158 employees; about 423MB (zipped).
Electronic records from the Executive Office of the President during the
second presidency of George W. Bush: 82 TB of data; 200+ million e-
mail messages; 3+ million digital photographs; 30+ million other
electronic records
http://www.georgewbushlibrary.smu.edu/Research/Electronic-Records.aspx
###### Begin Original ARMS Header ######

RECORD TYPE: PRESIDENTIAL (NOTES MAIL)

CREATOR:Sandy Kress ( CN=Sandy Kress/OU=OPD/O=EOP [ OPD ] ) CREATION DATE/TIME:14-JUN-2001 17:13:17.00

SUBJECT:: Education statement

TO:Claire E. Buchan ( CN=Claire E. Buchan/OU=WHO/O=EOP@EOP [ WHO ] ) READ:UNKNOWN

###### End Original ARMS Header ######
---------------------- Forwarded by Sandy Kress/OPD/EOP on 06/14/2001 05:13 PM ---------------------------
Sarah Pfeifer 06/14/2001 04:59:34 PM Record Type: Record
To: Sarah E. Youssef/OPD/EOP@EOP, Brian R. Besanceney/OPD/EOP@EOP, Sandy Kress/OPD/EOP@EOP

cc:

Subject: Education statement
---------------------- Forwarded by Sarah Pfeifer/OPD/EOP on 06/14/2001 04:59 PM ---------------------------
Sarah Pfeifer 06/14/2001 04:59:00 PM Record Type: Record
To: See the distribution list at the bottom of this message cc:

Subject: Education statement
This statement has been approved by the President. Harriet called me several minutes ago with one last change, which I have incorporated.
Message Sent To:_____________________________________________________________ Harriet Miers/WHO/EOP@EOP
John Gardner/WHO/EOP@EOP Barbara A. Barclay/WHO/EOP@EOP Debra D. Bird/WHO/EOP@EOP Carolyn E. Cleveland/WHO/
EOP@EOP
E-mail by B. Alexander (Sandy) Kress, Senior Adviser to President
George W. Bush on Education, concerning the drafting of the No Child
Left Behind Act in 2001
http://www.georgewbushlibrary.smu.edu/en/Research/Electronic-Records/Email.aspx#Email
• Visualisation of relationship
between terms in Wikileaks
Significant Action Reports
real to Iraq
• Big data: ‘whose size forces
us to look beyond the tried-
and true methods that are
prevalent at that
time’ (Jacobs, 2009)
• Illustrate how big data is
already a current issue for
humanities researchers
• Suggests humanities
becoming not only more
quantitative, but also more
visual, haptic and
exploratory
collateral exposure..?POSSIBLE INFORMATION
media diversion..?POSSIBLE INFORMATION
Extract from project publication for Insurance.AES256 by Michael Takeo
Magruder (2011), using Wikileaks material to reflect on issues of
information freedom and secrecy in today's ever-shifting media landscape.
http://www.takeo.org/nspace/2011-insurance_aes256/
Portfolio of Big Data projects funded by UK
Arts and Humanities Research Council,
2014-15
• Dealing with large textual corpora: UK statute law; mining
the history of medicine
• Linking existing databases: Snapdrgn; Big Data History of
Music
• Annotation of unstructured data: DEEP film access;
optical music recognition; Lost Visions
• Visualisation: International crime fiction; Seeing Data
• Critical study of data: Our Data Ourselves; Secret Life of a
Weather Datum
Portfolio of Big Data projects funded by UK
Arts and Humanities Research Council,
2014-15
• Mapping: Literary History of Edinburgh;
• Internet of Things: archaeological 3D imaging; Tangible Memories
• Reflects range of activities currently used in ‘Big Humanities’.
• Does anything link these together methodologically? Do they
represent anything different from what we have previously done?
• Is there a ‘Big Data moment’, or is it simply that data and
expertise is now available on a larger scale?
• What distinctive contributions can the arts and humanities make
to the Big Data debates?
HAVE WE BEEN HERE FOR A LONG TIME?
• If Big Data is defined as data whose
size requires us to look beyond tried
methods, it has been with us since
antiquity
• Invention of writing linked to government
need to manage information
• 1086: Detailed register of property in
Domesday Book
• 12th century: development of pipe rolls
and use of counters in government
accounting
• 13th century: alphabetisation of the bible
by a team of Dominican friars
WHY BIG DATA IS DIFFERENT
• Historical examples like Domesday Book or census were
inventories; descriptive and backward-looking
• The aim of Big Data techniques is predictive: ‘We know what
you are going to do tomorrow’ (credit score agency)
• Results derive from quantity of data rather than quality; methods
‘inherently inexact but the vast amount of data compensates for
the imperfections’ (Mayer-Schonberger, p. 187)
• Ignores causal relationships and looks for co-relations e.g. how
lifestyle factors predict likelihood of adhering to medical
prediction
EXAMPLES OF PREDICTIVE ANALYTICS
• Driven largely by finance and retail, but rapidly spreading into other
sectors
• Chicago: Automated Preventive Rodent Baiting Program analyses 31
indicators to predict where rodent infestations will occur
• New York: predicting where unlicensed building conversions have
occurred to target inspections and issue vacate orders
• Chicago: Predictive Policing System
• AHRC programme includes projects on online betting on election
results, and on legislation
• AHRC-Nesta project to use predictive analytics to improve museum
attendance
Use of big data techniques in choosing film directors,
cast, crew, etc.: the-numbers.com
Use of predictive analytics to ‘optimise scripts’ in film and TV:
epagogix.com
John Wiley considering using IBM Pure Data analytics in similar way
for scientific and academic publishing
CHALLENGES OF BIG DATA TO THE ARTS
AND HUMANITIES
• Not simply about role of quantification or scientific method in arts and
humanities
• Challenges assumptions about role of information in research: if data
is big enough, messy or poorly curated data need not be an issue
• Questions existing research methods: ‘data-driven research’
• Undermines assumptions about causality and human agency
• Role of retail and financial agencies in developing these methods - the
enclosure of data
• Challenges existing critical and theoretical frameworks: not ‘end of
theory’ but ‘big data needs big theory’
HOW THE ARTS AND HUMANITIES CAN
ADDRESS BIG DATA CHALLENGES
• Developing new theoretical frameworks and responses: critical
data studies
• Providing models in areas such as causality and ‘messiness of
data’
• Exploring the spaces and flow of big data
• Promoting moral values of humanities research in a big data world
• Role of design
• ‘Radical contextualisation’ of big data
• Humanisation of big data
THE NEED FOR BIG THEORY
• Chris Anderson in Wired 2008: ‘Out with every theory of human
behavior, from linguistics to sociology. Forget taxonomy, ontology,
and psychology. Who knows why people do what they do? The point
is they do it, and we can track and measure it with unprecedented
fidelity. With enough data, the numbers speak for themselves’.
• New York Times, 2010: ‘The next big idea in language, history and
the arts? Data. Members of a new generation of digitally savvy
humanists argue it is time to stop looking for inspiration in the next
political or philosophical ‘ism’ and start exploring how technology is
changing our understanding of the liberal arts. This latest frontier is
about method, they say, using powerful technologies and vast stores
of digitised materials that previous humanities scholars did not have’.
• Charles Darwin (cited by Callebut): ‘all observation must be for or
against some view if it is to be of any service’
THE NEED FOR BIG THEORY
• Bowker (2006): Raw data is both an oxymoron and a
bad idea; to the contrary, data should be cooked with
care
• Huggett (2014): Data are not 'out there', waiting to be
discovered; if anything, data are waiting to be created.
Information about the past is situated, contingent, and
incomplete; data are theory-laden, and relationships
are constantly changing depending on context.
• Kitchen and Lauriault (2014): Data are situated,
contingent, relational, and framed, and used
contextually to try and achieve certain aims and goals
CRITICAL DATA STUDIES
Dalton and Thatcher, What does a critical data studies look like,
and why do we care? Seven points for a critical approach to ‘big
data (Society and Space, 2014)
1. situate data regimes in time and space 

2. expose data as inherently political and whose interests they
serve 

3. unpack the complex, non-deterministic relationship between
data and society 

4. illustrate the ways in which data are never raw 

5. expose the fallacies that data can speak for themselves and
that big data will replace small data 

6. explore how new data regimes can be used in socially
progressive ways 

7. examine how academia engages with new data regimes and
the opportunities of such engagement
lifeofdata.org.uk
big-social-data.net
Our Data Ourselves
RETHINKING THE IMPLICATIONS OF BIG DATA
• Is a switch from causality to co-relation so radical?
• As long ago as 1946, the historian Marc Bloch argued
against the ‘idol of origins’ and sought a history with
stronger social and cultural understanding
• Pioneering work of humanities scholarship such as
Annales School of historians has lot to contribute in terms
of integrating methodology, data and new techniques
• Continued importance of critical understanding of data, as
Google flu trends controversy illustrates
• Experience of humanities scholars in dealing with complex
and messy historical datasets potentially very relevant
Visualisation of ontology for linking information
about people in the ancient world developed by
the Standards for Networking Ancient
Prosopographies project:
snapdrgn.net
seeingdata.org: includes videos on ‘Making Sense
of Data Visualisations’
Big Data in the Arts and Humanities
Erica Savig, M.Arch.
PhD Candidate, Cancer Biology
Stanford University
Lab of Garry P. Nolan
National Science Foundation Graduate
Research Fellow
Stanford Graduate Research Fellow
Common Design
Strategies for Exploring
Signaling Networks in
Biology and Intellectual
Geographies in History
Nicole Coleman
Director, Humanities + Design
Stanford University
Component
and Behavior
for Protein 1
Component
and Behavior
for Protein 2
Component
and Behavior
for Protein 3
Parametric Modeling Quantitatively Maps Single Cell Protein
Levels to Individual Qualitative Components
Michael Takeo Magruder, Data Flower: www.takeo.org
Fabio Lattanzi Antinori
The Obelisk, 2012
http://
fabiolattanziantinori.co
m/obelisk.php
co-curate.ncl.ac.ukpararchive.com
bloodaxe.ncl.ac.uk affectivedigitalhistories.org.uk
Tim Hitchcock on Big Data, Small Data and Meaning
(historyonics.blogspot.co.uk):
‘Big Data’ supposedly lets you get away with dirty data.  In contrast,
humanists do read the data; and do so with a sharp eye for its
individual rhythms and peculiarities – its weirdness. 
In the rush towards 'Big Data' – the Longue durée, and automated
network analysis; towards a vision of Humanist scholarship in which
Bayesian probability is as significant as biblical allusion, the most
urgent need seems to me to be to find the tools that allow us to do the
job of close reading of all the small data that goes to make the bigger
variety…we need to be able to contextualise every single word in a
representation of every word, ever. Every gesture contextualised in the
collective record all gestures; and every brushstroke, in the collective
knowledge of every painting. 
Towards a ‘radical contextualisation’: Mapping Metaphor
with the Historical Thesaurus of the English Language
http://blogs.arts.gla.ac.uk/metaphor/
tangible-memories.com

Weitere ähnliche Inhalte

Was ist angesagt?

20130805 Activating Linked Open Data in Libraries Archives and Museums
20130805 Activating Linked Open Data in Libraries Archives and Museums20130805 Activating Linked Open Data in Libraries Archives and Museums
20130805 Activating Linked Open Data in Libraries Archives and Museumsandrea huang
 
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital TextBridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital TextBeth Plale
 
Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014Beth Plale
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
The HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data FrameworkThe HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data FrameworkRobert H. McDonald
 
Beyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional PracticeBeyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional PracticeEric Kansa
 
Data management planning: UK policies and beyond
Data management planning: UK policies and beyondData management planning: UK policies and beyond
Data management planning: UK policies and beyondMartin Donnelly
 
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...lljohnston
 
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...PrattSILS
 
The Importance of Marketing Digital Collections
The Importance of Marketing Digital CollectionsThe Importance of Marketing Digital Collections
The Importance of Marketing Digital CollectionsChristine Madsen
 
Data, librarians, and services
Data, librarians, and servicesData, librarians, and services
Data, librarians, and servicesAndrew Treloar
 
Introduction for skills seminar on Search and Data Mining, Master of European...
Introduction for skills seminar on Search and Data Mining, Master of European...Introduction for skills seminar on Search and Data Mining, Master of European...
Introduction for skills seminar on Search and Data Mining, Master of European...Gerben Zaagsma
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamPlatforma Otwartej Nauki
 
Knowledge Organization | LIS653 | Fall 2017
Knowledge Organization | LIS653 | Fall 2017Knowledge Organization | LIS653 | Fall 2017
Knowledge Organization | LIS653 | Fall 2017PrattSILS
 
Big Data in the Arts and Humanities: Stirling presentation
Big Data in the Arts and Humanities: Stirling presentationBig Data in the Arts and Humanities: Stirling presentation
Big Data in the Arts and Humanities: Stirling presentationAndrew Prescott
 
The Content Mine (presented at UKSG)
The Content Mine (presented at UKSG)The Content Mine (presented at UKSG)
The Content Mine (presented at UKSG)petermurrayrust
 
Data sharing and data management – what are they all about?
Data sharing and data management –  what are they all about?Data sharing and data management –  what are they all about?
Data sharing and data management – what are they all about?Belinda Weaver
 

Was ist angesagt? (20)

20130805 Activating Linked Open Data in Libraries Archives and Museums
20130805 Activating Linked Open Data in Libraries Archives and Museums20130805 Activating Linked Open Data in Libraries Archives and Museums
20130805 Activating Linked Open Data in Libraries Archives and Museums
 
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital TextBridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
 
Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & Museums
 
The HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data FrameworkThe HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
 
Beyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional PracticeBeyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional Practice
 
Data management planning: UK policies and beyond
Data management planning: UK policies and beyondData management planning: UK policies and beyond
Data management planning: UK policies and beyond
 
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
 
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
 
The Importance of Marketing Digital Collections
The Importance of Marketing Digital CollectionsThe Importance of Marketing Digital Collections
The Importance of Marketing Digital Collections
 
Christine borgman keynote
Christine borgman keynoteChristine borgman keynote
Christine borgman keynote
 
Data, librarians, and services
Data, librarians, and servicesData, librarians, and services
Data, librarians, and services
 
Open Notebook Science
Open Notebook ScienceOpen Notebook Science
Open Notebook Science
 
Introduction for skills seminar on Search and Data Mining, Master of European...
Introduction for skills seminar on Search and Data Mining, Master of European...Introduction for skills seminar on Search and Data Mining, Master of European...
Introduction for skills seminar on Search and Data Mining, Master of European...
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
 
Knowledge Organization | LIS653 | Fall 2017
Knowledge Organization | LIS653 | Fall 2017Knowledge Organization | LIS653 | Fall 2017
Knowledge Organization | LIS653 | Fall 2017
 
Big Data in the Arts and Humanities: Stirling presentation
Big Data in the Arts and Humanities: Stirling presentationBig Data in the Arts and Humanities: Stirling presentation
Big Data in the Arts and Humanities: Stirling presentation
 
The Content Mine (presented at UKSG)
The Content Mine (presented at UKSG)The Content Mine (presented at UKSG)
The Content Mine (presented at UKSG)
 
Curation is for cytomics
Curation is for cytomicsCuration is for cytomics
Curation is for cytomics
 
Data sharing and data management – what are they all about?
Data sharing and data management –  what are they all about?Data sharing and data management –  what are they all about?
Data sharing and data management – what are they all about?
 

Andere mochten auch

Alternative Postgraduate Careers
Alternative Postgraduate CareersAlternative Postgraduate Careers
Alternative Postgraduate CareersAndrew Prescott
 
What Happens When the Internet of Things Meets the Middle Ages?
What Happens When the Internet of Things Meets the Middle Ages?What Happens When the Internet of Things Meets the Middle Ages?
What Happens When the Internet of Things Meets the Middle Ages?Andrew Prescott
 
Use of Analytics by Netflix - Case Study
Use of Analytics by Netflix - Case StudyUse of Analytics by Netflix - Case Study
Use of Analytics by Netflix - Case StudySaket Toshniwal
 
[Report] The Social Media ROI Cookbook, by Susan Etlinger
[Report] The Social Media ROI Cookbook, by Susan Etlinger[Report] The Social Media ROI Cookbook, by Susan Etlinger
[Report] The Social Media ROI Cookbook, by Susan EtlingerAltimeter, a Prophet Company
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBernard Marr
 

Andere mochten auch (6)

Alternative Postgraduate Careers
Alternative Postgraduate CareersAlternative Postgraduate Careers
Alternative Postgraduate Careers
 
What Happens When the Internet of Things Meets the Middle Ages?
What Happens When the Internet of Things Meets the Middle Ages?What Happens When the Internet of Things Meets the Middle Ages?
What Happens When the Internet of Things Meets the Middle Ages?
 
Use of Analytics by Netflix - Case Study
Use of Analytics by Netflix - Case StudyUse of Analytics by Netflix - Case Study
Use of Analytics by Netflix - Case Study
 
[Report] The Social Media ROI Cookbook, by Susan Etlinger
[Report] The Social Media ROI Cookbook, by Susan Etlinger[Report] The Social Media ROI Cookbook, by Susan Etlinger
[Report] The Social Media ROI Cookbook, by Susan Etlinger
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should Know
 

Ähnlich wie Big Data in the Arts and Humanities

Digital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesDigital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesChantal van Son
 
Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Stella Wisdom
 
Rebecca Grant DPASSH presentation 2015
Rebecca Grant DPASSH presentation 2015Rebecca Grant DPASSH presentation 2015
Rebecca Grant DPASSH presentation 2015dri_ireland
 
Words and More Words: Challenges of Big Data by Prof. Edie Rasmussen
Words and More Words: Challenges of Big Data by Prof. Edie RasmussenWords and More Words: Challenges of Big Data by Prof. Edie Rasmussen
Words and More Words: Challenges of Big Data by Prof. Edie Rasmussenwkwsci-research
 
Digital Scholarship Intersection Scale Social Machines
Digital Scholarship Intersection Scale Social MachinesDigital Scholarship Intersection Scale Social Machines
Digital Scholarship Intersection Scale Social MachinesDavid De Roure
 
Data, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileData, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileLEARN Project
 
Four Corners of the Big Tent
Four Corners of the Big TentFour Corners of the Big Tent
Four Corners of the Big TentJohn Bradley
 
Arin6912 – Digital Research And Publishing Presentation
Arin6912 – Digital Research And Publishing PresentationArin6912 – Digital Research And Publishing Presentation
Arin6912 – Digital Research And Publishing Presentationklfagan
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Jisc
 
Bigdataforesight
BigdataforesightBigdataforesight
Bigdataforesightsuresh sood
 
Roger Malina isea keynote 2012
Roger Malina isea keynote 2012Roger Malina isea keynote 2012
Roger Malina isea keynote 2012roger malina
 
Introduction to Big Data and Data Science
Introduction to Big Data and Data ScienceIntroduction to Big Data and Data Science
Introduction to Big Data and Data ScienceFeyzi R. Bagirov
 
When will there be a digital revolution in the humanities?
When will there be a digital revolution in the humanities?When will there be a digital revolution in the humanities?
When will there be a digital revolution in the humanities?Martin Wynne
 
European librarians theatre - Social Media Spotlight
European librarians theatre - Social Media SpotlightEuropean librarians theatre - Social Media Spotlight
European librarians theatre - Social Media SpotlightJulien Houssiere
 

Ähnlich wie Big Data in the Arts and Humanities (20)

Digital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesDigital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social Sciences
 
Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods
 
Rebecca Grant DPASSH presentation 2015
Rebecca Grant DPASSH presentation 2015Rebecca Grant DPASSH presentation 2015
Rebecca Grant DPASSH presentation 2015
 
Words and More Words: Challenges of Big Data by Prof. Edie Rasmussen
Words and More Words: Challenges of Big Data by Prof. Edie RasmussenWords and More Words: Challenges of Big Data by Prof. Edie Rasmussen
Words and More Words: Challenges of Big Data by Prof. Edie Rasmussen
 
Homelessness Data Discussion
Homelessness Data DiscussionHomelessness Data Discussion
Homelessness Data Discussion
 
Ongoing Research in Data Studies
Ongoing Research in Data StudiesOngoing Research in Data Studies
Ongoing Research in Data Studies
 
Digital Scholarship Intersection Scale Social Machines
Digital Scholarship Intersection Scale Social MachinesDigital Scholarship Intersection Scale Social Machines
Digital Scholarship Intersection Scale Social Machines
 
Data, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileData, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of Chile
 
Four Corners of the Big Tent
Four Corners of the Big TentFour Corners of the Big Tent
Four Corners of the Big Tent
 
Making data more human
Making data more humanMaking data more human
Making data more human
 
AHRC CDP Digital Humanities 101
AHRC CDP Digital Humanities 101  AHRC CDP Digital Humanities 101
AHRC CDP Digital Humanities 101
 
Arin6912 – Digital Research And Publishing Presentation
Arin6912 – Digital Research And Publishing PresentationArin6912 – Digital Research And Publishing Presentation
Arin6912 – Digital Research And Publishing Presentation
 
101 This is Digital Scholarship 2016
101 This is Digital Scholarship 2016101 This is Digital Scholarship 2016
101 This is Digital Scholarship 2016
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
 
Bigdataforesight
BigdataforesightBigdataforesight
Bigdataforesight
 
Roger Malina isea keynote 2012
Roger Malina isea keynote 2012Roger Malina isea keynote 2012
Roger Malina isea keynote 2012
 
Introduction to Big Data and Data Science
Introduction to Big Data and Data ScienceIntroduction to Big Data and Data Science
Introduction to Big Data and Data Science
 
Data stories
Data storiesData stories
Data stories
 
When will there be a digital revolution in the humanities?
When will there be a digital revolution in the humanities?When will there be a digital revolution in the humanities?
When will there be a digital revolution in the humanities?
 
European librarians theatre - Social Media Spotlight
European librarians theatre - Social Media SpotlightEuropean librarians theatre - Social Media Spotlight
European librarians theatre - Social Media Spotlight
 

Mehr von Andrew Prescott

Researching Freemasonry in a Time of Coronavirus: Resources and Opportunities
Researching Freemasonry in a Time of Coronavirus: Resources and OpportunitiesResearching Freemasonry in a Time of Coronavirus: Resources and Opportunities
Researching Freemasonry in a Time of Coronavirus: Resources and OpportunitiesAndrew Prescott
 
Artistic Practice and The Archive
Artistic Practice and The ArchiveArtistic Practice and The Archive
Artistic Practice and The ArchiveAndrew Prescott
 
Is Search the Right Way?
Is Search the Right Way?Is Search the Right Way?
Is Search the Right Way?Andrew Prescott
 
Medieval Studies: Some Hopes and Fears for the Future
Medieval Studies: Some Hopes and Fears for the FutureMedieval Studies: Some Hopes and Fears for the Future
Medieval Studies: Some Hopes and Fears for the FutureAndrew Prescott
 
New Modernist Editing meeting
New Modernist Editing meetingNew Modernist Editing meeting
New Modernist Editing meetingAndrew Prescott
 
New Materialities of the Book
New Materialities of the BookNew Materialities of the Book
New Materialities of the BookAndrew Prescott
 
Avoiding the Rear View Mirror
Avoiding the Rear View MirrorAvoiding the Rear View Mirror
Avoiding the Rear View MirrorAndrew Prescott
 
Challenges in the Digital Humanities
Challenges in the Digital HumanitiesChallenges in the Digital Humanities
Challenges in the Digital HumanitiesAndrew Prescott
 
Doing the Digital: How Scholars Learned to Stop Worrying and Love the Computer
Doing the Digital: How Scholars Learned to Stop Worrying and Love the ComputerDoing the Digital: How Scholars Learned to Stop Worrying and Love the Computer
Doing the Digital: How Scholars Learned to Stop Worrying and Love the ComputerAndrew Prescott
 
What are the Digital Humanities and what use are they to me?
What are the Digital Humanities and what use are they to me?What are the Digital Humanities and what use are they to me?
What are the Digital Humanities and what use are they to me?Andrew Prescott
 
The Arts and Humanities in a Digital Age: Disruptions and Continuities
The Arts and Humanities in a Digital Age: Disruptions and ContinuitiesThe Arts and Humanities in a Digital Age: Disruptions and Continuities
The Arts and Humanities in a Digital Age: Disruptions and ContinuitiesAndrew Prescott
 
Digital Transformations: keynote talk to Listening Experience Database Sympos...
Digital Transformations: keynote talk to Listening Experience Database Sympos...Digital Transformations: keynote talk to Listening Experience Database Sympos...
Digital Transformations: keynote talk to Listening Experience Database Sympos...Andrew Prescott
 
AHRC Digital Transformations theme: the Story So Far
AHRC Digital Transformations theme: the Story So FarAHRC Digital Transformations theme: the Story So Far
AHRC Digital Transformations theme: the Story So FarAndrew Prescott
 
Beyond the REF: the Role of Repositories
Beyond the REF: the Role of RepositoriesBeyond the REF: the Role of Repositories
Beyond the REF: the Role of RepositoriesAndrew Prescott
 

Mehr von Andrew Prescott (20)

Researching Freemasonry in a Time of Coronavirus: Resources and Opportunities
Researching Freemasonry in a Time of Coronavirus: Resources and OpportunitiesResearching Freemasonry in a Time of Coronavirus: Resources and Opportunities
Researching Freemasonry in a Time of Coronavirus: Resources and Opportunities
 
Working with Archives
Working with ArchivesWorking with Archives
Working with Archives
 
Artistic Practice and The Archive
Artistic Practice and The ArchiveArtistic Practice and The Archive
Artistic Practice and The Archive
 
Is Search the Right Way?
Is Search the Right Way?Is Search the Right Way?
Is Search the Right Way?
 
Medieval Studies: Some Hopes and Fears for the Future
Medieval Studies: Some Hopes and Fears for the FutureMedieval Studies: Some Hopes and Fears for the Future
Medieval Studies: Some Hopes and Fears for the Future
 
New Modernist Editing meeting
New Modernist Editing meetingNew Modernist Editing meeting
New Modernist Editing meeting
 
New Materialities of the Book
New Materialities of the BookNew Materialities of the Book
New Materialities of the Book
 
Prescottleicesterquad
PrescottleicesterquadPrescottleicesterquad
Prescottleicesterquad
 
Avoiding the Rear View Mirror
Avoiding the Rear View MirrorAvoiding the Rear View Mirror
Avoiding the Rear View Mirror
 
Challenges in the Digital Humanities
Challenges in the Digital HumanitiesChallenges in the Digital Humanities
Challenges in the Digital Humanities
 
Prescott Emda2015
Prescott Emda2015Prescott Emda2015
Prescott Emda2015
 
Sustainability
SustainabilitySustainability
Sustainability
 
Doing the Digital: How Scholars Learned to Stop Worrying and Love the Computer
Doing the Digital: How Scholars Learned to Stop Worrying and Love the ComputerDoing the Digital: How Scholars Learned to Stop Worrying and Love the Computer
Doing the Digital: How Scholars Learned to Stop Worrying and Love the Computer
 
New Materialities
New MaterialitiesNew Materialities
New Materialities
 
What are the Digital Humanities and what use are they to me?
What are the Digital Humanities and what use are they to me?What are the Digital Humanities and what use are they to me?
What are the Digital Humanities and what use are they to me?
 
The Arts and Humanities in a Digital Age: Disruptions and Continuities
The Arts and Humanities in a Digital Age: Disruptions and ContinuitiesThe Arts and Humanities in a Digital Age: Disruptions and Continuities
The Arts and Humanities in a Digital Age: Disruptions and Continuities
 
Digital Transformations: keynote talk to Listening Experience Database Sympos...
Digital Transformations: keynote talk to Listening Experience Database Sympos...Digital Transformations: keynote talk to Listening Experience Database Sympos...
Digital Transformations: keynote talk to Listening Experience Database Sympos...
 
Interdisciplinarity
InterdisciplinarityInterdisciplinarity
Interdisciplinarity
 
AHRC Digital Transformations theme: the Story So Far
AHRC Digital Transformations theme: the Story So FarAHRC Digital Transformations theme: the Story So Far
AHRC Digital Transformations theme: the Story So Far
 
Beyond the REF: the Role of Repositories
Beyond the REF: the Role of RepositoriesBeyond the REF: the Role of Repositories
Beyond the REF: the Role of Repositories
 

Kürzlich hochgeladen

How to Add a many2many Relational Field in Odoo 17
How to Add a many2many Relational Field in Odoo 17How to Add a many2many Relational Field in Odoo 17
How to Add a many2many Relational Field in Odoo 17Celine George
 
Philosophy of Education and Educational Philosophy
Philosophy of Education  and Educational PhilosophyPhilosophy of Education  and Educational Philosophy
Philosophy of Education and Educational PhilosophyShuvankar Madhu
 
3.21.24 The Origins of Black Power.pptx
3.21.24  The Origins of Black Power.pptx3.21.24  The Origins of Black Power.pptx
3.21.24 The Origins of Black Power.pptxmary850239
 
General views of Histopathology and step
General views of Histopathology and stepGeneral views of Histopathology and step
General views of Histopathology and stepobaje godwin sunday
 
Patient Counselling. Definition of patient counseling; steps involved in pati...
Patient Counselling. Definition of patient counseling; steps involved in pati...Patient Counselling. Definition of patient counseling; steps involved in pati...
Patient Counselling. Definition of patient counseling; steps involved in pati...raviapr7
 
CapTechU Doctoral Presentation -March 2024 slides.pptx
CapTechU Doctoral Presentation -March 2024 slides.pptxCapTechU Doctoral Presentation -March 2024 slides.pptx
CapTechU Doctoral Presentation -March 2024 slides.pptxCapitolTechU
 
5 charts on South Africa as a source country for international student recrui...
5 charts on South Africa as a source country for international student recrui...5 charts on South Africa as a source country for international student recrui...
5 charts on South Africa as a source country for international student recrui...CaraSkikne1
 
M-2- General Reactions of amino acids.pptx
M-2- General Reactions of amino acids.pptxM-2- General Reactions of amino acids.pptx
M-2- General Reactions of amino acids.pptxDr. Santhosh Kumar. N
 
Human-AI Co-Creation of Worked Examples for Programming Classes
Human-AI Co-Creation of Worked Examples for Programming ClassesHuman-AI Co-Creation of Worked Examples for Programming Classes
Human-AI Co-Creation of Worked Examples for Programming ClassesMohammad Hassany
 
Patterns of Written Texts Across Disciplines.pptx
Patterns of Written Texts Across Disciplines.pptxPatterns of Written Texts Across Disciplines.pptx
Patterns of Written Texts Across Disciplines.pptxMYDA ANGELICA SUAN
 
In - Vivo and In - Vitro Correlation.pptx
In - Vivo and In - Vitro Correlation.pptxIn - Vivo and In - Vitro Correlation.pptx
In - Vivo and In - Vitro Correlation.pptxAditiChauhan701637
 
The basics of sentences session 10pptx.pptx
The basics of sentences session 10pptx.pptxThe basics of sentences session 10pptx.pptx
The basics of sentences session 10pptx.pptxheathfieldcps1
 
Benefits & Challenges of Inclusive Education
Benefits & Challenges of Inclusive EducationBenefits & Challenges of Inclusive Education
Benefits & Challenges of Inclusive EducationMJDuyan
 
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdfP4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdfYu Kanazawa / Osaka University
 
2024.03.23 What do successful readers do - Sandy Millin for PARK.pptx
2024.03.23 What do successful readers do - Sandy Millin for PARK.pptx2024.03.23 What do successful readers do - Sandy Millin for PARK.pptx
2024.03.23 What do successful readers do - Sandy Millin for PARK.pptxSandy Millin
 
Prescribed medication order and communication skills.pptx
Prescribed medication order and communication skills.pptxPrescribed medication order and communication skills.pptx
Prescribed medication order and communication skills.pptxraviapr7
 
CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...
CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...
CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...Nguyen Thanh Tu Collection
 
How to Add a New Field in Existing Kanban View in Odoo 17
How to Add a New Field in Existing Kanban View in Odoo 17How to Add a New Field in Existing Kanban View in Odoo 17
How to Add a New Field in Existing Kanban View in Odoo 17Celine George
 
HED Office Sohayok Exam Question Solution 2023.pdf
HED Office Sohayok Exam Question Solution 2023.pdfHED Office Sohayok Exam Question Solution 2023.pdf
HED Office Sohayok Exam Question Solution 2023.pdfMohonDas
 

Kürzlich hochgeladen (20)

How to Add a many2many Relational Field in Odoo 17
How to Add a many2many Relational Field in Odoo 17How to Add a many2many Relational Field in Odoo 17
How to Add a many2many Relational Field in Odoo 17
 
Philosophy of Education and Educational Philosophy
Philosophy of Education  and Educational PhilosophyPhilosophy of Education  and Educational Philosophy
Philosophy of Education and Educational Philosophy
 
Personal Resilience in Project Management 2 - TV Edit 1a.pdf
Personal Resilience in Project Management 2 - TV Edit 1a.pdfPersonal Resilience in Project Management 2 - TV Edit 1a.pdf
Personal Resilience in Project Management 2 - TV Edit 1a.pdf
 
3.21.24 The Origins of Black Power.pptx
3.21.24  The Origins of Black Power.pptx3.21.24  The Origins of Black Power.pptx
3.21.24 The Origins of Black Power.pptx
 
General views of Histopathology and step
General views of Histopathology and stepGeneral views of Histopathology and step
General views of Histopathology and step
 
Patient Counselling. Definition of patient counseling; steps involved in pati...
Patient Counselling. Definition of patient counseling; steps involved in pati...Patient Counselling. Definition of patient counseling; steps involved in pati...
Patient Counselling. Definition of patient counseling; steps involved in pati...
 
CapTechU Doctoral Presentation -March 2024 slides.pptx
CapTechU Doctoral Presentation -March 2024 slides.pptxCapTechU Doctoral Presentation -March 2024 slides.pptx
CapTechU Doctoral Presentation -March 2024 slides.pptx
 
5 charts on South Africa as a source country for international student recrui...
5 charts on South Africa as a source country for international student recrui...5 charts on South Africa as a source country for international student recrui...
5 charts on South Africa as a source country for international student recrui...
 
M-2- General Reactions of amino acids.pptx
M-2- General Reactions of amino acids.pptxM-2- General Reactions of amino acids.pptx
M-2- General Reactions of amino acids.pptx
 
Human-AI Co-Creation of Worked Examples for Programming Classes
Human-AI Co-Creation of Worked Examples for Programming ClassesHuman-AI Co-Creation of Worked Examples for Programming Classes
Human-AI Co-Creation of Worked Examples for Programming Classes
 
Patterns of Written Texts Across Disciplines.pptx
Patterns of Written Texts Across Disciplines.pptxPatterns of Written Texts Across Disciplines.pptx
Patterns of Written Texts Across Disciplines.pptx
 
In - Vivo and In - Vitro Correlation.pptx
In - Vivo and In - Vitro Correlation.pptxIn - Vivo and In - Vitro Correlation.pptx
In - Vivo and In - Vitro Correlation.pptx
 
The basics of sentences session 10pptx.pptx
The basics of sentences session 10pptx.pptxThe basics of sentences session 10pptx.pptx
The basics of sentences session 10pptx.pptx
 
Benefits & Challenges of Inclusive Education
Benefits & Challenges of Inclusive EducationBenefits & Challenges of Inclusive Education
Benefits & Challenges of Inclusive Education
 
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdfP4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
 
2024.03.23 What do successful readers do - Sandy Millin for PARK.pptx
2024.03.23 What do successful readers do - Sandy Millin for PARK.pptx2024.03.23 What do successful readers do - Sandy Millin for PARK.pptx
2024.03.23 What do successful readers do - Sandy Millin for PARK.pptx
 
Prescribed medication order and communication skills.pptx
Prescribed medication order and communication skills.pptxPrescribed medication order and communication skills.pptx
Prescribed medication order and communication skills.pptx
 
CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...
CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...
CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...
 
How to Add a New Field in Existing Kanban View in Odoo 17
How to Add a New Field in Existing Kanban View in Odoo 17How to Add a New Field in Existing Kanban View in Odoo 17
How to Add a New Field in Existing Kanban View in Odoo 17
 
HED Office Sohayok Exam Question Solution 2023.pdf
HED Office Sohayok Exam Question Solution 2023.pdfHED Office Sohayok Exam Question Solution 2023.pdf
HED Office Sohayok Exam Question Solution 2023.pdf
 

Big Data in the Arts and Humanities

  • 1. Big Data in the Arts and Humanities Andrew Prescott, University of Glasgow AHRC Theme Leader for Digital Transformations Big Data in a Transdisciplinary Perspective
 7th Herrenhausen Conference of the Volkswagen Foundation 25 March 2015
  • 2. Neurone activity in the brain of a zebra fish embryo. Each video sequence is one terabyte in size. Ahrens, M. B. & Keller, P. J. Nature Meth. http://dx.doi.org/10.1038/NMETH.2434 (2013)
  • 3. The high frequency telescopes of the Square Kilometre Array will produce 1 exabyte per day (more than current global internet traffic) in first phase. This will eventually rise to many Petabits (1015) per second, more than 10 times the current global internet traffic
  • 4. BIG HUMANITIES DATASETS Sound and Video: • Shoa Holocaust Survivors testimonials collection is 20 terabytes (cf. Sloan Digital Sky Survey 10 terabytes) • The BBC’s digital assets are estimated at about 52 petabytes of data Structured data: • US National Archives and Records Administration: 142 TB of data; estimated 347 PB by 2022 • Ancestry holds 14 billion records and is adding 2 million records daily. Brightsolid's (Findmypast) new data centre in Aberdeen will have 400 petabytes of storage • Web archives: multi-petabyte Linguistic corpora: • Corpus of American Contemporary English: 450 million words • Wikipedia Corpus: 1.9 billion words • Google American books n-grams: 155 billion words
  • 5. THE CHANGING NATURE OF THE PRIMARY MATERIALS OF HUMANITIES RESEARCH • The papers of the British prime minister William Ewart Gladstone (1809-1898): approx. 160,000 documents in 762 volumes. • Margaret Thatcher archive: 1 million documents in 3,000 boxes occupying 300 metres of shelving • Enron Corporation Corpus, acquired by Federal Energy Regulatory Commission during enquiry into corporation’s collapse. Approx. 600,000 e-mails generated by 158 employees; about 423MB (zipped).
  • 6. Electronic records from the Executive Office of the President during the second presidency of George W. Bush: 82 TB of data; 200+ million e- mail messages; 3+ million digital photographs; 30+ million other electronic records http://www.georgewbushlibrary.smu.edu/Research/Electronic-Records.aspx
  • 7. ###### Begin Original ARMS Header ######
 RECORD TYPE: PRESIDENTIAL (NOTES MAIL)
 CREATOR:Sandy Kress ( CN=Sandy Kress/OU=OPD/O=EOP [ OPD ] ) CREATION DATE/TIME:14-JUN-2001 17:13:17.00
 SUBJECT:: Education statement
 TO:Claire E. Buchan ( CN=Claire E. Buchan/OU=WHO/O=EOP@EOP [ WHO ] ) READ:UNKNOWN
 ###### End Original ARMS Header ###### ---------------------- Forwarded by Sandy Kress/OPD/EOP on 06/14/2001 05:13 PM --------------------------- Sarah Pfeifer 06/14/2001 04:59:34 PM Record Type: Record To: Sarah E. Youssef/OPD/EOP@EOP, Brian R. Besanceney/OPD/EOP@EOP, Sandy Kress/OPD/EOP@EOP
 cc:
 Subject: Education statement ---------------------- Forwarded by Sarah Pfeifer/OPD/EOP on 06/14/2001 04:59 PM --------------------------- Sarah Pfeifer 06/14/2001 04:59:00 PM Record Type: Record To: See the distribution list at the bottom of this message cc:
 Subject: Education statement This statement has been approved by the President. Harriet called me several minutes ago with one last change, which I have incorporated. Message Sent To:_____________________________________________________________ Harriet Miers/WHO/EOP@EOP John Gardner/WHO/EOP@EOP Barbara A. Barclay/WHO/EOP@EOP Debra D. Bird/WHO/EOP@EOP Carolyn E. Cleveland/WHO/ EOP@EOP E-mail by B. Alexander (Sandy) Kress, Senior Adviser to President George W. Bush on Education, concerning the drafting of the No Child Left Behind Act in 2001 http://www.georgewbushlibrary.smu.edu/en/Research/Electronic-Records/Email.aspx#Email
  • 8. • Visualisation of relationship between terms in Wikileaks Significant Action Reports real to Iraq • Big data: ‘whose size forces us to look beyond the tried- and true methods that are prevalent at that time’ (Jacobs, 2009) • Illustrate how big data is already a current issue for humanities researchers • Suggests humanities becoming not only more quantitative, but also more visual, haptic and exploratory
  • 9. collateral exposure..?POSSIBLE INFORMATION media diversion..?POSSIBLE INFORMATION Extract from project publication for Insurance.AES256 by Michael Takeo Magruder (2011), using Wikileaks material to reflect on issues of information freedom and secrecy in today's ever-shifting media landscape. http://www.takeo.org/nspace/2011-insurance_aes256/
  • 10. Portfolio of Big Data projects funded by UK Arts and Humanities Research Council, 2014-15 • Dealing with large textual corpora: UK statute law; mining the history of medicine • Linking existing databases: Snapdrgn; Big Data History of Music • Annotation of unstructured data: DEEP film access; optical music recognition; Lost Visions • Visualisation: International crime fiction; Seeing Data • Critical study of data: Our Data Ourselves; Secret Life of a Weather Datum
  • 11. Portfolio of Big Data projects funded by UK Arts and Humanities Research Council, 2014-15 • Mapping: Literary History of Edinburgh; • Internet of Things: archaeological 3D imaging; Tangible Memories • Reflects range of activities currently used in ‘Big Humanities’. • Does anything link these together methodologically? Do they represent anything different from what we have previously done? • Is there a ‘Big Data moment’, or is it simply that data and expertise is now available on a larger scale? • What distinctive contributions can the arts and humanities make to the Big Data debates?
  • 12. HAVE WE BEEN HERE FOR A LONG TIME? • If Big Data is defined as data whose size requires us to look beyond tried methods, it has been with us since antiquity • Invention of writing linked to government need to manage information • 1086: Detailed register of property in Domesday Book • 12th century: development of pipe rolls and use of counters in government accounting • 13th century: alphabetisation of the bible by a team of Dominican friars
  • 13. WHY BIG DATA IS DIFFERENT • Historical examples like Domesday Book or census were inventories; descriptive and backward-looking • The aim of Big Data techniques is predictive: ‘We know what you are going to do tomorrow’ (credit score agency) • Results derive from quantity of data rather than quality; methods ‘inherently inexact but the vast amount of data compensates for the imperfections’ (Mayer-Schonberger, p. 187) • Ignores causal relationships and looks for co-relations e.g. how lifestyle factors predict likelihood of adhering to medical prediction
  • 14. EXAMPLES OF PREDICTIVE ANALYTICS • Driven largely by finance and retail, but rapidly spreading into other sectors • Chicago: Automated Preventive Rodent Baiting Program analyses 31 indicators to predict where rodent infestations will occur • New York: predicting where unlicensed building conversions have occurred to target inspections and issue vacate orders • Chicago: Predictive Policing System • AHRC programme includes projects on online betting on election results, and on legislation • AHRC-Nesta project to use predictive analytics to improve museum attendance
  • 15. Use of big data techniques in choosing film directors, cast, crew, etc.: the-numbers.com
  • 16. Use of predictive analytics to ‘optimise scripts’ in film and TV: epagogix.com John Wiley considering using IBM Pure Data analytics in similar way for scientific and academic publishing
  • 17. CHALLENGES OF BIG DATA TO THE ARTS AND HUMANITIES • Not simply about role of quantification or scientific method in arts and humanities • Challenges assumptions about role of information in research: if data is big enough, messy or poorly curated data need not be an issue • Questions existing research methods: ‘data-driven research’ • Undermines assumptions about causality and human agency • Role of retail and financial agencies in developing these methods - the enclosure of data • Challenges existing critical and theoretical frameworks: not ‘end of theory’ but ‘big data needs big theory’
  • 18. HOW THE ARTS AND HUMANITIES CAN ADDRESS BIG DATA CHALLENGES • Developing new theoretical frameworks and responses: critical data studies • Providing models in areas such as causality and ‘messiness of data’ • Exploring the spaces and flow of big data • Promoting moral values of humanities research in a big data world • Role of design • ‘Radical contextualisation’ of big data • Humanisation of big data
  • 19. THE NEED FOR BIG THEORY • Chris Anderson in Wired 2008: ‘Out with every theory of human behavior, from linguistics to sociology. Forget taxonomy, ontology, and psychology. Who knows why people do what they do? The point is they do it, and we can track and measure it with unprecedented fidelity. With enough data, the numbers speak for themselves’. • New York Times, 2010: ‘The next big idea in language, history and the arts? Data. Members of a new generation of digitally savvy humanists argue it is time to stop looking for inspiration in the next political or philosophical ‘ism’ and start exploring how technology is changing our understanding of the liberal arts. This latest frontier is about method, they say, using powerful technologies and vast stores of digitised materials that previous humanities scholars did not have’. • Charles Darwin (cited by Callebut): ‘all observation must be for or against some view if it is to be of any service’
  • 20. THE NEED FOR BIG THEORY • Bowker (2006): Raw data is both an oxymoron and a bad idea; to the contrary, data should be cooked with care • Huggett (2014): Data are not 'out there', waiting to be discovered; if anything, data are waiting to be created. Information about the past is situated, contingent, and incomplete; data are theory-laden, and relationships are constantly changing depending on context. • Kitchen and Lauriault (2014): Data are situated, contingent, relational, and framed, and used contextually to try and achieve certain aims and goals
  • 21. CRITICAL DATA STUDIES Dalton and Thatcher, What does a critical data studies look like, and why do we care? Seven points for a critical approach to ‘big data (Society and Space, 2014) 1. situate data regimes in time and space 
 2. expose data as inherently political and whose interests they serve 
 3. unpack the complex, non-deterministic relationship between data and society 
 4. illustrate the ways in which data are never raw 
 5. expose the fallacies that data can speak for themselves and that big data will replace small data 
 6. explore how new data regimes can be used in socially progressive ways 
 7. examine how academia engages with new data regimes and the opportunities of such engagement
  • 24. RETHINKING THE IMPLICATIONS OF BIG DATA • Is a switch from causality to co-relation so radical? • As long ago as 1946, the historian Marc Bloch argued against the ‘idol of origins’ and sought a history with stronger social and cultural understanding • Pioneering work of humanities scholarship such as Annales School of historians has lot to contribute in terms of integrating methodology, data and new techniques • Continued importance of critical understanding of data, as Google flu trends controversy illustrates • Experience of humanities scholars in dealing with complex and messy historical datasets potentially very relevant
  • 25. Visualisation of ontology for linking information about people in the ancient world developed by the Standards for Networking Ancient Prosopographies project: snapdrgn.net
  • 26. seeingdata.org: includes videos on ‘Making Sense of Data Visualisations’
  • 28. Erica Savig, M.Arch. PhD Candidate, Cancer Biology Stanford University Lab of Garry P. Nolan National Science Foundation Graduate Research Fellow Stanford Graduate Research Fellow Common Design Strategies for Exploring Signaling Networks in Biology and Intellectual Geographies in History Nicole Coleman Director, Humanities + Design Stanford University
  • 29. Component and Behavior for Protein 1 Component and Behavior for Protein 2 Component and Behavior for Protein 3 Parametric Modeling Quantitatively Maps Single Cell Protein Levels to Individual Qualitative Components
  • 30. Michael Takeo Magruder, Data Flower: www.takeo.org
  • 31. Fabio Lattanzi Antinori The Obelisk, 2012 http:// fabiolattanziantinori.co m/obelisk.php
  • 33. Tim Hitchcock on Big Data, Small Data and Meaning (historyonics.blogspot.co.uk): ‘Big Data’ supposedly lets you get away with dirty data.  In contrast, humanists do read the data; and do so with a sharp eye for its individual rhythms and peculiarities – its weirdness.  In the rush towards 'Big Data' – the Longue durée, and automated network analysis; towards a vision of Humanist scholarship in which Bayesian probability is as significant as biblical allusion, the most urgent need seems to me to be to find the tools that allow us to do the job of close reading of all the small data that goes to make the bigger variety…we need to be able to contextualise every single word in a representation of every word, ever. Every gesture contextualised in the collective record all gestures; and every brushstroke, in the collective knowledge of every painting. 
  • 34. Towards a ‘radical contextualisation’: Mapping Metaphor with the Historical Thesaurus of the English Language http://blogs.arts.gla.ac.uk/metaphor/