8. The Big Picture
More people
Moremachines
Big Data
Big Compute
Conventional
Computation
“Big Social”
Social Networks
e-infrastructure
online
R&D
Big Data
Production
& Analytics
deeply
about
society
9. Research Councils UK and Big
Data
▶ „Big data is a term for a collection of datasets
so large and complex that it is beyond the
ability of typical database software tools to
capture, store, manage, and analyse them.
„Big‟ is not defined as being larger than a
certain number of „bytes‟ because as
technology advances over time, the size of
datasets that qualify as big data will also
increase‟ (RCUK)
12. Research benefits of new data
▶ Undertaking research on pressing policy-related issues
without the need for new data collection
• Food consumption, social background and obesity
• Energy consumption, housing type and climatic
conditions
• Rural location, private/public transport alternatives and
incomes
• School attainment, higher education participation,
subject choices, student debt and later incomes
▶ New data such as social media enable us to ask big
questions, about big populations, and in real time – this is
transformative
22. Real life is and must be full of all kinds of social
constraint – the very processes from which
society arises. Computers can help if we use
them to create abstract social machines on the
Web: processes in which the people do the
creative work and the machine does the
administration... The stage is set for an
evolutionary growth of new social engines. The
ability to create new forms of social process
would be given to the world at large, and
development would be rapid.
Berners-Lee, Weaving the Web, 1999 (pp. 172–175)
The Order of Social
Machines
23. SOCIAM: The Theory and Practice of Social Machines is funded by the UK Engineering and Physical Sciences Research Council
(EPSRC) under grant number EPJ017728/1 and comprises the Universities of Southampton, Oxford and Edinburgh. See sociam.org
24. Physical World
(people and devices)
Building a Social Machine
Design and
Composition
Participation and
Data supply
Model of social interaction
Virtual World
(Network of
social interactions)
Dave Robertson
31. Big data elephant versus sense-making
network?
The challenge is to foster the co-constituted socio-technical
system on the right i.e. a computationally-enabled sense-
making network of expertise, data, models, visualisations
and narratives
Iain Buchan
33. Take homes
▶ There are multiple shifts in scholarship occurring:
– Volumes of data and associated automation
– Computational infrastructure and realtime analytics
– Dataflows vs datasets (and curation infrastructure)
– Correlation vs causation
– Responsible Innovation
– Machine-to-Machine and Internet of Things
▶ Social Machines provide an approach to co-
design and analysis in the evolving knowledge
infrastructure
EPSRC: Under ‘Big Data’ we are considering both very large and also complex data, including dynamic and heterogenous data from all the various sources including sensors, social media, industry etc.
ESRC was allocated 64m and much of this is being used to set up the ESRC Big Data Network. The ESRC’s Big Data Network will support the development of a network of innovative investments which will strengthen the UK’s competitive advantage in Big Data for the social sciences. The core aim of this network is to facilitate access to different types of data and thereby stimulate innovative research and develop new methods to undertake that research. Although you should note that diagram it is only illustrative in terms of how the UKDS and ADS will work across – that is still under discussion; and only illustrative in the number of Business and Local Government Data Research.This network has been divided into three phases. In Phase 1 of the Big Data Network the ESRC has invested in the development of the Administrative Data Research Network (ADRN) which will provide access to de-identified administrative data collected by government departments for research use – focus of this meeting and all your grants.A few words about Phase 2 and 3 before we pass to Vanessa to talk about the ADRN some more. Phase 2is currently bring commissioned and will deal primarily with business data and/ or local government data. Phase 3, further details of which will be released in the last autumn / winter and will focus primarily on third sector data and social media data. It is expected that there will be opportunities for interaction across all elements of the ESRC Big Data Network and that they will all work together around the wider objectives of facilitating access to different forms of data and of ensuring maximum impact is generated from the use of that data for the mutual benefit of data owners and researchers, and through the research facilitated by the Network, benefit society and the economy more generally.
Thanks to Simon Hettrick for additional input to this slide.