SlideShare ist ein Scribd-Unternehmen logo
1 von 30
A centre of expertise in digital information management
www.ukoln.ac.uk
UKOLN is supported by:
Evolution or revolution?
The changing data landscape
Dr Liz Lyon, Director, UKOLN, University of Bath, UK
Associate Director, UK Digital Curation Centre
1st
DCC Regional Roadshow, Bath, November 2010
.
This work is licensed under a Creative Commons Licence
Attribution-ShareAlike 2.0
“Data sets
are becoming
the new
instruments
of science”
Dan Atkins, Univ Michigan
Digital data
as the new
special
collections?
Sayeed Choudhury, Johns Hopkins
Research data :
institutional
crown jewels?
http://www.flickr.com/photos/lifes__too_short__to__drink__cheap__wine/4754234186/
Perspectives
• Environmental scan
– Scale and complexity
– Infrastructure
– Open science
• Policy
– Funders
– Institutions
– Ethics & IP
• Practice Challenges
– Storage
– Incentives
– Costs & Sustainability
http://www.flickr.com/photos/thegreenalbum/3997609142/
Big
science
PDB
GenBank
UniProt
Pfam
Spreadsheets, Notebooks
Local, Lost
High throughput experimental
methods
Industrial scale
Commons based production
Publicly data sets
Cherry picked results
Preserved
CATH, SCOP
(Protein
Structure
Classification)
ChemSpider
Data collections
Slide: Carole Goble
A centre of expertise in digital information management
www.ukoln.ac.uk
Structural Sciences
Infrastructure
Infrastructure Roadmap
Cross Organisations
Infrastructure Roadmap
Cross Disciplines
Infrastructure Roadmap
Open Science
Open Laboratories
A centre of expertise in digital information management
www.ukoln.ac.uk
• Faculty work with public
• Smartphone apps facilitation
• Societal benefits
Citizen as scientist
Validate
results data
Policy
INCREMENTAL ProjectInstitutional perspective
• Creating & organising data
• Storage and access
• Back-up
• Preservation
• Sharing and re-use
The majority of people felt
that some form of policy or
guidance was needed....
Jeff Haywood, RDMF V October 2010
http://www.dcc.ac.uk/sites/default/files/documents/RDMF/RDMF5/Haywood.pdf
“While many researchers are
positive about sharing data
in
principle, they are almost
universally reluctant in
practice. ..... using these
data to publish results before
anyone else is the
primary way of gaining
prestige in nearly all INCREMENTAL Project
“Data
sharing was
more readily
discussed by
early career
researchers.”
“In our view, CRU should have been
more open with its raw data…”
Data is headline news
JISC FoI FAQ
P4 medicine:
Predictive,
Personalised,
Preventive,
Participatory.
Leroy Hood –
Institute for Systems Biology
Your genome is basis for
your medical record
Open data and ethics
• Direct-to-Consumer kits
• Informed consent?
• Privacy?
• UC Berkeley initiative
• Implications for HE
students & staff?
Policy Gaps...
• Is Policy
disconnected from
Practice?
– Data Sharing
– Data Licensing
– Ethics and Privacy
– Citizen Science &
Public Engagement
– Data Storage,
Selection & Appraisal
– Data Citation and
Attribution
“Departments don’t have guidelines or
norms for personal back-up and researcher
procedure, knowledge and diligence varies
tremendously. Many have experienced
moderate to catastrophic data loss”
Incremental Project Report, June 2010
http://www.flickr.com/photos/mattimattila/3003324844/
Data storage...
The case for cloud computing in genome
informatics. Lincoln D Stein, May 2010
– Scaleable
– Cost-effective (rent on-demand)
– Secure (privacy and IPR)
– Robust and resilient
– Low entry barrier / ease-of-use
– Has data-handling / transfer /
analysis capability
• Cloud services?
Your data in the cloud
Incentivising
data
management
Sustainability:
Who owns?
Who benefits?
Who selects?
Who preserves?
Who pays?
KRDS
Chicago Mart Plaza, 6-8 December 2010
Thank you…

Weitere ähnliche Inhalte

Was ist angesagt?

Workshop at Oxford on publishing for early career researchers - April 2011
Workshop at Oxford on publishing for early career researchers - April 2011Workshop at Oxford on publishing for early career researchers - April 2011
Workshop at Oxford on publishing for early career researchers - April 2011
Jisc
 

Was ist angesagt? (20)

The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science Landscape
 
Mantra for Change - IASSIST 2011
Mantra for Change - IASSIST 2011Mantra for Change - IASSIST 2011
Mantra for Change - IASSIST 2011
 
Workshop at Oxford on publishing for early career researchers - April 2011
Workshop at Oxford on publishing for early career researchers - April 2011Workshop at Oxford on publishing for early career researchers - April 2011
Workshop at Oxford on publishing for early career researchers - April 2011
 
RDM LIASA webinar
RDM LIASA webinarRDM LIASA webinar
RDM LIASA webinar
 
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM ToolkitLEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
 
Immersive informatics - research data management at Pitt iSchool and Carnegie...
Immersive informatics - research data management at Pitt iSchool and Carnegie...Immersive informatics - research data management at Pitt iSchool and Carnegie...
Immersive informatics - research data management at Pitt iSchool and Carnegie...
 
From Open Data to Open Science, by Geoffrey Boulton
 From Open Data to Open Science, by Geoffrey Boulton From Open Data to Open Science, by Geoffrey Boulton
From Open Data to Open Science, by Geoffrey Boulton
 
Curating the Scholarly Record: Data Management and Research Libraries
Curating the Scholarly Record: Data Management and Research LibrariesCurating the Scholarly Record: Data Management and Research Libraries
Curating the Scholarly Record: Data Management and Research Libraries
 
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM PolicyLEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina Leonelli
 
Data management: The new frontier for libraries
Data management: The new frontier for librariesData management: The new frontier for libraries
Data management: The new frontier for libraries
 
The Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotThe Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data Pilot
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?
 
The big picture: reputation, rankings, assessment, and the role of libraries
The big picture: reputation, rankings, assessment, and the role of librariesThe big picture: reputation, rankings, assessment, and the role of libraries
The big picture: reputation, rankings, assessment, and the role of libraries
 
What does open science mean? A stakeholder perspective
What does open science mean? A stakeholder perspectiveWhat does open science mean? A stakeholder perspective
What does open science mean? A stakeholder perspective
 
The culture of researchData
The culture of researchDataThe culture of researchData
The culture of researchData
 
Hands-On Data Management Planning for Life Sciences
Hands-On Data Management Planning for Life SciencesHands-On Data Management Planning for Life Sciences
Hands-On Data Management Planning for Life Sciences
 
How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...
 
Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster
 
Infrastructure, Standards, and Policies for Research Data Management
Infrastructure, Standards, and Policies for Research Data Management Infrastructure, Standards, and Policies for Research Data Management
Infrastructure, Standards, and Policies for Research Data Management
 

Andere mochten auch

Andere mochten auch (8)

Codes, Clouds & Constellations: Open Science in the Data Decade
Codes, Clouds & Constellations: Open Science in the Data DecadeCodes, Clouds & Constellations: Open Science in the Data Decade
Codes, Clouds & Constellations: Open Science in the Data Decade
 
CMOs Share How The Job Has Gotten Easier
CMOs Share How The Job Has Gotten EasierCMOs Share How The Job Has Gotten Easier
CMOs Share How The Job Has Gotten Easier
 
Facing the Data Challenge: Institutions, Disciplines, Services and Risks
Facing the Data Challenge: Institutions, Disciplines, Services and RisksFacing the Data Challenge: Institutions, Disciplines, Services and Risks
Facing the Data Challenge: Institutions, Disciplines, Services and Risks
 
Open Science at Genome Scale
Open Science at Genome ScaleOpen Science at Genome Scale
Open Science at Genome Scale
 
Top marketing reading list
Top marketing reading listTop marketing reading list
Top marketing reading list
 
Tax Assist Budget Summary2011
Tax Assist Budget Summary2011Tax Assist Budget Summary2011
Tax Assist Budget Summary2011
 
TOP brand challenge facing B2B companies
TOP brand challenge facing B2B companies TOP brand challenge facing B2B companies
TOP brand challenge facing B2B companies
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 

Ähnlich wie Evolution or revolution? The changing data landscape

DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
University of California Curation Center
 

Ähnlich wie Evolution or revolution? The changing data landscape (20)

Open Data in a Global Ecosystem
Open Data in a Global EcosystemOpen Data in a Global Ecosystem
Open Data in a Global Ecosystem
 
Open Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practicesOpen Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practices
 
Simon hodson
Simon hodsonSimon hodson
Simon hodson
 
Informatics Transform : Re-engineering Libraries for the Data Decade
Informatics Transform : Re-engineering Libraries for the Data DecadeInformatics Transform : Re-engineering Libraries for the Data Decade
Informatics Transform : Re-engineering Libraries for the Data Decade
 
Diabetes Data Science
Diabetes Data ScienceDiabetes Data Science
Diabetes Data Science
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big Data
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing data
 
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
 
Partnering for Research Data
Partnering for Research DataPartnering for Research Data
Partnering for Research Data
 
Magle data curation in libraries
Magle data curation in librariesMagle data curation in libraries
Magle data curation in libraries
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Fsci 2018 monday30_july_am6
Fsci 2018 monday30_july_am6Fsci 2018 monday30_july_am6
Fsci 2018 monday30_july_am6
 
Why we care about research data? Why we share?
Why we care about research data? Why we share?Why we care about research data? Why we share?
Why we care about research data? Why we share?
 
The Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIHThe Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIH
 
Managing and Sharing Research Data
Managing and Sharing Research DataManaging and Sharing Research Data
Managing and Sharing Research Data
 
Research Data Management: Part 1, Principles & Responsibilities
Research Data Management: Part 1, Principles & ResponsibilitiesResearch Data Management: Part 1, Principles & Responsibilities
Research Data Management: Part 1, Principles & Responsibilities
 
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarThe Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
 
Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)
 
Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...
 
Univ of Miami CTSI: Citizen science seminar; Oct 2014
Univ of Miami CTSI: Citizen science seminar; Oct 2014Univ of Miami CTSI: Citizen science seminar; Oct 2014
Univ of Miami CTSI: Citizen science seminar; Oct 2014
 

Kürzlich hochgeladen

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 

Evolution or revolution? The changing data landscape

Hinweis der Redaktion

  1. CATH database (http://www.cathdb.info/). Its about protein structure classification, so it is quite niche. SCOP (http://scop.mrc-lmb.cam.ac.uk/scop), the other protein structure classification database. Its information is manually curated by Alexey Murzin and his group at the MRC Laboratory of Molecular Biology in Cambridge. Its web page still looks like it belongs in the late 90s. ChemSpider was originally a homemade project but was bought out by the Royal Society of Chemistry when they realised what a valuable resource it was to chemists. Massive centralisation – clouds and curated core facilitiesMassive decentralisation – sticks, spreadsheets, wikis