SlideShare ist ein Scribd-Unternehmen logo
1 von 9
| 1
Anita de Waard 0000-0002-9034-4119
VP Research Data Collaborations
Elsevier RDM Services
a.dewaard@elsevier.com
Big Data PI Meeting
March 16, 2016
Real-World Data
Challenges:
Moving Towards
Richer Data Ecosystems
| 2
ESGF-
VL
ESGF
ESG-
CET
ESG-II
ESG-I
Usable
capabilities
Future
capabilities
Prototype
capabilities
1999-2001
2001-2006
2006-2011
2011-2020
2020-
Planned Earth System Grid System Evolution
Planned Earth System Grid System Data Archival
Model
Intercomparison
Projects
Remote Sensing,
In Situ, Climatology,
Diagnostics, Ecosystem,
Hydrology, Biology,
Etc.
Petabytes (1015) Exabytes (1018)
1999 20222017
Centralized Archive Distributed Data Ecosystem Virtual Laboratory
Source: Dean Williams, Lawrence Livermore/ESGF, March 1st 2017
Trend # 1: Repositories are becoming virtual labs
| 3
Trend # 2: Scientists are Moving ‘Beyond Downloads’
| 4
Trend # 3: Computers are scientists, too!
“intelligent systems for computer-aided
discovery can complement and integrate
into the insight generation loop in
scalable ways…”
http://ieeexplore.ieee.org/abstract/document/7515118/: Computer-Aided Discovery: Toward Scientific Insight Generation with Machine Support
“This work combines time series Principal
Component Analysis with InSAR to constrain
the space of possible model explanations on
current empirical data sets and achieve a better
identification of deformation patterns”
| 5
Raising many technical/organisational/policy questions:
• Is Long-Tail Data + Semantics = Big Data?
• Is Data Science a field, or a skill? (A department, or a class?)
• Are supercomputing centers research departments or bits of infrastructure? (And if
infrastructure, are they part of IT? (“Oh, no, anything but that!”)
• Are repositories places to store outputs, or places where science is conducted?
• If so, how are repositories and HPC’s recognised and rewarded?
• How can we keep track of (micro)provenance of parts of data sets?
• Should we explore Blockchain technology for this? (“Oh no, anything but that!”)
• Is a piece of software part of the University’s Research Outputs?
• If so, how do we reward brilliant coders who blog, but don’t write?
• How do we reward (virtual) collaboration?
• Why won’t those damn scientists share their data?
• Who will own the Data Science Cloud: Amazon? Or the joint HPC’s (NDS??) Is NIH
Data Commons the Model? Or is this a free for all? What is the role of commercial
parties?
• Is data curation/stewardship a part of science, or a glorified administrator's job?
• What is the role of libraries, in all this?
• And why the hell is a publisher talking about it?
| 6 6
Inst. Data
Repositorie(s)
Lab
ELN(s)
Data
Journal
Data search
Link to article
Journal
Find
Topic
Identify
gaps
Plan &
Fund
Discover data, people,
methods & protocols
Collect, analyze &
vizualize
Store, preserve
& share
Publish
Prepare, reproduce,
re-use & benchmark
Domain-specific
Repositories
General search
Faculty
LIMS
Data
center
Inst. Data
Repositorie(s)
Lab
ELN(s)
Data
Journal
Data search
Data Management
Plans
Metadata, methods &
protocols ready for
preservation and publishing
Link to article
Journal
Publish data
(under embargo)
Secure
discoverability
in & outside
the institution
Plan each step from
experiment to publish
Domain-specific
Repositories
General search
What Elsevier is Interested in: Supporting RDM Networks
| 7
Biological Pathways extracted via
semantic text mining
A upregulates B
B upregulates C
C increases disease D
Normalizing vocabularies required: proteins, diseases, drugs, chemicals
A  B  C  D
Bioactivities
through text analysis
IC50 6.3nM, kinase binding assay
10mM concentration
Chemical Structures
And Properties
InChi,
Name
NCBI,
Uniprot
EMTREE
ReaxysTree,
Structures
What Elsevier is Interested in: Knowledge Graphs in Life
Science
| 8
What Elsevier is Interested in: Knowledgegraphs in Research
| 9
Thank you!
Links to things we’re involved with:
• https://www.elsevier.com/connect/10-aspects-of-highly-effective-research-data
• https://www.elsevier.com/about/open-science/research-data
• https://www.hivebench.com
• https://data.mendeley.com/
• https://datasearch.elsevier.com/
• https://www.elsevier.com/books-and-journals/content-innovation/data-base-
linking
• http://www.journals.elsevier.com/softwarex/
• https://www.elsevier.com/physical-sciences/earth-and-planetary-sciences/the-
2015-international-data-rescue-award-in-the-geosciences
• https://rd-alliance.org/groups/rdawds-publishing-data-services-wg.html
• https://www.force11.org/
• http://www.nationaldataservice.org/
• https://rd-alliance.org/
Anita de Waard, a.dewaard@elsevier.com

Weitere ähnliche Inhalte

Was ist angesagt?

Executive Summary - Data Management Hub
Executive Summary - Data Management HubExecutive Summary - Data Management Hub
Executive Summary - Data Management Hub
Denis Parfenov
 
Global registries initiative frumkin omodei
Global registries initiative frumkin omodeiGlobal registries initiative frumkin omodei
Global registries initiative frumkin omodei
ASIS&T
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management Ecosystem
John Kunze
 

Was ist angesagt? (20)

December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
 
Implementing Archivematica, research data network
Implementing Archivematica, research data networkImplementing Archivematica, research data network
Implementing Archivematica, research data network
 
BioSharing - Update - Feb2016
BioSharing - Update - Feb2016BioSharing - Update - Feb2016
BioSharing - Update - Feb2016
 
Rots RDAP11 Data Archives in Federal Agencies
Rots RDAP11 Data Archives in Federal AgenciesRots RDAP11 Data Archives in Federal Agencies
Rots RDAP11 Data Archives in Federal Agencies
 
Wheeler & Benedict -- Enabling the Preservation Relay
Wheeler & Benedict -- Enabling the Preservation RelayWheeler & Benedict -- Enabling the Preservation Relay
Wheeler & Benedict -- Enabling the Preservation Relay
 
Research data management for masters and ph d students
Research data management for masters and ph d studentsResearch data management for masters and ph d students
Research data management for masters and ph d students
 
On community-standards, data curation and scholarly communication" Stanford M...
On community-standards, data curation and scholarly communication" Stanford M...On community-standards, data curation and scholarly communication" Stanford M...
On community-standards, data curation and scholarly communication" Stanford M...
 
Organising and Documenting Data
Organising and Documenting DataOrganising and Documenting Data
Organising and Documenting Data
 
The Economics of Data Sharing
The Economics of Data SharingThe Economics of Data Sharing
The Economics of Data Sharing
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Publishing the Full Research Data Lifecycle
Publishing the Full Research Data LifecyclePublishing the Full Research Data Lifecycle
Publishing the Full Research Data Lifecycle
 
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
 
Executive Summary - Data Management Hub
Executive Summary - Data Management HubExecutive Summary - Data Management Hub
Executive Summary - Data Management Hub
 
Global registries initiative frumkin omodei
Global registries initiative frumkin omodeiGlobal registries initiative frumkin omodei
Global registries initiative frumkin omodei
 
Baker - Evolution of Data Products and Designated Audiences
Baker - Evolution of Data Products and Designated AudiencesBaker - Evolution of Data Products and Designated Audiences
Baker - Evolution of Data Products and Designated Audiences
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management Ecosystem
 
EDI Training Module 2: EDI Project
EDI Training Module 2:  EDI ProjectEDI Training Module 2:  EDI Project
EDI Training Module 2: EDI Project
 
No more waiting! Tools that work Today to reveal dataset use
No more waiting!  Tools that work Today to reveal dataset useNo more waiting!  Tools that work Today to reveal dataset use
No more waiting! Tools that work Today to reveal dataset use
 
Data Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim ClarkData Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim Clark
 
Smith RDAP11 NSF Data Management Plan Case Studies
Smith RDAP11 NSF Data Management Plan Case StudiesSmith RDAP11 NSF Data Management Plan Case Studies
Smith RDAP11 NSF Data Management Plan Case Studies
 

Andere mochten auch

Andere mochten auch (20)

Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016
 
BD2K @ NIH - A Vision Through 2020
BD2K @ NIH - A Vision Through 2020BD2K @ NIH - A Vision Through 2020
BD2K @ NIH - A Vision Through 2020
 
Moving Forward with Open Data Science - SWOT Analysis
Moving Forward with Open Data Science - SWOT AnalysisMoving Forward with Open Data Science - SWOT Analysis
Moving Forward with Open Data Science - SWOT Analysis
 
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataElsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
 
Nothing can be added to you, nor anything taken from you
Nothing can be added to you, nor anything taken from youNothing can be added to you, nor anything taken from you
Nothing can be added to you, nor anything taken from you
 
Innovatiegericht inkopen: zo doe je dat!
Innovatiegericht inkopen: zo doe je dat!Innovatiegericht inkopen: zo doe je dat!
Innovatiegericht inkopen: zo doe je dat!
 
Selected_Architectural_Designs_2010
Selected_Architectural_Designs_2010Selected_Architectural_Designs_2010
Selected_Architectural_Designs_2010
 
Disemination course barcelona.pdf
Disemination course barcelona.pdfDisemination course barcelona.pdf
Disemination course barcelona.pdf
 
Military Professional Human Resources of all types
Military Professional Human Resources of all typesMilitary Professional Human Resources of all types
Military Professional Human Resources of all types
 
Estatuto del Profesor Universitario (Art. 321-328)
Estatuto del Profesor Universitario (Art. 321-328)Estatuto del Profesor Universitario (Art. 321-328)
Estatuto del Profesor Universitario (Art. 321-328)
 
Low GI carbs – Can sugars play a role? The example of Palatinose™ (isomaltul...
Low GI carbs – Can sugars play a role? The example of Palatinose™ (isomaltul...Low GI carbs – Can sugars play a role? The example of Palatinose™ (isomaltul...
Low GI carbs – Can sugars play a role? The example of Palatinose™ (isomaltul...
 
ゴースト暗算を簡略化してみた
ゴースト暗算を簡略化してみたゴースト暗算を簡略化してみた
ゴースト暗算を簡略化してみた
 
MMS 2015: Deploy mac os x os with sccm (002) final
MMS 2015: Deploy mac os x os with sccm (002) finalMMS 2015: Deploy mac os x os with sccm (002) final
MMS 2015: Deploy mac os x os with sccm (002) final
 
Whatsapp en las empresas
Whatsapp en las empresasWhatsapp en las empresas
Whatsapp en las empresas
 
How to register in Imagine Cup Bahrain 2017?
How to register in Imagine Cup Bahrain 2017?How to register in Imagine Cup Bahrain 2017?
How to register in Imagine Cup Bahrain 2017?
 
Let's level up with gamification
Let's level up with gamificationLet's level up with gamification
Let's level up with gamification
 
An introduction of different types of glasses
An introduction of different types of glassesAn introduction of different types of glasses
An introduction of different types of glasses
 
Napolcom reviewer e book 2015
Napolcom reviewer e book 2015Napolcom reviewer e book 2015
Napolcom reviewer e book 2015
 

Ähnlich wie Real-World Data Challenges: Moving Towards Richer Data Ecosystems

Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Carole Goble
 
NCME Big Data in Education
NCME Big Data  in EducationNCME Big Data  in Education
NCME Big Data in Education
Philip Piety
 

Ähnlich wie Real-World Data Challenges: Moving Towards Richer Data Ecosystems (20)

Big Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARLBig Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARL
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Big Data for Library Services (2017)
Big Data for Library Services (2017)Big Data for Library Services (2017)
Big Data for Library Services (2017)
 
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
 
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at ScaleFull Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?
 
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data Commons
 
Converged IT and Data Commons
Converged IT and Data CommonsConverged IT and Data Commons
Converged IT and Data Commons
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Rscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libsRscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libs
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
NFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRNFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIR
 
Open Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon HodsonOpen Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon Hodson
 
Thoughts on Knowledge Graphs & Deeper Provenance
Thoughts on Knowledge Graphs  & Deeper ProvenanceThoughts on Knowledge Graphs  & Deeper Provenance
Thoughts on Knowledge Graphs & Deeper Provenance
 
Responsible conduct of research: Data Management
Responsible conduct of research: Data ManagementResponsible conduct of research: Data Management
Responsible conduct of research: Data Management
 
Some Ideas on Making Research Data: "It's the Metadata, stupid!"
Some Ideas on Making Research Data: "It's the Metadata, stupid!"Some Ideas on Making Research Data: "It's the Metadata, stupid!"
Some Ideas on Making Research Data: "It's the Metadata, stupid!"
 
NCME Big Data in Education
NCME Big Data  in EducationNCME Big Data  in Education
NCME Big Data in Education
 

Mehr von Anita de Waard

The habits of highly successful data:
The habits of highly successful data: The habits of highly successful data:
The habits of highly successful data:
Anita de Waard
 

Mehr von Anita de Waard (20)

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data Commons
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring Guidelines
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data Management
 
History of the future
History of the futureHistory of the future
History of the future
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of Publishing
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly Publishing
 
RDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupRDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest Group
 
The Rocky Road to Reuse
The Rocky Road to ReuseThe Rocky Road to Reuse
The Rocky Road to Reuse
 
Argumentation in biology papers
Argumentation in biology papersArgumentation in biology papers
Argumentation in biology papers
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
 
Ten Habits of Highly Effective Data
Ten Habits of Highly Effective DataTen Habits of Highly Effective Data
Ten Habits of Highly Effective Data
 
Ten Habits of Highly Successful Data
Ten Habits of Highly Successful DataTen Habits of Highly Successful Data
Ten Habits of Highly Successful Data
 
How to persuade with data
How to persuade with dataHow to persuade with data
How to persuade with data
 
Ten habits of highly effective data
Ten habits of highly effective dataTen habits of highly effective data
Ten habits of highly effective data
 
The habits of highly successful data:
The habits of highly successful data: The habits of highly successful data:
The habits of highly successful data:
 

Kürzlich hochgeladen

Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
gindu3009
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Sérgio Sacani
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
LeenakshiTyagi
 

Kürzlich hochgeladen (20)

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 

Real-World Data Challenges: Moving Towards Richer Data Ecosystems

  • 1. | 1 Anita de Waard 0000-0002-9034-4119 VP Research Data Collaborations Elsevier RDM Services a.dewaard@elsevier.com Big Data PI Meeting March 16, 2016 Real-World Data Challenges: Moving Towards Richer Data Ecosystems
  • 2. | 2 ESGF- VL ESGF ESG- CET ESG-II ESG-I Usable capabilities Future capabilities Prototype capabilities 1999-2001 2001-2006 2006-2011 2011-2020 2020- Planned Earth System Grid System Evolution Planned Earth System Grid System Data Archival Model Intercomparison Projects Remote Sensing, In Situ, Climatology, Diagnostics, Ecosystem, Hydrology, Biology, Etc. Petabytes (1015) Exabytes (1018) 1999 20222017 Centralized Archive Distributed Data Ecosystem Virtual Laboratory Source: Dean Williams, Lawrence Livermore/ESGF, March 1st 2017 Trend # 1: Repositories are becoming virtual labs
  • 3. | 3 Trend # 2: Scientists are Moving ‘Beyond Downloads’
  • 4. | 4 Trend # 3: Computers are scientists, too! “intelligent systems for computer-aided discovery can complement and integrate into the insight generation loop in scalable ways…” http://ieeexplore.ieee.org/abstract/document/7515118/: Computer-Aided Discovery: Toward Scientific Insight Generation with Machine Support “This work combines time series Principal Component Analysis with InSAR to constrain the space of possible model explanations on current empirical data sets and achieve a better identification of deformation patterns”
  • 5. | 5 Raising many technical/organisational/policy questions: • Is Long-Tail Data + Semantics = Big Data? • Is Data Science a field, or a skill? (A department, or a class?) • Are supercomputing centers research departments or bits of infrastructure? (And if infrastructure, are they part of IT? (“Oh, no, anything but that!”) • Are repositories places to store outputs, or places where science is conducted? • If so, how are repositories and HPC’s recognised and rewarded? • How can we keep track of (micro)provenance of parts of data sets? • Should we explore Blockchain technology for this? (“Oh no, anything but that!”) • Is a piece of software part of the University’s Research Outputs? • If so, how do we reward brilliant coders who blog, but don’t write? • How do we reward (virtual) collaboration? • Why won’t those damn scientists share their data? • Who will own the Data Science Cloud: Amazon? Or the joint HPC’s (NDS??) Is NIH Data Commons the Model? Or is this a free for all? What is the role of commercial parties? • Is data curation/stewardship a part of science, or a glorified administrator's job? • What is the role of libraries, in all this? • And why the hell is a publisher talking about it?
  • 6. | 6 6 Inst. Data Repositorie(s) Lab ELN(s) Data Journal Data search Link to article Journal Find Topic Identify gaps Plan & Fund Discover data, people, methods & protocols Collect, analyze & vizualize Store, preserve & share Publish Prepare, reproduce, re-use & benchmark Domain-specific Repositories General search Faculty LIMS Data center Inst. Data Repositorie(s) Lab ELN(s) Data Journal Data search Data Management Plans Metadata, methods & protocols ready for preservation and publishing Link to article Journal Publish data (under embargo) Secure discoverability in & outside the institution Plan each step from experiment to publish Domain-specific Repositories General search What Elsevier is Interested in: Supporting RDM Networks
  • 7. | 7 Biological Pathways extracted via semantic text mining A upregulates B B upregulates C C increases disease D Normalizing vocabularies required: proteins, diseases, drugs, chemicals A  B  C  D Bioactivities through text analysis IC50 6.3nM, kinase binding assay 10mM concentration Chemical Structures And Properties InChi, Name NCBI, Uniprot EMTREE ReaxysTree, Structures What Elsevier is Interested in: Knowledge Graphs in Life Science
  • 8. | 8 What Elsevier is Interested in: Knowledgegraphs in Research
  • 9. | 9 Thank you! Links to things we’re involved with: • https://www.elsevier.com/connect/10-aspects-of-highly-effective-research-data • https://www.elsevier.com/about/open-science/research-data • https://www.hivebench.com • https://data.mendeley.com/ • https://datasearch.elsevier.com/ • https://www.elsevier.com/books-and-journals/content-innovation/data-base- linking • http://www.journals.elsevier.com/softwarex/ • https://www.elsevier.com/physical-sciences/earth-and-planetary-sciences/the- 2015-international-data-rescue-award-in-the-geosciences • https://rd-alliance.org/groups/rdawds-publishing-data-services-wg.html • https://www.force11.org/ • http://www.nationaldataservice.org/ • https://rd-alliance.org/ Anita de Waard, a.dewaard@elsevier.com

Hinweis der Redaktion

  1. Outline: Some Trends Some Questions What Elsevier is interested in, and doing
  2. Example – your eln being able to publish protocols directly - easing the resaerchers burden