SlideShare ist ein Scribd-Unternehmen logo
1 von 38
Downloaden Sie, um offline zu lesen
Wolfgang KuchinkeWolfgang Kuchinke
University Duesseldorf, Duesseldorf, GermanyUniversity Duesseldorf, Duesseldorf, Germany
CORBEL ProjectCORBEL Project
W. Kuchinke (2018)
Repositories in an Open Data EcosystemRepositories in an Open Data Ecosystem
ECRIN – CORBEL WP 3.3 Working Group MeetingECRIN – CORBEL WP 3.3 Working Group Meeting
14. Jun 2018, Paris, France14. Jun 2018, Paris, France
2
W. Kuchinke (2018)
Open Data – Open
Science
Towards an ecosystem for Open Data and
Sensitive Data
3
W. Kuchinke (2018)
Open data is data that can be freely
used, shared and built-on by
anyone, anywhere, for any purpose.
Data sharing is the precondition for
the reproducibility of research
results.
Open Definition (http://opendefinition.org/okd/)
4
W. Kuchinke (2018)
For reproducibility and progress of research, data sharing is
critical. Providers of human data (e.g. publicly or privately
funded repositories and data archives) should fulfill their
social responsibility with data donors when their shareable
data conforms to the FAIR (findable, accessible,
interoperable, reusable) principles
FAIR data framework
5
W. Kuchinke (2018)
Research data, metadata and data management plans are part of Open
Research Data Management. Research data can contain a wide diversity of
collected information: text or numerical data, biosamples, images,
questionnaires, recorded videos, models, software, reports, workflows, etc.
All information about data type and format of the information needs to be
described. For this purpose, data need to be complemented by proper
metadata.
Metadata are essential to recover and reuse research data. Metadata
standards allow the interoperability across different systems, like repositories.
Metadata can be classified in 3 main types: descriptive, administrative, and
structural. Descriptive metadata serve to discovery and understand a data
source, and refers for example tothe title, author, publication date or abstract,
like, for example the Dublin Core Schema
The Importance of Metadata for
Open Research Data Management
6
W. Kuchinke (2018)
Conceptual representation of
the life cycle of data in biomedical data repositories (secure
storage of biomedical research and
healthcare-related data) from the moment of data generation,
through their utilization and transformation into useful
information, publication and finally their long-term archiving or
destruction
Data life cycle
7W. Kuchinke (2018)
Data Ecosystem
Repositories are the core components of an Open
Data Ecosystem
Many tools and data services support repositories
Different aspects
FAIR principles
Open data and clinical trials data should be stored
together
Cloud storage should be enabled
Analysis tools should be provided
Different data repositories should be connected to
each other
8
W. Kuchinke (2018)
FAIR data framework
Fig. Modified from: Dep. Med Inf. UMG, Groningen
Data Sources Data
Integration
and Data
Curation
Data
Storage
Data Usage
ePatient Record
Clinical Trial Data
Registry Data
Patient Reported Outcome
Sensor Data
Biomaterial Data
eLab Data
Lifestyle Data, Weather,
Medication, Social Media
Transform
Ontology match
Linkage
Data
Warehousing
Data Marts
Data management
and Analysis
Open Data
Data query
Data visualisation
Data analysis
Collaboration
Therapy Board
Data transfer
Data sharing
Publication
Data Governance
Persistent
Identifiers
Privacy Metadata harvesting Data Dictionary
Consent
Management
Identity Management Anonymisation Pseudonymisation
Data Annotation
9
W. Kuchinke (2018)
Move Data Governance to Data Generation step in the
data life cycle
Data Sources Data
Integration
and Data
Curation
Data
Storage
Data Usage
ePatient Record
Clinical Trial Data
Registry Data
Patient Reported Outcome
Sensor Data
Biomaterial Data
eLab Data
Lifestyle Data, Weather,
Medication, Social Media
Transform
Ontology match
Linkage
Data
Warehousing
Data Marts
Data management
and Analysis
Open Data
Data query
Data visualisation
Data analysis
Collaboration
Therapy Board
Data transfer
Data sharing
Publication
Data Governance
Persistent
Identifiers
Privacy Metadata harvesting Data Dictionary
Consent
Management
Identity Management Anonymisation Pseudonymisation
Data Annotation
Data
Governance
for privacy
protection
already at
the step of
data
generation
10
W. Kuchinke (2018)
Persistent Identifiers, Metadata, Privacy protection
become part of data generation
Build-in Data governance and Privacy protection
Data Sources Data
Integration
and Data
Curation
Data
Storage
Data Usage
ePatient Record
Clinical Trial Data
Registry Data
Patient Reported Outcome
Sensor Data
Biomaterial Data
eLab Data
Lifestyle Data, Weather,
Medication, Social Media
Transform
Ontology match
Linkage
Data
Warehousing
Data Marts
Data management
and Analysis
Open Data
Data query
Data visualisation
Data analysis
Collaboration
Therapy Board
Data transfer
Data sharing
Publication
Data Governance
Persistent
Identifiers
Privacy protection
Metadata harvesting
Data Dictionary
Consent
Management
Identity Management
Anonymisation
Pseudonymisation
Data Annotation
11
W. Kuchinke (2018)
Components of the
Repository Data
Ecosystem
An ecosystem suitable even for
Sensitive Data
12
W. Kuchinke (2018)
The Comprehensive Knowledge Archive Network (CKAN) is an open-
source open data portal for the storage and distribution of open
data.
Aimed at data publishers who make their data open and available. The
system is used both as a public platform on Datahub and in various
government data catalogues (e.g. UK's data.gov.uk, Dutch National
Data Register, the United States government's Data.gov and the
Australian government's Gov 2.0).
https://ckan.org/
What is CKAN?
13W. Kuchinke (2018)
CKAN
Open-source data portal platform
Developed by the OKFN (Open Knowledge
Foundation)
It is a complete out-of-the-box software solution
Tools to streamline publishing, sharing, finding and
using data
CKAN includes a web interface and the CKAN
Action API
Visualizations for structured data resources (such
as CSV files)
14W. Kuchinke (2018)
CKAN and reusability of healthcare data
Catalog and metadata saved in CKAN can be harvested based on
the OAI-PMH
Through the CKAN cloud environment, wearable and stationary
sensor data stored in individual CKANs can be integrated
Analysis and integration of clinical data of users based on diagnostic
data saved on the CKAN-based cloud
Prediction of situations, events, and incidents
15
W. Kuchinke (2018)
The Hyve is a company that provides professional IT services for open
source biomedical informatics solutions, to enhance the quality and
impact of research by enabling scientists in life sciences and healthcare
research to properly use open source software, open data and open
standards.
https://thehyve.nl/
What is the Hyve?
16W. Kuchinke (2018)
Tools developed by the Hyve
Portfolio of open source tools and products that facilitate FAIR
research data
FAIR Research Data Management in academic hospitals
Research Data Marts
I2b2 / tranSMART and cBioPortal (for oncology-focus medical
centers)
a robust research data warehouse can be established, which
exposes a unified patient-centric view of clinical and molecular
data for research & analysis
17
W. Kuchinke (2018)
tranSMART is an open-source data warehouse designed to store large
amounts of clinical data from clinical trials, as well as data from basic
research. In tranSMART data can be examined for translational
research purposes. tranSMART is built on top of the i2b2 platform, a
clinical data warehouse employing the i2b2 star model. Each of the data
types (e.g., gene expression, SNP or metabolomics) retain its specific
data structure.
What is tranSMART?
18W. Kuchinke (2018)
tranSMART data warehouse
Designed for use in individual clinical studies with hundreds or
thousands participants in which maybe tens of thousands
observations were gathered
tranSMART is also being adopted by hospitals and large population
studies
Large population study is the Netherlands Twin Register (NTR)
adding indexes, creating partitions, addition of bit strings, Saving
subject sets for single and combined queries, Splitting a query
19W. Kuchinke (2018)
Glowing Bear: the new tranSMART UI
Sponsored by Pfizer, Sanofi, Abbvie and Roche
Cross-study and ontology term support 
Support for time series and longitudinal data 
Possibility of saving queries and re-executing them later
Cohort builder
20
W. Kuchinke (2018)
The Dataverse is an open source web application to share, preserve,
cite, explore and analyze research data. Researchers, authors,
publishers, data distributors, and their institutions receive appropriate
credit via a data citation mechanism including a persistent identifier
(e.g., DOI, or Handle). A Dataverse repository hosts multiple
dataverses; each dataverse contains dataset(s) or other dataverses,
and each dataset contains descriptive metadata together with the data.
https://dataverse.org/
What is Dataverse?
21W. Kuchinke (2018)
Dataverse
Open source web application for sharing, citing, analyzing, and
preserving research data
Developed by the Data Science team at the Institute for Quantitative
Social Science
Dataverse code is open-source and free
Supports DataCite and other citation standards, such as ORCID
Creates a Digital Object Identifier (DOI) upon deposit
22W. Kuchinke (2018)
Dataverse repositories
Harvard Dataverse Network hosts the world's largest collection of
social science research
A Dataverse repository is the software installation, which hosts
multiple dataverses
Each dataverse contains datasets, and each dataset contains
descriptive metadata and data files
Dataverses may contain other dataverses
23W. Kuchinke (2018)
Dataverse datasets
A dataset in Dataverse is a container for data, documentation, code,
and the corresponding metadata which describe the dataset
From: http://guides.dataverse.org/en/latest/user/dataset-management.html
24W. Kuchinke (2018)
Dataverse and Cloud Storage
Dataverse installations can be configured to facilitate cloud-based
storage and computing
Default configuration for Dataverse uses a local file system for
storing data
Cloud-enabled Dataverse installation can use a Swift object storage
database for its data
This allows users to perform computations on data using an
integrated cloud environment
25W. Kuchinke (2018)
Example: DataverseNL
Service for archiving and publishing research data on several levels
faculties, institutions, research groups, projects within Dutch
universities
Possibility to store and share online a large variety of scientific data,
independent of file format, in a secure way
Not suitable for storing (privacy) sensitive data
PSI (Ψ): A Private data Sharing Interface
Privacy Tools Research Group (Harvard)
26W. Kuchinke (2018)
figshare
figshare helps academic institutions store, share and manage all of
their research output
Integrate into your CRIS/RIMS, institutional repository and archiving
solution
All research on figshare can be pushed to any institutional repository
Control how content is shared internally and publically
figshare is hosted on Amazon Web Services but we can also
integrate with centralized cloud
27W. Kuchinke (2018)
●figshare for academic institutions
figshare helps academic institutions store, share and manage their
research output
Integrates into institution’s CRIS/RIMS, the institutional repository
and archiving solutions
All research on figshare can be pushed to any institutional repository
Control how content is shared internally and publicly
figshare is hosted on Amazon Web Services but can also integrate
with a centralized cloud
28W. Kuchinke (2018)
Example: University of Sheffield
Custom portal to manage research data
29W. Kuchinke (2018)
Example: University of Salford / Manchester
Custom portal of figshare
30W. Kuchinke (2018)
OSF (Open Science Framework)
Cloud-based management of projects
View all projects from one dashboard.
Quickly share files
Share key project information and allow others to use and cite it.
See project changes
View project analytics
Archive data
31
W. Kuchinke (2018)
Analysis of the
repository ecosystem
components
32W. Kuchinke (2018)
●Role of Repositories in the Data Ecosystem
A multitude of services and tools to support research data
repositories
Different types of repositories are connected and supplement each
other in the storage, release and sharing of data with different
degree of protection and ownership
Tools to analyze, browse and visualize data should be integrated
Real World Data must be smoothly integrated into the research data
cycle
Data governance and data privacy protection play an important
role
New and efficient tools for anonymisation and data obfuscation
are necessary
33W. Kuchinke (2018)
Overview
Research Data Sharing and Storage Services
A multitude of services and tools support research data repositories to form
an open data ecosystem
Modified from: Instituuts Data Management Plannen, Groningen
During research
During research
After research
After research
BeeHub
B2SAFE
SurfDrive
Local ICT
Services
CLARIN INL
DANS
4TU. Centre for
ResearchData
Zenodo
B2SHARE
figshare
SURF
addgene
Brainmap.org
NeuroVault
OpenMRI
MycoBank
Language
Archive
DataFirst
Dataverse NL
CancerData
DRYAD
Connectome
SeaDataNet
nesstar
TalkBank
OpenML
OpenClinica
Curate
Science
EVIDENCIO
OSF
CRCNS
Dataverse
BeeHub
RUG GeoData
InstitutionDisciplin
34
W. Kuchinke (2018)
Real world evidence (RWE) in medicine means evidence obtained
from real world data (RWD), which is data obtained outside the
context of randomized controlled trials (RCTs); it is generated during
the routine clinical practice. Real world data is stored in Electronic
Health Records (EHR), medical claims or billing activities databases,
registries, patient-generated data, mobile devices, etc. In addition, it
may be derived from retrospective or prospective observational studies
and observational registries.
The necessity for RWD is based on the fact that clinical trials cannot
account for the entire patient population of a particular disease. Patients
suffering from comorbidities or belonging to a special geographic
region, have genetic variations or high age do not in general participate
in any clinical trials.
What is Real World Evidence?
35
W. Kuchinke (2018)
The management of human health and diseases, including
policy and decision making and the development of efficient
healthcare systems demand support by efficient and
rigorous evidence-based investigation and evaluation of
research results. Data are therefore central to further
improvements in public health, primary and hospital care, and
especially for the advancement of personalized medicine.
Relevant data should be collected as part of the usual
healthcare, from routine administrative sources and research
studies. Data governance and data privacy protection
should begin as early as possible, ideally during data
generation.
Rigorous evidence-based investigation
36W. Kuchinke (2018)
Dealing with Real World Clinical Data
Real World Clinical Data play an important role for research
For Patient Reported Outcomes and for Sensor Data
open source RADAR stack (RADAR-CNS project)
RADAR-base Management Portal is a one-stop shop for
managing remote patient monitoring studies
The RADAR Android apps to directly exchange data with the
patients and other care providers
Kafka-based stack: message transport system
European health data networks
Observational Health Data Sciences and Informatics, Observational
Medical Outcomes Partnership (OMOP)
37W. Kuchinke (2018)
Results of Ecosystem Analysis
It doesn‘t matter where one stores data
Everything is connected
Institutional repositories (dataverses), data marts, general
repositories, domain specific repositories, figshare for data sharing
An ecosystem for open data management
Covers complete data life cycle
Complete projects are supported
FAIR data as basis
tranSMART as integration hub for analysis
Integration of data governance and privacy protection at the stage of
data generation
But can sensitive data really be integrated?
Not yet convincingly shown!
38W. Kuchinke (2018)
Contact
Wolfgang Kuchinke
Heinrich-Heine University Düsseldorf, Düsseldorf,
Germany
wolfgang.kuchinke@uni-duesseldorf.de
Presentation contains additional material for explanation and workshop.

Weitere ähnliche Inhalte

Was ist angesagt?

Research data management : Open Research Data pilot, data management (plans),...
Research data management : Open Research Data pilot, data management (plans),...Research data management : Open Research Data pilot, data management (plans),...
Research data management : Open Research Data pilot, data management (plans),...Leon Osinski
 
Big data service architecture: a survey
Big data service architecture: a surveyBig data service architecture: a survey
Big data service architecture: a surveyssuser0191d4
 
EDI Training Module 12: Learn to Cite and Link Your Data
EDI Training Module 12:  Learn to Cite and Link Your DataEDI Training Module 12:  Learn to Cite and Link Your Data
EDI Training Module 12: Learn to Cite and Link Your DataEnvironmental Data Initiative
 
dkNET Poster Experimental Biology 2019
dkNET Poster Experimental Biology 2019dkNET Poster Experimental Biology 2019
dkNET Poster Experimental Biology 2019dkNET
 
Who will use the open data? Mark Humphries keynote
Who will use the open data? Mark Humphries keynoteWho will use the open data? Mark Humphries keynote
Who will use the open data? Mark Humphries keynoteJisc RDM
 
Dataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataDataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataTom Plasterer
 
ElN - repository integration at the University of Goettingen
ElN - repository integration at the University of GoettingenElN - repository integration at the University of Goettingen
ElN - repository integration at the University of Goettingenrmacneil88
 
Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019Kees van Bochove
 
Linked Data for Biopharma
Linked Data for BiopharmaLinked Data for Biopharma
Linked Data for BiopharmaTom Plasterer
 
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...Tom Plasterer
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE
 
dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021
dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021
dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021dkNET
 
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...Tom Plasterer
 
Preparing your data for sharing and publishing
Preparing your data for sharing and publishingPreparing your data for sharing and publishing
Preparing your data for sharing and publishingVarsha Khodiyar
 
DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?DataONE
 
Research Data Management, Open Data and Zenodo - 6th National Open Access Con...
Research Data Management, Open Data and Zenodo - 6th National Open Access Con...Research Data Management, Open Data and Zenodo - 6th National Open Access Con...
Research Data Management, Open Data and Zenodo - 6th National Open Access Con...Pedro Príncipe
 
David Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordDavid Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordJisc
 

Was ist angesagt? (20)

Research data management : Open Research Data pilot, data management (plans),...
Research data management : Open Research Data pilot, data management (plans),...Research data management : Open Research Data pilot, data management (plans),...
Research data management : Open Research Data pilot, data management (plans),...
 
Big data service architecture: a survey
Big data service architecture: a surveyBig data service architecture: a survey
Big data service architecture: a survey
 
EDI Training Module 12: Learn to Cite and Link Your Data
EDI Training Module 12:  Learn to Cite and Link Your DataEDI Training Module 12:  Learn to Cite and Link Your Data
EDI Training Module 12: Learn to Cite and Link Your Data
 
dkNET Poster Experimental Biology 2019
dkNET Poster Experimental Biology 2019dkNET Poster Experimental Biology 2019
dkNET Poster Experimental Biology 2019
 
Who will use the open data? Mark Humphries keynote
Who will use the open data? Mark Humphries keynoteWho will use the open data? Mark Humphries keynote
Who will use the open data? Mark Humphries keynote
 
Dataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataDataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* Data
 
ElN - repository integration at the University of Goettingen
ElN - repository integration at the University of GoettingenElN - repository integration at the University of Goettingen
ElN - repository integration at the University of Goettingen
 
Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019
 
Open Science: What, why, how?
Open Science: What, why, how? Open Science: What, why, how?
Open Science: What, why, how?
 
Linked Data for Biopharma
Linked Data for BiopharmaLinked Data for Biopharma
Linked Data for Biopharma
 
How to elaborate a data management plan
How to elaborate a data management planHow to elaborate a data management plan
How to elaborate a data management plan
 
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data Sharing
 
dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021
dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021
dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021
 
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...
 
Preparing your data for sharing and publishing
Preparing your data for sharing and publishingPreparing your data for sharing and publishing
Preparing your data for sharing and publishing
 
DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?
 
Research Data Management, Open Data and Zenodo - 6th National Open Access Con...
Research Data Management, Open Data and Zenodo - 6th National Open Access Con...Research Data Management, Open Data and Zenodo - 6th National Open Access Con...
Research Data Management, Open Data and Zenodo - 6th National Open Access Con...
 
Enabling simultaneous analysis of multiple cohort studies: A BRISSKit use case
Enabling simultaneous analysis of multiple cohort studies: A BRISSKit use caseEnabling simultaneous analysis of multiple cohort studies: A BRISSKit use case
Enabling simultaneous analysis of multiple cohort studies: A BRISSKit use case
 
David Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordDavid Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published record
 

Ähnlich wie Repositories in an Open Data Ecosystem

Open science curriculum for students, June 2019
Open science curriculum for students, June 2019Open science curriculum for students, June 2019
Open science curriculum for students, June 2019Dag Endresen
 
Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...African Open Science Platform
 
Museum collections as research data - October 2019
Museum collections as research data - October 2019Museum collections as research data - October 2019
Museum collections as research data - October 2019Dag Endresen
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonAfrican Open Science Platform
 
British Library Datasets Programme 2010
British Library Datasets Programme 2010British Library Datasets Programme 2010
British Library Datasets Programme 2010ALISS
 
Winning Horizon 2020 with Open Science
Winning Horizon 2020 with Open ScienceWinning Horizon 2020 with Open Science
Winning Horizon 2020 with Open ScienceMartin Donnelly
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...Robert Grossman
 
Gobinda Chowdhury
Gobinda ChowdhuryGobinda Chowdhury
Gobinda Chowdhurymaredata
 
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The HyveOpen Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The HyveKees van Bochove
 
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016Kees van Bochove
 
Ross Wilkinson - Data Publication: Australian and Global Policy Developments
Ross Wilkinson - Data Publication: Australian and Global Policy DevelopmentsRoss Wilkinson - Data Publication: Australian and Global Policy Developments
Ross Wilkinson - Data Publication: Australian and Global Policy DevelopmentsWiley
 
CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...CINECAProject
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)Dag Endresen
 
2011-12-02 Open PHACTS at STM Innovation
2011-12-02 Open PHACTS at STM Innovation2011-12-02 Open PHACTS at STM Innovation
2011-12-02 Open PHACTS at STM Innovationopen_phacts
 
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Kees van Bochove
 
From Deadly E. coli to Endangered Polar Bear: GigaScience Provides First Cita...
From Deadly E. coli to Endangered Polar Bear: GigaScience Provides First Cita...From Deadly E. coli to Endangered Polar Bear: GigaScience Provides First Cita...
From Deadly E. coli to Endangered Polar Bear: GigaScience Provides First Cita...GigaScience, BGI Hong Kong
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeLizLyon
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...Dr. Haxel Consult
 

Ähnlich wie Repositories in an Open Data Ecosystem (20)

Open science curriculum for students, June 2019
Open science curriculum for students, June 2019Open science curriculum for students, June 2019
Open science curriculum for students, June 2019
 
Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...
 
Museum collections as research data - October 2019
Museum collections as research data - October 2019Museum collections as research data - October 2019
Museum collections as research data - October 2019
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon Hodson
 
British Library Datasets Programme 2010
British Library Datasets Programme 2010British Library Datasets Programme 2010
British Library Datasets Programme 2010
 
Winning Horizon 2020 with Open Science
Winning Horizon 2020 with Open ScienceWinning Horizon 2020 with Open Science
Winning Horizon 2020 with Open Science
 
Simon hodson
Simon hodsonSimon hodson
Simon hodson
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
 
Gobinda Chowdhury
Gobinda ChowdhuryGobinda Chowdhury
Gobinda Chowdhury
 
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The HyveOpen Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
 
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
 
Ross Wilkinson - Data Publication: Australian and Global Policy Developments
Ross Wilkinson - Data Publication: Australian and Global Policy DevelopmentsRoss Wilkinson - Data Publication: Australian and Global Policy Developments
Ross Wilkinson - Data Publication: Australian and Global Policy Developments
 
CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)
 
2011-12-02 Open PHACTS at STM Innovation
2011-12-02 Open PHACTS at STM Innovation2011-12-02 Open PHACTS at STM Innovation
2011-12-02 Open PHACTS at STM Innovation
 
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
 
From Deadly E. coli to Endangered Polar Bear: GigaScience Provides First Cita...
From Deadly E. coli to Endangered Polar Bear: GigaScience Provides First Cita...From Deadly E. coli to Endangered Polar Bear: GigaScience Provides First Cita...
From Deadly E. coli to Endangered Polar Bear: GigaScience Provides First Cita...
 
Nicole Nogoy at the Auckland BMC RoadShow
Nicole Nogoy at the Auckland BMC RoadShowNicole Nogoy at the Auckland BMC RoadShow
Nicole Nogoy at the Auckland BMC RoadShow
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 

Mehr von Wolfgang Kuchinke

Temporal relations in queries of ehr data for research
Temporal relations in queries of ehr data for researchTemporal relations in queries of ehr data for research
Temporal relations in queries of ehr data for researchWolfgang Kuchinke
 
Secure access to biomedical data sources for legal data sharing-kuchinke
Secure access to biomedical data sources for legal data sharing-kuchinkeSecure access to biomedical data sources for legal data sharing-kuchinke
Secure access to biomedical data sources for legal data sharing-kuchinkeWolfgang Kuchinke
 
Computer System Validation with privacy zones, e-source and clinical trials b...
Computer System Validation with privacy zones, e-source and clinical trials b...Computer System Validation with privacy zones, e-source and clinical trials b...
Computer System Validation with privacy zones, e-source and clinical trials b...Wolfgang Kuchinke
 
Personalized medicine tools for clinical trials - kuchinke
Personalized medicine tools for clinical trials - kuchinkePersonalized medicine tools for clinical trials - kuchinke
Personalized medicine tools for clinical trials - kuchinkeWolfgang Kuchinke
 
Computer System Validation - privacy zones, eSource and EHR data in clinical ...
Computer System Validation - privacy zones, eSource and EHR data in clinical ...Computer System Validation - privacy zones, eSource and EHR data in clinical ...
Computer System Validation - privacy zones, eSource and EHR data in clinical ...Wolfgang Kuchinke
 
Use of personalized medicine tools for clinical research networks
Use of personalized medicine tools for clinical research networksUse of personalized medicine tools for clinical research networks
Use of personalized medicine tools for clinical research networksWolfgang Kuchinke
 
Kuchinke - Learning Health System (LHS) in Europe - Introduction and meeting ...
Kuchinke - Learning Health System (LHS) in Europe - Introduction and meeting ...Kuchinke - Learning Health System (LHS) in Europe - Introduction and meeting ...
Kuchinke - Learning Health System (LHS) in Europe - Introduction and meeting ...Wolfgang Kuchinke
 
Regulations, privacy, security for data bridges - Kuchinke
Regulations, privacy, security for data bridges - KuchinkeRegulations, privacy, security for data bridges - Kuchinke
Regulations, privacy, security for data bridges - KuchinkeWolfgang Kuchinke
 
Agile Computer System Validation of software products
Agile Computer System Validation of software productsAgile Computer System Validation of software products
Agile Computer System Validation of software productsWolfgang Kuchinke
 
Reverse Engineering of Clinical Trials to Improve Research
Reverse Engineering of Clinical Trials to Improve ResearchReverse Engineering of Clinical Trials to Improve Research
Reverse Engineering of Clinical Trials to Improve ResearchWolfgang Kuchinke
 
Introduction to CTIM - the Clinical Trial Information Mediator
Introduction to CTIM - the Clinical Trial Information MediatorIntroduction to CTIM - the Clinical Trial Information Mediator
Introduction to CTIM - the Clinical Trial Information MediatorWolfgang Kuchinke
 
Standard based Electronic Archiving for Clinical Trials
Standard based Electronic Archiving for Clinical TrialsStandard based Electronic Archiving for Clinical Trials
Standard based Electronic Archiving for Clinical TrialsWolfgang Kuchinke
 
Increased Ethical Demands for Patient Empowerment in Personalised Medicine
Increased Ethical Demands for Patient Empowerment in Personalised MedicineIncreased Ethical Demands for Patient Empowerment in Personalised Medicine
Increased Ethical Demands for Patient Empowerment in Personalised MedicineWolfgang Kuchinke
 
Ethical concerns caused by integrative patient empowerment services
Ethical concerns caused by integrative patient empowerment servicesEthical concerns caused by integrative patient empowerment services
Ethical concerns caused by integrative patient empowerment servicesWolfgang Kuchinke
 
Regulations, privacy and security requirements - Legal interoperability for d...
Regulations, privacy and security requirements - Legal interoperability for d...Regulations, privacy and security requirements - Legal interoperability for d...
Regulations, privacy and security requirements - Legal interoperability for d...Wolfgang Kuchinke
 
Service Integration for Research Infrastructures by Reciprocal Usage
Service Integration for Research Infrastructures by Reciprocal UsageService Integration for Research Infrastructures by Reciprocal Usage
Service Integration for Research Infrastructures by Reciprocal UsageWolfgang Kuchinke
 
Zone model for data privacy and confidentiality in medical research
Zone model for data privacy and confidentiality in medical researchZone model for data privacy and confidentiality in medical research
Zone model for data privacy and confidentiality in medical researchWolfgang Kuchinke
 
CDISC Use Case Workshop Archiving of Studies
CDISC Use Case Workshop Archiving of StudiesCDISC Use Case Workshop Archiving of Studies
CDISC Use Case Workshop Archiving of StudiesWolfgang Kuchinke
 

Mehr von Wolfgang Kuchinke (18)

Temporal relations in queries of ehr data for research
Temporal relations in queries of ehr data for researchTemporal relations in queries of ehr data for research
Temporal relations in queries of ehr data for research
 
Secure access to biomedical data sources for legal data sharing-kuchinke
Secure access to biomedical data sources for legal data sharing-kuchinkeSecure access to biomedical data sources for legal data sharing-kuchinke
Secure access to biomedical data sources for legal data sharing-kuchinke
 
Computer System Validation with privacy zones, e-source and clinical trials b...
Computer System Validation with privacy zones, e-source and clinical trials b...Computer System Validation with privacy zones, e-source and clinical trials b...
Computer System Validation with privacy zones, e-source and clinical trials b...
 
Personalized medicine tools for clinical trials - kuchinke
Personalized medicine tools for clinical trials - kuchinkePersonalized medicine tools for clinical trials - kuchinke
Personalized medicine tools for clinical trials - kuchinke
 
Computer System Validation - privacy zones, eSource and EHR data in clinical ...
Computer System Validation - privacy zones, eSource and EHR data in clinical ...Computer System Validation - privacy zones, eSource and EHR data in clinical ...
Computer System Validation - privacy zones, eSource and EHR data in clinical ...
 
Use of personalized medicine tools for clinical research networks
Use of personalized medicine tools for clinical research networksUse of personalized medicine tools for clinical research networks
Use of personalized medicine tools for clinical research networks
 
Kuchinke - Learning Health System (LHS) in Europe - Introduction and meeting ...
Kuchinke - Learning Health System (LHS) in Europe - Introduction and meeting ...Kuchinke - Learning Health System (LHS) in Europe - Introduction and meeting ...
Kuchinke - Learning Health System (LHS) in Europe - Introduction and meeting ...
 
Regulations, privacy, security for data bridges - Kuchinke
Regulations, privacy, security for data bridges - KuchinkeRegulations, privacy, security for data bridges - Kuchinke
Regulations, privacy, security for data bridges - Kuchinke
 
Agile Computer System Validation of software products
Agile Computer System Validation of software productsAgile Computer System Validation of software products
Agile Computer System Validation of software products
 
Reverse Engineering of Clinical Trials to Improve Research
Reverse Engineering of Clinical Trials to Improve ResearchReverse Engineering of Clinical Trials to Improve Research
Reverse Engineering of Clinical Trials to Improve Research
 
Introduction to CTIM - the Clinical Trial Information Mediator
Introduction to CTIM - the Clinical Trial Information MediatorIntroduction to CTIM - the Clinical Trial Information Mediator
Introduction to CTIM - the Clinical Trial Information Mediator
 
Standard based Electronic Archiving for Clinical Trials
Standard based Electronic Archiving for Clinical TrialsStandard based Electronic Archiving for Clinical Trials
Standard based Electronic Archiving for Clinical Trials
 
Increased Ethical Demands for Patient Empowerment in Personalised Medicine
Increased Ethical Demands for Patient Empowerment in Personalised MedicineIncreased Ethical Demands for Patient Empowerment in Personalised Medicine
Increased Ethical Demands for Patient Empowerment in Personalised Medicine
 
Ethical concerns caused by integrative patient empowerment services
Ethical concerns caused by integrative patient empowerment servicesEthical concerns caused by integrative patient empowerment services
Ethical concerns caused by integrative patient empowerment services
 
Regulations, privacy and security requirements - Legal interoperability for d...
Regulations, privacy and security requirements - Legal interoperability for d...Regulations, privacy and security requirements - Legal interoperability for d...
Regulations, privacy and security requirements - Legal interoperability for d...
 
Service Integration for Research Infrastructures by Reciprocal Usage
Service Integration for Research Infrastructures by Reciprocal UsageService Integration for Research Infrastructures by Reciprocal Usage
Service Integration for Research Infrastructures by Reciprocal Usage
 
Zone model for data privacy and confidentiality in medical research
Zone model for data privacy and confidentiality in medical researchZone model for data privacy and confidentiality in medical research
Zone model for data privacy and confidentiality in medical research
 
CDISC Use Case Workshop Archiving of Studies
CDISC Use Case Workshop Archiving of StudiesCDISC Use Case Workshop Archiving of Studies
CDISC Use Case Workshop Archiving of Studies
 

Kürzlich hochgeladen

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 

Kürzlich hochgeladen (20)

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 

Repositories in an Open Data Ecosystem

  • 1. Wolfgang KuchinkeWolfgang Kuchinke University Duesseldorf, Duesseldorf, GermanyUniversity Duesseldorf, Duesseldorf, Germany CORBEL ProjectCORBEL Project W. Kuchinke (2018) Repositories in an Open Data EcosystemRepositories in an Open Data Ecosystem ECRIN – CORBEL WP 3.3 Working Group MeetingECRIN – CORBEL WP 3.3 Working Group Meeting 14. Jun 2018, Paris, France14. Jun 2018, Paris, France
  • 2. 2 W. Kuchinke (2018) Open Data – Open Science Towards an ecosystem for Open Data and Sensitive Data
  • 3. 3 W. Kuchinke (2018) Open data is data that can be freely used, shared and built-on by anyone, anywhere, for any purpose. Data sharing is the precondition for the reproducibility of research results. Open Definition (http://opendefinition.org/okd/)
  • 4. 4 W. Kuchinke (2018) For reproducibility and progress of research, data sharing is critical. Providers of human data (e.g. publicly or privately funded repositories and data archives) should fulfill their social responsibility with data donors when their shareable data conforms to the FAIR (findable, accessible, interoperable, reusable) principles FAIR data framework
  • 5. 5 W. Kuchinke (2018) Research data, metadata and data management plans are part of Open Research Data Management. Research data can contain a wide diversity of collected information: text or numerical data, biosamples, images, questionnaires, recorded videos, models, software, reports, workflows, etc. All information about data type and format of the information needs to be described. For this purpose, data need to be complemented by proper metadata. Metadata are essential to recover and reuse research data. Metadata standards allow the interoperability across different systems, like repositories. Metadata can be classified in 3 main types: descriptive, administrative, and structural. Descriptive metadata serve to discovery and understand a data source, and refers for example tothe title, author, publication date or abstract, like, for example the Dublin Core Schema The Importance of Metadata for Open Research Data Management
  • 6. 6 W. Kuchinke (2018) Conceptual representation of the life cycle of data in biomedical data repositories (secure storage of biomedical research and healthcare-related data) from the moment of data generation, through their utilization and transformation into useful information, publication and finally their long-term archiving or destruction Data life cycle
  • 7. 7W. Kuchinke (2018) Data Ecosystem Repositories are the core components of an Open Data Ecosystem Many tools and data services support repositories Different aspects FAIR principles Open data and clinical trials data should be stored together Cloud storage should be enabled Analysis tools should be provided Different data repositories should be connected to each other
  • 8. 8 W. Kuchinke (2018) FAIR data framework Fig. Modified from: Dep. Med Inf. UMG, Groningen Data Sources Data Integration and Data Curation Data Storage Data Usage ePatient Record Clinical Trial Data Registry Data Patient Reported Outcome Sensor Data Biomaterial Data eLab Data Lifestyle Data, Weather, Medication, Social Media Transform Ontology match Linkage Data Warehousing Data Marts Data management and Analysis Open Data Data query Data visualisation Data analysis Collaboration Therapy Board Data transfer Data sharing Publication Data Governance Persistent Identifiers Privacy Metadata harvesting Data Dictionary Consent Management Identity Management Anonymisation Pseudonymisation Data Annotation
  • 9. 9 W. Kuchinke (2018) Move Data Governance to Data Generation step in the data life cycle Data Sources Data Integration and Data Curation Data Storage Data Usage ePatient Record Clinical Trial Data Registry Data Patient Reported Outcome Sensor Data Biomaterial Data eLab Data Lifestyle Data, Weather, Medication, Social Media Transform Ontology match Linkage Data Warehousing Data Marts Data management and Analysis Open Data Data query Data visualisation Data analysis Collaboration Therapy Board Data transfer Data sharing Publication Data Governance Persistent Identifiers Privacy Metadata harvesting Data Dictionary Consent Management Identity Management Anonymisation Pseudonymisation Data Annotation Data Governance for privacy protection already at the step of data generation
  • 10. 10 W. Kuchinke (2018) Persistent Identifiers, Metadata, Privacy protection become part of data generation Build-in Data governance and Privacy protection Data Sources Data Integration and Data Curation Data Storage Data Usage ePatient Record Clinical Trial Data Registry Data Patient Reported Outcome Sensor Data Biomaterial Data eLab Data Lifestyle Data, Weather, Medication, Social Media Transform Ontology match Linkage Data Warehousing Data Marts Data management and Analysis Open Data Data query Data visualisation Data analysis Collaboration Therapy Board Data transfer Data sharing Publication Data Governance Persistent Identifiers Privacy protection Metadata harvesting Data Dictionary Consent Management Identity Management Anonymisation Pseudonymisation Data Annotation
  • 11. 11 W. Kuchinke (2018) Components of the Repository Data Ecosystem An ecosystem suitable even for Sensitive Data
  • 12. 12 W. Kuchinke (2018) The Comprehensive Knowledge Archive Network (CKAN) is an open- source open data portal for the storage and distribution of open data. Aimed at data publishers who make their data open and available. The system is used both as a public platform on Datahub and in various government data catalogues (e.g. UK's data.gov.uk, Dutch National Data Register, the United States government's Data.gov and the Australian government's Gov 2.0). https://ckan.org/ What is CKAN?
  • 13. 13W. Kuchinke (2018) CKAN Open-source data portal platform Developed by the OKFN (Open Knowledge Foundation) It is a complete out-of-the-box software solution Tools to streamline publishing, sharing, finding and using data CKAN includes a web interface and the CKAN Action API Visualizations for structured data resources (such as CSV files)
  • 14. 14W. Kuchinke (2018) CKAN and reusability of healthcare data Catalog and metadata saved in CKAN can be harvested based on the OAI-PMH Through the CKAN cloud environment, wearable and stationary sensor data stored in individual CKANs can be integrated Analysis and integration of clinical data of users based on diagnostic data saved on the CKAN-based cloud Prediction of situations, events, and incidents
  • 15. 15 W. Kuchinke (2018) The Hyve is a company that provides professional IT services for open source biomedical informatics solutions, to enhance the quality and impact of research by enabling scientists in life sciences and healthcare research to properly use open source software, open data and open standards. https://thehyve.nl/ What is the Hyve?
  • 16. 16W. Kuchinke (2018) Tools developed by the Hyve Portfolio of open source tools and products that facilitate FAIR research data FAIR Research Data Management in academic hospitals Research Data Marts I2b2 / tranSMART and cBioPortal (for oncology-focus medical centers) a robust research data warehouse can be established, which exposes a unified patient-centric view of clinical and molecular data for research & analysis
  • 17. 17 W. Kuchinke (2018) tranSMART is an open-source data warehouse designed to store large amounts of clinical data from clinical trials, as well as data from basic research. In tranSMART data can be examined for translational research purposes. tranSMART is built on top of the i2b2 platform, a clinical data warehouse employing the i2b2 star model. Each of the data types (e.g., gene expression, SNP or metabolomics) retain its specific data structure. What is tranSMART?
  • 18. 18W. Kuchinke (2018) tranSMART data warehouse Designed for use in individual clinical studies with hundreds or thousands participants in which maybe tens of thousands observations were gathered tranSMART is also being adopted by hospitals and large population studies Large population study is the Netherlands Twin Register (NTR) adding indexes, creating partitions, addition of bit strings, Saving subject sets for single and combined queries, Splitting a query
  • 19. 19W. Kuchinke (2018) Glowing Bear: the new tranSMART UI Sponsored by Pfizer, Sanofi, Abbvie and Roche Cross-study and ontology term support  Support for time series and longitudinal data  Possibility of saving queries and re-executing them later Cohort builder
  • 20. 20 W. Kuchinke (2018) The Dataverse is an open source web application to share, preserve, cite, explore and analyze research data. Researchers, authors, publishers, data distributors, and their institutions receive appropriate credit via a data citation mechanism including a persistent identifier (e.g., DOI, or Handle). A Dataverse repository hosts multiple dataverses; each dataverse contains dataset(s) or other dataverses, and each dataset contains descriptive metadata together with the data. https://dataverse.org/ What is Dataverse?
  • 21. 21W. Kuchinke (2018) Dataverse Open source web application for sharing, citing, analyzing, and preserving research data Developed by the Data Science team at the Institute for Quantitative Social Science Dataverse code is open-source and free Supports DataCite and other citation standards, such as ORCID Creates a Digital Object Identifier (DOI) upon deposit
  • 22. 22W. Kuchinke (2018) Dataverse repositories Harvard Dataverse Network hosts the world's largest collection of social science research A Dataverse repository is the software installation, which hosts multiple dataverses Each dataverse contains datasets, and each dataset contains descriptive metadata and data files Dataverses may contain other dataverses
  • 23. 23W. Kuchinke (2018) Dataverse datasets A dataset in Dataverse is a container for data, documentation, code, and the corresponding metadata which describe the dataset From: http://guides.dataverse.org/en/latest/user/dataset-management.html
  • 24. 24W. Kuchinke (2018) Dataverse and Cloud Storage Dataverse installations can be configured to facilitate cloud-based storage and computing Default configuration for Dataverse uses a local file system for storing data Cloud-enabled Dataverse installation can use a Swift object storage database for its data This allows users to perform computations on data using an integrated cloud environment
  • 25. 25W. Kuchinke (2018) Example: DataverseNL Service for archiving and publishing research data on several levels faculties, institutions, research groups, projects within Dutch universities Possibility to store and share online a large variety of scientific data, independent of file format, in a secure way Not suitable for storing (privacy) sensitive data PSI (Ψ): A Private data Sharing Interface Privacy Tools Research Group (Harvard)
  • 26. 26W. Kuchinke (2018) figshare figshare helps academic institutions store, share and manage all of their research output Integrate into your CRIS/RIMS, institutional repository and archiving solution All research on figshare can be pushed to any institutional repository Control how content is shared internally and publically figshare is hosted on Amazon Web Services but we can also integrate with centralized cloud
  • 27. 27W. Kuchinke (2018) ●figshare for academic institutions figshare helps academic institutions store, share and manage their research output Integrates into institution’s CRIS/RIMS, the institutional repository and archiving solutions All research on figshare can be pushed to any institutional repository Control how content is shared internally and publicly figshare is hosted on Amazon Web Services but can also integrate with a centralized cloud
  • 28. 28W. Kuchinke (2018) Example: University of Sheffield Custom portal to manage research data
  • 29. 29W. Kuchinke (2018) Example: University of Salford / Manchester Custom portal of figshare
  • 30. 30W. Kuchinke (2018) OSF (Open Science Framework) Cloud-based management of projects View all projects from one dashboard. Quickly share files Share key project information and allow others to use and cite it. See project changes View project analytics Archive data
  • 31. 31 W. Kuchinke (2018) Analysis of the repository ecosystem components
  • 32. 32W. Kuchinke (2018) ●Role of Repositories in the Data Ecosystem A multitude of services and tools to support research data repositories Different types of repositories are connected and supplement each other in the storage, release and sharing of data with different degree of protection and ownership Tools to analyze, browse and visualize data should be integrated Real World Data must be smoothly integrated into the research data cycle Data governance and data privacy protection play an important role New and efficient tools for anonymisation and data obfuscation are necessary
  • 33. 33W. Kuchinke (2018) Overview Research Data Sharing and Storage Services A multitude of services and tools support research data repositories to form an open data ecosystem Modified from: Instituuts Data Management Plannen, Groningen During research During research After research After research BeeHub B2SAFE SurfDrive Local ICT Services CLARIN INL DANS 4TU. Centre for ResearchData Zenodo B2SHARE figshare SURF addgene Brainmap.org NeuroVault OpenMRI MycoBank Language Archive DataFirst Dataverse NL CancerData DRYAD Connectome SeaDataNet nesstar TalkBank OpenML OpenClinica Curate Science EVIDENCIO OSF CRCNS Dataverse BeeHub RUG GeoData InstitutionDisciplin
  • 34. 34 W. Kuchinke (2018) Real world evidence (RWE) in medicine means evidence obtained from real world data (RWD), which is data obtained outside the context of randomized controlled trials (RCTs); it is generated during the routine clinical practice. Real world data is stored in Electronic Health Records (EHR), medical claims or billing activities databases, registries, patient-generated data, mobile devices, etc. In addition, it may be derived from retrospective or prospective observational studies and observational registries. The necessity for RWD is based on the fact that clinical trials cannot account for the entire patient population of a particular disease. Patients suffering from comorbidities or belonging to a special geographic region, have genetic variations or high age do not in general participate in any clinical trials. What is Real World Evidence?
  • 35. 35 W. Kuchinke (2018) The management of human health and diseases, including policy and decision making and the development of efficient healthcare systems demand support by efficient and rigorous evidence-based investigation and evaluation of research results. Data are therefore central to further improvements in public health, primary and hospital care, and especially for the advancement of personalized medicine. Relevant data should be collected as part of the usual healthcare, from routine administrative sources and research studies. Data governance and data privacy protection should begin as early as possible, ideally during data generation. Rigorous evidence-based investigation
  • 36. 36W. Kuchinke (2018) Dealing with Real World Clinical Data Real World Clinical Data play an important role for research For Patient Reported Outcomes and for Sensor Data open source RADAR stack (RADAR-CNS project) RADAR-base Management Portal is a one-stop shop for managing remote patient monitoring studies The RADAR Android apps to directly exchange data with the patients and other care providers Kafka-based stack: message transport system European health data networks Observational Health Data Sciences and Informatics, Observational Medical Outcomes Partnership (OMOP)
  • 37. 37W. Kuchinke (2018) Results of Ecosystem Analysis It doesn‘t matter where one stores data Everything is connected Institutional repositories (dataverses), data marts, general repositories, domain specific repositories, figshare for data sharing An ecosystem for open data management Covers complete data life cycle Complete projects are supported FAIR data as basis tranSMART as integration hub for analysis Integration of data governance and privacy protection at the stage of data generation But can sensitive data really be integrated? Not yet convincingly shown!
  • 38. 38W. Kuchinke (2018) Contact Wolfgang Kuchinke Heinrich-Heine University Düsseldorf, Düsseldorf, Germany wolfgang.kuchinke@uni-duesseldorf.de Presentation contains additional material for explanation and workshop.