SlideShare ist ein Scribd-Unternehmen logo
1 von 52
www.geant.org
www.geant.org
1 |
Click to edit Master title style
• Click to edit Master text styles
• Second level
• Third level
• Fourth level
• Fifth level
01/04/2022 1
Introduction to Open Science and EOSC
www.geant.org
Sarah Jones
EOSC Engagement Manager
sarah.jones@geant.org
Twitter: @sarahroams
Predictive Epigenetics PEP-NET training network
1st April 2020
www.geant.org
www.geant.org
Go to menti.com
and enter code
3330 9014
2 |
www.geant.org
www.geant.org
3 |
The what and why of
FAIR and Open Science
Image by Michael Longmire https://unsplash.com/photos/L9EV3OogLh0
www.geant.org
www.geant.org
“science carried out and communicated in a manner which
allows others to contribute, collaborate and add to the research
effort, with all kinds of data, results and protocols made freely
available at different stages of the research process.”
Research Information Network, Open Science case studies
www.rin.ac.uk/our-work/data-management-and-curation/
open-science-case-studies
Defining Open Science
4 |
www.geant.org
www.geant.org
The spectrum of Open Science
5 |
CC-BY Andreas Neuhold
https://commons.wikimedia.org/wiki/File:Open_Science_-_Prinzipien.png
www.geant.org
www.geant.org
Why open access?
Open Access Explained!
www.youtube.com/watch?v=L5rVH1KGBCY
www.geant.org
www.geant.org
• Free, immediate, online access to the results of research
• Two routes to make sure anyone can access your papers
– Gold route: paying APCs to ensure publishers makes copy open
– Green route: self-archiving Open Access copy in repository
• Find out what your publisher allows on SHERPA RoMEO
– www.sherpa.ac.uk/romeo
Open access to publications
www.geant.org
www.geant.org
Open data
 make your stuff available on the Web (whatever format) under an open licence
 make it available as structured data (e.g. Excel instead of a scan of a table)
 use non-proprietary formats (e.g. CSV instead of Excel)
 use URIs to denote things, so that people can point at your stuff
 link your data to other data to provide context
Tim Berners-Lee’s proposal for five star open data - http://5stardata.info
“Open data and content can be freely used, modified
and shared by anyone for any purpose”
http://opendefinition.org
www.geant.org
www.geant.org
• Documenting and sharing workflows and methods
• Sharing code and tools to allow others to reproduce work
• Using web based tools to facilitate collaboration and interaction from the
outside world in your research
• Using tools like MyExperiment and Taverna
Open methods
www.geant.org
www.geant.org
Reliance on specialist research software
Slide from Neil Chue-Hong, Software Sustainability Institute
56%
71%
Do you use research
software?
What would happen to your
research without software
Survey of researchers from 15 UK Russell Group universities conducted
by SSI between August - October 2014. DOI: 10.5281/zenodo.14809
Develop their
own software
Have no formal
software training
www.geant.org
www.geant.org
Design
Experiment
Analysis
Publication
Release
Openness at every stage of research
Open science image CC BY-SA 3.0 by Greg Emmerich www.flickr.com/photos/gemmerich/6365692655
Change the typical
lifecycle
Publish earlier and
release more
Papers + Data +
Methods + Code…
Support
reproducibility
www.geant.org
www.geant.org
Degrees of openness
Open Restricted Closed
Content that can be
freely used, modified and
shared by anyone
for any purpose
Limits on who can use the data,
how or for what purpose
- Charges for use
- Data sharing agreements
- Restrictive licences
- Peer-to-peer exchange
- …
Five star open data

Unable to share
Under embargo
www.geant.org
www.geant.org
• FAIR ≠ Open
• FAIR ensures data can be found, understood and reused
• Data can be shared under restrictions & still be FAIR
"As open as possible, as closed as necessary"
And what is FAIR?
13 |
Image CC-BY-SA by SangyaPundir Image CC-BY by European Commission FAIR data expert group
www.geant.org
www.geant.org
What FAIR means: 15 principles
Findable
F1. (meta)data are assigned a globally unique and eternally
persistent identifier.
F2. data are described with rich metadata.
F3. (meta)data are registered or indexed in a searchable resource.
F4. metadata specify the data identifier.
Interoperable
I1. (meta)data use a formal, accessible, shared, and broadly
applicable language for knowledge representation.
I2. (meta)data use vocabularies that follow FAIR principles.
I3. (meta)data include qualified references to other (meta)data.
Accessible
A1 (meta)data are retrievable by their identifier using a
standardized communications protocol.
A1.1 the protocol is open, free, and universally implementable.
A1.2 the protocol allows for an authentication and authorization
procedure, where necessary.
A2 metadata are accessible, even when the data are no longer
available.
Reusable
R1. meta(data) have a plurality of accurate and relevant attributes.
R1.1. (meta)data are released with a clear and accessible data
usage license.
R1.2. (meta)data are associated with their provenance.
R1.3. (meta)data meet domain-relevant community standards.
Slide CC-BY by Erik Schultes, Leiden UMC
doi: 10.1038/sdata.2016.18
www.geant.org
www.geant.org
The FAIR data principles explained
• Clarifications from GO FAIR
• Each principle is a link to
further clarification,
examples and context
https://www.go-fair.org/fair-
principles
R1. Meta(data) are richly described with a plurality of accurate and relevant
attributes
• By giving data many ‘labels’, it will be much easier to find and reuse the data.
• Provide not just metadata that allows discovery, but also metadata that richly
describes the context under which that data was generated
• “plurality” indicates that metadata should be as generous as possible, even to the
point of providing information that may seem irrelevant.
www.geant.org
www.geant.org
• Findable
- Persistent Identifier
- Metadata online
• Accessible
- Data online
- Restrictions where needed
• Interoperable
- Use standards, controlled vocabs
- Common (open) formats
• Reusable
- Rich documentation
- Clear usage licence
FAIR data checklist
https://doi.org/10.5281/zenodo.5111307
www.geant.org
www.geant.org
• Various research communities have been sharing their
data in a ‘FAIR’ way long before the term emerged
• Meaningful and memorable articulation of concepts
• Natural desire to want to be ‘fair’
• FAIR is gaining significant international traction
FAIR is nothing new
www.geant.org
www.geant.org
Science has always been open
18 |
www.geant.org
www.geant.org
More scientific breakthroughs
www.nytimes.com/2010/08/13/health/research/13alzheimer.html?pagewanted=all&_r=0
“It was unbelievable. Its not science
the way most of us have practiced in
our careers. But we all realised that
we would never get biomarkers
unless all of us parked our egos and
intellectual property noses outside
the door and agreed that all of our
data would be public immediately.”
Dr John Trojanowski, University of Pennsylvania
www.geant.org
www.geant.org
A study that analysed the citation counts of 10,555 papers on gene expression
studies that created microarray data, showed:
“studies that made data available in a public repository
received 9% more citations than similar studies for
which the data was not made available”
Data reuse and the open data citation advantage,
Piwowar, H. & Vision, T. https://peerj.com/articles/175
Get a citation advantage
www.geant.org
www.geant.org
Increased use and economic benefit
Up to 2008 Since 2009
• Freely available over the internet
• Google Earth now uses the images
• Transmission of 2,100,000
scenes per year.
• Estimated to have created value for the
environmental management industry of
$935 million, with direct benefit of more
than $100 million per year to the US
economy
• Has stimulated the development of
applications from a large number of
companies worldwide
The case of NASA Landsat satellite imagery of the Earth’s surface:
http://earthobservatory.nasa.gov/IOTD/view.php?id=83394&src=ve
• Sold through the US Geological
Survey for US$600 per scene
• Sales of 19,000 scenes per year
• Annual revenue of $11.4 million
www.geant.org
www.geant.org
“Open Research Europe requires open
access to research data supporting
articles under the principle ‘as open
as possible, as closed as necessary’,
according to the policy of Horizon
Europe. Data should be deposited in
trusted data repositories.”
Funder imperatives...
https://open-research-europe.ec.europa.eu/for-
authors/data-guidelines#opendata
www.geant.org
www.geant.org
But there are also opportunity costs
By Emilio Bruna
http://brunalab.org/blog/2014/09/04/the-opportunity-
cost-of-my-openscience-was-35-hours-690
For his paper he calculated the following:
1. Double checking the main dataset and
reformatting to submit to Dryad: 5 hours
2. Creating complementary file and preparing
metadata: 3 hours
3. Submission of these two files and the
metadata to Dryad: 45 minutes
4. Preparing a map of the locations: 1 hour
5. Submission of map to Figshare: 15 minutes
6. Cleaning up and documenting the code,
uploading it to GitHub: 25 hours
7. Cost of archiving in Dryad: US$90
8. Page Charges: $600
www.geant.org
www.geant.org
• EC and Member States committed to FAIR and Open
• Pursue this in research policy and grant conditions
• Lots of investment in infrastructure to support data sharing
• Ultimately supports the science ecosystem and ensures
greater return on investment
FAIR and Open both central to EOSC
24 |
www.geant.org
www.geant.org
Go to menti.com
and enter code
3330 9014
25 |
www.geant.org
www.geant.org
What is EOSC?
Image: Martin Reisch https://unsplash.com/photos/6DivtP_WRYs
www.geant.org
www.geant.org
• Collaboration between European
Commission and Member States to
“make Open Science the new normal”
• Established EOSC Association as legal
entity to govern and oversee the
implementation
• Huge investment in infrastructure –
€350 million in initial development
phase and at least €1 billion co-
investment foreseen for next 7 years
Large EC initiative
27 |
EOSC
Association
Steering
Board
European
Commission
Long history of political agreements and activity
Lots of groundwork since 2015
• Council Conclusions
• Expert Group reports
• EC documents
• Major investment in EOSC
related projects to develop the
infrastructure and platform
www.geant.org
www.geant.org
29
www.geant.org
www.geant.org
• A web of FAIR data and services
• Federation of eInfra and Research
Infrastructures (RIs)
• Environment in which data can be
brought together with services to
perform analyses and address
societal challenges
The EOSC platform
www.geant.org
www.geant.org
Aims to enable multidisciplinary discovery & use
Disconnected silos to a
federated infrastructure
providing added value to
researchers
www.geant.org
www.geant.org
FAIR is central to principles in EOSC
• Is the glue that connects data & services
• Requirement for FAIR to support reuse
• Use community standards
• Share all types of output (openly)
www.geant.org
www.geant.org
Many projects and initiatives contributing…
www.geant.org
www.geant.org
Current state of the EOSC platform –
still a work in progress!
34 |
Image: Xuan Nuygen https://unsplash.com/photos/i2M4JeyBIV8
www.geant.org
www.geant.org
• Currently the primary resource for
navigating EOSC
• https://eosc-portal.eu
• Includes a virtual tour for new users
• Catalogue and marketplace is how
you discover, access and compose
resources
EOSC Portal
www.geant.org
www.geant.org
Search by scientific domains or categories of services
Navigating resources
Access to free storage, compute and support services
C-SCALE will federate compute
and data resources from the
Copernicus DIAS, the national
Collaborative Ground
Segments and the European
Open Science Cloud (EOSC)
towards a European open
source Big (Copernicus) Data
Analytics platform:
- Storage services: up to 12 PB
- Cloud services: up to
17,728,500 CPU hours
- HPC/HTC services: up to
3,100,000 CPU hours
- GPU services: up to 6,000
GPU hours
DICE makes available a set of
data management services (and
associated resources) for
researchers and research
communities from any scientific
domain including:
- Data archives (up to 25 PB)
- Policies based data archives (up
to 17 PB)
- Personal and project
workspaces (up to 5 PB)
- Data repository services for
data sharing (up to 8 PB)
- Data discovery services (with
PID and DOI services and
metadata harvesting)
EGI-ACE will deliver the EOSC
Compute Platform and will
contribute to the EOSC Data
Commons. Services offered
include: compute and storage
resources, compute platform
services, data management
services and related user support
and training.
The total capacity that EGI-ACE
makes available through the call
between 2021-2023 is:
- 80,000,000 CPU hours
- 250,000 GPU hours
- 20 PB storage
support to Argos DMP service by
drafting discipline specific DMPs,
Horizon Europe DMP support
set your own community
research gateway
(connect.openaire.eu) and
Zenodo communities
access open science metrics for
your projects, institution,
community
service to anonymise your data
and comply with GDPR
support and mentoring on
Horizon Europe open access
mandates
Provides three core services for
Research Lifecycle Management:
- ROHub: tool to facilitate the
exchange of information across the
scientific community.
- Text Enrichment and Mining:
service which automatically extracts
valuable information and metadata
from bibliographic sources and
other text documents
- Datacube technology for Earth
Observation (EO) data
management: efficient access to
extensive collections of multi-
temporal and multi-dimensional EO
imagery, also allowing
interoperability among the different
information layers.
https://marketplace.eosc-portal.eu
www.geant.org
www.geant.org
Finding research data
• Currently via community repositories and catalogues
• These are being aggregated to offer a cross-search in EOSC…
www.geant.org
www.geant.org
EOSC Future is using AI techniques to make recommendations to users:
• relevant projects, data, publications, training materials
• potential collaborators (people, task forces, communities)
Recommendations based on
• viewing history
• order history
• general popularity
• popularity among users with
a similar background/interests
Recommendations for users
www.geant.org
www.geant.org
• Federated identity management – ease of single sign on
• Access to a greater number of services
• Funding provided to pay for compute e.g. EGI-ACE, DICE
• Discovery of related data from other disciplines / sectors
• Greater ability to collaborate and address key research
questions
Benefits of EOSC for researchers
40
www.geant.org
www.geant.org
Go to menti.com
and enter code
3330 9014
41 |
www.geant.org
www.geant.org
42 |
How to practice Open Science
Image: Jetshoots.com https://unsplash.com/photos/VdOO4_HFTWM
www.geant.org
www.geant.org
1. Choose your dataset(s)
– What can you may open? You may need to revisit this step if you
encounter problems later.
2. Apply an open license
– Determine what IP exists. Apply a suitable licence e.g. CC-BY
3. Make the data available
– Provide the data in a suitable format. Use repositories.
•
4.Make it discoverable
– Post on the web, register in catalogues…
How to make data open?
https://okfn.org
www.geant.org
www.geant.org
https://www.dcc.ac.uk/guidance/how-guides/license-research-data
License research data openly
This DCC guide outlines the pros and
cons of each approach and gives practical
advice on how to implement your licence
CREATIVE COMMONS LIMITATIONS
NC Non-Commercial
What counts as commercial?
ND No Derivatives
Severely restricts use
These clauses are not open licenses
European Commission
guidelines point to:
or
www.geant.org
www.geant.org
Answer questions to determine which licence(s) are
appropriate to use
EUDAT licensing tool
https://ufal.github.io/public-license-selector
www.geant.org
www.geant.org
www.fosteropenscience.eu/content/re3data-demo
Deposit in a data repository
http://databib.org
http://service.re3data.org/search
Re3data is one registry of repositories that can be searched to
find a relevant home for your data. FAIRsharing is another.
www.geant.org
www.geant.org
• Look for provision from your community, university, publisher, funder etc
• Check they match your particular data needs: e.g. formats accepted;
mixture of Open and Restricted Access.
• See if they provide guidance on how to cite the deposited data.
• Do they assign a persistent & globally unique identifier for sustainable
citations and to links back to particular researchers and grants?
• Look for certification as a ‘Trustworthy Digital Repository’ with an explicit
ambition to keep the data available in long term.
How to select a repository?
www.geant.org
www.geant.org
Metadata Standards Directory
Broad, disciplinary listing of standards
and tools. Maintained by RDA group
http://rd-alliance.github.io/metadata-directory
Use metadata standards
FAIRsharing
• A portal of data standards,
databases, and policies
• Focused on life, environmental
and biomedical sciences
https://fairsharing.org
www.geant.org
www.geant.org
If you want your data to be re-used and sustainable in the long-
term, you typically want to opt for open, non-proprietary formats.
Choose appropriate file formats
Type Recommended Avoid for data sharing
Tabular data CSV, TSV, SPSS portable Excel
Text Plain text, HTML, RTF
PDF/A only if layout matters
Word
Media Container: MP4, Ogg
Codec: Theora, Dirac, FLAC
Quicktime
H264
Images TIFF, JPEG2000, PNG GIF, JPG
Structured data XML, RDF RDBMS
Further examples:
https://ukdataservice.ac.uk/learning-hub/research-data-management/format-
your-data/recommended-formats
www.geant.org
www.geant.org
Managing and sharing data:
a best practice guide
https://dam.ukdataservice.ac.uk/media/622417/managingsharing.pdf
www.geant.org
www.geant.org
More on life science tools
and infrastructure coming
up in Susanna’s talk
51 |
Image: Sangharsh Lohakare https://unsplash.com/photos/Iy7QyzOs1bo
www.geant.org
www.geant.org
Click to edit Master title style
• Click to edit Master text styles
• Second level
• Third level
• Fourth level
• Fifth level
01/04/2022 52
Thank you
www.geant.org
Any questions?
© GÉANT Association on behalf of the GN4 Phase 2 project (GN4-2).
The research leading to these results has received funding from
the European Union’s Horizon 2020 research and innovation
programme under Grant Agreement No. 731122 (GN4-2). 52 |

Weitere ähnliche Inhalte

Was ist angesagt?

What is Open Science
What is Open ScienceWhat is Open Science
What is Open ScienceNancy Pontika
 
Understanding Open Science: Definitions and framework
Understanding Open Science: Definitions and framework Understanding Open Science: Definitions and framework
Understanding Open Science: Definitions and framework Nancy Pontika
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceANOOP V S
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 
What it means to be FAIR
What it means to be FAIRWhat it means to be FAIR
What it means to be FAIRSarah Jones
 
FAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDAFAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDASarah Jones
 
Day 1 (Lecture 3): Predictive Analytics in Healthcare
Day 1 (Lecture 3): Predictive Analytics in HealthcareDay 1 (Lecture 3): Predictive Analytics in Healthcare
Day 1 (Lecture 3): Predictive Analytics in HealthcareAseda Owusua Addai-Deseh
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationDatabricks
 
Education data mining presentation
Education data mining presentationEducation data mining presentation
Education data mining presentationNishabhanot1
 
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...Edureka!
 
Big Data Platforms: An Overview
Big Data Platforms: An OverviewBig Data Platforms: An Overview
Big Data Platforms: An OverviewC. Scyphers
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceSrishti44
 
20190528 - Guidelines for Trustworthy AI
20190528 - Guidelines for Trustworthy AI20190528 - Guidelines for Trustworthy AI
20190528 - Guidelines for Trustworthy AIBrussels Legal Hackers
 
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Lauri Eloranta
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientistVijayMohan Vasu
 
How to start: Setting up an open access repository in 22 steps
How to start: Setting up an open access repository in 22 stepsHow to start: Setting up an open access repository in 22 steps
How to start: Setting up an open access repository in 22 stepsIryna Kuchma
 
Introduction to Topological Data Analysis
Introduction to Topological Data AnalysisIntroduction to Topological Data Analysis
Introduction to Topological Data AnalysisHendri Karisma
 
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptxNeo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptxNeo4j
 

Was ist angesagt? (20)

What is Open Science
What is Open ScienceWhat is Open Science
What is Open Science
 
Understanding Open Science: Definitions and framework
Understanding Open Science: Definitions and framework Understanding Open Science: Definitions and framework
Understanding Open Science: Definitions and framework
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Artificial Intelligence AI in Libraries Training for Innovation Webinar
Artificial Intelligence  AI in Libraries Training for Innovation WebinarArtificial Intelligence  AI in Libraries Training for Innovation Webinar
Artificial Intelligence AI in Libraries Training for Innovation Webinar
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
What it means to be FAIR
What it means to be FAIRWhat it means to be FAIR
What it means to be FAIR
 
FAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDAFAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDA
 
Day 1 (Lecture 3): Predictive Analytics in Healthcare
Day 1 (Lecture 3): Predictive Analytics in HealthcareDay 1 (Lecture 3): Predictive Analytics in Healthcare
Day 1 (Lecture 3): Predictive Analytics in Healthcare
 
Preparing Data for Sharing: The FAIR Principles
Preparing Data for Sharing: The FAIR PrinciplesPreparing Data for Sharing: The FAIR Principles
Preparing Data for Sharing: The FAIR Principles
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with Alation
 
Education data mining presentation
Education data mining presentationEducation data mining presentation
Education data mining presentation
 
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
 
Big Data Platforms: An Overview
Big Data Platforms: An OverviewBig Data Platforms: An Overview
Big Data Platforms: An Overview
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
20190528 - Guidelines for Trustworthy AI
20190528 - Guidelines for Trustworthy AI20190528 - Guidelines for Trustworthy AI
20190528 - Guidelines for Trustworthy AI
 
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
 
How to start: Setting up an open access repository in 22 steps
How to start: Setting up an open access repository in 22 stepsHow to start: Setting up an open access repository in 22 steps
How to start: Setting up an open access repository in 22 steps
 
Introduction to Topological Data Analysis
Introduction to Topological Data AnalysisIntroduction to Topological Data Analysis
Introduction to Topological Data Analysis
 
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptxNeo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
 

Ähnlich wie Introduction to Open Science and EOSC

Managing and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextManaging and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextSarah Jones
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open scienceSarah Jones
 
I o dav data workshop prof wafula final 19.9.17
I o dav data workshop prof wafula final 19.9.17I o dav data workshop prof wafula final 19.9.17
I o dav data workshop prof wafula final 19.9.17Tom Nyongesa
 
Open science and its advocacy
Open science and its advocacyOpen science and its advocacy
Open science and its advocacySarah Jones
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonAfrican Open Science Platform
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonAfrican Open Science Platform
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataAnita de Waard
 
Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...African Open Science Platform
 
Figshare for institutions presentation swets customer day 2014
Figshare for institutions   presentation swets customer day 2014Figshare for institutions   presentation swets customer day 2014
Figshare for institutions presentation swets customer day 2014Swetsbelgie
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonAfrican Open Science Platform
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?LEARN Project
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing dataSarah Jones
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataARDC
 
FAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basicsFAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basicsOpenAIRE
 
From Open Access to Open data, our initiatives
From Open Access to Open data, our initiativesFrom Open Access to Open data, our initiatives
From Open Access to Open data, our initiativesJohannes Keizer
 

Ähnlich wie Introduction to Open Science and EOSC (20)

Managing and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextManaging and sharing data: lessons from the European context
Managing and sharing data: lessons from the European context
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open science
 
Open Science
Open ScienceOpen Science
Open Science
 
I o dav data workshop prof wafula final 19.9.17
I o dav data workshop prof wafula final 19.9.17I o dav data workshop prof wafula final 19.9.17
I o dav data workshop prof wafula final 19.9.17
 
Open science and its advocacy
Open science and its advocacyOpen science and its advocacy
Open science and its advocacy
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon Hodson
 
FAIR play?
FAIR play? FAIR play?
FAIR play?
 
BLC & Digital Science: Mark Hahnel, Figshare
BLC & Digital Science: Mark Hahnel, FigshareBLC & Digital Science: Mark Hahnel, Figshare
BLC & Digital Science: Mark Hahnel, Figshare
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 
Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...
 
Figshare for institutions presentation swets customer day 2014
Figshare for institutions   presentation swets customer day 2014Figshare for institutions   presentation swets customer day 2014
Figshare for institutions presentation swets customer day 2014
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
Open Science - Global Perspectives/Simon Hodson
Open Science - Global Perspectives/Simon HodsonOpen Science - Global Perspectives/Simon Hodson
Open Science - Global Perspectives/Simon Hodson
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing data
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) data
 
FAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basicsFAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basics
 
Better Data for a Better World
Better Data for a Better WorldBetter Data for a Better World
Better Data for a Better World
 
From Open Access to Open data, our initiatives
From Open Access to Open data, our initiativesFrom Open Access to Open data, our initiatives
From Open Access to Open data, our initiatives
 

Mehr von Sarah Jones

Data training tips and tricks
Data training tips and tricksData training tips and tricks
Data training tips and tricksSarah Jones
 
EOSC and libraries
EOSC and librariesEOSC and libraries
EOSC and librariesSarah Jones
 
EOSC Association priorities and activities
EOSC Association priorities and activitiesEOSC Association priorities and activities
EOSC Association priorities and activitiesSarah Jones
 
Reflections on Open Science
Reflections on Open ScienceReflections on Open Science
Reflections on Open ScienceSarah Jones
 
MAR comments analysis
MAR comments analysisMAR comments analysis
MAR comments analysisSarah Jones
 
EOSC-MAR-update.pptx
EOSC-MAR-update.pptxEOSC-MAR-update.pptx
EOSC-MAR-update.pptxSarah Jones
 
Why is EOSC so hard?
Why is EOSC so hard?Why is EOSC so hard?
Why is EOSC so hard?Sarah Jones
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchersSarah Jones
 
Is Europe ready for Open Science
Is Europe ready for Open ScienceIs Europe ready for Open Science
Is Europe ready for Open ScienceSarah Jones
 
DMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsDMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsSarah Jones
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open ScienceSarah Jones
 
Why institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRWhy institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRSarah Jones
 
It takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsIt takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsSarah Jones
 
DMPTuuli - what's new?
DMPTuuli - what's new?DMPTuuli - what's new?
DMPTuuli - what's new?Sarah Jones
 
DCC and FAIR initiatives
DCC and FAIR initiativesDCC and FAIR initiatives
DCC and FAIR initiativesSarah Jones
 
Reflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDCReflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDCSarah Jones
 
Future EOSC roadmap
Future EOSC roadmapFuture EOSC roadmap
Future EOSC roadmapSarah Jones
 
Global Open Research Commons IG
Global Open Research Commons IGGlobal Open Research Commons IG
Global Open Research Commons IGSarah Jones
 
Global Research Data Initiatives
Global Research Data InitiativesGlobal Research Data Initiatives
Global Research Data InitiativesSarah Jones
 

Mehr von Sarah Jones (20)

Data training tips and tricks
Data training tips and tricksData training tips and tricks
Data training tips and tricks
 
EOSC and libraries
EOSC and librariesEOSC and libraries
EOSC and libraries
 
EOSC Association priorities and activities
EOSC Association priorities and activitiesEOSC Association priorities and activities
EOSC Association priorities and activities
 
Reflections on Open Science
Reflections on Open ScienceReflections on Open Science
Reflections on Open Science
 
MAR comments analysis
MAR comments analysisMAR comments analysis
MAR comments analysis
 
EOSC-MAR-update.pptx
EOSC-MAR-update.pptxEOSC-MAR-update.pptx
EOSC-MAR-update.pptx
 
Why is EOSC so hard?
Why is EOSC so hard?Why is EOSC so hard?
Why is EOSC so hard?
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
 
Is Europe ready for Open Science
Is Europe ready for Open ScienceIs Europe ready for Open Science
Is Europe ready for Open Science
 
DMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsDMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessons
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open Science
 
Why institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRWhy institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIR
 
It takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsIt takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commons
 
DMPTuuli - what's new?
DMPTuuli - what's new?DMPTuuli - what's new?
DMPTuuli - what's new?
 
DCC and FAIR initiatives
DCC and FAIR initiativesDCC and FAIR initiatives
DCC and FAIR initiatives
 
Reflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDCReflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDC
 
Future EOSC roadmap
Future EOSC roadmapFuture EOSC roadmap
Future EOSC roadmap
 
Global Open Research Commons IG
Global Open Research Commons IGGlobal Open Research Commons IG
Global Open Research Commons IG
 
EOSC work plan
EOSC work planEOSC work plan
EOSC work plan
 
Global Research Data Initiatives
Global Research Data InitiativesGlobal Research Data Initiatives
Global Research Data Initiatives
 

Kürzlich hochgeladen

AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfOverkill Security
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 

Kürzlich hochgeladen (20)

AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 

Introduction to Open Science and EOSC

  • 1. www.geant.org www.geant.org 1 | Click to edit Master title style • Click to edit Master text styles • Second level • Third level • Fourth level • Fifth level 01/04/2022 1 Introduction to Open Science and EOSC www.geant.org Sarah Jones EOSC Engagement Manager sarah.jones@geant.org Twitter: @sarahroams Predictive Epigenetics PEP-NET training network 1st April 2020
  • 3. www.geant.org www.geant.org 3 | The what and why of FAIR and Open Science Image by Michael Longmire https://unsplash.com/photos/L9EV3OogLh0
  • 4. www.geant.org www.geant.org “science carried out and communicated in a manner which allows others to contribute, collaborate and add to the research effort, with all kinds of data, results and protocols made freely available at different stages of the research process.” Research Information Network, Open Science case studies www.rin.ac.uk/our-work/data-management-and-curation/ open-science-case-studies Defining Open Science 4 |
  • 5. www.geant.org www.geant.org The spectrum of Open Science 5 | CC-BY Andreas Neuhold https://commons.wikimedia.org/wiki/File:Open_Science_-_Prinzipien.png
  • 6. www.geant.org www.geant.org Why open access? Open Access Explained! www.youtube.com/watch?v=L5rVH1KGBCY
  • 7. www.geant.org www.geant.org • Free, immediate, online access to the results of research • Two routes to make sure anyone can access your papers – Gold route: paying APCs to ensure publishers makes copy open – Green route: self-archiving Open Access copy in repository • Find out what your publisher allows on SHERPA RoMEO – www.sherpa.ac.uk/romeo Open access to publications
  • 8. www.geant.org www.geant.org Open data  make your stuff available on the Web (whatever format) under an open licence  make it available as structured data (e.g. Excel instead of a scan of a table)  use non-proprietary formats (e.g. CSV instead of Excel)  use URIs to denote things, so that people can point at your stuff  link your data to other data to provide context Tim Berners-Lee’s proposal for five star open data - http://5stardata.info “Open data and content can be freely used, modified and shared by anyone for any purpose” http://opendefinition.org
  • 9. www.geant.org www.geant.org • Documenting and sharing workflows and methods • Sharing code and tools to allow others to reproduce work • Using web based tools to facilitate collaboration and interaction from the outside world in your research • Using tools like MyExperiment and Taverna Open methods
  • 10. www.geant.org www.geant.org Reliance on specialist research software Slide from Neil Chue-Hong, Software Sustainability Institute 56% 71% Do you use research software? What would happen to your research without software Survey of researchers from 15 UK Russell Group universities conducted by SSI between August - October 2014. DOI: 10.5281/zenodo.14809 Develop their own software Have no formal software training
  • 11. www.geant.org www.geant.org Design Experiment Analysis Publication Release Openness at every stage of research Open science image CC BY-SA 3.0 by Greg Emmerich www.flickr.com/photos/gemmerich/6365692655 Change the typical lifecycle Publish earlier and release more Papers + Data + Methods + Code… Support reproducibility
  • 12. www.geant.org www.geant.org Degrees of openness Open Restricted Closed Content that can be freely used, modified and shared by anyone for any purpose Limits on who can use the data, how or for what purpose - Charges for use - Data sharing agreements - Restrictive licences - Peer-to-peer exchange - … Five star open data  Unable to share Under embargo
  • 13. www.geant.org www.geant.org • FAIR ≠ Open • FAIR ensures data can be found, understood and reused • Data can be shared under restrictions & still be FAIR "As open as possible, as closed as necessary" And what is FAIR? 13 | Image CC-BY-SA by SangyaPundir Image CC-BY by European Commission FAIR data expert group
  • 14. www.geant.org www.geant.org What FAIR means: 15 principles Findable F1. (meta)data are assigned a globally unique and eternally persistent identifier. F2. data are described with rich metadata. F3. (meta)data are registered or indexed in a searchable resource. F4. metadata specify the data identifier. Interoperable I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation. I2. (meta)data use vocabularies that follow FAIR principles. I3. (meta)data include qualified references to other (meta)data. Accessible A1 (meta)data are retrievable by their identifier using a standardized communications protocol. A1.1 the protocol is open, free, and universally implementable. A1.2 the protocol allows for an authentication and authorization procedure, where necessary. A2 metadata are accessible, even when the data are no longer available. Reusable R1. meta(data) have a plurality of accurate and relevant attributes. R1.1. (meta)data are released with a clear and accessible data usage license. R1.2. (meta)data are associated with their provenance. R1.3. (meta)data meet domain-relevant community standards. Slide CC-BY by Erik Schultes, Leiden UMC doi: 10.1038/sdata.2016.18
  • 15. www.geant.org www.geant.org The FAIR data principles explained • Clarifications from GO FAIR • Each principle is a link to further clarification, examples and context https://www.go-fair.org/fair- principles R1. Meta(data) are richly described with a plurality of accurate and relevant attributes • By giving data many ‘labels’, it will be much easier to find and reuse the data. • Provide not just metadata that allows discovery, but also metadata that richly describes the context under which that data was generated • “plurality” indicates that metadata should be as generous as possible, even to the point of providing information that may seem irrelevant.
  • 16. www.geant.org www.geant.org • Findable - Persistent Identifier - Metadata online • Accessible - Data online - Restrictions where needed • Interoperable - Use standards, controlled vocabs - Common (open) formats • Reusable - Rich documentation - Clear usage licence FAIR data checklist https://doi.org/10.5281/zenodo.5111307
  • 17. www.geant.org www.geant.org • Various research communities have been sharing their data in a ‘FAIR’ way long before the term emerged • Meaningful and memorable articulation of concepts • Natural desire to want to be ‘fair’ • FAIR is gaining significant international traction FAIR is nothing new
  • 19. www.geant.org www.geant.org More scientific breakthroughs www.nytimes.com/2010/08/13/health/research/13alzheimer.html?pagewanted=all&_r=0 “It was unbelievable. Its not science the way most of us have practiced in our careers. But we all realised that we would never get biomarkers unless all of us parked our egos and intellectual property noses outside the door and agreed that all of our data would be public immediately.” Dr John Trojanowski, University of Pennsylvania
  • 20. www.geant.org www.geant.org A study that analysed the citation counts of 10,555 papers on gene expression studies that created microarray data, showed: “studies that made data available in a public repository received 9% more citations than similar studies for which the data was not made available” Data reuse and the open data citation advantage, Piwowar, H. & Vision, T. https://peerj.com/articles/175 Get a citation advantage
  • 21. www.geant.org www.geant.org Increased use and economic benefit Up to 2008 Since 2009 • Freely available over the internet • Google Earth now uses the images • Transmission of 2,100,000 scenes per year. • Estimated to have created value for the environmental management industry of $935 million, with direct benefit of more than $100 million per year to the US economy • Has stimulated the development of applications from a large number of companies worldwide The case of NASA Landsat satellite imagery of the Earth’s surface: http://earthobservatory.nasa.gov/IOTD/view.php?id=83394&src=ve • Sold through the US Geological Survey for US$600 per scene • Sales of 19,000 scenes per year • Annual revenue of $11.4 million
  • 22. www.geant.org www.geant.org “Open Research Europe requires open access to research data supporting articles under the principle ‘as open as possible, as closed as necessary’, according to the policy of Horizon Europe. Data should be deposited in trusted data repositories.” Funder imperatives... https://open-research-europe.ec.europa.eu/for- authors/data-guidelines#opendata
  • 23. www.geant.org www.geant.org But there are also opportunity costs By Emilio Bruna http://brunalab.org/blog/2014/09/04/the-opportunity- cost-of-my-openscience-was-35-hours-690 For his paper he calculated the following: 1. Double checking the main dataset and reformatting to submit to Dryad: 5 hours 2. Creating complementary file and preparing metadata: 3 hours 3. Submission of these two files and the metadata to Dryad: 45 minutes 4. Preparing a map of the locations: 1 hour 5. Submission of map to Figshare: 15 minutes 6. Cleaning up and documenting the code, uploading it to GitHub: 25 hours 7. Cost of archiving in Dryad: US$90 8. Page Charges: $600
  • 24. www.geant.org www.geant.org • EC and Member States committed to FAIR and Open • Pursue this in research policy and grant conditions • Lots of investment in infrastructure to support data sharing • Ultimately supports the science ecosystem and ensures greater return on investment FAIR and Open both central to EOSC 24 |
  • 26. www.geant.org www.geant.org What is EOSC? Image: Martin Reisch https://unsplash.com/photos/6DivtP_WRYs
  • 27. www.geant.org www.geant.org • Collaboration between European Commission and Member States to “make Open Science the new normal” • Established EOSC Association as legal entity to govern and oversee the implementation • Huge investment in infrastructure – €350 million in initial development phase and at least €1 billion co- investment foreseen for next 7 years Large EC initiative 27 | EOSC Association Steering Board European Commission
  • 28. Long history of political agreements and activity Lots of groundwork since 2015 • Council Conclusions • Expert Group reports • EC documents • Major investment in EOSC related projects to develop the infrastructure and platform
  • 30. www.geant.org www.geant.org • A web of FAIR data and services • Federation of eInfra and Research Infrastructures (RIs) • Environment in which data can be brought together with services to perform analyses and address societal challenges The EOSC platform
  • 31. www.geant.org www.geant.org Aims to enable multidisciplinary discovery & use Disconnected silos to a federated infrastructure providing added value to researchers
  • 32. www.geant.org www.geant.org FAIR is central to principles in EOSC • Is the glue that connects data & services • Requirement for FAIR to support reuse • Use community standards • Share all types of output (openly)
  • 33. www.geant.org www.geant.org Many projects and initiatives contributing…
  • 34. www.geant.org www.geant.org Current state of the EOSC platform – still a work in progress! 34 | Image: Xuan Nuygen https://unsplash.com/photos/i2M4JeyBIV8
  • 35. www.geant.org www.geant.org • Currently the primary resource for navigating EOSC • https://eosc-portal.eu • Includes a virtual tour for new users • Catalogue and marketplace is how you discover, access and compose resources EOSC Portal
  • 36. www.geant.org www.geant.org Search by scientific domains or categories of services Navigating resources
  • 37. Access to free storage, compute and support services C-SCALE will federate compute and data resources from the Copernicus DIAS, the national Collaborative Ground Segments and the European Open Science Cloud (EOSC) towards a European open source Big (Copernicus) Data Analytics platform: - Storage services: up to 12 PB - Cloud services: up to 17,728,500 CPU hours - HPC/HTC services: up to 3,100,000 CPU hours - GPU services: up to 6,000 GPU hours DICE makes available a set of data management services (and associated resources) for researchers and research communities from any scientific domain including: - Data archives (up to 25 PB) - Policies based data archives (up to 17 PB) - Personal and project workspaces (up to 5 PB) - Data repository services for data sharing (up to 8 PB) - Data discovery services (with PID and DOI services and metadata harvesting) EGI-ACE will deliver the EOSC Compute Platform and will contribute to the EOSC Data Commons. Services offered include: compute and storage resources, compute platform services, data management services and related user support and training. The total capacity that EGI-ACE makes available through the call between 2021-2023 is: - 80,000,000 CPU hours - 250,000 GPU hours - 20 PB storage support to Argos DMP service by drafting discipline specific DMPs, Horizon Europe DMP support set your own community research gateway (connect.openaire.eu) and Zenodo communities access open science metrics for your projects, institution, community service to anonymise your data and comply with GDPR support and mentoring on Horizon Europe open access mandates Provides three core services for Research Lifecycle Management: - ROHub: tool to facilitate the exchange of information across the scientific community. - Text Enrichment and Mining: service which automatically extracts valuable information and metadata from bibliographic sources and other text documents - Datacube technology for Earth Observation (EO) data management: efficient access to extensive collections of multi- temporal and multi-dimensional EO imagery, also allowing interoperability among the different information layers. https://marketplace.eosc-portal.eu
  • 38. www.geant.org www.geant.org Finding research data • Currently via community repositories and catalogues • These are being aggregated to offer a cross-search in EOSC…
  • 39. www.geant.org www.geant.org EOSC Future is using AI techniques to make recommendations to users: • relevant projects, data, publications, training materials • potential collaborators (people, task forces, communities) Recommendations based on • viewing history • order history • general popularity • popularity among users with a similar background/interests Recommendations for users
  • 40. www.geant.org www.geant.org • Federated identity management – ease of single sign on • Access to a greater number of services • Funding provided to pay for compute e.g. EGI-ACE, DICE • Discovery of related data from other disciplines / sectors • Greater ability to collaborate and address key research questions Benefits of EOSC for researchers 40
  • 42. www.geant.org www.geant.org 42 | How to practice Open Science Image: Jetshoots.com https://unsplash.com/photos/VdOO4_HFTWM
  • 43. www.geant.org www.geant.org 1. Choose your dataset(s) – What can you may open? You may need to revisit this step if you encounter problems later. 2. Apply an open license – Determine what IP exists. Apply a suitable licence e.g. CC-BY 3. Make the data available – Provide the data in a suitable format. Use repositories. • 4.Make it discoverable – Post on the web, register in catalogues… How to make data open? https://okfn.org
  • 44. www.geant.org www.geant.org https://www.dcc.ac.uk/guidance/how-guides/license-research-data License research data openly This DCC guide outlines the pros and cons of each approach and gives practical advice on how to implement your licence CREATIVE COMMONS LIMITATIONS NC Non-Commercial What counts as commercial? ND No Derivatives Severely restricts use These clauses are not open licenses European Commission guidelines point to: or
  • 45. www.geant.org www.geant.org Answer questions to determine which licence(s) are appropriate to use EUDAT licensing tool https://ufal.github.io/public-license-selector
  • 46. www.geant.org www.geant.org www.fosteropenscience.eu/content/re3data-demo Deposit in a data repository http://databib.org http://service.re3data.org/search Re3data is one registry of repositories that can be searched to find a relevant home for your data. FAIRsharing is another.
  • 47. www.geant.org www.geant.org • Look for provision from your community, university, publisher, funder etc • Check they match your particular data needs: e.g. formats accepted; mixture of Open and Restricted Access. • See if they provide guidance on how to cite the deposited data. • Do they assign a persistent & globally unique identifier for sustainable citations and to links back to particular researchers and grants? • Look for certification as a ‘Trustworthy Digital Repository’ with an explicit ambition to keep the data available in long term. How to select a repository?
  • 48. www.geant.org www.geant.org Metadata Standards Directory Broad, disciplinary listing of standards and tools. Maintained by RDA group http://rd-alliance.github.io/metadata-directory Use metadata standards FAIRsharing • A portal of data standards, databases, and policies • Focused on life, environmental and biomedical sciences https://fairsharing.org
  • 49. www.geant.org www.geant.org If you want your data to be re-used and sustainable in the long- term, you typically want to opt for open, non-proprietary formats. Choose appropriate file formats Type Recommended Avoid for data sharing Tabular data CSV, TSV, SPSS portable Excel Text Plain text, HTML, RTF PDF/A only if layout matters Word Media Container: MP4, Ogg Codec: Theora, Dirac, FLAC Quicktime H264 Images TIFF, JPEG2000, PNG GIF, JPG Structured data XML, RDF RDBMS Further examples: https://ukdataservice.ac.uk/learning-hub/research-data-management/format- your-data/recommended-formats
  • 50. www.geant.org www.geant.org Managing and sharing data: a best practice guide https://dam.ukdataservice.ac.uk/media/622417/managingsharing.pdf
  • 51. www.geant.org www.geant.org More on life science tools and infrastructure coming up in Susanna’s talk 51 | Image: Sangharsh Lohakare https://unsplash.com/photos/Iy7QyzOs1bo
  • 52. www.geant.org www.geant.org Click to edit Master title style • Click to edit Master text styles • Second level • Third level • Fourth level • Fifth level 01/04/2022 52 Thank you www.geant.org Any questions? © GÉANT Association on behalf of the GN4 Phase 2 project (GN4-2). The research leading to these results has received funding from the European Union’s Horizon 2020 research and innovation programme under Grant Agreement No. 731122 (GN4-2). 52 |

Hinweis der Redaktion

  1. Journal prices have outpaced inflation by more than 250% over the past 30 years 15 entire disciplines where the average price for one journal for one year is over £1000 (chemistry £4227, physics £3229). Journal called tetrahedron that’s over £40,000 Irrational to think that scientists are paid by government to do research and then the papers are locked away behind paywalls. Journals don’t do the research, employ the people or pay the reviewers.
  2. In the last four years, we have investigated and understood the challenges of the UK research community. Anecdotally, we had a lot of evidence for people working in this area that researchers relied on software, but there had been no studies conducted. So we did this ourselves. Two areas of interest, do you use software and possibly more important, what would happen to your research without software – this is 170,000 researchers in the UK who could not conduct their software without software. This is more than just a reliance on Word or web browsers – specialist software is written into the research workflows of people from psychology to physics, from the life sciences to literature. The reliance isn’t confined to the “traditionally” computationally intensive subjects, it’s a feature of all disciplines. This means that 140,000 researchers are relying on their own coding skills.
  3. Certain research communities have also seen the benefit of sharing data as it speeds up the process of discovery. This article shows how researchers in the field of Alzheimer’s research have agreed as a community to share data immediately to make scientific breakthroughs.
  4. There’s also a citation advantage for individual researchers. This study by Heather Piwowar and Todd Vision looked at 10,555 paper of gene expression studies that had shared the associated microarray data. Those studies that shared data received 9% more citations.
  5. There’s also an economic benefit, as seen by the case of the NASA landsat satellite images. These were sold until 2008 for $600 a scene. Now they’re freely available and used by Google Earth. Previously they sold 19,000 images a year, whereas now they transmit 2.1 million. The revenue has gone up incredibly too from $11.4 million to an estimated value of $935 million with direct benefit of more than $100 million. The release has also stimulated the development of applications from companies worldwide. This case study comes from the Royal Society Report on Science as an Open Enterprise.
  6. The background to this is about making the most of the data that has been created through publicly funded research. The guidelines speak of: Improved quality of results Greater efficiency Faster to market = faster growth Improved transparency of the scientific process
  7. It’s not all positive though – otherwise why isn’t everyone already doing this. There is a certain amount of effort and cost to open science, which this blog post by Emilio Bruna highlights. He calculated the cost of sharing his data for one paper and came to a total of 35 hours and $690. He breaks this down into the cost of preparing the dataset, creating complementary metadata and associated files, cleaning up and documenting the code (which involves a big mental leap), and the charges applied.
  8. Still a question we are asking ourselves but some commonality of vision is sticking. I like this picture as it represents some of that for me: Federation of services Interconnecting / interoperable User in the centre Greenfield site? Open to ideas / creativity?
  9. Guidance from the DCC can also help researchers to understand data licensing. This guide outlines the pros and cons of each approach e.g. the limitations of some CC options The OA guidelines under Horizon 2020 point to CC-0 or CC-BY as a straightforward and effective way to make it possible for others to mine, exploit and reproduce the data. See p11 at: http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-pilot-guide_en.pdf