2. âOpen science is the idea that scientific knowledge of all kinds should
be openly shared as early as is practical in the discovery process.â
Michael Nielsen
3. Open science
Open access to publications
Open data
Open source
Open educational
resources
Open peer review
Open notebook
science
Citizen science
4. European Commission (2013):
âOpen access can be defined as the practice of providing on-line access to
scientificinformation that is free of charge to the end-user and that is re-usable.
In the context of research and innovation, 'scientific information' can refer to
(i) peer-reviewed scientific research articles (published in scholarly journals)
or
(ii) research data (data underlying publications, curated data and/or raw data).â
Guidelines on Open Access to Scientific Publications and Research Data
in Horizon 2020. Version 16 December 2013.
5. What are the benefits of open publications and data?
ď§ open publications and data reach a broader audience â they have more impact
ď§ people from outside of the academic community can use them â teachers, students,
doctors, entrepreneurs, non-governmental organizations
ď§ researchers representing far away scientific disciplines have access â facilitates
interdisciplinary studies
ď§ open data can be used in novel, unexpected ways, and included in larger datasets
ď§ archiving â safe long-term storage of digital copies of files
ď§ faster scholarly communication â research moves faster, there is less duplication of
results, less competitiveness, more cooperation, also international cooperation
ď§ easier to identify cases of scientific misconduct or fraud, and to check reproducibility
ď§ the use of open licenses allows text and data mining
7. âBy âopen accessâ to [peer-reviewed research literature], we mean its free
availability on the public internet, permitting any users to read, download, copy,
distribute, print, search, or link to the full texts of these articles, crawl them for
indexing, pass them as data to software, or use them for any other lawful purpose,
without financial, legal, or technical barriers
other than those inseparable from gaining access to the internet itself.
The only constraint on reproduction and distribution, and the only role for
copyright in this domain, should be to
give authors control over the integrity of their work and the right to be properly
acknowledged and cited.â
Budapest Open Access Initiative (BOAI), 2002
8. 1. Open repositories
The author makes the publication available through self-archiving â parallel to
its publication in any kind of research journal
2. Open access journals
The publisher makes the publication available directly from the journalâs site.
9. Open access to publications
Gold road:
Open journals and books
Green road:
Open repositories
10. Open access journals
An OA journal grants access to the full texts of all papers it publishes,
e.g. through its own website.
PLOS â Public Library of Science
https://www.plos.org/
BMC â BioMed Central
https://www.biomedcentral.com
eLife
https://elifesciences.org/
16. Publication repositories
An OA repository is where the author makes the publication
available, through self-archiving
â parallel to its publication in any chosen research journal.
17. Publication repositories
Institutional â run by a university, research institute, granting agency â
they collect the whole research output of the institution in one place.
Disciplinary â accepts deposits from researchers affiliated with any
institution, as long as they fit in the topic.
18. Institutional repositories
DASH â Digital Access to Scholarship at Harvard
https://dash.harvard.edu/
AMUR â Adam Mickiewicz University Repository
https://repozytorium.amu.edu.pl/
PubMed Central â run by the NIH (National Institutes of Health)
19. Disciplinary repositories
SSRN â Social Sciences Research Network
www.ssrn.com
arXiv â Physics, Mathematics, Computer Science,
Quantitative Biology, Quantitative Finance and
Statistics
http://arxiv.org/
20. What about papers that donât have a suitable
institutional nor disciplinary repository?
21. General repositories
(catch-all)
CeON Repository â publication repository,
run by ICM University of Warsaw
https://depot.ceon.pl/
Zenodo â publication and data repository,
run at CERN, funded by the EC
https://zenodo.org/
Figshare â publication and data repository,
run by a private company
https://figshare.com/
23. Financing of open repositories
Author
Repository
Reader
Institution
open
repository
(e.g. university, research institute, consortium
of institutions, granting agency, government, âŚ)
25. How to deposit your paper in a repository?
⢠Choose a repository
⢠Do you have the rights necessary to disseminate the paper?
⢠Deposit your paper: enter the metadata required by the repository
and upload your file
27. âResearch data is data that is collected, observed, or created,
for purposes of analysis to produce original research results.â
Definitions of research data
ââŚthe recorded factual material commonly accepted in the
scientific community as necessary to validate research findings.â
28. Numerical data
Text documents, lab notes
Questionnaires, responses, transcripts
Audiotapes, videotapes
Photographs, films
Artifacts, specimens, samples
Models, algorithms, scripts
Simulation results
Methodologies and workflows
Examples of research data:
29. Numerical data
Text documents, lab notes
Questionnaires, responses, transcripts
Audiotapes, videotapes
Photographs, films
Artifacts, specimens, samples
Models, algorithms, scripts
Simulation results
Methodologies and workflows
Examples of research data
The focus [in the context of open access] is on research data
that is available in digital form.
30. âOpen data and content can be freely used,
modified, and shared by anyone for any purpose.â
Open Knowledge Foundation
The Open Definition:
What is open data?
32. Research data archives / repositories
1. Secure long-term storage â trusted archive or repository
2. Stable location â stable digital identifier, e.g. DOI (digital
object identifier)
3. Visibility â well known among interested researchers,
well visible in search engines
33. Digital research data repositories
⢠specialized
⢠disciplinary
⢠institutional
⢠general (catch-all)
37. General data repositories
National research data repository:
Netherlands
National research data repository:
Poland
General repository:
publications and data
General repository:
publications and data
39. ⢠Articles describing data (data descriptors)
⢠The data itself is deposited in a repository
⢠Some journals also allow you to attach your data as Supplementary Material
Data journals
ď This is an addition to the repository system, not an alternative
40. Should all data be open? No.
But data existence should always be open:
⢠Allows discovery & negotiation on use
⢠Avoids pointless replication
Slide adapted from Kevin Ashley, DCC, CC-BY
Privacy protection (human subjects!)
National security issues
Interference with commercialization plans
Ethical issues, protection of endangered species, ...
41. How to deposit your data in a repository?
⢠Choose a repository
⢠Do you have the rights necessary to disseminate the data?
⢠Deposit your data: enter the metadata required by the repository and
upload your file(s)
Preparing sensible metadata and documentation and
organizing your files may take time!
42.
43. Mandate on open access to publications in H2020
Under Horizon 2020, each beneficiary must ensure open access to all peer-
reviewed scientific publications relating to its results.
In order to comply with this requirement, beneficiaries must, at the very least,
ensure that their publications, if any, can be read online, downloaded and printed.
However, as any additional rights such as the right to copy, distribute, search, link,
crawl, and mine increase the utility of the accessible publication, beneficiaries
should make every effort to provide for as many of them as possible.
44. Open research data pilot in H2020
âParticipating projects are required to deposit the research data described
above, preferably into a research data repository.
As far as possible, projects must then take measures to enable for third
parties to access, mine, exploit, reproduce and disseminate (free of
charge for any user) this research data.
One straightforward and effective way of doing this is to attach a Creative
Commons Licence (CC-BY or CC0 tool) to the data deposited.â
45. Open research data pilot in H2020
Projects may opt-out, butâŚ
âThe use of a detailed data management plan covering individual datasets is
required for funded projects participating in the Open Research Data Pilot.â
46. Thank you for your attention
Contact:
m.hoffman-sommer@icm.edu.pl
Hinweis der Redaktion
âOpen Science is the practice of science in such a way that others can collaborate and contribute, where research data, lab notes and other research processes are freely available, under terms that enable reuse, redistribution and reproduction of the research and its underlying data and methods.â
FOSTER project
âOpen access can be defined as the practice of providing on-line access to scientific1 information that is free of charge to the end-user and that is re-usable. In the context of research and innovation, 'scientific information' can refer to (i) peer-reviewed scientific research articles (published in scholarly journals) or (ii) research data (data underlying publications, curated data and/or raw data)â.
TEI is an XML-based standard for digital representation of texts.
Co postulowaĹ BOAI? JakÄ formÄ realizacji otwartego dostÄpu proponowaĹ? 2 moĹźliwoĹci: nastÄpny slajd.
1 â repozytoria â istotne sÄ kwestie prawne â skoro u kogoĹ coĹ publikujemy, no to w jakiĹ sposĂłb siÄ z tym wydawcÄ na coĹ umawiamy â o tym Krzysiek.
2 â czasopisma â istotne sÄ kwestie finansowania â skÄ d wezmÄ siÄ Ĺrodki na proces publikacyjny â o tym MichaĹ.
Open repositories:
The author makes the publication available through self-archiving â parallel to its publication in any kind of research journal
Open access journals:
The publisher makes the publication available directly from the journalâs site.
PLOS: Biology, Genetics, Pathogens, Medecine, Computational Biology, ONE
BMC:
eLife: Howard Hughes Medical Institute, Max-Planck Gesellschaft, Wellcome Trust.
Jak kaĹźdy rejestr: niekompletne!
PLOS: Biology, Genetics, Pathogens, Medecine, Computational Biology, ONE
BMC:
eLife: Howard Hughes Medical Institute, Max-Planck Gesellschaft, Wellcome Trust.
PMC: Kilka milionĂłw artykuĹĂłw.
Gromadzi wszystkie artykuĹy z badaĹ przez siebie finansowanych â sÄ deponowane albo bezpoĹrednio przez wydawcĂłw, albo przez autorĂłw.
Dodatkowo przyjmuje: finansowane przez inne wybrane wspĂłĹpracujÄ ce agencje grantowe; lub publikowane przez wybrane wspĂłĹpracujÄ ce czasopisma.
Samoarchiwizacja: tylko wtedy, gdy finansowanie z odpowiedniej agencji, ale czasopismo nie wspĂłĹpracuje i nie zdeponowaĹo w imieniu autora.
TrochÄ zaciera siÄ tu róşnica miÄdzy zielonÄ a zĹotÄ drogÄ . Stopniowa ewolucja od inst. do dziedzinowego.
W efekcie: doĹÄ dobra kontrola jakoĹci â tylko wybrane czasopisma albo wybrane instytucje grantowe.
arXiv â 1991
SSRN â 1992
Ĺťe Zenodo = publikacje + dane
OpowiedzieÄ o agregatorach â OpenAIRE!
To wszystko sÄ publiczne pieniÄ dze.
TEI is an XML-based standard for digital representation of texts.
US OSTP
Boston University, US
Dane publiczne â ze ĹşrĂłdeĹ rzÄ dowych i z administracji publicznej. Different especially in legal terms â who owns it.
Cultural: Data about cultural works and artefacts â for example titles and authors â and generally collected and held by galleries, libraries, archives and museums.
Science: Data that is produced as part of scientific research from astronomy to zoology.
Finance: Data such as government accounts (expenditure and revenue) and information on financial markets (stocks, shares, bonds etc).
Statistics: Data produced by statistical offices such as the census and key socioeconomic indicators.
Weather: The many types of information used to understand and predict the weather and climate.
Environment: Information related to the natural environment such presence and level of pollutants, the quality and rivers and seas.
Transport: Data such as timetables, routes, on-time statistics.
What do they provide us with?
Godne zaufania: instytucja nie zniknie, sensowne backupy, dbaĹoĹÄ o rzeczywistÄ dostÄpnoĹÄ (migracja do nowych formatĂłw albo na nowe rodzaje noĹnikĂłw, odĹwieĹźanie noĹnikĂłw, itd.)
TrwaĹa lokalizacja â moĹźliwoĹÄ cytowania danych.
Zalety wzglÄdem Supplementary Data lub wĹasnej strony internetowej.
So: is open sharing of research data a new phenomenon? No. It has always existed in the scientific community, and for several decades now has taken place through computer databases. Here we have 3 examples.
Liczne inicjatywy istniejÄ ce nawet od kilkudziesiÄciu lat â gĹĂłwnie repozytoria danych, np. GenBank (1982 w obecnej formie; NCBI â czÄĹÄ biblioteki NIH), Protein Data Bank (od 1971; wtedy Brookhaven Nat. Lab i Cambridge Univ.).
Obecnie czÄsto wymieniane pod wspĂłlnym hasĹem: Otwarta Nauka â Otwarty dostÄp do wynikĂłw badaĹ naukowych (cytat z czegoĹ z KE).
Od 1989: virtually every journal requires deposition of coordinates and experimental data as a prerequisite to publication (PDB).
Od 198???: czasopisma wymagajÄ , by sekwencje DNA byĹy deponowane.
PaleoBioDB (1998)
UK Data Archive: zaĹoĹźone w 1967 przez SSRC, oraz University of Essex. Otwarte dla wszystkich.
OgĂłlne: przyjmujÄ kaĹźdy rodzaj danych.
DANS: finansowane przez akademiÄ nauk.
Zenodo: w CERNie, finansowane przez KE.
RepOD: w ICMie.
FigShare: prywatna firma.
I will try to use H2020 policies as an exemplification of certain trends that exist in science worldwide â because in its policies the EC actually follows existing trends.
And here we are approaching legal issues.
And here we are approaching legal issues.
And here we see that everyone should know what a DMP is.
Krzysztof will tell you when you actually have the rights to disseminate your papers and data, and what to do in order to make them reusable.