1. Data > Research (Dinkum Science)
Andy Turner
http://bit.ly/ResearchDataChampion
Turner, A.G.D. (2018) FAIR Data > Dinkum Research. Presentation slides for the Keynote originally titled
“FAIR Data > Dinkum Science” (to be) presented at the Jisc Research Data Champions Day, 2018-03-26.
Judicious Universal Scientific Thorough
2. Outline
● Getting to know each other (GTKEO)
● Background (B)
● Key points (KP)
● Plans (P)
GTKEO 1/5
3. I am a researcher based in the School of Geography
at the University of Leeds
GTKEO 2/5
4. Who are we?
● We are not just here!
○ Live Stream
■ Channel
● How diverse are we?
● Who are our employers?
○ Universities, information/infrastructure organisations, independents (journalists etc.)
● Who are we reporting to/representing?
● What roles do we have that are relevant to this keynote talk and the day?
○ Administration/Library (research integrity; Ethics, Research Data Management (RDM)...)
○ Research (data; assimilation, processing: reviewers; publication, grant applications...)
○ Teaching (research postgraduate supervision, dissertation supervision...)
GTKEO 3/5
6. Subject Map of Science from
http://www.plosone.org/article/info:doi/10.1371/journal.pone.0039464?imageURI=info:doi/10.1371/journal.pone.0039464.g001
GTKEO 5/5
7. My Jisc Research Data Champion background
● Specialism: computational geography (20 years of experience)
● Open Science Advocate
○ Background in e-Research (e-Science)
■ Adoptee of more open research protocols
○ Data sharing evangeliser
● Java programmer
○ I use: Integrated Development Environments, Build and deployment tools, Versioning
Repositories
○ I try to develop well documented, well tested code
● Research Data Leeds
○ The Research Data Management Service at the University of Leeds
○ Research Data Advisor
● Research Ethics Administration
B 1/8
8. ● Directions for Research Data Management in UK Universities Meeting,
Cambridge, November 2014
○ http://bit.ly/D4RDM14
● Jisc Research Data Network Event York 2017-06-27 and 2017-06-28
○ https://docs.google.com/document/d/1uAZcttlUVhj8mwnAxq9aeTJ7RtXr0RSEVciXjic4AAg/edi
t
● Engaging Researchers in Good Data Management Workshop Cambridge
2017-11-15
○ https://docs.google.com/document/d/1-
ZxvNzB3dU8mYD4s79pIubJAFvzy0sHVFCSoSz5iaFE/edit#
B 2/8
9. Background to this talk
● Original brief:
○ A ‘keynote’ talk on Open Science and Research Data
● What is “Open” Science?
○ Is there such a thing as “closed” science?
■ There is research that is not accessible, but science is something that is communicated
to form (general) knowledge (even though some knowledge is dangerous!
○ My friend Jon Tennant and Knowledge Exchange
■ “Open Science is Just Science Done Right”
○ “Open Science is the practice of science in such a way that others can collaborate and contribute, where
research data, lab notes and other research processes are freely available, under terms that enable reuse,
redistribution and reproduction of the research and its underlying data and methods.”
○ “Open science is the movement to make scientific research, data and dissemination accessible to all levels of an inquiring
society.”
B 3/8
10. ● Panton Principles (2009)
○ https://en.wikipedia.org/wiki/Panton_Principles
○ https://pantonprinciples.org/
● Open Scholarship
○ Knowledge Exchange Open Scholarship Workshop, Paris, September 2017
■ http://bit.ly/OpenScholarship2017-09-27
■ An account of the Knowledge Exchange workshop on Open Scholarship, Paris 2017 and
a menu for possible actions for 2018 and beyond.
● http://repository.jisc.ac.uk/6792/1/Knowledge_Exchange_Moving_from_ambition_t
o_reality_Feb_2018.pdf
B 4/8
11. ● FAIR Data Principles
○ https://www.force11.org/group/fairgroup/fairprinciples
○ Findable
■ F1. (meta)data are assigned a globally unique and eternally persistent identifier.
■ F2. data are described with rich metadata.
■ F3. (meta)data are registered or indexed in a searchable resource.
■ F4. metadata specify the data identifier.
B 5/8
12. ○ Accessible
■ A1 (meta)data are retrievable by their identifier using a standardized communications protocol.
● A1.1 the protocol is open, free, and universally implementable.
● A1.2 the protocol allows for an authentication and authorization procedure, where necessary.
■ A2 metadata are accessible, even when the data are no longer available.
○ Interoperable
■ I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge
representation.
■ I2. (meta)data use vocabularies that follow FAIR principles.
■ I3. (meta)data include qualified references to other (meta)data.
B 6/8
13. ○ Re-usable
■ R1. meta(data) have a plurality of accurate and relevant attributes.
● R1.1. (meta)data are released with a clear and accessible data usage license.
● R1.2. (meta)data are associated with their provenance.
■ R1.3. (meta)data meet domain-relevant community standards.
● “As open as possible as closed as necessary”
○ http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-
oa-data-mgt_en.pdf
B 7/8
14. Open Science Origins
● The Antarctic Treaty
(1959) specifies in
Article III that
scientific
observations and
results made in
Antarctica must be
'made freely
available.'
B 8/8
15. Data management for reproducible research
● How do I get your data
○ Online repositories with versioning and APIs for data access
● How do I use your data?
○ Documentation, metadata, common data formats, data packages
● How do I trust your data?
○ Record of provenance and processing, versioning
● How do I build on your data?
○ Record of provenance and processing, compatible content, linkable to other data
● What am I allowed to do with your data?
○ Licences, terms of use, data access agreements, ethics
KP 1/4
16. Code management for reproducible research
● How do I get your code?
○ Online repositories and persistent archives with versioning support
● How do I use your code?
○ Documentation, examples, packages, virtual machines, containers
● How do I trust your code?
○ Tests, examples, readable code
● How do I build on your code?
○ Documentation, readable code, tests
● What am I allowed to do with your code?
○ Licence
KP 2/4
17. Opening up and revolutionising Scholarly
Communication
● Scholarly communication was arguably slow to embrace the Web. Despite
this embrace being something of an original motivation for it.
○ It seems like the Web revolutionised the world and is now finally revolutionising scholarly
communications.
● Revolution are sometimes slow and take longer that you might expect.
● Empires close to collapse are often at their most violent!
● More Open Peer Review
○ Why not make an open pledge:
■ http://www.openaccesspledge.com/
■ ?
KP 3/4
18. You/we are not alone :-)
● Open Knowledge
● Force 11 Scholarly Communications Institute
● Knowledge Exchange
● Open Science Framework
● Research Data Alliance
KP 4/4
19. Plans
● Research Data Management Case Studies
○ Colleagues and Students at/from Leeds
● Research Data Audit
○ Research Data Registry/Catalogue
○ Digital Asset Framework
● Research Data Advocacy
● Lead by example and surf the Open Science wave
P 1/1
21. Collaborative note taking for the event
● Research Data Champions Day 2018-03-26
○ http://bit.ly/RDCD18
22. Links
● O’Reilly, M. (2017) Data as code Data management for reproducible research
○ http://rse.ac.uk/conf2017/wp-content/uploads/sites/2/2017/11/Data-as-code-Data-
management-for-reproducible-research-Martin-OReilly.pdf
● “We are entering an era of open science” says EU Vice President Neelie
Kroes at launch of new global Research Data Alliance
○ https://blog.okfn.org/2013/03/21/we-are-entering-an-era-of-open-science-says-eu-vp-neelie-
kroes/
23. Thank you for your attention, contributions and
participation here at Jisc HQ
● A big shout out to our friends in Cardiff at the Software Sustainability Institute
Collaborations Workshop #CollabW18
○ https://www.software.ac.uk/cw18
24. Open Science @openscience Mar 15
"Anyone, anywhere in the world should have free, unhindered access to not just my research, but to the research of every great and
enquiring mind across the spectrum of human understanding." – Prof. Stephen Hawking
Images
FAIR By SangyaPundir - Own work, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=53414062
Monash godess https://blogs.monash.edu/someofourparts/files/2014/02/DOJ-ITN-corner.jpg
Using logarithmic maps of the universe from Princeton and images from NASA, Pablo Carlos Budassi created an image that shows the observable universe in one disc. https://futurism.com/universe-looks-like-one-image/
https://commons.wikimedia.org/wiki/File:Wikimedia-research-blank.png Wikimedia Foundation, Wikimedia-research-blank, CC BY-SA 3.0
Rigour image adapted from https://emc2andallthat.wordpress.com/2014/07/22/weasel-words-in-education-part-5-rigour/
I want to find out a bit about who is engaging with this keynote directly focussing on the overlaps between the various roles we have in the realm of generating and exchanging knowledge and information that might helps us understand the past, present and future.
Image
https://eigenblogger.com/2010/09/10/post1594/
Images:
https://goo.gl/images/12GZTW
https://goo.gl/images/5uYJXG
https://goo.gl/images/qDcyXg
Images
Open Science is Just Science Done Right https://twitter.com/shefw/status/973220499575787520