This document discusses building a data legacy by managing research data across one's career. It outlines challenges with 20th century approaches and a vision for 21st century research focused on data sharing. National and international developments supporting open data and research are described. The Australian Research Data Commons (ARDC) is presented as an investment to enable digital, data-driven research. Researchers are encouraged to make their data findable, accessible, interoperable and reusable to accelerate discovery and build a lasting data legacy.
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Managing Your Data Legacy for 21st Century Research
1. BUILDING YOUR DATA LEGACY
MANAGING DATA ACROSS YOUR CAREER
SLIDES: @VALUEMGMT CC-BY*: DR RICHARD FERRERS, ARDC
*except where indicated.
A Vision for 21st Century Research
2. ARDC - IMAGINING THE RESEARCH DATA COMMONS
20TH CENTURY RESEARCH
▸ Citations -> Grants
▸ Results -> Journal -> Citations
▸ Data -> drawer
▸ Focus: journal impact factor, citations,
grants
=> Research in the age of the Journal
3. ARDC - EMPOWERING 21ST CENTURY RESEARCH
WHAT IS THE PROBLEM WITH 20TH CENTURY RESEARCH?
▸ Scarcity and declining government research funding
▸ Power of journals, long delays in publishing
▸ Strong research competition for grants; low return on effort
▸ Deluge of literature; not enough time
▸ Incentives: promotion, funding due to citations, journal impact factor
=> stress, stress, stress
4. ARDC - ACCELERATING RESEARCH
A VISION FOR 21ST CENTURY RESEARCH
▸ data intensive; data commons (FAIR)
▸ globally collaborative (FA)
▸ cross-disciplinary (IR)
▸ Focus; data/software/services ->
impact -> value creation
=> research in the age of the Internet
machine-actionable | scalable
5. OVERVIEW
▸ 1.The Problem; old v new; (Why?)
▸ 2.The Context; national and international perspectives (What?)
▸ 3.The AU Government solution; NCRIS -> ARDC (How?)
▸ 4.The ARDC Solution; the Australian Research Data Commons
▸ 5.Your personal Solution; Managing your data legacy (What next?)
6. 1.1 “WE STAND AT THE
DAWN OF A NEW
GENERATION”
Internet, smartphone - enabled,
carbon constrained, 7.6B shipmates
connected => an internet world
8. ARDC - CATALOGUING AUSTRALIA’S RESEARCH DATA
1.3 WHERE IS YOUR FOCUS? ABUNDANCE OR SCARCITY…
Scarce Abundant
time bandwidth
$$ data/compute
grants people / effort
citations problems
9. BANDWIDTH ABUNDANCE - DATA OVER FIBRE OPTIC CABLE
1975: 45 Mb /sec over 10km (Wikipedia
2012: 1,000,000,000 Mb /sec over 50km cable (Wikipedia)
A 20,000,000x improvement in under 40 years.
Powering the internet.
11. 1.4 THE DATA DELUGE
Copyright xtremesport4u.xom Permission requested
12. ARDC - FOR DATA DRIVEN RESEARCH
1.5 THE DATA DELUGE EVIDENCE - DATA DOIS MINTED (JULY ’18) / PAPERS
World AU Figshare CrossRef
Total 14,000,000 265,000* 1,000,000 100M
2018 1,700,000 50,000 163,000 5.6M
2017 3,200,000 45,000 318,000 7M
* Paradisec = 228,000; one project, ten staff.Source: stats.datacite.org
13. 2. NATIONAL AND INTERNATIONAL
DEVELOPMENTS
TOWARDS 21ST CENTURY RESEARCH
14. ARDC - DIGITALLY TRANSFORMING, ACCELERATING AUSTRALIA'S 21ST CENTURY RESEARCH
2. NATIONAL / INTERNATIONAL DEVELOPMENTS IN DATA, RESEARCH
▸ Int’l - Royal Society (UK) - Science as an Open
Enterprise (2012)
▸ AU - National Data Statement (NISA)
(Dec 2015) | National Data Commissioner (Jul ’18)
▸ Int’l - China; Statement on Scientific Data
Management Measures (April 2018)
▸ Int’l - US; Statement of National Academies
- Science, Engineering and Medicine - Open Science by Design.
(July 2018)
16. RESEARCH DATA ALLIANCE - 7,000
SCIENTISTS; 137 COUNTRIES;
BUILDING BRIDGES TO DATA SHARING
2.2 RD-A in a nutshell
ARDC - ENABLING 21ST CENTURY RESEARCH
17. 2.3 AU National Data Statement (2015); innovation.gov.au
ARDC - FOR DATA INTENSIVE RESEARCH
”THE AUSTRALIAN GOVERNMENT COMMITS TO OPTIMISE THE USE AND
REUSE OF PUBLIC DATA.…
AUSTRALIAN GOVERNMENT ENTITIES WILL:
- MAKE NON-SENSITIVE DATA OPEN BY DEFAULT … [FOR] INNOVATION
AND PRODUCTIVITY [FOR] ALL …
- WHERE POSSIBLE, ENSURE NON-SENSITIVE PUBLICLY FUNDED
RESEARCH DATA IS MADE OPEN FOR USE AND REUSE”
18. ”THE NEW NATIONAL DATA COMMISSIONER WILL BE THE TRUSTED
OVERSEER OF A NEW DATA SHARING AND RELEASE FRAMEWORK,
ALLOWING AUSTRALIA TO REALISE THE FULL POTENTIAL OF DATA
WHILE MAINTAINING PUBLIC TRUST IN THE DATA SYSTEM” (LINK)
2.4 National Data Commissioner appointed 01.07.18
ARDC - DATA COMPUTE CLOUD PEOPLE
19. LET THE OPENING OF SCIENTIFIC DATA
BECOME THE NORM ”ALL SCIENTIFIC DATA
GENERATED IN CHINA MUST BE SUBMITTED TO
GOVERNMENT-SANCTIONED DATA CENTERS BEFORE
APPEARING IN PUBLICATIONS [FROM TODAY].” SCIENCE
2.5 Scientific Data Management Measures - China State Council
17.03.18
ARDC - POWERING 21ST CENTURY RESEARCH
20. “RESEARCH FUNDERS SHOULD PROVIDE
EXPLICIT AND CONSISTENT SUPPORT
FOR PRACTICES ... THAT FACILITATE
[OPEN SCIENCE] SHIFT IN CULTURE AND
INCENTIVES…” (P.130)
2.6 US National Academies of Science, Engineering and Medicine
(Jul 2018)
ARDC - ENABLING 21ST CENTURY RESEARCH
21. “RESEARCH INSTITUTIONS SHOULD WORK TO
CREATE A CULTURE THAT ACTIVELY SUPPORT
OPEN SCIENCE… BY BETTER REWARDING &
SUPPORTING RESEARCHERS ENGAGED IN
OPEN SCIENCE PRACTICES…” (P.130)
2.7 US National Academies of Science, Engineering and Medicine
(Jul 2018)
ARDC - ENABLING 21ST CENTURY RESEARCH
23. 3. NCRIS - NATIONAL RESEARCH
INFRASTRUCTURE INVESTMENT
ENABLING 21ST CENTURY RESEARCH
24. 3.WHAT IS NCRIS?
$1.9 BILLION CAPITAL (2018 - 30) [1]
$1.5 BILLION OPERATING (2015 - 25) [2]
Implementing 21st Century Research
in Australia - the Federal Government response
National Collaborative Research Infrastructure Strategy
25. 3.1 NCRIS -“GREAT RESEARCH
INFRASTRUCTURE ATTRACTS AND
NURTURES TALENT AND UNDERWRITES A
NATION’S REPUTATION FOR HIGH-
IMPACT RESEARCH”
AU Chief Scientist Alan Finkel, National Research Infrastructure
Roadmap (2016)
ARDC - ACCELERATING RESEARCH
26. ARDC - POWERING 21ST CENTURY RESEARCH
3.2 NCRIS NINE FOCUS AREAS
…. Australia … as an emerging or established global leader:
•Digital Data and eResearch Platforms
•Platforms for Humanities, Arts and Social
Sciences
•Characterisation
•Advanced Fabrication and Manufacturing
•Advanced Physics and Astronomy
•Earth and Environmental Systems
•Biosecurity
•Complex Biology
•Therapeutic Development
Dept of Education
27. 4. ARDC: AN NCRIS INVESTMENT
DIGITAL DATA AND ERESEARCH PLATFORMS
28. ARDC - DIGITALLY TRANSFORMING RESEARCH
4.1 WHAT IS ARDC?
▸ $20M per annum operating
▸ $77M capital investment
▸ Cloud compute | storage
▸ Research data management
▸ Five year Strategic Planning
- about to commence
30. ARDC - ENABLING 21ST CENTURY RESEARCH
4.2 DATA ENHANCED VIRTUAL LABS
Marine | Astro | Geo | Eco
Humanities / Social Science
Bio
Characterisation Imaging
Agriculture (Link)
Link to details
31. ARDC - TALKING DATA
4.3 WHAT COULD GO WRONG WITH YOUR DATA?
▸ Forgotten
▸ Lost
▸ Ignored
▸ Undocumented
32. ARDC - ENABLING 21ST CENTURY RESEARCH
4.4 WHAT DO I DO? RSCH DATA MANAGEMENT
=> FAIR data
▸ Findable | Accessible
▸ Interoperable | Reuseable
33. 5. AS A 21ST CENTURY RESEARCHER
MANAGING YOUR DATA LEGACY - AT DEAKIN
34. ARDC - TRAINING RESEARCHERS IN FAIR DATA
5.1 HOW DO I DO - FAIR DATA?
▸ Describe your data (FA)
▸ Licence your data (R)
▸ Publish your data descriptions to your CV (A)
▸ Store your data (safely, long term, identifiably)
▸ Promote your data (A)
▸ Connect your data (grants, publications,
literature, problem statements) (I)
=> Estimated Cost: 1 FTE per 100 Rschrs
36. ARDC - ENVISIONING 21ST CENTURY RESEARCH
5.3 WHAT CAN I DO RIGHT NOW?
▸ Get an ORCiD - a permanent identifier for you
eg RICHARD FERRERS, PAUL WONG
▸ Describe your data assets in your CV (LinkedIn)
▸ Get DOIs for your most valuable data (Figshare,
DRO - Deakin Research Online)
▸ Cite your data in your publications eg AJTDE
▸ Link your data descriptions to publications,
grants, your research team, key literature
37. ARDC - VISIONARY THINKING
5.4 WHAT CAN I THINK ABOUT DOING NEXT?
▸ Start your new research with a Literature
Review and a Data Review
▸ Submit your data to a Data Journal
▸ Learn about Making Data FAIR
▸ Consider if your next research data can be
OPEN, and if not FAIR
▸ Link your ORCid to Research Impact Story
38. ARDC - VISIONARY THINKING
5.5 HOW CAN I BUILD A DATA LEGACY?
▸ History of your research data (FA); new vision
▸ Linked from your CV (FA); new problems
▸ Encourage your research community to agree
on data standards (IR); new process
▸ Engage your user community to reuse and
extend your data; (IR) new tools/services
▸ Train the next generation to build on your
work; new researchers
39. ARDC - ENVISIONING 21ST CENTURY RESEARCH
5.6 KEY TAKEAWAYS
▸ DATA IS OUR VISION FOR THE FUTURE
▸ DATA IS ABUNDANT
▸ DATA NEEDS DESCRIPTION AND STORAGE
▸ DATA NEEDS TO BE CONNECTED
▸ EXPLOIT NATIONAL INVESTMENT IN DATA
▸ MAKE YOUR DATA FAIR OR OPEN
▸ BUILD YOUR DATA LEGACY - YOUR CV
40. “IN 100 YEARS, IT WILL BE MY DATA
THAT IS MY LASTING LEGACY.”
5.6 Prof. Craig Johnson,
ARDC - EMPOWERING 21ST CENTURY RESEARCH
41. WHAT WILL BE YOUR LASTING LEGACY?
5.7 Challenge for a Deakin Early Career Researchers
ARDC - HERE TO HELP
43. … SUPPORT THOSE AROUND YOU TO
MAKE THE CHANGE TO 21ST CENTURY
RESEARCH…
5.9 We are all journeying together
ARDC - ENABLING 21ST CENTURY RESEARCH
44. RICHARD FERRERS
RESEARCH DATA SPECIALIST
MYDATA - GOOGLE: FERRERS FIGSHARE
ARDC - ACCELERATING 21ST CENTURY RESEARCH
ORCiD: LinkedIn
Figshare: MyData
@valuemgmt