Can we predict dependencies using domain information?
Identity Awareness: Toward an Invisible e-Infrastructure for Identifying Data and Authors
1. Identity Awareness:
Toward an Invisible e-Infrastructure
for Identifying Data and Authors
Amir Aryani, Adrian Burton
Australian National Data Service
2. Identity Awareness
• Connecting data to
• Researchers
• Grants
• Publications
• Licence
ODIN Project
• Interoperability between
• ORCID
• DataCite
4. “My vision is a scientific community that
does not waste resources on recreating
data that have already been produced, in
particular if public money has helped to
collect those data in the first place.”
Neelie Kroes, Vice-President of the European Commission, Digital Agenda
5. Research Data Australia (RDA)
Number of published research
collections in RDA
40,000"
30,000"
20,000"
10,000"
0"
2009)11" 2011)09" 2012)01" 2012)05" 2012)07" 2012)09" 2012)10"
7. Identity Awareness
means knowing how to
• Identify the researchers who contributed to a dataset
• Identify the publications that use a dataset
• Identify the related grant or the research project
• Identify the licence for a dataset
Researcher Licence
Data
Grants and
Publication
projects
7
8. RDA Quality Model for Data
RIF-CS Elements Requirement 1 2 3
1 registry object Required * * *
2 originating source Required * * *
3 group Required * * *
4 key Required * * *
5 collection type Required * * *
6 name/title Required * *
7 related party (researcher or organisation) Required * *
8 description Required * *
9 location/address Required * *
10 rights (Licence) Required * *
11 activity (grant or research project) Required if available *
12 subject Recommended *
13 spatial coverage Recommended *
14 temporal coverage Recommended *
15 citation Recommended *
16 identifier Recommended *
17. Data Creator,
Researcher, Author
Birth Cohort Study
dataset
Non- Birth Cohort
Study dataset
Derived dataset
Grey Literature
1958
Published article
Citation
Data Creator
Derived Data Creator
External Data input
Author: Grey lit
External Data
Author: Article
(Census, Health etc )
1970
18. Data Creator,
Researcher, Author
Birth Cohort Study
dataset
Non- Birth Cohort
Study dataset
Derived dataset
Grey Literature
1958
Published article
Citation
Data Creator
Derived Data Creator
External Data input
Author: Grey lit
External Data
Author: Article
(Census, Health etc )
1970
19. Data Creator,
Researcher, Author
Birth Cohort Study
dataset
Non- Birth Cohort
Study dataset
Derived dataset
Grey Literature
1958
Published article
Citation
Data Creator
Derived Data Creator
External Data input
Author: Grey lit
External Data
Author: Artticle
(Census, Health etc )
1970
20. Data Creator,
Researcher, Author
Birth Cohort Study
dataset
Non- Birth Cohort
Study dataset
Derived dataset
Grey Literature
1958
Published article
Citation
Data Creator
Derived Data Creator
External Data input
Author: Grey lit
External Data
Author: Article
(Census, Health etc )
1970
21. Acknowledgment
ANDS is supported by the The ODIN project is funded by the
Australian Government through European Union under FP7 call
the National Collaborative INFRA-2012-3.3 (Grant Agreement
Research Infrastructure Strategy number 312788)
Program and the Education
Investment Fund (EIF) Super
Science Initiative
22. Conclusion
Enabling identity awareness is an international challenge
that requires a collaborative effort. ANDS encourage your
collaboration in this area and particularly to investigate
these questions:
• How can we measure and improve identity awareness
of research data?
• How can we measure and improve data reuse?
• How can we measure research impact?
• How can your organisation take advantage of the some
of the emerging global identity infrastructures?