SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Identity Awareness:
Toward an Invisible e-Infrastructure
 for Identifying Data and Authors
          Amir Aryani, Adrian Burton
        Australian National Data Service
Identity Awareness
• Connecting data to
  • Researchers
  • Grants
  • Publications
  • Licence




                       ODIN Project
                       • Interoperability between
                         • ORCID
                         • DataCite
Identity Awareness
“My vision is a scientific community that
does not waste resources on recreating
data that have already been produced, in
particular if public money has helped to
collect those data in the first place.”
Neelie Kroes, Vice-President of the European Commission, Digital Agenda
Research Data Australia (RDA)

Number of published research
collections in RDA

40,000"




30,000"




20,000"




10,000"




    0"
     2009)11"   2011)09"   2012)01"   2012)05"   2012)07"   2012)09"   2012)10"
Research Data Australia
Coverage of Published Research Collections
Identity Awareness
means knowing how to
•   Identify the researchers who contributed to a dataset
•   Identify the publications that use a dataset
•   Identify the related grant or the research project

•   Identify the licence for a dataset


    Researcher                                Licence
                            Data


                                     Grants and
              Publication
                                      projects
                              7
RDA Quality Model for Data
     RIF-CS Elements                            Requirement             1   2   3
1    registry object                            Required                *   *   *
2    originating source                         Required                *   *   *
3    group                                      Required                *   *   *
4    key                                        Required                *   *   *
5    collection type                            Required                *   *   *
6    name/title                                 Required                    *   *
7    related party (researcher or organisation) Required                    *   *
8    description                                Required                    *   *
9    location/address                           Required                    *   *
10   rights (Licence)                           Required                    *   *
11   activity (grant or research project)       Required if available           *
12   subject                                    Recommended                     *
13   spatial coverage                           Recommended                     *
14   temporal coverage                          Recommended                     *
15   citation                                   Recommended                     *
16   identifier                                 Recommended                     *
ODIN Project
10
Open Researcher & Contributor ID

                                Work
Connecting researcher to        (Publication)
                                (Data)

Partners:                       Grants
•   American Physical Society
•   CrossRef                    Affiliations
•   Elsevier
•   Thomson Reuters             Patents
•   Wellcome Trust
•   …
DataCite
                                                   1,104,998
    Digital Object Identifiers (DOI) by DataCite




The information on this slide was captured on 30 Oct 2012
High energy physics data
Social Sciences Cohort Data
Data Creator,
                                 Researcher, Author
                                 Birth Cohort Study
                                 dataset
                                  Non- Birth Cohort
                                  Study dataset
                                  Derived dataset

                                  Grey Literature
           1958
                                  Published article

                                   Citation
                                   Data Creator
                                   Derived Data Creator
                                   External Data input
                                   Author: Grey lit
External Data
                                   Author: Article
(Census, Health etc   )
                          1970
Data Creator,
                                 Researcher, Author
                                 Birth Cohort Study
                                 dataset
                                  Non- Birth Cohort
                                  Study dataset
                                  Derived dataset

                                  Grey Literature
           1958
                                  Published article

                                   Citation
                                   Data Creator
                                   Derived Data Creator
                                   External Data input
                                   Author: Grey lit
External Data
                                   Author: Article
(Census, Health etc   )
                          1970
Data Creator,
                                 Researcher, Author
                                 Birth Cohort Study
                                 dataset
                                  Non- Birth Cohort
                                  Study dataset
                                  Derived dataset

                                  Grey Literature
           1958
                                  Published article

                                   Citation
                                   Data Creator
                                   Derived Data Creator
                                   External Data input
                                   Author: Grey lit
External Data
                                   Author: Artticle
(Census, Health etc   )
                          1970
Data Creator,
                                 Researcher, Author
                                 Birth Cohort Study
                                 dataset
                                  Non- Birth Cohort
                                  Study dataset
                                  Derived dataset

                                  Grey Literature
           1958
                                  Published article

                                   Citation
                                   Data Creator
                                   Derived Data Creator
                                   External Data input
                                   Author: Grey lit
External Data
                                   Author: Article
(Census, Health etc   )
                          1970
Acknowledgment
ANDS is supported by the           The ODIN project is funded by the
Australian Government through      European Union under FP7 call
the National Collaborative         INFRA-2012-3.3 (Grant Agreement
Research Infrastructure Strategy   number 312788)
Program and the Education
Investment Fund (EIF) Super
Science Initiative
Conclusion
Enabling identity awareness is an international challenge
that requires a collaborative effort. ANDS encourage your
collaboration in this area and particularly to investigate
these questions:

• How can we measure and improve identity awareness
  of research data?
• How can we measure and improve data reuse?
• How can we measure research impact?
• How can your organisation take advantage of the some
  of the emerging global identity infrastructures?

Weitere ähnliche Inhalte

Ähnlich wie Identity Awareness: Toward an Invisible e-Infrastructure for Identifying Data and Authors

Zooniverse teachers workshop
Zooniverse teachers workshopZooniverse teachers workshop
Zooniverse teachers workshopLaura Whyte
 
Publishing of Scientific Data - Science Foundation Ireland Summit 2010
Publishing of Scientific Data  - Science Foundation Ireland Summit 2010Publishing of Scientific Data  - Science Foundation Ireland Summit 2010
Publishing of Scientific Data - Science Foundation Ireland Summit 2010jodischneider
 
Exploring Process Barriers to Release Public Sector Information in Local Gove...
Exploring Process Barriers to Release Public Sector Information in Local Gove...Exploring Process Barriers to Release Public Sector Information in Local Gove...
Exploring Process Barriers to Release Public Sector Information in Local Gove...Peter Conradie
 
Research Data Sharing LERU
Research Data Sharing LERU Research Data Sharing LERU
Research Data Sharing LERU LIBER Europe
 
DataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefDataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefCrossref
 
Profile of an Industry: Research Data Services
Profile of an Industry: Research Data ServicesProfile of an Industry: Research Data Services
Profile of an Industry: Research Data ServicesTanner Jessel
 
Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...
Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...
Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...Dimitrios Koureas
 
Scott Edmunds: Data publication in the data deluge
Scott Edmunds: Data publication in the data delugeScott Edmunds: Data publication in the data deluge
Scott Edmunds: Data publication in the data delugeGigaScience, BGI Hong Kong
 
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data HandlingScott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data HandlingGigaScience, BGI Hong Kong
 
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...MusicNet
 
Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"GigaScience, BGI Hong Kong
 
Scalable Identifiers for Natural History Collections
Scalable Identifiers for Natural History CollectionsScalable Identifiers for Natural History Collections
Scalable Identifiers for Natural History CollectionsJohn Kunze
 
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseAnita de Waard
 

Ähnlich wie Identity Awareness: Toward an Invisible e-Infrastructure for Identifying Data and Authors (16)

Zooniverse teachers workshop
Zooniverse teachers workshopZooniverse teachers workshop
Zooniverse teachers workshop
 
Publishing of Scientific Data - Science Foundation Ireland Summit 2010
Publishing of Scientific Data  - Science Foundation Ireland Summit 2010Publishing of Scientific Data  - Science Foundation Ireland Summit 2010
Publishing of Scientific Data - Science Foundation Ireland Summit 2010
 
Exploring Process Barriers to Release Public Sector Information in Local Gove...
Exploring Process Barriers to Release Public Sector Information in Local Gove...Exploring Process Barriers to Release Public Sector Information in Local Gove...
Exploring Process Barriers to Release Public Sector Information in Local Gove...
 
Research Data Sharing LERU
Research Data Sharing LERU Research Data Sharing LERU
Research Data Sharing LERU
 
DataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefDataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRef
 
Profile of an Industry: Research Data Services
Profile of an Industry: Research Data ServicesProfile of an Industry: Research Data Services
Profile of an Industry: Research Data Services
 
Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...
Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...
Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...
 
Scott Edmunds: Data publication in the data deluge
Scott Edmunds: Data publication in the data delugeScott Edmunds: Data publication in the data deluge
Scott Edmunds: Data publication in the data deluge
 
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data HandlingScott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
 
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
 
Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"
 
Scalable Identifiers for Natural History Collections
Scalable Identifiers for Natural History CollectionsScalable Identifiers for Natural History Collections
Scalable Identifiers for Natural History Collections
 
NISO Forum, Denver, Sept. 24, 2012: Data Equivalence
NISO Forum, Denver, Sept. 24, 2012: Data EquivalenceNISO Forum, Denver, Sept. 24, 2012: Data Equivalence
NISO Forum, Denver, Sept. 24, 2012: Data Equivalence
 
Identifying psychological research data in the digital environment.
Identifying psychological research data in the digital environment. Identifying psychological research data in the digital environment.
Identifying psychological research data in the digital environment.
 
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
 
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
 

Mehr von amiraryani

Visualising Research Graph using Neo4j and Gephi
Visualising Research Graph using Neo4j and GephiVisualising Research Graph using Neo4j and Gephi
Visualising Research Graph using Neo4j and Gephiamiraryani
 
Using the Research Graph and Data Switchboard for cross-platform discovery
Using the Research Graph and Data Switchboard for cross-platform discoveryUsing the Research Graph and Data Switchboard for cross-platform discovery
Using the Research Graph and Data Switchboard for cross-platform discoveryamiraryani
 
Research Data Alliance Plenary 9: DDRI Working Group Session
Research Data Alliance Plenary 9: DDRI Working Group SessionResearch Data Alliance Plenary 9: DDRI Working Group Session
Research Data Alliance Plenary 9: DDRI Working Group Sessionamiraryani
 
Research Graph: Connecting Identifiers across Research Data Infrastructures
Research Graph: Connecting Identifiers across Research Data InfrastructuresResearch Graph: Connecting Identifiers across Research Data Infrastructures
Research Graph: Connecting Identifiers across Research Data Infrastructuresamiraryani
 
Using Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-SwitchboardUsing Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-Switchboardamiraryani
 
ORCID in RD-Switchboard
ORCID in RD-SwitchboardORCID in RD-Switchboard
ORCID in RD-Switchboardamiraryani
 
Research Data and the Future of Software Engineering
Research Data and the Future of Software EngineeringResearch Data and the Future of Software Engineering
Research Data and the Future of Software Engineeringamiraryani
 
Report from RDAPlenary 3 to DataCitation Community in Australia
Report from RDAPlenary 3 to DataCitation Community in AustraliaReport from RDAPlenary 3 to DataCitation Community in Australia
Report from RDAPlenary 3 to DataCitation Community in Australiaamiraryani
 
Data Description Registry Interoperability WG at Research Data Alliance Third...
Data Description Registry Interoperability WG at Research Data Alliance Third...Data Description Registry Interoperability WG at Research Data Alliance Third...
Data Description Registry Interoperability WG at Research Data Alliance Third...amiraryani
 
ORCID integration: A case study from ANDS and international development
ORCID integration: A case study from ANDS and international developmentORCID integration: A case study from ANDS and international development
ORCID integration: A case study from ANDS and international developmentamiraryani
 
Can we predict dependencies using domain information?
Can we predict dependencies using domain information?Can we predict dependencies using domain information?
Can we predict dependencies using domain information?amiraryani
 

Mehr von amiraryani (11)

Visualising Research Graph using Neo4j and Gephi
Visualising Research Graph using Neo4j and GephiVisualising Research Graph using Neo4j and Gephi
Visualising Research Graph using Neo4j and Gephi
 
Using the Research Graph and Data Switchboard for cross-platform discovery
Using the Research Graph and Data Switchboard for cross-platform discoveryUsing the Research Graph and Data Switchboard for cross-platform discovery
Using the Research Graph and Data Switchboard for cross-platform discovery
 
Research Data Alliance Plenary 9: DDRI Working Group Session
Research Data Alliance Plenary 9: DDRI Working Group SessionResearch Data Alliance Plenary 9: DDRI Working Group Session
Research Data Alliance Plenary 9: DDRI Working Group Session
 
Research Graph: Connecting Identifiers across Research Data Infrastructures
Research Graph: Connecting Identifiers across Research Data InfrastructuresResearch Graph: Connecting Identifiers across Research Data Infrastructures
Research Graph: Connecting Identifiers across Research Data Infrastructures
 
Using Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-SwitchboardUsing Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-Switchboard
 
ORCID in RD-Switchboard
ORCID in RD-SwitchboardORCID in RD-Switchboard
ORCID in RD-Switchboard
 
Research Data and the Future of Software Engineering
Research Data and the Future of Software EngineeringResearch Data and the Future of Software Engineering
Research Data and the Future of Software Engineering
 
Report from RDAPlenary 3 to DataCitation Community in Australia
Report from RDAPlenary 3 to DataCitation Community in AustraliaReport from RDAPlenary 3 to DataCitation Community in Australia
Report from RDAPlenary 3 to DataCitation Community in Australia
 
Data Description Registry Interoperability WG at Research Data Alliance Third...
Data Description Registry Interoperability WG at Research Data Alliance Third...Data Description Registry Interoperability WG at Research Data Alliance Third...
Data Description Registry Interoperability WG at Research Data Alliance Third...
 
ORCID integration: A case study from ANDS and international development
ORCID integration: A case study from ANDS and international developmentORCID integration: A case study from ANDS and international development
ORCID integration: A case study from ANDS and international development
 
Can we predict dependencies using domain information?
Can we predict dependencies using domain information?Can we predict dependencies using domain information?
Can we predict dependencies using domain information?
 

Identity Awareness: Toward an Invisible e-Infrastructure for Identifying Data and Authors

  • 1. Identity Awareness: Toward an Invisible e-Infrastructure for Identifying Data and Authors Amir Aryani, Adrian Burton Australian National Data Service
  • 2. Identity Awareness • Connecting data to • Researchers • Grants • Publications • Licence ODIN Project • Interoperability between • ORCID • DataCite
  • 4. “My vision is a scientific community that does not waste resources on recreating data that have already been produced, in particular if public money has helped to collect those data in the first place.” Neelie Kroes, Vice-President of the European Commission, Digital Agenda
  • 5. Research Data Australia (RDA) Number of published research collections in RDA 40,000" 30,000" 20,000" 10,000" 0" 2009)11" 2011)09" 2012)01" 2012)05" 2012)07" 2012)09" 2012)10"
  • 6. Research Data Australia Coverage of Published Research Collections
  • 7. Identity Awareness means knowing how to • Identify the researchers who contributed to a dataset • Identify the publications that use a dataset • Identify the related grant or the research project • Identify the licence for a dataset Researcher Licence Data Grants and Publication projects 7
  • 8. RDA Quality Model for Data RIF-CS Elements Requirement 1 2 3 1 registry object Required * * * 2 originating source Required * * * 3 group Required * * * 4 key Required * * * 5 collection type Required * * * 6 name/title Required * * 7 related party (researcher or organisation) Required * * 8 description Required * * 9 location/address Required * * 10 rights (Licence) Required * * 11 activity (grant or research project) Required if available * 12 subject Recommended * 13 spatial coverage Recommended * 14 temporal coverage Recommended * 15 citation Recommended * 16 identifier Recommended *
  • 10. 10
  • 11. Open Researcher & Contributor ID Work Connecting researcher to (Publication) (Data) Partners: Grants • American Physical Society • CrossRef Affiliations • Elsevier • Thomson Reuters Patents • Wellcome Trust • …
  • 12. DataCite 1,104,998 Digital Object Identifiers (DOI) by DataCite The information on this slide was captured on 30 Oct 2012
  • 14.
  • 15.
  • 17. Data Creator, Researcher, Author Birth Cohort Study dataset Non- Birth Cohort Study dataset Derived dataset Grey Literature 1958 Published article Citation Data Creator Derived Data Creator External Data input Author: Grey lit External Data Author: Article (Census, Health etc ) 1970
  • 18. Data Creator, Researcher, Author Birth Cohort Study dataset Non- Birth Cohort Study dataset Derived dataset Grey Literature 1958 Published article Citation Data Creator Derived Data Creator External Data input Author: Grey lit External Data Author: Article (Census, Health etc ) 1970
  • 19. Data Creator, Researcher, Author Birth Cohort Study dataset Non- Birth Cohort Study dataset Derived dataset Grey Literature 1958 Published article Citation Data Creator Derived Data Creator External Data input Author: Grey lit External Data Author: Artticle (Census, Health etc ) 1970
  • 20. Data Creator, Researcher, Author Birth Cohort Study dataset Non- Birth Cohort Study dataset Derived dataset Grey Literature 1958 Published article Citation Data Creator Derived Data Creator External Data input Author: Grey lit External Data Author: Article (Census, Health etc ) 1970
  • 21. Acknowledgment ANDS is supported by the The ODIN project is funded by the Australian Government through European Union under FP7 call the National Collaborative INFRA-2012-3.3 (Grant Agreement Research Infrastructure Strategy number 312788) Program and the Education Investment Fund (EIF) Super Science Initiative
  • 22. Conclusion Enabling identity awareness is an international challenge that requires a collaborative effort. ANDS encourage your collaboration in this area and particularly to investigate these questions: • How can we measure and improve identity awareness of research data? • How can we measure and improve data reuse? • How can we measure research impact? • How can your organisation take advantage of the some of the emerging global identity infrastructures?