SlideShare a Scribd company logo
1 of 10
IPNI & PhytoKeys
   Integration
 Nicky Nicolson (RBG Kew)
What is IPNI?
Nomenclator for vascular plants.
Collaboration btw RBG Kew (UK), Harvard
  University Herbaria (US) and Australian
  National Botanic Garden, Canberra (AU)
Composed of three parts:
  • Data
  • Expertise
  • Services
What data does IPNI hold?
• What data types:
  – ICN governed nomenclatural acts
  – Standardised author list
  – Publications
• Which groups:
  – Vascular plants
• Which ranks:
  – Family and below
How is data entered?
• Data entry:
  – From literature scanning, journals received by
    library at Kew, Harvard, Canberra
  – User reports of missing nomenclatural acts,
    usually accompanied by a link to digitised
    literature page (BHL)
• How many?
  – About 7400 names entered in average year
  – About 6100 nomenclatural acts published / year
  – … of these about 2800 are tax. novs.
Curation - after data entry
• Full audit history on core objects – names /
  authors / publications.
• Average 300,000 edits on name records / year
• Standardisation effort ongoing :
  – Assessment of nomenclatural status
  – Epithet
  – Author citation
  – Publication title
  – Collation
  – Year
Current Phytokeys “integration”
• Phytokeys staff email details to IPNI
• IPNI editor creates record and returns IDs to
  Phytokeys
• ID embedded in publication

              email != integration

…but it is an opportunity to converse about the
 content of the nomenclatural act, and an
 opportunity to correct if necessary
Future Phytokeys integration
• Phytokeys submits structured (XML) message
  to IPNI service
• IPNI service creates record “on-demand” and
  returns ID to Phytokeys in structured response
• ID embedded in publication

IPNI retains control of un-suppression

No human communication – but we need to still
 have the opportunity to correct
Evaluating it
Benefits
• Nomenclatural problems resolved pre-
  publication (workflow slower, but quality
  higher)
• IPNI editorial role switched from keying to
  checking
• IPNI identifiers seeded into literature
• Published data more usable
• Useful (automated) route into IPNI
Costs (some but far smaller) :
• Development / testing time
Future
• Extend this model to work with other
  publishers
• A step towards registration? This changes the
  game:
  – Currently: a name missed is to IPNI's detriment -
    our dataset is deficient
  – With registration: a name missed will not be valid
    under the code

More Related Content

What's hot

Open Access NBIC Workshop April 19, 2011
Open Access NBIC Workshop April 19, 2011Open Access NBIC Workshop April 19, 2011
Open Access NBIC Workshop April 19, 2011Philip Bourne
 
ScienceDirect Presentation: Seton Hall
ScienceDirect Presentation: Seton HallScienceDirect Presentation: Seton Hall
ScienceDirect Presentation: Seton Hallrachelmccullough
 
Enabling Semantically Aware Software Applications
Enabling Semantically Aware Software Applications Enabling Semantically Aware Software Applications
Enabling Semantically Aware Software Applications Trish Whetzel
 
RefWorks-Excel-RefWorks - deleting duplicates made easy?
RefWorks-Excel-RefWorks - deleting duplicates made easy?RefWorks-Excel-RefWorks - deleting duplicates made easy?
RefWorks-Excel-RefWorks - deleting duplicates made easy?judithgulpers
 
RDA - Long Tail Data Interest Group - NPG Scientitic Data oveview
RDA - Long Tail Data Interest Group - NPG Scientitic Data oveviewRDA - Long Tail Data Interest Group - NPG Scientitic Data oveview
RDA - Long Tail Data Interest Group - NPG Scientitic Data oveviewSusanna-Assunta Sansone
 
Leicester Research Archive (LRA): the work of a repository administrator
Leicester Research Archive (LRA): the work of a repository administratorLeicester Research Archive (LRA): the work of a repository administrator
Leicester Research Archive (LRA): the work of a repository administratorGaz Johnson
 

What's hot (7)

Open Access NBIC Workshop April 19, 2011
Open Access NBIC Workshop April 19, 2011Open Access NBIC Workshop April 19, 2011
Open Access NBIC Workshop April 19, 2011
 
ScienceDirect Presentation: Seton Hall
ScienceDirect Presentation: Seton HallScienceDirect Presentation: Seton Hall
ScienceDirect Presentation: Seton Hall
 
Enabling Semantically Aware Software Applications
Enabling Semantically Aware Software Applications Enabling Semantically Aware Software Applications
Enabling Semantically Aware Software Applications
 
RefWorks-Excel-RefWorks - deleting duplicates made easy?
RefWorks-Excel-RefWorks - deleting duplicates made easy?RefWorks-Excel-RefWorks - deleting duplicates made easy?
RefWorks-Excel-RefWorks - deleting duplicates made easy?
 
Accessing The Materials You Need
Accessing The Materials You NeedAccessing The Materials You Need
Accessing The Materials You Need
 
RDA - Long Tail Data Interest Group - NPG Scientitic Data oveview
RDA - Long Tail Data Interest Group - NPG Scientitic Data oveviewRDA - Long Tail Data Interest Group - NPG Scientitic Data oveview
RDA - Long Tail Data Interest Group - NPG Scientitic Data oveview
 
Leicester Research Archive (LRA): the work of a repository administrator
Leicester Research Archive (LRA): the work of a repository administratorLeicester Research Archive (LRA): the work of a repository administrator
Leicester Research Archive (LRA): the work of a repository administrator
 

Viewers also liked

Linq 2013 plenary_keynote_bates
Linq 2013 plenary_keynote_batesLinq 2013 plenary_keynote_bates
Linq 2013 plenary_keynote_batesLINQ_Conference
 
How to deliver rich, real-time apps - AppsWorld 2014
How to deliver rich, real-time apps - AppsWorld 2014How to deliver rich, real-time apps - AppsWorld 2014
How to deliver rich, real-time apps - AppsWorld 2014Andy Piper
 
Iref franchisee-presentation
Iref franchisee-presentationIref franchisee-presentation
Iref franchisee-presentationreddvise
 
Linq 2013 session_red_1_kameas
Linq 2013 session_red_1_kameasLinq 2013 session_red_1_kameas
Linq 2013 session_red_1_kameasLINQ_Conference
 
Build a shower cubicle
Build a shower cubicleBuild a shower cubicle
Build a shower cubicleZulaiha Amaria
 

Viewers also liked (6)

Linq 2013 plenary_keynote_bates
Linq 2013 plenary_keynote_batesLinq 2013 plenary_keynote_bates
Linq 2013 plenary_keynote_bates
 
How to deliver rich, real-time apps - AppsWorld 2014
How to deliver rich, real-time apps - AppsWorld 2014How to deliver rich, real-time apps - AppsWorld 2014
How to deliver rich, real-time apps - AppsWorld 2014
 
Iref franchisee-presentation
Iref franchisee-presentationIref franchisee-presentation
Iref franchisee-presentation
 
Linq 2013 session_red_1_kameas
Linq 2013 session_red_1_kameasLinq 2013 session_red_1_kameas
Linq 2013 session_red_1_kameas
 
Build a shower cubicle
Build a shower cubicleBuild a shower cubicle
Build a shower cubicle
 
Me and my movies presentation
Me and my movies presentationMe and my movies presentation
Me and my movies presentation
 

Similar to IPNI PhytoKeys integration

Advancing the International Plant Names Index (IPNI)
Advancing the International Plant Names Index (IPNI) Advancing the International Plant Names Index (IPNI)
Advancing the International Plant Names Index (IPNI) nickyn
 
Bibliographic References in BHL
Bibliographic References in BHLBibliographic References in BHL
Bibliographic References in BHLWilliam Ulate
 
Elsevier - Smart Data and Algorithms for the Publishing Industry
Elsevier - Smart Data and Algorithms for the Publishing IndustryElsevier - Smart Data and Algorithms for the Publishing Industry
Elsevier - Smart Data and Algorithms for the Publishing IndustryAntonio Gulli
 
ICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
ICIC 2013 Conference Proceedings Antony Williams Royal Society of ChemistryICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
ICIC 2013 Conference Proceedings Antony Williams Royal Society of ChemistryDr. Haxel Consult
 
Semantics as a service at EMBL-EBI
Semantics as a service at EMBL-EBISemantics as a service at EMBL-EBI
Semantics as a service at EMBL-EBISimon Jupp
 
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011sspeiser
 
Crushing, Blending, and Stretching Transactional Data
Crushing, Blending, and Stretching Transactional DataCrushing, Blending, and Stretching Transactional Data
Crushing, Blending, and Stretching Transactional DataRay Schwartz
 
DOIs for African Partner Journals
DOIs for African Partner JournalsDOIs for African Partner Journals
DOIs for African Partner JournalsCarol Anne Meyer
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsJohn Kunze
 
Accelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO WayAccelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO WayMongoDB
 
Do you Need a New System? Jane Burke at ALIA 2013
Do you Need a New System? Jane Burke at ALIA 2013Do you Need a New System? Jane Burke at ALIA 2013
Do you Need a New System? Jane Burke at ALIA 2013ProQuest
 
Globus in European Life Science
Globus in European Life ScienceGlobus in European Life Science
Globus in European Life ScienceGlobus
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsKen Karapetyan
 

Similar to IPNI PhytoKeys integration (20)

Advancing the International Plant Names Index (IPNI)
Advancing the International Plant Names Index (IPNI) Advancing the International Plant Names Index (IPNI)
Advancing the International Plant Names Index (IPNI)
 
Bibliographic References in BHL
Bibliographic References in BHLBibliographic References in BHL
Bibliographic References in BHL
 
Oct 15 NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan ...
Oct 15 NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan ...Oct 15 NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan ...
Oct 15 NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan ...
 
Elsevier - Smart Data and Algorithms for the Publishing Industry
Elsevier - Smart Data and Algorithms for the Publishing IndustryElsevier - Smart Data and Algorithms for the Publishing Industry
Elsevier - Smart Data and Algorithms for the Publishing Industry
 
ICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
ICIC 2013 Conference Proceedings Antony Williams Royal Society of ChemistryICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
ICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
 
Big data challenges associated with building a national data repository for c...
Big data challenges associated with building a national data repository for c...Big data challenges associated with building a national data repository for c...
Big data challenges associated with building a national data repository for c...
 
Semantics as a service at EMBL-EBI
Semantics as a service at EMBL-EBISemantics as a service at EMBL-EBI
Semantics as a service at EMBL-EBI
 
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
 
Crushing, Blending, and Stretching Transactional Data
Crushing, Blending, and Stretching Transactional DataCrushing, Blending, and Stretching Transactional Data
Crushing, Blending, and Stretching Transactional Data
 
Martone acs presentation
Martone acs presentationMartone acs presentation
Martone acs presentation
 
DOIs for African Partner Journals
DOIs for African Partner JournalsDOIs for African Partner Journals
DOIs for African Partner Journals
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data Citations
 
eScience Resources for the Chemistry Community from the Royal Society of Chem...
eScience Resources for the Chemistry Community from the Royal Society of Chem...eScience Resources for the Chemistry Community from the Royal Society of Chem...
eScience Resources for the Chemistry Community from the Royal Society of Chem...
 
Accelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO WayAccelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO Way
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
 
Do you Need a New System? Jane Burke at ALIA 2013
Do you Need a New System? Jane Burke at ALIA 2013Do you Need a New System? Jane Burke at ALIA 2013
Do you Need a New System? Jane Burke at ALIA 2013
 
Globus in European Life Science
Globus in European Life ScienceGlobus in European Life Science
Globus in European Life Science
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
eScience at the Royal Society of Chemistry and our current initiatives
eScience at the Royal Society of Chemistry and our current initiativeseScience at the Royal Society of Chemistry and our current initiatives
eScience at the Royal Society of Chemistry and our current initiatives
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
 

More from nickyn

829 tdwg-2015-nicolson-kew-strings-to-things
829 tdwg-2015-nicolson-kew-strings-to-things829 tdwg-2015-nicolson-kew-strings-to-things
829 tdwg-2015-nicolson-kew-strings-to-thingsnickyn
 
Rda p5-env-plenary-nn
Rda p5-env-plenary-nnRda p5-env-plenary-nn
Rda p5-env-plenary-nnnickyn
 
Challenges in developing names services - RDA
Challenges in developing names services - RDAChallenges in developing names services - RDA
Challenges in developing names services - RDAnickyn
 
Kew at the pro-iBiosphere data hackathon
Kew at the pro-iBiosphere data hackathonKew at the pro-iBiosphere data hackathon
Kew at the pro-iBiosphere data hackathonnickyn
 
names-backbone-graph-TDWG
names-backbone-graph-TDWGnames-backbone-graph-TDWG
names-backbone-graph-TDWGnickyn
 
A names backbone - a graph of taxonomy
A names backbone - a graph of taxonomyA names backbone - a graph of taxonomy
A names backbone - a graph of taxonomynickyn
 
Services and Kew's (names) data
Services and Kew's (names) dataServices and Kew's (names) data
Services and Kew's (names) datanickyn
 
Building a names backbone
Building a names backboneBuilding a names backbone
Building a names backbonenickyn
 

More from nickyn (8)

829 tdwg-2015-nicolson-kew-strings-to-things
829 tdwg-2015-nicolson-kew-strings-to-things829 tdwg-2015-nicolson-kew-strings-to-things
829 tdwg-2015-nicolson-kew-strings-to-things
 
Rda p5-env-plenary-nn
Rda p5-env-plenary-nnRda p5-env-plenary-nn
Rda p5-env-plenary-nn
 
Challenges in developing names services - RDA
Challenges in developing names services - RDAChallenges in developing names services - RDA
Challenges in developing names services - RDA
 
Kew at the pro-iBiosphere data hackathon
Kew at the pro-iBiosphere data hackathonKew at the pro-iBiosphere data hackathon
Kew at the pro-iBiosphere data hackathon
 
names-backbone-graph-TDWG
names-backbone-graph-TDWGnames-backbone-graph-TDWG
names-backbone-graph-TDWG
 
A names backbone - a graph of taxonomy
A names backbone - a graph of taxonomyA names backbone - a graph of taxonomy
A names backbone - a graph of taxonomy
 
Services and Kew's (names) data
Services and Kew's (names) dataServices and Kew's (names) data
Services and Kew's (names) data
 
Building a names backbone
Building a names backboneBuilding a names backbone
Building a names backbone
 

IPNI PhytoKeys integration

  • 1. IPNI & PhytoKeys Integration Nicky Nicolson (RBG Kew)
  • 2. What is IPNI? Nomenclator for vascular plants. Collaboration btw RBG Kew (UK), Harvard University Herbaria (US) and Australian National Botanic Garden, Canberra (AU) Composed of three parts: • Data • Expertise • Services
  • 3. What data does IPNI hold? • What data types: – ICN governed nomenclatural acts – Standardised author list – Publications • Which groups: – Vascular plants • Which ranks: – Family and below
  • 4.
  • 5. How is data entered? • Data entry: – From literature scanning, journals received by library at Kew, Harvard, Canberra – User reports of missing nomenclatural acts, usually accompanied by a link to digitised literature page (BHL) • How many? – About 7400 names entered in average year – About 6100 nomenclatural acts published / year – … of these about 2800 are tax. novs.
  • 6. Curation - after data entry • Full audit history on core objects – names / authors / publications. • Average 300,000 edits on name records / year • Standardisation effort ongoing : – Assessment of nomenclatural status – Epithet – Author citation – Publication title – Collation – Year
  • 7. Current Phytokeys “integration” • Phytokeys staff email details to IPNI • IPNI editor creates record and returns IDs to Phytokeys • ID embedded in publication email != integration …but it is an opportunity to converse about the content of the nomenclatural act, and an opportunity to correct if necessary
  • 8. Future Phytokeys integration • Phytokeys submits structured (XML) message to IPNI service • IPNI service creates record “on-demand” and returns ID to Phytokeys in structured response • ID embedded in publication IPNI retains control of un-suppression No human communication – but we need to still have the opportunity to correct
  • 9. Evaluating it Benefits • Nomenclatural problems resolved pre- publication (workflow slower, but quality higher) • IPNI editorial role switched from keying to checking • IPNI identifiers seeded into literature • Published data more usable • Useful (automated) route into IPNI Costs (some but far smaller) : • Development / testing time
  • 10. Future • Extend this model to work with other publishers • A step towards registration? This changes the game: – Currently: a name missed is to IPNI's detriment - our dataset is deficient – With registration: a name missed will not be valid under the code

Editor's Notes

  1. Provider of objective nomenclatural facts – the basis for taxonomic work. Scope (vasc plants) important – botanical code is wider, and Phytokeys scope is wider. IPNI is not just a dataset – it is actively / expertly curated
  2. Standardised author Standardised publication Distribution form type Details about the type and where it is held Links to associated records – this name is a validation of an earlier name. The eariler invalid record is annotated with the relevant code article Full record history on all names Data available in a structured format
  3. Stats derived from 2004 onwards. Most names aren’t entered until the hard copy arrives at K / HUH library – we estimate at most 2 year time lag between publication data and entry to IPNI.
  4. We’ve now 10 years worth of audit log data.
  5. Question: will resolving of nomenclatural problems pre-publication be maintained on automation?