Online direct import of specimen records from iDigBio infrastructure into taxonomic manuscripts

•Download as PPTX, PDF•

1 like•236 views

On 16 June 2016, V. Senderov and L. Penev held a webinar presenting two novel workflows developed at Pensoft Publishers and used in the Biodiversity Data Journal; (1) automatic import of specimen records into manuscripts, and (2) automatic generation of data paper manuscripts from Ecologocal Metadata Language (EML) metadata. The aim of the webinar was to familiarize the public with the workflows, and motivate them from a scientific standpoint. The title of the webinar was: "Online direct import of specimen records from iDigBio infrastructure into taxonomic manuscripts." Integrated Digitized Biocollections (iDigBio) is the leading US-based aggregator of biocollections data. They hold regular webinars and workshops aimed at improving biodiversity informatics knowledge, which are attended by collection managers, scientists, and IT personnel. Thus, making a presentation for them was an excellent way of making our research and tools-development efforts widely known and getting feedback from the community. Our efforts, which are part of the larger PhD project of V. Senderov to build an Open Biodiversity Knowledge Management System (OBKMS) (Senderov and Penev 2016), focused on two areas: optimizing the workflow of specimen or occurrence data and optimizing the workflow of dataset metadata. The results of these efforts are that now it is possible to directly import specimen or occurrence record information from GBIF, BOLD Systems, iDigBio, or PlutoF into ARPHA via a record identifier. No manual copying or retyping is required. Moreover, it is possible to generate a data paper manuscript from recent Ecological Metadata Langauge (EML) versions in ARPHA. The data paper concept is discussed throughly in Chavan and Penev (2011).

Data & Analytics

Online direct import of specimen records from
iDigBio infrastructure into taxonomic
manuscripts
Lyubomir Penev , Viktor Senderov
Institute for Biodiversity and Ecosystem Research, Bulgarian Academy of
Sciences, Sofia, Bulgaria & Pensoft Publishers
penev@pensoft.net
Pensoft & iDigBio Webinar, 16 June 2015

Poll results
0 1 2 3 4 5 6
Librarian or data manager
Student or scientist in biology
Data or computer scientist
IT specialist, database admin
Other
Please select what best describes you

Data deluge: We now sample more data
than we can digest (analyze, publish & use)
Drawings: slavenapeneva.com

Data import
Authoring
Peer-review
Publication
Dissemination
+
Next-Gen taxonomy requires Next-Gen publishing
All within a single
online collaborative
platform!
ARPHA Writing Tool & Biodiversity Data
Journal facilitate data publishing & re-use

REPOSITORIES
ARPHA WRITING TOOL
MANUSCRIPT
PULISHED ARTICLE

Step 1:
Step 1: Start a taxonomic manuscript in
ARPHA, and open a taxon treatment

Step 1:
Step 2a: Click at the Materials section within
the treatment

Step 2b: Three ways to import specimen
occurrence records into a manuscript

Step 3: Import from iDigBio (or GBIF, or BOLD,
or PlutoF) using record ID(s)

WHY import & publish specimen records in
this way?
Avoid re-typing errors and save time
Tracking (provenance) information is saved in occurrenceDetails
Mobilization, peer-review and publication of small data
Data downloadable anytime as CSV file
Machine-readable and harvestable (from the XML version of the
published article)
Automatically exported in Darwin Core Archive
Automatically exported to and indexed by GBIF on the day of the
publication
Interoperable in DarwinCore standard
Re-usable (new opportunities for collaboration)
Increase discoverability, visibility, and citation of authors’ work

This is how data look like in the published
paper

Can we generate and import an entire
manuscript?

http://arpha.pensoft.net/dev/
Allows to import different types of manuscripts
from XML. E.g.:
• Software Description
• Taxonomic Paper
• Data paper
For collaborations please contact us at
info@pensoft.net
For developers and data managers:
Pensoft API

Pensoft developers team
European Commission: EUBON FP7 Project
European Commission: PhD Financed through the
EU Marie-Sklodovska-Curie Program Grant
Agreement Nr. 642241
Slavena Peneva (drawings and design)
Our sincere thanks are due to

What's hot

FAIR Computational WorkflowsCarole Goble

Data and Donuts: Data cleaning with OpenRefineC. Tobin Magle

Let’s go on a FAIR safari!Carole Goble

Providing Tools for Author Evaluation - A case studyinscit2006

Research Objects Tutorial (TPDL)dgarijo

Research Objects in Scientific Publicationsdgarijo

Introduction to FAIRDOMCarole Goble

The FAIRDOM Commons for Systems BiologyFAIRDOM

Capturing the context: one small(ish step for modellers, one giant leap for m...FAIRDOM

Crosslinks ericmeeks

LibMeter Case 01 DE Univ Duesseldorf ULBDLibMeter

[Final] project presentationFederation University

Creating abstractions from scientific workflows: PhD symposium 2015dgarijo

Reproducible Research: how could Research Objects helpCarole Goble

Making your data good enough for sharing.FAIRDOM

Entity Linking Combining Open Source Annotatorspruiz_

Report of the second FAIRDOM foundryFAIRDOM

JASM2014 talk - "Phinch: An interactive, exploratory data visualization for e...Holly Bik

Reproducible researchC. Tobin Magle

Reflections on a (slightly unusual) multi-disciplinary academic careerCarole Goble

What's hot (20)

FAIR Computational Workflows

Data and Donuts: Data cleaning with OpenRefine

Let’s go on a FAIR safari!

Providing Tools for Author Evaluation - A case study

Research Objects Tutorial (TPDL)

Research Objects in Scientific Publications

Introduction to FAIRDOM

The FAIRDOM Commons for Systems Biology

Capturing the context: one small(ish step for modellers, one giant leap for m...

Crosslinks

LibMeter Case 01 DE Univ Duesseldorf ULBD

[Final] project presentation

Creating abstractions from scientific workflows: PhD symposium 2015

Reproducible Research: how could Research Objects help

Making your data good enough for sharing.

Entity Linking Combining Open Source Annotators

Report of the second FAIRDOM foundry

JASM2014 talk - "Phinch: An interactive, exploratory data visualization for e...

Reproducible research

Reflections on a (slightly unusual) multi-disciplinary academic career

Similar to Online direct import of specimen records from iDigBio infrastructure into taxonomic manuscripts

XML-based editorial workflow, or how to extract more value from the same source?vbrant

Author's workflow and the role of open accessPaola Gargiulo

VIVO at the University of Idahoanniegaines

ACS 248th Paper 146 VIVO/ScientistsDB Integration into EurekaStuart Chalk

From Open Access to Open Science: from the Viewpoint of a Scholarly PublisherPensoft Publishers

Introduction to OpenAIRE services and the OpenAIRE Research GraphOpenAIRE

Proof of Concept for Learning Analytics InteroperabilityOpen Cyber University of Korea

COPO - Collaborative Open Plant Omics, by Rob DaveyAIMS (Agricultural Information Management Standards)

U-Boot community analysisxulioc

Electronic Library Bremen – state & focus of developmentMartin Blenkle

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...Carole Goble

Biocatalogue, FileQuirks, MyExperimentJerzy

Access the world’s research outputs through the CORE API Matteo Cancellieri

OpenAIRE: Science. Set Free, Iryna Kuchma, EIFLPlatforma Otwartej Nauki

Bots & spidersMaté Ongenaert

EOSC-Life Workflow CollaboratoryCarole Goble

Data management for researchersDirk Roorda

Software Analytics: Data Analytics for Software EngineeringTao Xie

Alabi2008presentationbirdsnare

EIA Biodiversity Data MobilisationVishwas Chavan

Similar to Online direct import of specimen records from iDigBio infrastructure into taxonomic manuscripts (20)

XML-based editorial workflow, or how to extract more value from the same source?

Author's workflow and the role of open access

VIVO at the University of Idaho

ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka

From Open Access to Open Science: from the Viewpoint of a Scholarly Publisher

Introduction to OpenAIRE services and the OpenAIRE Research Graph

Proof of Concept for Learning Analytics Interoperability

COPO - Collaborative Open Plant Omics, by Rob Davey

U-Boot community analysis

Electronic Library Bremen – state & focus of development

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...

Biocatalogue, FileQuirks, MyExperiment

Access the world’s research outputs through the CORE API

OpenAIRE: Science. Set Free, Iryna Kuchma, EIFL

Bots & spiders

EOSC-Life Workflow Collaboratory

Data management for researchers

Software Analytics: Data Analytics for Software Engineering

Alabi2008presentation

EIA Biodiversity Data Mobilisation

Recently uploaded

办理(UC毕业证书)堪培拉大学毕业证成绩单原版一比一z xss

Real-Time AI Streaming - AI Max PrincetonTimothy Spann

Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics

INTRODUCTION TO Natural language processingsocarem879

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics

Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter

在线办理WLU毕业证罗瑞尔大学毕业证成绩单留信学历认证nhjeo1gg

NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali

Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03

IMA MSN - Medical Students Network (2).pptxdolaknnilon

Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy

原版1:1定制南十字星大学毕业证（SCU毕业证）#文凭成绩单#真实留信学历认证永久存档208367051

Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics

FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone

NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics

Business Analytics using Microsoft Excelysmaelreyes

Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16

Semantic Shed - Squashing and Squeezing.pptxMike Bennett

Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen

MK KOMUNIKASI DATA (TI)komdat komdat.docxUnduhUnggah1

Recently uploaded (20)

办理(UC毕业证书)堪培拉大学毕业证成绩单原版一比一

Real-Time AI Streaming - AI Max Princeton

Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...

INTRODUCTION TO Natural language processing

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...

Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...

在线办理WLU毕业证罗瑞尔大学毕业证成绩单留信学历认证

NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...

Top 5 Best Data Analytics Courses In Queens

IMA MSN - Medical Students Network (2).pptx

Student profile product demonstration on grades, ability, well-being and mind...

原版1:1定制南十字星大学毕业证（SCU毕业证）#文凭成绩单#真实留信学历认证永久存档

Predicting Salary Using Data Science: A Comprehensive Analysis.pdf

FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024

NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...

Business Analytics using Microsoft Excel

Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh

Semantic Shed - Squashing and Squeezing.pptx

Data Factory in Microsoft Fabric (MsBIP #82)

MK KOMUNIKASI DATA (TI)komdat komdat.docx

Online direct import of specimen records from iDigBio infrastructure into taxonomic manuscripts

1. Online direct import of specimen records from iDigBio infrastructure into taxonomic manuscripts Lyubomir Penev , Viktor Senderov Institute for Biodiversity and Ecosystem Research, Bulgarian Academy of Sciences, Sofia, Bulgaria & Pensoft Publishers penev@pensoft.net Pensoft & iDigBio Webinar, 16 June 2015

2. Poll results 0 1 2 3 4 5 6 Librarian or data manager Student or scientist in biology Data or computer scientist IT specialist, database admin Other Please select what best describes you

3. Data deluge: We now sample more data than we can digest (analyze, publish & use) Drawings: slavenapeneva.com

4. Data import Authoring Peer-review Publication Dissemination + Next-Gen taxonomy requires Next-Gen publishing All within a single online collaborative platform! ARPHA Writing Tool & Biodiversity Data Journal facilitate data publishing & re-use

5. REPOSITORIES ARPHA WRITING TOOL MANUSCRIPT PULISHED ARTICLE

6. Step 1: Step 1: Start a taxonomic manuscript in ARPHA, and open a taxon treatment

7. Step 1: Step 2a: Click at the Materials section within the treatment

8. Step 2b: Three ways to import specimen occurrence records into a manuscript

9. Step 3: Import from iDigBio (or GBIF, or BOLD, or PlutoF) using record ID(s)

10. Where to take record IDs from iDigBio?

11. Where to take record IDs from iDigBio?

12. WHY import & publish specimen records in this way? Avoid re-typing errors and save time Tracking (provenance) information is saved in occurrenceDetails Mobilization, peer-review and publication of small data Data downloadable anytime as CSV file Machine-readable and harvestable (from the XML version of the published article) Automatically exported in Darwin Core Archive Automatically exported to and indexed by GBIF on the day of the publication Interoperable in DarwinCore standard Re-usable (new opportunities for collaboration) Increase discoverability, visibility, and citation of authors’ work

13. This is how data look like in the published paper

14. Mapping & visualization

15. Easy export, harvesting & re-use

16. Live demo

17. Can we generate and import an entire manuscript?

18. Live demo

19. http://arpha.pensoft.net/dev/ Allows to import different types of manuscripts from XML. E.g.: • Software Description • Taxonomic Paper • Data paper For collaborations please contact us at info@pensoft.net For developers and data managers: Pensoft API

20. Pensoft developers team European Commission: EUBON FP7 Project European Commission: PhD Financed through the EU Marie-Sklodovska-Curie Program Grant Agreement Nr. 642241 Slavena Peneva (drawings and design) Our sincere thanks are due to

21. I Open Science! PLAZI

Editor's Notes

Hello, If you wish to view the video recording of the live presentation, please visit http://idigbio.adobeconnect.com/p7sg0aym3e3/. More information from iDigBio can be found at the webinar information page. Enjoy, Viktor
This fictionalized workflow presents the flow of information content about biodiversity specimens or biodiversity occurrences from the data portals GBIF, BOLD Systems, iDigBio, and PlutoF, through user-interface elements in ARPHA to textualized content in a research manuscript in Biodiversity Data Journal. In the next few slides, we illustrate the workflow using the example of iDigBio.

Online direct import of specimen records from iDigBio infrastructure into taxonomic manuscripts

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Online direct import of specimen records from iDigBio infrastructure into taxonomic manuscripts

Similar to Online direct import of specimen records from iDigBio infrastructure into taxonomic manuscripts (20)

Recently uploaded

Recently uploaded (20)

Online direct import of specimen records from iDigBio infrastructure into taxonomic manuscripts

Editor's Notes