SlideShare ist ein Scribd-Unternehmen logo
1 von 42
Publishing Biodiversity:
The interplay between Scratchpads and
    the new Biodiversity Data Journal

 Koureas D.N.1, Rycroft S. 1, Baker E. 1, Livermore L. 1, Scott B. 1,
  Heaton A.1, Bouton K.1, Penev L.2, Roberts D.1 and Smith V.S.1


                       1
                         The Natural History Museum London
                       2
                         Pensoft Publishers
Our current taxonomic data production

       •    15-20k new spp. described annually (2M total)1
       •    30k nomenclatural acts (12M total) 1
       •    20k phylogenies (750k total)2
       •    31k taxa sequenced (360k taxa total)3
       •    800k BioMed papers (40M total pp. of taxonomy) 4

       •    Countless specimens, images, maps, keys and datasets




        Typically generated by small communities for
        “local” research projects




Figures from 1) Zhang, Zootaxa 2011 4, 1-4; 2) Web-of-Science; 3) Genbank and 4) PubMed.
The four nodes of data workflow


1.   We collect and   generate data


2.   We   curate, link and structure data


3.   We   analyse data


4.   We   publish data
The four nodes of data workflow
What are the
bottlenecks
in the workflow?                           Data
                                            Data
                                       collection &
                                        collection &
                                       generation
                                        generation
                                                       bottleneck



                           Data
                            Data                         Data
                                                          Data
                         publishing
                          publishing                   curation
                                                        curation



                        bottleneck
                                          Data
                                           Data
                                         analysis
                                          analysis
What we need is…
a
seamless
workflow                     Data
                              Data
                         collection &
                          collection &
                         generation
                          generation




             Data
              Data                         Data
                                            Data
           publishing
            publishing                   curation
                                          curation




                            Data
                             Data
                           analysis
                            analysis
To achieve this…


                                                This requires data, information & knowledge
      Link together
      “                                         to be…

      evolutionary                                   •Digital
      data… by developing                                  Not printed paper
                                                     •Openly accessible
      analytical tools and                                 Not behind barriers (e.g. paywalls)
      proper                                         •Linked-up
      documentation and                                     Not in silos
      then use this framework to
      conduct comparative analyses,
      studies of evolutionary process                    Global Systematics
      and biodiversity analyses”


Cyndy Parr, Rob Guralnick, Nico Cellinese and Rod Page. TREE. doi:10.1016/j.tree.2011.11.001
Scratchpads
Virtual Research Environments




      Making taxonomy digital, open & linked
so…
what are
the

Scratchpads?
What are Scratchpads?



• Hosted websites for biodiversity data

• Virtual research & publication platform

• Completely open access & open source

• Modular & flexible
What are Scratchpads?
facilitate
development of online research communities

through

standardized environment of entering and curating data

that allow
sharing and interlinking

and

dissemination of research products
The Scratchpads concept
A Scratchpad is a website that holds data for you and your community




  Your data                               External data & services
Examples of use:




                                         Taxa
(Classifications, taxon profiles, specimens, literature, images, maps, phenotypic, genotypic
                        & morphometric datasets, keys, phylogenies)




    Conservation              Projects              Regions                 Societies
Are Scratchpads sustainable?

464 Scratchpads Communities
by   6,407 active registered users
                                          In total more than
covering   52,661 taxa
in 559,488 pages.                         1,200,000 visitors
Per month unique visitors to Scratchpads sites




                                                               65000
                                                               unique visitors/month
Are Scratchpads sustainable?



2007   2011                                   2014


              ViBRANT
              Virtual Biodiversity Research

                     &                                   &

                                              Other grants in the pipeline
                                                     Proposals?
the main

features
The main features

Dynamic Biological Classifications


                                     Manually entered or imported

                                     Auto generated
The main features
Taxon pages
              Overview of data related to taxon

              Generated from tagged content
The main features
         Bibliography management




An inbuilt Bibliography manager

Faceted browsing

Taxon tagging and free keywords

Import from and export to all major formats
The main features
        Specimen/Observation data




Annotated full specimen/observation records

Linked to images and georeferenced
The main features
Distribution maps
                      Google maps based




                      Data layers

                             Occurrence data



                             Distribution data
                             TDWG regions




                             GBIF data
The main features
Character matrices – Key construction




             Quantitative or qualitative characters

             Auto generation of keys

             Taxon based matrices
             [Specimens based character matrices]
The main features

Media handling


                   Bulk upload

                   Metadata (incl. EXIF)

                   Media galleries
The main features

Generation of custom pages


                                 Tagged or not

                                 External RSS

                                 Twitter feeds

                                 Media files
The main features

Enhanced communication tools
                                 Working groups

                                 Forums

                                 Blog entries

                                 Webforms

                                 Newsletters

                                 RSS syndication

                                 Inbuilt comments
The main features

analytical
tools




OBOE service
i.a.
Ecological informatics,
Phylogenetics,
Sequence alignment
The main features
data
mobilisation




               more on the way…
The main features

The
Publication
module




              Open-access
                   journal
What will BDJ publish?
• Single taxon treatments and
  nomenclatural acts
• Local or regional checklists
• Sampling reports and occasional
  inventories
• Habitat-based checklists and inventories
• Ecological and biological observations of
  species and communities?
• Single identification keys
• biodiversity-related databases, including
  genomic, ecological and environmental
  data (data papers)
• Biodiversity-related software tools
How do

Scratchpads
and

BDJ
interact?
Working in a single environment




Allow submission of
datasets
for publication
without
reformatting and restructuring

                  based on standardised XML schema
The publication module
Data included in manuscript in a structured annotated format

Author names and affiliations
The publication module
Taxon descriptions
The publication module
Specimen data
The publication module

Author names and affiliations

          Taxon descriptions

              Specimen data

           Figures and Tables
                                 XML
                                  XML
                        Keys

                 References



                      Texts
The data workflow

                                                  XML
Community



                                               submission
                                                               PENSOFT JOURNAL SYSTEM
                            SCRATCHPADS
                                                                       (PJS 2.0)




                                                                MANUSCRIPT PUBLISHED
                                                                MANUSCRIPT PUBLISHED
                                                                    (XML, PDF)
                                                                     (XML, PDF)




       Archive   datasets    Occurrence data       Taxon treatments          Taxon names

                                                      Plazi           Wiki
The editorial workflow
Scratchpads             Penso                               Peer-review op ons
                        Journal                                 Public
                                                                         Community
                        System                                                       Closed
                        (PJS)
                                                                                                            Review



                                                Review
                                                                                       Nominated reviewers
                                                requests
                                                                                                            Review
                                   Editor
     Collabora ve                                                                        Panel reviewers
     online wri ng              Online edi ng


                                                                                                            Review

                                   Editorial
                             decision & feedback                                         Public reviewers
 Authors



                                                 Publica on &                                          All reviews assembled into a
   Online edi ng                                 dissemina on                                               single online version
                     Author’s revised
                       manuscript
Example papers via Scratchpads…
Blagoderov V, Hippa H, Nel A (2010). ZooKeys 50: 79–90.       Faulwetter S, Chatzigeorgiou G, Galil BS, Nicolaidou A,   Brake I, von Tschirnhaus M (2010). ZooKeys 50: 91–96.
             doi: 10.3897/zookeys.50.506                         Arvanitidis C (2011. ZooKeys 150: 327–345. doi:                     doi: 10.3897/zookeys.50.505
                                                                            10.3897/zookeys.150.1877




http://sciaroidea.info/node/44428                           http://polychaetes.marbigen.org/node/35                       http://milichiidae.info/node/14995

                                                          Live (updated) versions of these papers
Acknowledgements
Scratchpads technical development
 - Simon Rycroft, Ben Scott, Ed Baker, Alice Heaton & Katherine Bouton
Scratchpads outreach
 - Laurence Livermore, Isa van deVelde & Dimitris Koureas

e-Monocot
 - Paul Wilkin & the Kew team, Charles Godfray & the Oxford team

ViBRANT
 - Vince Smith, Dave Roberts & Lucy Reeve

 Pensoft
 - Lyobomir Penev and the team



Our 7000 users
Data
                     Data
                collection &
                 collection &
                generation
                 generation




  Data                            Data
                                   Data
   Data
publishing
 publishing   Thank you         curation
                                 curation




                   Data
                    Data
                  analysis
                   analysis
Authors and Contributors



                          Contributors
           (mentor, linguis c editor, copy editor,
           poten al reviewer, colleague/friend)              Con
                                                                trib
                                                                    u
                                                                        ng

                    ite
                 Inv
                                                                                 Manuscript ready to submit
                                                 Taxon treatment
            Template-
              based                              Interac ve key
            manuscript                          Checklist
                                                                     Authoring

Lead author crea on
                                                Data paper
           Inv
                 ite

                                                                     ing
                                                                  hor
                                                              Aut




                           Co-authors

Weitere ähnliche Inhalte

Ă„hnlich wie Publishing biodiversity: The interplay between Scratchpads and the new Biodiversity Data Journal

Scratchpads training course introduction
Scratchpads training course introductionScratchpads training course introduction
Scratchpads training course introductionDimitrios Koureas
 
Scientific data management from the lab to the web
Scientific data management   from the lab to the webScientific data management   from the lab to the web
Scientific data management from the lab to the webJose Manuel GĂłmez-PĂ©rez
 
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and ActionAlbert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and ActionInstitute for Knowledge Mobilization
 
Triplifier talk
Triplifier talkTriplifier talk
Triplifier talkJohn Deck
 
3 bitriplifiertalk
3 bitriplifiertalk3 bitriplifiertalk
3 bitriplifiertalkJohn Deck
 
Open@Fao presentation at the EADI Open For Development Project, 2012
Open@Fao presentation at the EADI Open For Development Project, 2012 Open@Fao presentation at the EADI Open For Development Project, 2012
Open@Fao presentation at the EADI Open For Development Project, 2012 Stephen Katz
 
D paul ecn2013
D paul ecn2013D paul ecn2013
D paul ecn2013ECNOfficer
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarshiptsbbbu
 
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Vince Smith
 
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012Lee Dirks
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identificationguest453b14
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identificationguest453b14
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identificationguest453b14
 
Dataset citation and identification
Dataset citation and identificationDataset citation and identification
Dataset citation and identificationAdam Farquhar
 
Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...GarethKnight
 
Qiagram Slides 2011 05
Qiagram Slides 2011 05Qiagram Slides 2011 05
Qiagram Slides 2011 05bhughes26
 

Ă„hnlich wie Publishing biodiversity: The interplay between Scratchpads and the new Biodiversity Data Journal (20)

Scratchpads training course introduction
Scratchpads training course introductionScratchpads training course introduction
Scratchpads training course introduction
 
Scientific data management from the lab to the web
Scientific data management   from the lab to the webScientific data management   from the lab to the web
Scientific data management from the lab to the web
 
Knowledge mobilization
Knowledge mobilization Knowledge mobilization
Knowledge mobilization
 
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and ActionAlbert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
 
Triplifier talk
Triplifier talkTriplifier talk
Triplifier talk
 
3 bitriplifiertalk
3 bitriplifiertalk3 bitriplifiertalk
3 bitriplifiertalk
 
Open@Fao presentation at the EADI Open For Development Project, 2012
Open@Fao presentation at the EADI Open For Development Project, 2012 Open@Fao presentation at the EADI Open For Development Project, 2012
Open@Fao presentation at the EADI Open For Development Project, 2012
 
D paul ecn2013
D paul ecn2013D paul ecn2013
D paul ecn2013
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarship
 
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
 
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
 
Data mining
Data miningData mining
Data mining
 
STI Summit 2011 - Digital Worlds
STI Summit 2011 - Digital WorldsSTI Summit 2011 - Digital Worlds
STI Summit 2011 - Digital Worlds
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Dataset citation and identification
Dataset citation and identificationDataset citation and identification
Dataset citation and identification
 
Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...
 
Qiagram
QiagramQiagram
Qiagram
 
Qiagram Slides 2011 05
Qiagram Slides 2011 05Qiagram Slides 2011 05
Qiagram Slides 2011 05
 

KĂĽrzlich hochgeladen

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersChitralekhaTherkar
 

KĂĽrzlich hochgeladen (20)

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of Powders
 

Publishing biodiversity: The interplay between Scratchpads and the new Biodiversity Data Journal

  • 1. Publishing Biodiversity: The interplay between Scratchpads and the new Biodiversity Data Journal Koureas D.N.1, Rycroft S. 1, Baker E. 1, Livermore L. 1, Scott B. 1, Heaton A.1, Bouton K.1, Penev L.2, Roberts D.1 and Smith V.S.1 1 The Natural History Museum London 2 Pensoft Publishers
  • 2. Our current taxonomic data production • 15-20k new spp. described annually (2M total)1 • 30k nomenclatural acts (12M total) 1 • 20k phylogenies (750k total)2 • 31k taxa sequenced (360k taxa total)3 • 800k BioMed papers (40M total pp. of taxonomy) 4 • Countless specimens, images, maps, keys and datasets Typically generated by small communities for “local” research projects Figures from 1) Zhang, Zootaxa 2011 4, 1-4; 2) Web-of-Science; 3) Genbank and 4) PubMed.
  • 3. The four nodes of data workflow 1. We collect and generate data 2. We curate, link and structure data 3. We analyse data 4. We publish data
  • 4. The four nodes of data workflow What are the bottlenecks in the workflow? Data Data collection & collection & generation generation bottleneck Data Data Data Data publishing publishing curation curation bottleneck Data Data analysis analysis
  • 5. What we need is… a seamless workflow Data Data collection & collection & generation generation Data Data Data Data publishing publishing curation curation Data Data analysis analysis
  • 6. To achieve this… This requires data, information & knowledge Link together “ to be… evolutionary •Digital data… by developing Not printed paper •Openly accessible analytical tools and Not behind barriers (e.g. paywalls) proper •Linked-up documentation and Not in silos then use this framework to conduct comparative analyses, studies of evolutionary process Global Systematics and biodiversity analyses” Cyndy Parr, Rob Guralnick, Nico Cellinese and Rod Page. TREE. doi:10.1016/j.tree.2011.11.001
  • 7. Scratchpads Virtual Research Environments Making taxonomy digital, open & linked
  • 9. What are Scratchpads? • Hosted websites for biodiversity data • Virtual research & publication platform • Completely open access & open source • Modular & flexible
  • 10. What are Scratchpads? facilitate development of online research communities through standardized environment of entering and curating data that allow sharing and interlinking and dissemination of research products
  • 11. The Scratchpads concept A Scratchpad is a website that holds data for you and your community Your data External data & services
  • 12. Examples of use: Taxa (Classifications, taxon profiles, specimens, literature, images, maps, phenotypic, genotypic & morphometric datasets, keys, phylogenies) Conservation Projects Regions Societies
  • 13. Are Scratchpads sustainable? 464 Scratchpads Communities by 6,407 active registered users In total more than covering 52,661 taxa in 559,488 pages. 1,200,000 visitors Per month unique visitors to Scratchpads sites 65000 unique visitors/month
  • 14. Are Scratchpads sustainable? 2007 2011 2014 ViBRANT Virtual Biodiversity Research & & Other grants in the pipeline Proposals?
  • 16. The main features Dynamic Biological Classifications Manually entered or imported Auto generated
  • 17. The main features Taxon pages Overview of data related to taxon Generated from tagged content
  • 18. The main features Bibliography management An inbuilt Bibliography manager Faceted browsing Taxon tagging and free keywords Import from and export to all major formats
  • 19. The main features Specimen/Observation data Annotated full specimen/observation records Linked to images and georeferenced
  • 20. The main features Distribution maps Google maps based Data layers Occurrence data Distribution data TDWG regions GBIF data
  • 21. The main features Character matrices – Key construction Quantitative or qualitative characters Auto generation of keys Taxon based matrices [Specimens based character matrices]
  • 22. The main features Media handling Bulk upload Metadata (incl. EXIF) Media galleries
  • 23. The main features Generation of custom pages Tagged or not External RSS Twitter feeds Media files
  • 24. The main features Enhanced communication tools Working groups Forums Blog entries Webforms Newsletters RSS syndication Inbuilt comments
  • 25. The main features analytical tools OBOE service i.a. Ecological informatics, Phylogenetics, Sequence alignment
  • 26. The main features data mobilisation more on the way…
  • 28. What will BDJ publish? • Single taxon treatments and nomenclatural acts • Local or regional checklists • Sampling reports and occasional inventories • Habitat-based checklists and inventories • Ecological and biological observations of species and communities? • Single identification keys • biodiversity-related databases, including genomic, ecological and environmental data (data papers) • Biodiversity-related software tools
  • 30. Working in a single environment Allow submission of datasets for publication without reformatting and restructuring based on standardised XML schema
  • 31. The publication module Data included in manuscript in a structured annotated format Author names and affiliations
  • 34. The publication module Author names and affiliations Taxon descriptions Specimen data Figures and Tables XML XML Keys References Texts
  • 35. The data workflow XML Community submission PENSOFT JOURNAL SYSTEM SCRATCHPADS (PJS 2.0) MANUSCRIPT PUBLISHED MANUSCRIPT PUBLISHED (XML, PDF) (XML, PDF) Archive datasets Occurrence data Taxon treatments Taxon names Plazi Wiki
  • 36. The editorial workflow Scratchpads Penso Peer-review op ons Journal Public Community System Closed (PJS) Review Review Nominated reviewers requests Review Editor Collabora ve Panel reviewers online wri ng Online edi ng Review Editorial decision & feedback Public reviewers Authors Publica on & All reviews assembled into a Online edi ng dissemina on single online version Author’s revised manuscript
  • 37. Example papers via Scratchpads… Blagoderov V, Hippa H, Nel A (2010). ZooKeys 50: 79–90. Faulwetter S, Chatzigeorgiou G, Galil BS, Nicolaidou A, Brake I, von Tschirnhaus M (2010). ZooKeys 50: 91–96. doi: 10.3897/zookeys.50.506 Arvanitidis C (2011. ZooKeys 150: 327–345. doi: doi: 10.3897/zookeys.50.505 10.3897/zookeys.150.1877 http://sciaroidea.info/node/44428 http://polychaetes.marbigen.org/node/35 http://milichiidae.info/node/14995 Live (updated) versions of these papers
  • 38.
  • 39. Acknowledgements Scratchpads technical development - Simon Rycroft, Ben Scott, Ed Baker, Alice Heaton & Katherine Bouton Scratchpads outreach - Laurence Livermore, Isa van deVelde & Dimitris Koureas e-Monocot - Paul Wilkin & the Kew team, Charles Godfray & the Oxford team ViBRANT - Vince Smith, Dave Roberts & Lucy Reeve Pensoft - Lyobomir Penev and the team Our 7000 users
  • 40. Data Data collection & collection & generation generation Data Data Data Data publishing publishing Thank you curation curation Data Data analysis analysis
  • 41.
  • 42. Authors and Contributors Contributors (mentor, linguis c editor, copy editor, poten al reviewer, colleague/friend) Con trib u ng ite Inv Manuscript ready to submit Taxon treatment Template- based Interac ve key manuscript Checklist Authoring Lead author crea on Data paper Inv ite ing hor Aut Co-authors