SlideShare ist ein Scribd-Unternehmen logo
1 von 46
Downloaden Sie, um offline zu lesen
Small pieces loosely joined
Towards a unified theory of
biodiversity for the web



Vincent S. Smith
Macro taxonomy
The big picture of taxonomic research


   Goal…
     • Inventory the Earth’s species
     • Document their relationships
     • “Publish” these data

   Data set…
     • 1.8 M described spp. (10M names)
     • 300M pages (over last 250 years)
     • 1.5-3B specimens

   People…
     • 4-6,000 scientists
     • 30-40,000 “pro-amateurs”
     • Many more citizen scientists?
Micro taxonomy
The practice of taxonomic research


   Sociology…
     • Parochial
     • Specialized experts
     • Fragmented & distributed

   Methodology…
     • Different (domain specific)
     • Communities of practice
     • Non transferable skills

   Output…
     • Heterogeneous & scattered
     • High volume, low impact
     • Hard to find (use)
                                      How do we integrate micro &
                                     macro taxonomy for the Web?
http://Scratchpads.eu
What is a Scratchpad?
A website for you & your community




       1                       2                3
   Your data              Uploaded &   Published & reviewed
                            tagged          on your site
What is a Scratchpad?
A website for you & your community




       1                       2                3
   Your data              Uploaded &   Published & reviewed
                            tagged          on your site


    Fast                 Intuitive      Fit for use
What can Scratchpads do?
Import, manage, search & browse:


                                   Specimens




    DNA & Phylogenies




                   Literature      Images
What can Scratchpads do?
Integration & connectivity within & between sites


                                          Specimens




     DNA & Phylogenies




            Taxonomy
                    Literature            Images
What can Scratchpads do?
In summary:

+Administration                         +Groups                                    +Specimens
 -Change your site information            -Creating a group                         -Creating a record
 -Change you front page                   -Subscribing to a group                   -Importing from a spreadsheet
 -Change your logo                      +Image                                      -Linking specimen & location records
 -Activity and access logs                -Uploading & basic annotation             -Linking specimen & pub. records
+Backup                                   -Linking image & location records        +Tasks
 -Backing up your data                    -Linking image & specimen records         -Creating a tasklist
 -Restoring your data                     -Linking image & publication records     +Taxonomy
+Bibliography                             -Overlay annotations on images            -Importing from a spreadsheet
 -Creating a record                     +Layout                                     -Importing from ClassificationBank
 -Importing from a ref. manager           -Change your theme                        -Starting from scratch
 -Exporting to a reference manager        -Menus                                    -Taxonomy manager
+Blog                                     -Blocks and sidebars                      -Displaying a classification
 -Creating and adding a blog            +Locations                                  -Adding names
+Custom Content                           -Creating a record                        -Deleting names
 -Defining a CCK                          -Importing from a spreadsheet             -Taxonomy & panels
 -Importing from a spreadsheet          +Pages                                     +Users
 -Creating a custom view                  -Creating, editing, cloning & deleting    -Your settings
+Fileshare                                -Configuring the panels template          -Adding a new user
 -Creating and using a fileshare        +Panels                                     -User roles and permissions
+Forum                                    -Adding & configuring content             -Adding and editing user profile fields
 -Altering the forum settings             -Creating a new panel                     -Logging in
 -Creating a container for a forum        -Citing a Panels page                    +Webform
 -Creating a new forum                  +Phylogeny                                  -Creating and using webforms
 -Creating a new topic inside a forum     -Adding a phylogenetic tree
What can Scratchpads do?
Visual taskguide
Current Scratchpads
                                         Ants
                      Sites:  70+        Bees
                                         Beetles
                      Users: 850+        Big-headed flies
                                         Birds
                      Pages: 130k        Blackflies
                                         Ciliates
                      Since March 2007   Cockroaches
                                         Dragon Trees
                                         Dung Beetles
                                         False Buttonweed
                                         Flat worms
                                         Flies
                                         Foraminifera
                                         Fossil Insects
                                         Fungus Gnats
                                         Holometabola
                                         Leaf-miner Flies
                                         Lice
                                         Lichens of Bermuda
                                         Malvaceae
                                         Megalastrum ferns
                                         Milichiid flies
                                         Mosquitoes
                                         Mosses
                                         Nannotax fossils
                                         Nepticuloid moths
                                         Palms
                                         Pearl oysters
                                         Polychaete worms
                                         Scaleworms
                                         Stick insects
                                         Sulawesi Ferns
                                         Termites
                                         Triticid grasses
                                         Weevils
                                         Wood Ferns
Scratchpad visitors
Tracking visitors across sites




   Key monthly statistics
        - 50,000 page views
        - 6,000 visitors
        - 8 minutes on site
        - 50% returning visits

                    (average per month 08’)
Scratchpad applications
A multipurpose, flexible technology




                                  eBooks
                4th Edition Howard & Moore, Birds of the world
                  (fact checking, data compilation, 2010, funding)
Scratchpad applications
A multipurpose, flexible technology




                               eJournals
 European Mosquito Bulletin (ISSN 1460-6127), Phasmid Studies (ISSN 0966-0011)
                 (submission, review, & dissemination of articles)
Scratchpad applications
  A multipurpose, flexible technology




                               Image galleries
Nanno fossils, Cockroaches, Stick insects, Flatworms, Grasses, Lichens & many more…
                      (rapid upload, annotation, & display of images)
How do Scratchpads work?
Getting a Scratchpad


    Requirements
      • Biological focus
      • Agree to T&C’s (click-thru)
      • CC license “by-nc-sa”

    Application
                                         http://scratchpads.eu/apply
      • Maintainer
      • Scope/Mission/API Keys
      • (Sub)domain name

    Content
      • Unrestricted (overlapping)
      • No branding (focus on authors)
      • Value added
How do Scratchpads work?
Using a Scratchpad


    Management
     • User categories (maintainer, ed. contrib.)
     • Public / private content (flexible groups)
     • Admin. page (site settings & behavior)

    Data Input
     • Content types (biblio, maps, “page” etc)
     • Forms, managers, Excel, EndNote etc
     • Custom content (add or extend data types)

    Tagging (indexing)
     • Taxonomy terms (2M +)
     • Multiple classifications
     • Auto-tagging
Autotagging
Indexing data to make it findable


1. Create content
   (e.g. reference)



                                      Journal citation
2. Find terms                       mentions taxon name
   (Autotag)




3. Submit
   (Index)
Autotagging
Indexing data to make it findable


1. Create content
   (e.g. reference)




2. Find terms
   (Autotag)
                                    Matches taxonomy
                                    term (Drag & Drop)




3. Submit
   (Index)
Autotagging
Indexing data to make it findable


1. Create content
   (e.g. reference)




2. Find terms
   (Autotag)




3. Submit                           Page tagged (indexed)
   (Index)                            with taxon name
How do Scratchpads work?
Indexing data to make it findable



                                    • Tagged data can be
                                      presented differently

                                    • For example as part of
                                      a traditional bibliography

                                    • Or as small windows
                                      or “panels” of data
How do Scratchpads work?
Integrating data & “publishing” in a Scratchpad

Types of Scratchpad Panel…
Built with “tagged data”
                                Personalized
                   Common        instructions     Bibliographic
                    names                           literature


  Taxonomic                                                        Files and
  hierarchies                                                     documents




 Photographs &                                                     Specimen
  illustrations                                                     records




  Customized                                                      Phylogenetic
    content                                                          trees
How do Scratchpads work?
Integrating data & “publishing” in a Scratchpad




        Dynamically built species pages
How do Scratchpads work?
Integrating data & “publishing” in a Scratchpad




         Browsed through a taxonomy
How do Scratchpads work?
Integrating data & “publishing” in a Scratchpad




           Including 3rd party content
How do Scratchpads work?
Integrating data & “publishing” in a Scratchpad




             With data curation tools
How do Scratchpads work?
Integrating data & “publishing” in a Scratchpad




               Listing all “authors”
How do Scratchpads work?
Integrating data & “publishing” in a Scratchpad




           Dated, permanent & citable
How do Scratchpads work?
Adjusting the panels layout




        Choose which panels to display
How do Scratchpads work?
An example based on the Catalogue of Life classification




                          2 million taxon pages
                    Open curation at http://catlife.myspecies.info
Biodiversity on the Web
The informatics landscape
Biodiversity on the Web
Scratchpads are personalizing biodiversity science
A unified theory of biodiversity?
BHL, EOL and scholarly journals



       Biodiversity Heritage Library
         • Digitising heritage literature



       Encyclopedia of Life
         • A web page for every species



       Scholarly Journals
         • Traditional publishing
Biodiversity Heritage Library
“Digitizing biodiversity literature”


 • Biodiversity publications since 1469
    - 5.4 million books
    - 800,000 monographs
    - 40,000 periodicals

 • Held by Natural History libraries
     E.g., NHM holds more than 1M books, 250k
     monographs & periodicals, 0.5M artworks

 • BHL partnership of 10 Nat. Hist. libraries
 • Sharing the digisation of contents
 • Focus on out of copyright materials
 • Partnership with “Internet Archive”
 • Make the contents “findable”
Biodiversity Heritage Library
“Digitizing biodiversity literature”


 1. Scan (photograph)
 2. Extract text (OCR)
 3. Find keywords
     - Taxonomic names
     - Author names
     - Citations
     - Collection data
     - Morphological data
     - Descriptions
     - Identification keys
     - Illustrations
     - Photographs




                                   1 scribe machine, 3,500 pages per shift per day
                                        34 scribe machines now in operation
Biodiversity Heritage Library
“Digitizing biodiversity literature”


 1. Scan
 2. Extract text (OCR)
 3. Find keywords
     - Taxonomic names
     - Author names
     - Citations
     - Collection data
     - Morphological data
     - Descriptions
     - Identification keys
     - Illustrations         Palma, R.L., and
     - Photographs           R.L.C. Pilgrim.
                             2002. A revision
                             of the genus
                             Naubates
                             (Insecta:
                             Phthiraptera:
                             Philopteridae).
                             J. R. Soc. N.Z.
                             32:7-60.
Biodiversity Heritage Library
“Digitizing biodiversity literature”


 1. Scan
 2. Extract text (OCR)
 3. Find keywords
     - Taxonomic names
     - Author names
     - Citations
     - Collection data
     - Morphological data
     - Descriptions
     - Identification keys
     - Illustrations         Palma, R.L., and
     - Photographs           R.L.C. Pilgrim.
                             2002. A revision
 4. Index                    of the genus
                             Naubates
                             (Insecta:
 5. Put on the web           Phthiraptera:
                             Philopteridae).
                             J. R. Soc. N.Z.
 6. 10M pp. to date          32:7-60.
Scratchpads and BHL
Creating a community built virtual taxonomic library




                                       Not
                                 Yes
                                       Yet?




         Scratchpads as a tool to add articles (and markup) to BHL?
Encyclopedia of Life
“A web page for every species”

 • A web page for all 1.8M species
 • $25m funding (5 years)
   - MacArthur and Sloan Foundations

 • Multiple audiences
   - Science & outreach

 • Megascience mashup
   - Aggregating data from the web

 • 10 years to complete
   - First draft 2008, “finished” 2017!

 • Struggling to find an identity?
   - Competition, vetting, growth, credit

 • A possible publishing platform?
   - LifeDesks / Scratchpads
Journals Articles
Scholarly communication in taxonomy & systematics


 • Fragmented
 • Mostly commercial
 • Data poor
 • Fixed audience
   - Hard to repurpose

 • Possible role for EoL?
   - Web publishing platform (cf Wikipedia)

 • Zootaxa
   - 15% n. spp; 50 spp. a week!

 • Scratchpads / EoL / Zootaxa                      Biodiversity
   - MS Word Template (markup)                       Journals
   - Simultaneous publication
Summary
“Small pieces loosely joined”


  1. Bringing data together
    Biodiversity studies are data rich, poorly archived & ever changing


  2. Bringing people together
    Biodiversity researchers are few in number, fragmented & highly distributed


  3. Bringing science together
    Biodiversity science demands a different approach to addressing BIG questions




                              BIG IS DIFFERENT
             New opportunities & new challenges!
Thanks…


           Simon Rycroft   Dave Roberts    Kehan Harman




     Ben Scott     Edward Baker    Irina Brake   Vladimir Blagoderov
Questions?
Scratchpad management
   Scalable & sustainable technology




                    Hardware, software & user support
Virtual machine, open-source software, self-archiving, backed-up, multi-site configuration
(easy to move & upgrade, secure & reliable, citable, screencasts, low admin., low marginal costs)

Weitere ähnliche Inhalte

Andere mochten auch

Data Centric Security Strategy
Data Centric Security StrategyData Centric Security Strategy
Data Centric Security Strategy
Aleksey Lukatskiy
 
Apresentação Michael Rivers | OIS2010 | Case de inovação aberta na área da saúde
Apresentação Michael Rivers | OIS2010 | Case de inovação aberta na área da saúdeApresentação Michael Rivers | OIS2010 | Case de inovação aberta na área da saúde
Apresentação Michael Rivers | OIS2010 | Case de inovação aberta na área da saúde
Allagi Open Innovation Services
 
Diferensi sosial
Diferensi sosialDiferensi sosial
Diferensi sosial
omcivics
 
Case P&G | OIS2010 | Open innovation: como inovar e proteger a propriedade in...
Case P&G | OIS2010 | Open innovation: como inovar e proteger a propriedade in...Case P&G | OIS2010 | Open innovation: como inovar e proteger a propriedade in...
Case P&G | OIS2010 | Open innovation: como inovar e proteger a propriedade in...
Allagi Open Innovation Services
 
Apresentação Niklas Walhberg | OIS 2011 | Seminário 23/11
Apresentação Niklas Walhberg | OIS 2011 |  Seminário  23/11Apresentação Niklas Walhberg | OIS 2011 |  Seminário  23/11
Apresentação Niklas Walhberg | OIS 2011 | Seminário 23/11
Allagi Open Innovation Services
 
Apresentação Alf Martin Johansen | OIS 2011 | Painel: Desafios na construção ...
Apresentação Alf Martin Johansen | OIS 2011 | Painel: Desafios na construção ...Apresentação Alf Martin Johansen | OIS 2011 | Painel: Desafios na construção ...
Apresentação Alf Martin Johansen | OIS 2011 | Painel: Desafios na construção ...
Allagi Open Innovation Services
 
Russia Security furure regulations
Russia Security furure regulationsRussia Security furure regulations
Russia Security furure regulations
Aleksey Lukatskiy
 
Apresentação Joakim Appelquist | OIS 2011 | Palestra no Seminário
Apresentação Joakim Appelquist | OIS 2011 | Palestra no SeminárioApresentação Joakim Appelquist | OIS 2011 | Palestra no Seminário
Apresentação Joakim Appelquist | OIS 2011 | Palestra no Seminário
Allagi Open Innovation Services
 

Andere mochten auch (20)

Data Centric Security Strategy
Data Centric Security StrategyData Centric Security Strategy
Data Centric Security Strategy
 
Nindyashinta Maharani
Nindyashinta MaharaniNindyashinta Maharani
Nindyashinta Maharani
 
Apresentação Michael Rivers | OIS2010 | Case de inovação aberta na área da saúde
Apresentação Michael Rivers | OIS2010 | Case de inovação aberta na área da saúdeApresentação Michael Rivers | OIS2010 | Case de inovação aberta na área da saúde
Apresentação Michael Rivers | OIS2010 | Case de inovação aberta na área da saúde
 
Copyright Clarity: Remix and Fair USe in Education
Copyright Clarity: Remix and Fair USe in EducationCopyright Clarity: Remix and Fair USe in Education
Copyright Clarity: Remix and Fair USe in Education
 
Diferensi sosial
Diferensi sosialDiferensi sosial
Diferensi sosial
 
Tobacco Primer
Tobacco PrimerTobacco Primer
Tobacco Primer
 
Case P&G | OIS2010 | Open innovation: como inovar e proteger a propriedade in...
Case P&G | OIS2010 | Open innovation: como inovar e proteger a propriedade in...Case P&G | OIS2010 | Open innovation: como inovar e proteger a propriedade in...
Case P&G | OIS2010 | Open innovation: como inovar e proteger a propriedade in...
 
Powerful Voices for Kids: Media Literacy and Technology Integration in Urban ...
Powerful Voices for Kids: Media Literacy and Technology Integration in Urban ...Powerful Voices for Kids: Media Literacy and Technology Integration in Urban ...
Powerful Voices for Kids: Media Literacy and Technology Integration in Urban ...
 
Apresentação Niklas Walhberg | OIS 2011 | Seminário 23/11
Apresentação Niklas Walhberg | OIS 2011 |  Seminário  23/11Apresentação Niklas Walhberg | OIS 2011 |  Seminário  23/11
Apresentação Niklas Walhberg | OIS 2011 | Seminário 23/11
 
Younger Learners Get Digital and Media Literacy
Younger Learners Get Digital and Media LiteracyYounger Learners Get Digital and Media Literacy
Younger Learners Get Digital and Media Literacy
 
Apresentação Alf Martin Johansen | OIS 2011 | Painel: Desafios na construção ...
Apresentação Alf Martin Johansen | OIS 2011 | Painel: Desafios na construção ...Apresentação Alf Martin Johansen | OIS 2011 | Painel: Desafios na construção ...
Apresentação Alf Martin Johansen | OIS 2011 | Painel: Desafios na construção ...
 
URI Harrington School Executive Advisory Board Meeting
URI Harrington School Executive Advisory Board MeetingURI Harrington School Executive Advisory Board Meeting
URI Harrington School Executive Advisory Board Meeting
 
Working Together on the Web, Working Well? Innovation of a Research Work Envi...
Working Together on the Web, Working Well? Innovation of a Research Work Envi...Working Together on the Web, Working Well? Innovation of a Research Work Envi...
Working Together on the Web, Working Well? Innovation of a Research Work Envi...
 
Russia Security furure regulations
Russia Security furure regulationsRussia Security furure regulations
Russia Security furure regulations
 
Google Chronicles: Analytics And Chrome
Google Chronicles: Analytics And ChromeGoogle Chronicles: Analytics And Chrome
Google Chronicles: Analytics And Chrome
 
Plack
PlackPlack
Plack
 
No specimen (software) left behind
No specimen (software) left behindNo specimen (software) left behind
No specimen (software) left behind
 
An introduction to ViBRANT lightning talks
An introduction to ViBRANT lightning talksAn introduction to ViBRANT lightning talks
An introduction to ViBRANT lightning talks
 
Apresentação Joakim Appelquist | OIS 2011 | Palestra no Seminário
Apresentação Joakim Appelquist | OIS 2011 | Palestra no SeminárioApresentação Joakim Appelquist | OIS 2011 | Palestra no Seminário
Apresentação Joakim Appelquist | OIS 2011 | Palestra no Seminário
 
Keeping an Open Mind on Open Source
Keeping an Open Mind on Open SourceKeeping an Open Mind on Open Source
Keeping an Open Mind on Open Source
 

Ähnlich wie Small pieces loosely joined: towards a unified theory of biodiversity for the web

Looking Under the Hood: How Your Metadata Strategy Impacts Everything You Do ...
Looking Under the Hood: How Your Metadata Strategy Impacts Everything You Do ...Looking Under the Hood: How Your Metadata Strategy Impacts Everything You Do ...
Looking Under the Hood: How Your Metadata Strategy Impacts Everything You Do ...
SPTechCon
 
eMonocot Plenary 09/2011
eMonocot Plenary 09/2011eMonocot Plenary 09/2011
eMonocot Plenary 09/2011
Edward Baker
 
Scratchpads past,present,future
Scratchpads past,present,futureScratchpads past,present,future
Scratchpads past,present,future
Edward Baker
 
Harith Alani's presentation at SSSW 2011
Harith Alani's presentation at SSSW 2011Harith Alani's presentation at SSSW 2011
Harith Alani's presentation at SSSW 2011
sssw2011
 
From WWW to GGG Ignite Athens 2012
From WWW to GGG Ignite Athens 2012From WWW to GGG Ignite Athens 2012
From WWW to GGG Ignite Athens 2012
healis
 
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
ICZN
 

Ähnlich wie Small pieces loosely joined: towards a unified theory of biodiversity for the web (20)

Small pieces loosely joined: towards a unified theory of biodiversity for the...
Small pieces loosely joined: towards a unified theory of biodiversity for the...Small pieces loosely joined: towards a unified theory of biodiversity for the...
Small pieces loosely joined: towards a unified theory of biodiversity for the...
 
Small pieces loosely joined: getting louse research online.
Small pieces loosely joined: getting louse research online.Small pieces loosely joined: getting louse research online.
Small pieces loosely joined: getting louse research online.
 
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
 
If we build it will they come?
If we build it will they come?If we build it will they come?
If we build it will they come?
 
Semantics, Sensors, and the Social Web
Semantics, Sensors, and the Social WebSemantics, Sensors, and the Social Web
Semantics, Sensors, and the Social Web
 
Scratchpad training
Scratchpad trainingScratchpad training
Scratchpad training
 
Looking Under the Hood: How Your Metadata Strategy Impacts Everything You Do ...
Looking Under the Hood: How Your Metadata Strategy Impacts Everything You Do ...Looking Under the Hood: How Your Metadata Strategy Impacts Everything You Do ...
Looking Under the Hood: How Your Metadata Strategy Impacts Everything You Do ...
 
A summary of Scratchpad functionality
A summary of Scratchpad functionalityA summary of Scratchpad functionality
A summary of Scratchpad functionality
 
If we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleIf we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote Goble
 
myExperiment and the Rise of Social Machines
myExperiment and the Rise of Social MachinesmyExperiment and the Rise of Social Machines
myExperiment and the Rise of Social Machines
 
eMonocot Plenary 09/2011
eMonocot Plenary 09/2011eMonocot Plenary 09/2011
eMonocot Plenary 09/2011
 
Scratchpads past,present,future
Scratchpads past,present,futureScratchpads past,present,future
Scratchpads past,present,future
 
Scratchpads training course introduction
Scratchpads training course introductionScratchpads training course introduction
Scratchpads training course introduction
 
Harith Alani's presentation at SSSW 2011
Harith Alani's presentation at SSSW 2011Harith Alani's presentation at SSSW 2011
Harith Alani's presentation at SSSW 2011
 
Lice on the Web: A workshop on the new Phthiraptera website
Lice on the Web: A workshop on the new Phthiraptera websiteLice on the Web: A workshop on the new Phthiraptera website
Lice on the Web: A workshop on the new Phthiraptera website
 
The Inside Out Library.
The Inside Out Library. The Inside Out Library.
The Inside Out Library.
 
From WWW to GGG Ignite Athens 2012
From WWW to GGG Ignite Athens 2012From WWW to GGG Ignite Athens 2012
From WWW to GGG Ignite Athens 2012
 
Where are we going and how are we going to get there?
Where are we going and how are we going to get there?Where are we going and how are we going to get there?
Where are we going and how are we going to get there?
 
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
 

Mehr von Vince Smith

Mehr von Vince Smith (20)

DiSSCo institutional benefits
DiSSCo institutional benefitsDiSSCo institutional benefits
DiSSCo institutional benefits
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 
Moving beyond the box: automating the digitisation of insect collections
Moving beyond the box: automating the digitisation of insect collectionsMoving beyond the box: automating the digitisation of insect collections
Moving beyond the box: automating the digitisation of insect collections
 
FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...
 
Use it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresUse it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructures
 
No specimen left behind: Collections digitisation at the NHM, London*
No specimen left behind:  Collections digitisation at the NHM, London*No specimen left behind:  Collections digitisation at the NHM, London*
No specimen left behind: Collections digitisation at the NHM, London*
 
SYNTHESYS 3 Overview
SYNTHESYS 3 OverviewSYNTHESYS 3 Overview
SYNTHESYS 3 Overview
 
Scratchpad 2014-introduction
Scratchpad 2014-introductionScratchpad 2014-introduction
Scratchpad 2014-introduction
 
Consolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsConsolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review Presentations
 
Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...
 
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
 
Scratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataScratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity data
 
Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...
 
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
 
Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notext
 
The biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveThe biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspective
 
Building data infrastructures for science
Building data infrastructures for scienceBuilding data infrastructures for science
Building data infrastructures for science
 
Don't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easyDon't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easy
 
Delivering biodiversity knowledge in the information age
Delivering biodiversity knowledge in the information ageDelivering biodiversity knowledge in the information age
Delivering biodiversity knowledge in the information age
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics Landscape
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Small pieces loosely joined: towards a unified theory of biodiversity for the web

  • 1. Small pieces loosely joined Towards a unified theory of biodiversity for the web Vincent S. Smith
  • 2. Macro taxonomy The big picture of taxonomic research Goal… • Inventory the Earth’s species • Document their relationships • “Publish” these data Data set… • 1.8 M described spp. (10M names) • 300M pages (over last 250 years) • 1.5-3B specimens People… • 4-6,000 scientists • 30-40,000 “pro-amateurs” • Many more citizen scientists?
  • 3. Micro taxonomy The practice of taxonomic research Sociology… • Parochial • Specialized experts • Fragmented & distributed Methodology… • Different (domain specific) • Communities of practice • Non transferable skills Output… • Heterogeneous & scattered • High volume, low impact • Hard to find (use) How do we integrate micro & macro taxonomy for the Web?
  • 5. What is a Scratchpad? A website for you & your community 1 2 3 Your data Uploaded & Published & reviewed tagged on your site
  • 6. What is a Scratchpad? A website for you & your community 1 2 3 Your data Uploaded & Published & reviewed tagged on your site Fast Intuitive Fit for use
  • 7. What can Scratchpads do? Import, manage, search & browse: Specimens DNA & Phylogenies Literature Images
  • 8. What can Scratchpads do? Integration & connectivity within & between sites Specimens DNA & Phylogenies Taxonomy Literature Images
  • 9. What can Scratchpads do? In summary: +Administration +Groups +Specimens -Change your site information -Creating a group -Creating a record -Change you front page -Subscribing to a group -Importing from a spreadsheet -Change your logo +Image -Linking specimen & location records -Activity and access logs -Uploading & basic annotation -Linking specimen & pub. records +Backup -Linking image & location records +Tasks -Backing up your data -Linking image & specimen records -Creating a tasklist -Restoring your data -Linking image & publication records +Taxonomy +Bibliography -Overlay annotations on images -Importing from a spreadsheet -Creating a record +Layout -Importing from ClassificationBank -Importing from a ref. manager -Change your theme -Starting from scratch -Exporting to a reference manager -Menus -Taxonomy manager +Blog -Blocks and sidebars -Displaying a classification -Creating and adding a blog +Locations -Adding names +Custom Content -Creating a record -Deleting names -Defining a CCK -Importing from a spreadsheet -Taxonomy & panels -Importing from a spreadsheet +Pages +Users -Creating a custom view -Creating, editing, cloning & deleting -Your settings +Fileshare -Configuring the panels template -Adding a new user -Creating and using a fileshare +Panels -User roles and permissions +Forum -Adding & configuring content -Adding and editing user profile fields -Altering the forum settings -Creating a new panel -Logging in -Creating a container for a forum -Citing a Panels page +Webform -Creating a new forum +Phylogeny -Creating and using webforms -Creating a new topic inside a forum -Adding a phylogenetic tree
  • 10. What can Scratchpads do? Visual taskguide
  • 11. Current Scratchpads Ants Sites: 70+ Bees Beetles Users: 850+ Big-headed flies Birds Pages: 130k Blackflies Ciliates Since March 2007 Cockroaches Dragon Trees Dung Beetles False Buttonweed Flat worms Flies Foraminifera Fossil Insects Fungus Gnats Holometabola Leaf-miner Flies Lice Lichens of Bermuda Malvaceae Megalastrum ferns Milichiid flies Mosquitoes Mosses Nannotax fossils Nepticuloid moths Palms Pearl oysters Polychaete worms Scaleworms Stick insects Sulawesi Ferns Termites Triticid grasses Weevils Wood Ferns
  • 12. Scratchpad visitors Tracking visitors across sites Key monthly statistics - 50,000 page views - 6,000 visitors - 8 minutes on site - 50% returning visits (average per month 08’)
  • 13. Scratchpad applications A multipurpose, flexible technology eBooks 4th Edition Howard & Moore, Birds of the world (fact checking, data compilation, 2010, funding)
  • 14. Scratchpad applications A multipurpose, flexible technology eJournals European Mosquito Bulletin (ISSN 1460-6127), Phasmid Studies (ISSN 0966-0011) (submission, review, & dissemination of articles)
  • 15. Scratchpad applications A multipurpose, flexible technology Image galleries Nanno fossils, Cockroaches, Stick insects, Flatworms, Grasses, Lichens & many more… (rapid upload, annotation, & display of images)
  • 16. How do Scratchpads work? Getting a Scratchpad Requirements • Biological focus • Agree to T&C’s (click-thru) • CC license “by-nc-sa” Application http://scratchpads.eu/apply • Maintainer • Scope/Mission/API Keys • (Sub)domain name Content • Unrestricted (overlapping) • No branding (focus on authors) • Value added
  • 17. How do Scratchpads work? Using a Scratchpad Management • User categories (maintainer, ed. contrib.) • Public / private content (flexible groups) • Admin. page (site settings & behavior) Data Input • Content types (biblio, maps, “page” etc) • Forms, managers, Excel, EndNote etc • Custom content (add or extend data types) Tagging (indexing) • Taxonomy terms (2M +) • Multiple classifications • Auto-tagging
  • 18. Autotagging Indexing data to make it findable 1. Create content (e.g. reference) Journal citation 2. Find terms mentions taxon name (Autotag) 3. Submit (Index)
  • 19. Autotagging Indexing data to make it findable 1. Create content (e.g. reference) 2. Find terms (Autotag) Matches taxonomy term (Drag & Drop) 3. Submit (Index)
  • 20. Autotagging Indexing data to make it findable 1. Create content (e.g. reference) 2. Find terms (Autotag) 3. Submit Page tagged (indexed) (Index) with taxon name
  • 21. How do Scratchpads work? Indexing data to make it findable • Tagged data can be presented differently • For example as part of a traditional bibliography • Or as small windows or “panels” of data
  • 22. How do Scratchpads work? Integrating data & “publishing” in a Scratchpad Types of Scratchpad Panel… Built with “tagged data” Personalized Common instructions Bibliographic names literature Taxonomic Files and hierarchies documents Photographs & Specimen illustrations records Customized Phylogenetic content trees
  • 23. How do Scratchpads work? Integrating data & “publishing” in a Scratchpad Dynamically built species pages
  • 24. How do Scratchpads work? Integrating data & “publishing” in a Scratchpad Browsed through a taxonomy
  • 25. How do Scratchpads work? Integrating data & “publishing” in a Scratchpad Including 3rd party content
  • 26. How do Scratchpads work? Integrating data & “publishing” in a Scratchpad With data curation tools
  • 27. How do Scratchpads work? Integrating data & “publishing” in a Scratchpad Listing all “authors”
  • 28. How do Scratchpads work? Integrating data & “publishing” in a Scratchpad Dated, permanent & citable
  • 29. How do Scratchpads work? Adjusting the panels layout Choose which panels to display
  • 30. How do Scratchpads work? An example based on the Catalogue of Life classification 2 million taxon pages Open curation at http://catlife.myspecies.info
  • 31. Biodiversity on the Web The informatics landscape
  • 32. Biodiversity on the Web Scratchpads are personalizing biodiversity science
  • 33. A unified theory of biodiversity? BHL, EOL and scholarly journals Biodiversity Heritage Library • Digitising heritage literature Encyclopedia of Life • A web page for every species Scholarly Journals • Traditional publishing
  • 34. Biodiversity Heritage Library “Digitizing biodiversity literature” • Biodiversity publications since 1469 - 5.4 million books - 800,000 monographs - 40,000 periodicals • Held by Natural History libraries E.g., NHM holds more than 1M books, 250k monographs & periodicals, 0.5M artworks • BHL partnership of 10 Nat. Hist. libraries • Sharing the digisation of contents • Focus on out of copyright materials • Partnership with “Internet Archive” • Make the contents “findable”
  • 35. Biodiversity Heritage Library “Digitizing biodiversity literature” 1. Scan (photograph) 2. Extract text (OCR) 3. Find keywords - Taxonomic names - Author names - Citations - Collection data - Morphological data - Descriptions - Identification keys - Illustrations - Photographs 1 scribe machine, 3,500 pages per shift per day 34 scribe machines now in operation
  • 36. Biodiversity Heritage Library “Digitizing biodiversity literature” 1. Scan 2. Extract text (OCR) 3. Find keywords - Taxonomic names - Author names - Citations - Collection data - Morphological data - Descriptions - Identification keys - Illustrations Palma, R.L., and - Photographs R.L.C. Pilgrim. 2002. A revision of the genus Naubates (Insecta: Phthiraptera: Philopteridae). J. R. Soc. N.Z. 32:7-60.
  • 37. Biodiversity Heritage Library “Digitizing biodiversity literature” 1. Scan 2. Extract text (OCR) 3. Find keywords - Taxonomic names - Author names - Citations - Collection data - Morphological data - Descriptions - Identification keys - Illustrations Palma, R.L., and - Photographs R.L.C. Pilgrim. 2002. A revision 4. Index of the genus Naubates (Insecta: 5. Put on the web Phthiraptera: Philopteridae). J. R. Soc. N.Z. 6. 10M pp. to date 32:7-60.
  • 38. Scratchpads and BHL Creating a community built virtual taxonomic library Not Yes Yet? Scratchpads as a tool to add articles (and markup) to BHL?
  • 39. Encyclopedia of Life “A web page for every species” • A web page for all 1.8M species • $25m funding (5 years) - MacArthur and Sloan Foundations • Multiple audiences - Science & outreach • Megascience mashup - Aggregating data from the web • 10 years to complete - First draft 2008, “finished” 2017! • Struggling to find an identity? - Competition, vetting, growth, credit • A possible publishing platform? - LifeDesks / Scratchpads
  • 40. Journals Articles Scholarly communication in taxonomy & systematics • Fragmented • Mostly commercial • Data poor • Fixed audience - Hard to repurpose • Possible role for EoL? - Web publishing platform (cf Wikipedia) • Zootaxa - 15% n. spp; 50 spp. a week! • Scratchpads / EoL / Zootaxa Biodiversity - MS Word Template (markup) Journals - Simultaneous publication
  • 41. Summary “Small pieces loosely joined” 1. Bringing data together Biodiversity studies are data rich, poorly archived & ever changing 2. Bringing people together Biodiversity researchers are few in number, fragmented & highly distributed 3. Bringing science together Biodiversity science demands a different approach to addressing BIG questions BIG IS DIFFERENT New opportunities & new challenges!
  • 42. Thanks… Simon Rycroft Dave Roberts Kehan Harman Ben Scott Edward Baker Irina Brake Vladimir Blagoderov
  • 44.
  • 45.
  • 46. Scratchpad management Scalable & sustainable technology Hardware, software & user support Virtual machine, open-source software, self-archiving, backed-up, multi-site configuration (easy to move & upgrade, secure & reliable, citable, screencasts, low admin., low marginal costs)