SlideShare a Scribd company logo
1 of 64
Download to read offline
ANTABIF Training
                                   getting your data online




                           Bruno Danis, Anton Van de Putte and Nabil Youdjou




Wednesday 26 October 11
Objectives
                   • familiarize with ANTABIF
                   • learn about architecture, functionalities
                          tools and standards we offer
                   • hands on exercises with dummy and *real*
                          data
                   • collect feedback on the fitness for use for
                          this community


Wednesday 26 October 11
On the Menu Today
                   • Background about ANTABIF
                   • Technical overview
                   • Standards, tools and resources
                   • Functionalities
                   • Future directions
                   • Hands on
Wednesday 26 October 11
Background




Wednesday 26 October 11
Antarctic Treaty
                      « In order to promote international
                      cooperation in scientific investigation in
                      Antarctica, […], the Contracting Parties
                      agree that, to the greatest extent feasible and
                      practicable: […]

                      Scientific observations and results from
                      Antarctica shall be exchanged and made
                      freely available. »


Wednesday 26 October 11
SCAR-MarBIN & ANTABIF
     •      www.scarmarbin.be
     •      www.antabif.be or www.biodiversity.aq
     •      Core funding: BELSPO.be
     •      International Polar Year 2007/08
     •      Census of Antarctic Marine Life
     •      Ocean Biogeographic Information System
     •      Global Biodiversity Information Facility
Wednesday 26 October 11
General Philosophy
     • Build an electronic ecosystem
     • Offer free and open access to data and technology
     • Expose all the (biodiversity) data and metadata, in
            multiple contexts
     • Remain community-driven, and collaborative
     • Adopt strong standardization
     • Work for science, conservation, management
Wednesday 26 October 11
Wednesday 26 October 11
Achievements
  •      The first RAMS
  •      Board of 60+ editors
  •      Feeds WoRMS, CoL and EoL
  •      17,098 taxa (RAMS)
  •      Building a dynamic RAS
  •      24,248 taxa (RAS)


Wednesday 26 October 11
Achievements
  •      1,288,441 records
  •      198 datasets
  •      5,235 taxa
  •      Feeds OBIS, GBIF
  •      Downloadable
  •      WebGIS
  •      Webservices
Wednesday 26 October 11
Achievements
 •      Up since Oct 2005
 •      open access
 •      909,915 visitors
 •      8,093,774 hits
 •      51,416,196 dld records
 •      Citations: 183
 •      Cited Publications: 38
Wednesday 26 October 11
Achievements

                          Records      SMB       ANTABIF     Progress

                          Metadata     198        7.200        36,4

                    Occurrence       1.288.441   2.659.392     2,1

                      Taxonomy        17.184      30.472       1,8



Wednesday 26 October 11
Nuts and Bolts




Wednesday 26 October 11
100% Open Source
     •      Language: Ruby

     •      Framework: Rails(ActiveRecord) and YUI

     •      (smart) Search engine: Full text (Elasticsearch-Lucene)

     •      Database/GIS server/SpatialDB: PostGresql/Geoserver/PostGIS

     •      Mapping client: OpenLayers

     •      Web services: RESTish (all resources)

     •      Protocols/Standards: DIF, DwC, DwC-A, Tapir…etc

     •      GBIF tools : HIT, IPT

     •      Hosting: BeBIF (ULB/VUB joint IT Center)

     •      Metadata systems: GCMD API (DIF)


Wednesday 26 October 11
Data flow
                                     (your point of view)

  Your data                         DwC-A             IPT                   ANTABIF


                      standardize            upload               publish




                                                        publish




                                                              Data Paper




Wednesday 26 October 11
Data flow
                          (our point of view)




Wednesday 26 October 11
Standards, tools, resources




Wednesday 26 October 11
Metadata
           Information about datasets deteriorates over time!




Wednesday 26 October 11
Metadata

               • preferred MD catalogue = Antarctic Master
                      Directory (subset of GCMD)
               • standard = DIF (Data Interchange Format)
               • used by the whole SCAR community
               • crawled by Google, Scopus...

Wednesday 26 October 11
DarwinCore


                      "A vocabulary of words that biologists,
                      hackers, and citizen scientists use to broadly
                      describe the biodiversity of life on earth."




Wednesday 26 October 11
DarwinCore Archive
       • Complete package of data
             –One file
             –Multiple files
       • Text Files…
       • Self-documenting
       • Intended to be shared/distributed

Wednesday 26 October 11
DarwinCore Archive

                            Archives always have a ‘core’ data file


                                                                 My_data.txt



                              The	
  core	
  data	
  file	
  is	
  a	
  text	
  file.



Wednesday 26 October 11
DarwinCore Archive

                            Archives always have a ‘core’ data file


                                                                 My_data.txt



                              The	
  core	
  data	
  file	
  is	
  a	
  text	
  file.



Wednesday 26 October 11
DarwinCore Archive
                            Darwin Core Archive (two files)




                          meta.xml	
  describes	
  the	
  mappings	
  in	
  the
                                core	
  data	
  file	
  (species.txt)
Wednesday 26 October 11
DarwinCore Archive
         Multiple extensions are available




             Columns	
  in	
  extensions	
  are	
  mapped	
  to	
  Darwin	
  Core	
  using	
  the	
  meta.xml	
  file

Wednesday 26 October 11
DarwinCore Archive
       Many extensions are available




            h?p://rs.gbif.org/extension/

Wednesday 26 October 11
Spreadsheet templates

                   • Metadata - describe a database or other
                          data resource. 
                   • Species Occurrence - store basic species
                          collections or observational data
                   • Species Checklists – recording and storing
                          simple annotated species checklists.



Wednesday 26 October 11
Wednesday 26 October 11
Wednesday 26 October 11
Wednesday 26 October 11
Wednesday 26 October 11
Wednesday 26 October 11
Spreadsheet processor
                   • web application: Excel spreadsheet to
                          DwC-A.
                   • Excel files contain data entry and GBIF
                          metadata profile.
                   • Worksheet supports publication of primary
                          biodiversity data
                   • Processor performs data validation and
                          transformation and returns a validated
                          DwC-A
Wednesday 26 October 11
Wednesday 26 October 11
DwC-A validator


                   • tests Darwin Core Archives
                   • validates the content against the known
                          extensions and terms registered within the
                          GBIF network for sharing biodiversity data.



Wednesday 26 October 11
Wednesday 26 October 11
IPT - Integrated Publishing Toolkit

                   • Publishing primary biodiversity data
                   • Resources
                    • Metadata
                    • Source Data (text, zip, SQL)
                    • Source Mappings
                    • Visibility
                    • Published Release
Wednesday 26 October 11
The Data Paper concept
        • A scholarly journal publication whose primary purpose is to
          describe a dataset or group of datasets, rather than to report a
          research investigation.
        • Benefits of the Data Paper
              – Scholarly credit to Data Publishers
              – Describe the data in structured human readable form
              – Bring the existence of the data to the attention of the
                scholarly community



Wednesday 26 October 11
Data Paper: Incentivising Data Discovery




Wednesday 26 October 11
Reward data publishing




                          Metadata document   Data Paper




Wednesday 26 October 11
Step-by-Step
       • Complete metadata of a dataset using metadata editor in IPT
         2.0.2
       • Generate ‘Data Paper’ manuscript (menu: Manage Resource –
         RTF Download)
       • Submit the manuscript for possible publication in one of the
         PenSoft publication (ZooKeys, PhytoKeys, BioRisks, NeoBiota).
       • Revision (if any) is carried out using metadata editor in IPT 2.0.2
         and manuscript re-submitted to PenSoft Open Journal System



Wednesday 26 October 11
Once paper is accepted
        • Digital Object Identifier is assigned to the Data Paper
        • Paper is published in (a) print format, (b) PDF format, (c)
          semantically enhanced HTML, and (d) XML is archived in
          PubMedCentral
        • DoI of the Data Paper is linked with the Persistent Identifier
          of the metadata document in the GBIF Registry
        • Data Paper is indexed by Web of Knowledge (ISI),
          PubMedCentral, Scopus, Zoological Record, Google Scholar,
          CAB Abstracts, Directory of Open Access Journal (DOAJ),
          EBSCO.

Wednesday 26 October 11
Important to consider

        • Metadata is complete in all the respect
        • All the claims are adequately substantiated
        • Data described in ‘Data Paper’ is freely available at
          the time of submission of the manuscript




Wednesday 26 October 11
ORC
                   • GBIF’s Online Resource Center
                   • Provides access to documents, best
                          practices, tools and links
                   • Wide thematic scope
                   • Different ways of accessing resources
                   • Enabling community contributions
                   • Different levels of resource access
                   • Multilanguage support
Wednesday 26 October 11
Wednesday 26 October 11
Functionalities




Wednesday 26 October 11
www.biodiversity.aq
                   • general website
                   • latest news
                   • contact
                   • sponsors
                   • governance
                   • links
Wednesday 26 October 11
www. biodiversity.aq




Wednesday 26 October 11
data. biodiversity.aq
                   • find primary biodiversity data
                   • visualize occurrence data on map
                   • view taxonomic data
                   • download data
                   • view metrics
                   • send feedback
                   • access technical documentation
Wednesday 26 October 11
data. biodiversity.aq




Wednesday 26 October 11
ipt. biodiversity.aq
                   • prepare and clean your data
                   • publish primary biodiversity data
                   • publish metadata
                   • push data and metadata to ANTABIF &
                          GBIF
                   • get a Data Paper
Wednesday 26 October 11
ipt. biodiversity.aq




Wednesday 26 October 11
afg. biodiversity.aq

                   • (nice-looking) Identification aid
                   • Publication/sharing platform for customized
                          Field Guides
                   • High quality (useful) pictures
                   • Expert Descriptions
                   • Built dynamically from various sources
Wednesday 26 October 11
afg. biodiversity.aq




Wednesday 26 October 11
share. biodiversity.aq

                   • download shared resources
                   • reports, communication material
                   • original datasets, tools, resources


Wednesday 26 October 11
share. biodiversity.aq




Wednesday 26 October 11
PIC
                   • polarcommons.org
                   • Emergency solution for orphan datasets
                   • Setup of a commons
                    • IT cloud
                    • Set of norms
                   • All polar data (IPY)
                   • Simple procedure!
Wednesday 26 October 11
www.polarcommons.org




Wednesday 26 October 11
Future directions




Wednesday 26 October 11
Architecture
                   • A network of IPTs
                   • Enhanced data flow
                   • Community involved in data management
                   • Enhanced interoperability
                   • Optimization of research efforts/resources
                   • Integrative, connected science
                   • Factual, adaptative conservation
Wednesday 26 October 11
Challenges ahead
                   • Data intensive science
                   • Data deluge
                   • Digital divides
                   • Other data types and integration
                   • Orphan datasets
                   • Cultural change
Wednesday 26 October 11
Hands on now




Wednesday 26 October 11
The rest of the day
                   • Using the portals
                   • Using data tools
                    • templates
                    • data validation
                    • documentation
                    • publishing
Wednesday 26 October 11
http://share.biodiversity.aq/training/




Wednesday 26 October 11

More Related Content

Similar to Antabif training

Biodiversity Information Networks: Dataflows for interdisciplinary sciences
Biodiversity Information Networks: Dataflows for interdisciplinary sciencesBiodiversity Information Networks: Dataflows for interdisciplinary sciences
Biodiversity Information Networks: Dataflows for interdisciplinary sciencesGBIF_NPT
 
Building A Scalable Open Source Storage Solution
Building A Scalable Open Source Storage SolutionBuilding A Scalable Open Source Storage Solution
Building A Scalable Open Source Storage SolutionPhil Cryer
 
Antarctic Biodiversity Networks: new architecture, new tools
Antarctic Biodiversity Networks: new architecture, new toolsAntarctic Biodiversity Networks: new architecture, new tools
Antarctic Biodiversity Networks: new architecture, new toolsBruno Danis
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataPascal-Nicolas Becker
 
Danis biosystematics2011
Danis biosystematics2011Danis biosystematics2011
Danis biosystematics2011Bruno Danis
 
20160922 Materials Data Facility TMS Webinar
20160922 Materials Data Facility TMS Webinar20160922 Materials Data Facility TMS Webinar
20160922 Materials Data Facility TMS WebinarBen Blaiszik
 
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...datascienceiqss
 
Digital library initiatives in Turkey: A brief overview
Digital library initiatives in Turkey: A brief overviewDigital library initiatives in Turkey: A brief overview
Digital library initiatives in Turkey: A brief overviewYasar Tonta
 
The National Library of Australia's New Discovery Service
The National Library of Australia's New Discovery ServiceThe National Library of Australia's New Discovery Service
The National Library of Australia's New Discovery ServiceOCLC Research
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management EcosystemJohn Kunze
 
Putting the Pieces Together: Creating a National Educational Television Catalog
Putting the Pieces Together: Creating a National Educational Television CatalogPutting the Pieces Together: Creating a National Educational Television Catalog
Putting the Pieces Together: Creating a National Educational Television CatalogWGBH Media Library and Archives
 
Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014aceas13tern
 
Establishing a UQ Research Data Management Service
Establishing a UQ Research Data Management Service Establishing a UQ Research Data Management Service
Establishing a UQ Research Data Management Service ARDC
 
Packaging computational biology tools for broad distribution and ease-of-reuse
Packaging computational biology tools for broad distribution and ease-of-reusePackaging computational biology tools for broad distribution and ease-of-reuse
Packaging computational biology tools for broad distribution and ease-of-reuseMatthew Vaughn
 
Desktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omicsDesktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omicsDavid Wallom
 
RDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemRDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemASIS&T
 

Similar to Antabif training (20)

Biodiversity Information Networks: Dataflows for interdisciplinary sciences
Biodiversity Information Networks: Dataflows for interdisciplinary sciencesBiodiversity Information Networks: Dataflows for interdisciplinary sciences
Biodiversity Information Networks: Dataflows for interdisciplinary sciences
 
Building A Scalable Open Source Storage Solution
Building A Scalable Open Source Storage SolutionBuilding A Scalable Open Source Storage Solution
Building A Scalable Open Source Storage Solution
 
Antarctic Biodiversity Networks: new architecture, new tools
Antarctic Biodiversity Networks: new architecture, new toolsAntarctic Biodiversity Networks: new architecture, new tools
Antarctic Biodiversity Networks: new architecture, new tools
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked Data
 
Danis biosystematics2011
Danis biosystematics2011Danis biosystematics2011
Danis biosystematics2011
 
20160922 Materials Data Facility TMS Webinar
20160922 Materials Data Facility TMS Webinar20160922 Materials Data Facility TMS Webinar
20160922 Materials Data Facility TMS Webinar
 
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
 
Digital library initiatives in Turkey: A brief overview
Digital library initiatives in Turkey: A brief overviewDigital library initiatives in Turkey: A brief overview
Digital library initiatives in Turkey: A brief overview
 
The National Library of Australia's New Discovery Service
The National Library of Australia's New Discovery ServiceThe National Library of Australia's New Discovery Service
The National Library of Australia's New Discovery Service
 
Guy Cochrane - Core European (& Australian) barcoding data services
Guy Cochrane - Core European (& Australian) barcoding data servicesGuy Cochrane - Core European (& Australian) barcoding data services
Guy Cochrane - Core European (& Australian) barcoding data services
 
Data Publishing in Archaeozoology
Data Publishing in ArchaeozoologyData Publishing in Archaeozoology
Data Publishing in Archaeozoology
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management Ecosystem
 
Putting the Pieces Together: Creating a National Educational Television Catalog
Putting the Pieces Together: Creating a National Educational Television CatalogPutting the Pieces Together: Creating a National Educational Television Catalog
Putting the Pieces Together: Creating a National Educational Television Catalog
 
Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014
 
Establishing a UQ Research Data Management Service
Establishing a UQ Research Data Management Service Establishing a UQ Research Data Management Service
Establishing a UQ Research Data Management Service
 
130712 antabif workshop
130712 antabif workshop130712 antabif workshop
130712 antabif workshop
 
Danis xlink2011
Danis xlink2011Danis xlink2011
Danis xlink2011
 
Packaging computational biology tools for broad distribution and ease-of-reuse
Packaging computational biology tools for broad distribution and ease-of-reusePackaging computational biology tools for broad distribution and ease-of-reuse
Packaging computational biology tools for broad distribution and ease-of-reuse
 
Desktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omicsDesktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omics
 
RDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemRDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management Ecosystem
 

More from Bruno Danis

Danis Concarneau 2016
Danis Concarneau 2016Danis Concarneau 2016
Danis Concarneau 2016Bruno Danis
 
VERSO: Ecosystem Responses in the Southern Ocean
VERSO: Ecosystem Responses in the Southern OceanVERSO: Ecosystem Responses in the Southern Ocean
VERSO: Ecosystem Responses in the Southern OceanBruno Danis
 
Register of Antarctic Marine Species - AquaRES
Register of Antarctic Marine Species - AquaRESRegister of Antarctic Marine Species - AquaRES
Register of Antarctic Marine Species - AquaRESBruno Danis
 
150712 antabif data_publication_lsssg
150712 antabif data_publication_lsssg150712 antabif data_publication_lsssg
150712 antabif data_publication_lsssgBruno Danis
 
150712 antabif dry_valleys
150712 antabif dry_valleys150712 antabif dry_valleys
150712 antabif dry_valleysBruno Danis
 
ANTABIF at the BELSPO-SOA event
ANTABIF at the BELSPO-SOA eventANTABIF at the BELSPO-SOA event
ANTABIF at the BELSPO-SOA eventBruno Danis
 
Réseaux d'information sur la biodiversité - situation outils et perspectives
Réseaux d'information sur la biodiversité - situation outils et perspectivesRéseaux d'information sur la biodiversité - situation outils et perspectives
Réseaux d'information sur la biodiversité - situation outils et perspectivesBruno Danis
 
Presentation at College de Belgique
Presentation at College de BelgiquePresentation at College de Belgique
Presentation at College de BelgiqueBruno Danis
 
Danis ANTABIF update for SCADM
Danis ANTABIF update for SCADMDanis ANTABIF update for SCADM
Danis ANTABIF update for SCADMBruno Danis
 
Danis antabif kickoff
Danis antabif kickoffDanis antabif kickoff
Danis antabif kickoffBruno Danis
 

More from Bruno Danis (20)

Danis Concarneau 2016
Danis Concarneau 2016Danis Concarneau 2016
Danis Concarneau 2016
 
Scar bs2017
Scar bs2017Scar bs2017
Scar bs2017
 
VERSO: Ecosystem Responses in the Southern Ocean
VERSO: Ecosystem Responses in the Southern OceanVERSO: Ecosystem Responses in the Southern Ocean
VERSO: Ecosystem Responses in the Southern Ocean
 
Register of Antarctic Marine Species - AquaRES
Register of Antarctic Marine Species - AquaRESRegister of Antarctic Marine Species - AquaRES
Register of Antarctic Marine Species - AquaRES
 
Mars Workshop
Mars WorkshopMars Workshop
Mars Workshop
 
150712 antabif data_publication_lsssg
150712 antabif data_publication_lsssg150712 antabif data_publication_lsssg
150712 antabif data_publication_lsssg
 
150712 antabif dry_valleys
150712 antabif dry_valleys150712 antabif dry_valleys
150712 antabif dry_valleys
 
ANTABIF at the BELSPO-SOA event
ANTABIF at the BELSPO-SOA eventANTABIF at the BELSPO-SOA event
ANTABIF at the BELSPO-SOA event
 
Antabif on mars
Antabif on marsAntabif on mars
Antabif on mars
 
Mars intro
Mars introMars intro
Mars intro
 
Danis_CIBIM
Danis_CIBIMDanis_CIBIM
Danis_CIBIM
 
Réseaux d'information sur la biodiversité - situation outils et perspectives
Réseaux d'information sur la biodiversité - situation outils et perspectivesRéseaux d'information sur la biodiversité - situation outils et perspectives
Réseaux d'information sur la biodiversité - situation outils et perspectives
 
Presentation at College de Belgique
Presentation at College de BelgiquePresentation at College de Belgique
Presentation at College de Belgique
 
Danis ANTABIF update for SCADM
Danis ANTABIF update for SCADMDanis ANTABIF update for SCADM
Danis ANTABIF update for SCADM
 
Antabif general
Antabif generalAntabif general
Antabif general
 
Danis antabif kickoff
Danis antabif kickoffDanis antabif kickoff
Danis antabif kickoff
 
Danis egbamm
Danis egbammDanis egbamm
Danis egbamm
 
Danis lsssg
Danis lsssgDanis lsssg
Danis lsssg
 
Danis keynote
Danis keynoteDanis keynote
Danis keynote
 
Danis&raymond
Danis&raymondDanis&raymond
Danis&raymond
 

Recently uploaded

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Recently uploaded (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Antabif training

  • 1. ANTABIF Training getting your data online Bruno Danis, Anton Van de Putte and Nabil Youdjou Wednesday 26 October 11
  • 2. Objectives • familiarize with ANTABIF • learn about architecture, functionalities tools and standards we offer • hands on exercises with dummy and *real* data • collect feedback on the fitness for use for this community Wednesday 26 October 11
  • 3. On the Menu Today • Background about ANTABIF • Technical overview • Standards, tools and resources • Functionalities • Future directions • Hands on Wednesday 26 October 11
  • 5. Antarctic Treaty « In order to promote international cooperation in scientific investigation in Antarctica, […], the Contracting Parties agree that, to the greatest extent feasible and practicable: […] Scientific observations and results from Antarctica shall be exchanged and made freely available. » Wednesday 26 October 11
  • 6. SCAR-MarBIN & ANTABIF • www.scarmarbin.be • www.antabif.be or www.biodiversity.aq • Core funding: BELSPO.be • International Polar Year 2007/08 • Census of Antarctic Marine Life • Ocean Biogeographic Information System • Global Biodiversity Information Facility Wednesday 26 October 11
  • 7. General Philosophy • Build an electronic ecosystem • Offer free and open access to data and technology • Expose all the (biodiversity) data and metadata, in multiple contexts • Remain community-driven, and collaborative • Adopt strong standardization • Work for science, conservation, management Wednesday 26 October 11
  • 9. Achievements • The first RAMS • Board of 60+ editors • Feeds WoRMS, CoL and EoL • 17,098 taxa (RAMS) • Building a dynamic RAS • 24,248 taxa (RAS) Wednesday 26 October 11
  • 10. Achievements • 1,288,441 records • 198 datasets • 5,235 taxa • Feeds OBIS, GBIF • Downloadable • WebGIS • Webservices Wednesday 26 October 11
  • 11. Achievements • Up since Oct 2005 • open access • 909,915 visitors • 8,093,774 hits • 51,416,196 dld records • Citations: 183 • Cited Publications: 38 Wednesday 26 October 11
  • 12. Achievements Records SMB ANTABIF Progress Metadata 198 7.200 36,4 Occurrence 1.288.441 2.659.392 2,1 Taxonomy 17.184 30.472 1,8 Wednesday 26 October 11
  • 13. Nuts and Bolts Wednesday 26 October 11
  • 14. 100% Open Source • Language: Ruby • Framework: Rails(ActiveRecord) and YUI • (smart) Search engine: Full text (Elasticsearch-Lucene) • Database/GIS server/SpatialDB: PostGresql/Geoserver/PostGIS • Mapping client: OpenLayers • Web services: RESTish (all resources) • Protocols/Standards: DIF, DwC, DwC-A, Tapir…etc • GBIF tools : HIT, IPT • Hosting: BeBIF (ULB/VUB joint IT Center) • Metadata systems: GCMD API (DIF) Wednesday 26 October 11
  • 15. Data flow (your point of view) Your data DwC-A IPT ANTABIF standardize upload publish publish Data Paper Wednesday 26 October 11
  • 16. Data flow (our point of view) Wednesday 26 October 11
  • 18. Metadata Information about datasets deteriorates over time! Wednesday 26 October 11
  • 19. Metadata • preferred MD catalogue = Antarctic Master Directory (subset of GCMD) • standard = DIF (Data Interchange Format) • used by the whole SCAR community • crawled by Google, Scopus... Wednesday 26 October 11
  • 20. DarwinCore "A vocabulary of words that biologists, hackers, and citizen scientists use to broadly describe the biodiversity of life on earth." Wednesday 26 October 11
  • 21. DarwinCore Archive • Complete package of data –One file –Multiple files • Text Files… • Self-documenting • Intended to be shared/distributed Wednesday 26 October 11
  • 22. DarwinCore Archive Archives always have a ‘core’ data file My_data.txt The  core  data  file  is  a  text  file. Wednesday 26 October 11
  • 23. DarwinCore Archive Archives always have a ‘core’ data file My_data.txt The  core  data  file  is  a  text  file. Wednesday 26 October 11
  • 24. DarwinCore Archive Darwin Core Archive (two files) meta.xml  describes  the  mappings  in  the core  data  file  (species.txt) Wednesday 26 October 11
  • 25. DarwinCore Archive Multiple extensions are available Columns  in  extensions  are  mapped  to  Darwin  Core  using  the  meta.xml  file Wednesday 26 October 11
  • 26. DarwinCore Archive Many extensions are available h?p://rs.gbif.org/extension/ Wednesday 26 October 11
  • 27. Spreadsheet templates • Metadata - describe a database or other data resource.  • Species Occurrence - store basic species collections or observational data • Species Checklists – recording and storing simple annotated species checklists. Wednesday 26 October 11
  • 33. Spreadsheet processor • web application: Excel spreadsheet to DwC-A. • Excel files contain data entry and GBIF metadata profile. • Worksheet supports publication of primary biodiversity data • Processor performs data validation and transformation and returns a validated DwC-A Wednesday 26 October 11
  • 35. DwC-A validator • tests Darwin Core Archives • validates the content against the known extensions and terms registered within the GBIF network for sharing biodiversity data. Wednesday 26 October 11
  • 37. IPT - Integrated Publishing Toolkit • Publishing primary biodiversity data • Resources • Metadata • Source Data (text, zip, SQL) • Source Mappings • Visibility • Published Release Wednesday 26 October 11
  • 38. The Data Paper concept • A scholarly journal publication whose primary purpose is to describe a dataset or group of datasets, rather than to report a research investigation. • Benefits of the Data Paper – Scholarly credit to Data Publishers – Describe the data in structured human readable form – Bring the existence of the data to the attention of the scholarly community Wednesday 26 October 11
  • 39. Data Paper: Incentivising Data Discovery Wednesday 26 October 11
  • 40. Reward data publishing Metadata document Data Paper Wednesday 26 October 11
  • 41. Step-by-Step • Complete metadata of a dataset using metadata editor in IPT 2.0.2 • Generate ‘Data Paper’ manuscript (menu: Manage Resource – RTF Download) • Submit the manuscript for possible publication in one of the PenSoft publication (ZooKeys, PhytoKeys, BioRisks, NeoBiota). • Revision (if any) is carried out using metadata editor in IPT 2.0.2 and manuscript re-submitted to PenSoft Open Journal System Wednesday 26 October 11
  • 42. Once paper is accepted • Digital Object Identifier is assigned to the Data Paper • Paper is published in (a) print format, (b) PDF format, (c) semantically enhanced HTML, and (d) XML is archived in PubMedCentral • DoI of the Data Paper is linked with the Persistent Identifier of the metadata document in the GBIF Registry • Data Paper is indexed by Web of Knowledge (ISI), PubMedCentral, Scopus, Zoological Record, Google Scholar, CAB Abstracts, Directory of Open Access Journal (DOAJ), EBSCO. Wednesday 26 October 11
  • 43. Important to consider • Metadata is complete in all the respect • All the claims are adequately substantiated • Data described in ‘Data Paper’ is freely available at the time of submission of the manuscript Wednesday 26 October 11
  • 44. ORC • GBIF’s Online Resource Center • Provides access to documents, best practices, tools and links • Wide thematic scope • Different ways of accessing resources • Enabling community contributions • Different levels of resource access • Multilanguage support Wednesday 26 October 11
  • 47. www.biodiversity.aq • general website • latest news • contact • sponsors • governance • links Wednesday 26 October 11
  • 49. data. biodiversity.aq • find primary biodiversity data • visualize occurrence data on map • view taxonomic data • download data • view metrics • send feedback • access technical documentation Wednesday 26 October 11
  • 51. ipt. biodiversity.aq • prepare and clean your data • publish primary biodiversity data • publish metadata • push data and metadata to ANTABIF & GBIF • get a Data Paper Wednesday 26 October 11
  • 53. afg. biodiversity.aq • (nice-looking) Identification aid • Publication/sharing platform for customized Field Guides • High quality (useful) pictures • Expert Descriptions • Built dynamically from various sources Wednesday 26 October 11
  • 55. share. biodiversity.aq • download shared resources • reports, communication material • original datasets, tools, resources Wednesday 26 October 11
  • 57. PIC • polarcommons.org • Emergency solution for orphan datasets • Setup of a commons • IT cloud • Set of norms • All polar data (IPY) • Simple procedure! Wednesday 26 October 11
  • 60. Architecture • A network of IPTs • Enhanced data flow • Community involved in data management • Enhanced interoperability • Optimization of research efforts/resources • Integrative, connected science • Factual, adaptative conservation Wednesday 26 October 11
  • 61. Challenges ahead • Data intensive science • Data deluge • Digital divides • Other data types and integration • Orphan datasets • Cultural change Wednesday 26 October 11
  • 62. Hands on now Wednesday 26 October 11
  • 63. The rest of the day • Using the portals • Using data tools • templates • data validation • documentation • publishing Wednesday 26 October 11