3. Hardisty, Nature 502, 171 (2013)
BUT: predictive ecology has substantial data needs
Harfoot, BIH2013, Rome, 2013
The big question
What is the future of the biological world?
Imagine if we could:
…Predict community level dynamics of ecosystems at
scales from local to global, based on the ecology and
biology of all individual organisms
4. Decentralized biodiversity infrastructure
Plants
3,400 Herbaria worldwide
10,000 Associate curators and specialists
350,000,000 specimens in collections
180,000,000 specimens digitized
2,000,000,000 specimens including animals
Source: gbif.org; http://sciweb.nybg.org/science2/IndexHerbariorum.asp
5. One collection’s view of the world
Nationaal Herbarium Nederland collection on GBIF
Source: http://www.gbif.org/dataset/7b33b040-f762-11e1-a439-00145eb45e9a
6. 200,000,000+ printed pages
1,900,000 species described
20,000,000+ species treatments
17,000 new species per year
Biodiversity libraries
BUT: The data are hidden
Incomplete digitization
Publications are not semantically
enhanced
Collections are incomplete
Data is not linked
Most data are not open
7. Names as information tags in life sciences
Names
Characteristics
Publications
GenesCollections
Specimens
Distribution
8. Coordination and Policy Development in Preparation for a
European Open Biodiversity Knowledge Management
System
Supported by the European Commission through its FP7 research funding programme
(pro-)iBiosphere
10. Create digital objects
+ Identifiers and resolvers
+ Open Access
+ Legislation
+ Adequate infrastructure
+ Sustainable and permanent infrastructure
+ Reliable services for partners in research projects and society
Seamless Global Virtual Research Knowledge Management System
(European Open Biodiversity Knowledge Management System)
Biodiversity Knowledge Management System
12. free of charge online access to EU-funded research
…
essential for Europe's ability to enhance its economic
performance and improve the capacity to compete through
knowledge
http://ec.europa.eu/digital-agenda/en/open-access-scientific-information
13. Access to digital data sets resulting from federally funded research
….
will accelerate scientific breakthroughs and innovation, promote
entrepreneurship, and enhance economic growth and job
creation.
http://www.whitehouse.gov/blog/2013/02/22/expanding-public-access-results-federally-funded-research
17. … but additional messages:
Direct access to digital content
Re-acting: The biodiversity community
is re-acting together to novel
environmental, scientific and societal
challenges
18. … but additional messages:
Direct access to digital content
Re-acting: The biodiversity community is
re-acting together to novel environ-
mental, scientific and societal challenges
Third leg: digital content as commitment
19. … but additional messages:
Direct access to digital content
Re-acting: The biodiversity community is
re-acting together to novel environ-
mental, scientific and societal challenges
Third leg: digital content as commitment
Catalyst for discussions
26. Bouchout Declaration
But: Gestion Collaborative du Libre Accès aux
Connaissances sur la Biodiversité (Open Biodiversity
Knowledge Management)
En tant que signataires, nous encourageons une
approche globale de la Gestion Collaborative du Libre
Accès aux Connaissances sur la Biodiversité, basée sur les
principes fondamentaux suivants :
28. Bouchout Declaration
Licenses
Licences ou clauses de renonciation qui accordent
ou permettent à tous les utilisateurs un droit
libre, irrévocable et international de copier,
utiliser, diffuser, transmettre et rendre public le
travail effectué, ainsi que d’exploiter le travail
réalisé et de faire des travaux dérivés, sous
réserve d’une attribution conforme aux pratiques
communautaires, tout en reconnaissant que les
fournisseurs pourront développer des produits
commerciaux avec des licences plus restrictives.
29. Bouchout Declaration
Licenses ctd.
Data and research results are not copyrighted
and thus no license should be added
If possible, publications should be created as
Open Access works
31. Bouchout Declaration
Attribution
Suivi de l’utilisation des identifiants dans les liens
et les citations pour garantir que les sources et les
fournisseurs de données se voient attribuer la
reconnaissance de leurs contributions
34. Bouchout Declaration
Persistent Identifiers
Identifiants permanents pour les objets de
données et les objets physiques tels que les
spécimens, les images et les traitements
taxonomiques avec des mécanismes standard
pour diriger les utilisateurs directement vers les
contenus et les données
35. Bouchout Declaration
Linked Open Data
Liaison des données utilisant des vocabulaires
communs, à la fois au sein et au-delà de la
biodiversité, qui permettent la participation à une
infrastructure en nuage des Données Liées en
Libre Accès
Where does the data come from?
Data comes from the institutions and legacy literature
116,000,000 plant records
Collections, GBIF series
None is complete on its own, the power is in aggretation; data highly complimentary, none complete alone, power in aggregation