SlideShare ist ein Scribd-Unternehmen logo
1 von 29
The biodiversity
informatics landscape:
a systematics perspective
Vince Smith

Biodiversity Informatics Horizons
Rome, 3-6 Sept 2013
Overview
1.

Background – the biodiversity informatics domain
•
•
•

2.

Social challenges
•
•
•

3.

Mobilizing existing data (metadata, literature, collections)
New forms of data ([meta]genomics & observatories)

Synthetic challenges
•
•
•

5.

Openness
Collaboration and communities
Standards, identifiers & protocols

(Big) data challenges
•
•

4.

The problem (i.e. why are we here)
Representations of the domain (data, infrastructures, projects…)
Toward an integrated view (strategy)

Data Aggregation & linking
Visualisation
Modeling

Next steps (data infrastructures & funding)
•

Lessons learned: new informatics opportunities in H2020
1. Background
The problem – integrating biodiversity research
How to we join up these activities?
What infrastructures do we need?
(technologies, tools, standards…)
What processes do we need?
(Modelling, workflows…)
What data do we need?
(Genes, localities…)

How do we use this as a tool?
Species conservation & protected areas
Impacts of human development
Biodiversity & human health
Impacts of climate change
Food, farming & biofuels
Invasive alien species
Natural History – the foundation
Darwin’s “tangled bank”…

"It is interesting to contemplate a tangled
bank, clothed with many plants of many
kinds, …, so different from each other, and
dependent upon each other in so complex a
manner, have all been produced by laws acting
around us.”
C. Darwin "On the Origin of Species”, 1859

Systematics, a foundational “law”
Ecological interactions
A granular understanding of biodiversity

Genes

Individuals Populations Species

Interactions
AB C D E F

GCGC
GTAC
CTAG

GenBank

i
ii
iii
iv
v
vi

1
2
1
2
3

Local populations

A
B
C
D
E
F
Global
biodiversity

-+++++
+-+++
+++
+
+
Biological
networks
An informaticians view of biodiversity

GenBank

MorphBank

Interactions

Geospatial

Census

Genotype

Phenotype

Biotic
Interactions

Environment

Human Effects

IUCN

Pop. data

Niche & Pop.
Ecology
TreeBase

Biodiversity
Loss

GBIF

Phylogenetic
Trees
IPNI, Zoobank

Taxonomy

AquaMaps

Geographic
Dsitributions
Extent of Occurrence

Range Maps

Conservation &
management
AquaMaps

Forecasts of
Change

Data
Products
Systems

Key problems
• Landscape is complex, fragmented & hard to navigate
• Many audiences (policy makers, scientists, amateurs, citizen scientists)
• Many scales (global solutions to local problems)

Figure adapted from
Peterson et al 2010
A project centric view of biodiversity
Scan / Mark/up
PLAZI
Inotaxa
BHL
eFloras

CDM
GNA (NameBank)

Phylogenetic
Tree of Life
TreeBase
CIPRES

Descriptive /
classification
EoL
Scratchpads
CATE
MorphoBank
Wikipedia

Molecular
Databases
NCBI/EMBL/DDBJ
CBoL
Barcode of Life
Initiative

Bibliographic
IPNI
Google Scholar
Connotea
ViTaL
ISI

Institutional
EMu (=MOA)
Recorder

uBio

TDWG
Checklists

Identification
Key2Nature
IdentifyLife

Inter-Institutional
Synthesis
BCI
BioCASE
GeoCASE
MaNIS

PESI:
ERMS
Fauna Europea
Euro+Med Plantbase
ORBIS
WORMS
Flora Europea

Nomenclators
Index Fungorum
ZooBank
IPNI
(Kew/AUS/Harvard)
ING
AFD/APC/APUI
NZOR
CoL (Sp2000& ITIS)
ZooRecord

LifeWatch

GBIF
Biodiversity
ALA
CONABIO
CRIA (Brazil)
IUCN
SEEK
OPAL
DAISIE
iNaturalist

A snapshot from 2009, “the dance of the initiatives”
The strategic view: community informatics challenges

GBIF GBIC Report
(Coming soon)

EU Biodiversity Strategy
(2011)

Biodiv. Inf. Challenges
(2013)

Grand Challenges for Biodiversity Informatics
(integrating activities for H2020)
2. Social challenges
- Openness
- Collaboration and communities
- Standards, identifiers & links
Openness in biodiversity informatics
“A piece of data or content is open if anyone is free to use, reuse, and redistribute it subject, at most, to the requirement to attribute and/or share-alike.” http://opendefinition.org/

• Sharing data is a foundation
for our activities
• Normal practice in some
communities (molecular)
• Mandated by some funders
& governments
Many kinds of openness:
• Open Access
• Open Data
• Open Science
• Open Source

E. Archambault et. al., Proportion of Open Access Peer-Reviewed Papers at the
European and World Levels--2004-2011, June 2013, Science-Metrix Inc.

“One-half of all papers are now freely available
within a year or two of publication”
Openness in biodiversity informatics
“A piece of data or content is open if anyone is free to use, reuse, and redistribute it subject, at most, to the requirement to attribute and/or share-alike.” http://opendefinition.org/

• Sharing data is a foundation
for our activities
• Normal practice in some
communities (molecular)
• Mandated by some funders
& governments
Many kinds of openness:
• Open Access
• Open Data
• Open Science
• Open Source
Incentivise through credit via citation (e.g. BDJ)

Need to continue to incentivise openness
What are Scratchpads? (http://scratchpads.eu)
Collaboration & communities
Making taxonomy a team sport
e.g., Scratchpad Virtual Research Communities

Taxa

Projects

544 Scratchpad Communities
by

6,644 active registered users

covering

91,631 taxa

in 535,317 pages.

Regions

Societies

In total more than

1,300,000 visitors

81 paper citations in 2012
Our infrastructures need to facilitate collaboration
Standards, identifiers & protocols
Facilitating data sharing across communities
A foundation for integration
Key requirements:
• Need to be inclusive, practical & extensible
• Readable by humans & machines
• Widely used
Good examples:
• Darwin Core
• CrossRef & DataCite DOIs
• ORCHID Author identifiers
Gaps / Problems
• Reuse & persistence of identifiers
• Vocabularies & ontologies (time consuming / little reward)
Potential solutions
• Build them into our credit systems
• Show sematic reasoning potential (LOD & RDF demonstrators)
Standards can’t be developed in isolation – they must be used
3. (Big) data challenges
- Mobilising existing data
- New forms of data
Mobilising existing data
Collections, literature & metadata
How can we quickly, efficiently and cost
effectively mobilise biological data at scale?
Collections
• 1.5-3B specimens in collections worldwide
• Fragments efforts / heterogeneity of process
• Needs ambition (NHM: 20M in 5 yrs.) & coord.
Literature
• >300M pages of biodiversity literature
• BHL (41M pp.) an example of what can be done
• Needs a sustainability & article metadata

NHM
Digitisation

BHL
literature

Metadata registries
• Data about data (cheaper & scalable)
• e.g. bibliographic data, dataset portals
Informatics challenges
• Storage & persistence
• Automation & annotation
• Incentives to digitise & fitness for use

Bibliography of Life
(RefFinder & RefBank)
Mobilising & managing new forms of data
Metagenomics & ecological observatories
These new data types do not depend on
traditional taxonomy & systematics
New Molecular approaches
• Molecular detection & monitoring of organisms is routine
• Metagenomics (env. sequencing) commonplace
• Becoming the 1° route to understanding biodiversity

3-4 June 2013, NHM

Ecological observatories
• Automated biodiversity detection
• Remote sensing (e.g. satellite & acoustic data, drones, camera traps)
• Monitoring conspicuous, rare or invasive spp. (algal blooms, palms)
• Monitoring human activity
Informatics challenges
• Very large quantities of data (2.5-10TB per researcher per yr.)
• Doesn’t map well to existing data infrastructures
• Challenge current networking & storage capacity
• Digital and physical collections become equally important?
22 July, 2013
4. Synthetic challenges
- Data aggregation & linking
- Visualisation
- Modeling
Aggregation & linking
Portals bringing together distributed & diverse forms of data
Giving consistent and comprehensive access
to all biological data

eMonocot

Several approaches, with different advantages
• Tightly coupled to a few data sources
•

(e.g. eMonocot, CDM)

• Loosely coupled to many sources
•

•

(e.g. BioNames, Wikipedia)
Hybrid forms (e.g. Canadensys, EOL, GBIF)

Selective & accurate but hard to scale
(276k taxa, 8k images, 13 keys & 3 phylogenies)

Informatics challenges
• Portals are hard to sustain
• New methods of data discovery & access
• Create new windows (views) on content
• New data structures, new types of database

BioNames

Scalable but less accurate
(3M taxon names, 93k phylogenies & 28k articles)
Visualisation
Visually synthesizing large, linked biodiversity datasets
Making biodiversity data accessible &
understandable
Research opportunities
• Tools integration (e.g. GeoCat, CartoDB)
• Span multiple audiences
Outreach opportunities
• Visually compelling story telling
• Crowdsourcing tools (e.g. Notes From Nature)
Exploiting new technologies
• Touch screens
• Mobile
• Location awareness
Informatics challenges
• Very specific to individual use cases
• Sustainability issues

NHM specimen records
http://data.nhm.ac.uk/globe/
Modeling the biosphere: a (the) 30 year goal?
Reasoning across large, linked biodiversity datasets
A clear, singular, long-term vision, which
biodiversity data can contribute too
Conceptually has many potential uses
• Identifying trends
• Explaining patterns
• Making predictions
• Real time alerts
- when data contradicts current knowledge

• The ultimate policy tool
Major informatics challenges
• Technical very difficult (many years off)
• Needs effective prototypes & platforms
• Some first steps e.g. OBOE, LEFT

Nature 2013, doi:10.1038/493295a
5. Next steps
Lessons learned: new opportunities in H2020
PATHWAYS TO INTEGRATION
(by addressing these social, data & synthetic challenges)

• Break out of the discipline, technical &
project centric activities (it is
unsustainable, inefficient & bad for science)

• Integrate & build on exiting programmes
where possible (LifeWatch is a potential umbrella
for these activities)

• Bridge the disconnect between
informaticians & users (make the users
informaticians & in informaticians users)

• Our products well suited to address these
challenges
• Use H2020 as a mechanism to achieve
integration

How do we join up these activities?
QUESTIONS
Possible biodiversity informatics design principles*
= experience from 7-years with the Scratchpads
= lessons for infrastructures in H2020?
1. Start with needs - focus on real user needs (not just the ‘official process’)
2. Do less - if someone else is doing it, link to it or use it
3. Design with data - prototype and test with real users on the live website
4. Do the hard work to make it simple - let the computer take the strain
5. Iterate. Then iterate again. - iteration reduces risk & is more sustainable
6. Build for inclusion – it’s easier in the long run
7. Understand context - we are designing for people, not a screen or a brand
8. Build digital services, not websites - there is life beyond the website
9. Be consistent, not uniform - every circumstance is different
10. Make things open: it makes things better - it’s more sustainable
*https://www.gov.uk/designprinciples
Mobilising existing data: how to prioritise
CONTENT

FUN
LEARNING
OUTREACH

Digitise a few things & invest in
depth, description & promotion

A LITTLE

A LOT

Digitise lots of things, put little effort
into description & promotion

AGGREGATION
COLECTIONS
MANAGEMENT

METADATA

DATA MINING

RESEARCH

Nick Poole, UK Collections Trust
Collaboration & communities
Making taxonomy a team sport
Average dates when increasing numbers of taxonomists were involved in describing species
CONE SNAILS

BIRDS

MAMMALS

AMPHIBIANS

SPIDERS

PLANTS

Joppa et al, 2011

•
•
•
•

Very few recent single author papers
Most (fundable) science is cross-disciplinary
Need to incentivise data curation & annotation
Need mechanisms to share annotations
Our infrastructures need to facilitate collaboration
The biodiversity informatics landscape: a systematics perspective

Weitere ähnliche Inhalte

Was ist angesagt?

Scratchpads introductory presentation 45mins
Scratchpads introductory presentation   45minsScratchpads introductory presentation   45mins
Scratchpads introductory presentation 45minsDimitrios Koureas
 
BioDBCore: Current Status and Next Developments
BioDBCore: Current Status and Next DevelopmentsBioDBCore: Current Status and Next Developments
BioDBCore: Current Status and Next DevelopmentsPascale Gaudet
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18Dag Endresen
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyICZN
 
Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Stella Wisdom
 
#HepaticaWeek April 2016, GBIF data publishing
#HepaticaWeek April 2016, GBIF data publishing#HepaticaWeek April 2016, GBIF data publishing
#HepaticaWeek April 2016, GBIF data publishingDag Endresen
 
Per de Place Bjørn - Revolutionizing taxonomy through an open-access web-regi...
Per de Place Bjørn - Revolutionizing taxonomy through an open-access web-regi...Per de Place Bjørn - Revolutionizing taxonomy through an open-access web-regi...
Per de Place Bjørn - Revolutionizing taxonomy through an open-access web-regi...ICZN
 
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...ICZN
 
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...Ross Mounce
 
Text and Data Mining explained at FTDM
Text and Data Mining explained at FTDMText and Data Mining explained at FTDM
Text and Data Mining explained at FTDMpetermurrayrust
 
GBIF data publishing. GBIF seminar in Bergen. 2016-12-14
GBIF data publishing. GBIF seminar in Bergen. 2016-12-14GBIF data publishing. GBIF seminar in Bergen. 2016-12-14
GBIF data publishing. GBIF seminar in Bergen. 2016-12-14Dag Endresen
 
Museum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on themMuseum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on themRoss Mounce
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureRoss Mounce
 
Content Mining at Wellcome Trust
Content Mining at Wellcome TrustContent Mining at Wellcome Trust
Content Mining at Wellcome Trustpetermurrayrust
 
20140317 pi b_nmbe_journal_club
20140317 pi b_nmbe_journal_club20140317 pi b_nmbe_journal_club
20140317 pi b_nmbe_journal_clubagosti
 
ContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKpetermurrayrust
 
Modern Tools & Rationales for 21st Century Research
Modern Tools & Rationales  for 21st Century ResearchModern Tools & Rationales  for 21st Century Research
Modern Tools & Rationales for 21st Century ResearchRoss Mounce
 
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014Dag Endresen
 
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...Dag Endresen
 

Was ist angesagt? (20)

Scratchpads introductory presentation 45mins
Scratchpads introductory presentation   45minsScratchpads introductory presentation   45mins
Scratchpads introductory presentation 45mins
 
BioDBCore: Current Status and Next Developments
BioDBCore: Current Status and Next DevelopmentsBioDBCore: Current Status and Next Developments
BioDBCore: Current Status and Next Developments
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
 
Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods
 
#HepaticaWeek April 2016, GBIF data publishing
#HepaticaWeek April 2016, GBIF data publishing#HepaticaWeek April 2016, GBIF data publishing
#HepaticaWeek April 2016, GBIF data publishing
 
Per de Place Bjørn - Revolutionizing taxonomy through an open-access web-regi...
Per de Place Bjørn - Revolutionizing taxonomy through an open-access web-regi...Per de Place Bjørn - Revolutionizing taxonomy through an open-access web-regi...
Per de Place Bjørn - Revolutionizing taxonomy through an open-access web-regi...
 
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
 
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
 
Text and Data Mining explained at FTDM
Text and Data Mining explained at FTDMText and Data Mining explained at FTDM
Text and Data Mining explained at FTDM
 
GBIF data publishing. GBIF seminar in Bergen. 2016-12-14
GBIF data publishing. GBIF seminar in Bergen. 2016-12-14GBIF data publishing. GBIF seminar in Bergen. 2016-12-14
GBIF data publishing. GBIF seminar in Bergen. 2016-12-14
 
Museum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on themMuseum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on them
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | Future
 
Content Mining at Wellcome Trust
Content Mining at Wellcome TrustContent Mining at Wellcome Trust
Content Mining at Wellcome Trust
 
20140317 pi b_nmbe_journal_club
20140317 pi b_nmbe_journal_club20140317 pi b_nmbe_journal_club
20140317 pi b_nmbe_journal_club
 
ContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UK
 
Modern Tools & Rationales for 21st Century Research
Modern Tools & Rationales  for 21st Century ResearchModern Tools & Rationales  for 21st Century Research
Modern Tools & Rationales for 21st Century Research
 
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014
 
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
 
Ifla Bhl080208cr
Ifla Bhl080208crIfla Bhl080208cr
Ifla Bhl080208cr
 

Ähnlich wie The biodiversity informatics landscape: a systematics perspective

Delivering biodiversity knowledge in the information age
Delivering biodiversity knowledge in the information ageDelivering biodiversity knowledge in the information age
Delivering biodiversity knowledge in the information ageVince Smith
 
Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince Smith
 
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...e-ROSA
 
Building data infrastructures for science
Building data infrastructures for scienceBuilding data infrastructures for science
Building data infrastructures for scienceVince Smith
 
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...LIBER Europe
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessAbby Clobridge
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2Alex Hardisty
 
Access methods for analysing sensitive data (amased)
Access methods for analysing sensitive data (amased)Access methods for analysing sensitive data (amased)
Access methods for analysing sensitive data (amased)Jisc
 
Current and emerging scientific data curation practices
Current and emerging scientific data curation practicesCurrent and emerging scientific data curation practices
Current and emerging scientific data curation practicesMichael Day
 
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.Carole Goble
 
eROSA Policy WS2: Second Stakeholder Workshop
eROSA Policy WS2: Second Stakeholder WorkshopeROSA Policy WS2: Second Stakeholder Workshop
eROSA Policy WS2: Second Stakeholder Workshope-ROSA
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 
The role of libraries and information professionals during the Big Data Era/ ...
The role of libraries and information professionals during the Big Data Era/ ...The role of libraries and information professionals during the Big Data Era/ ...
The role of libraries and information professionals during the Big Data Era/ ...African Open Science Platform
 
Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...Vince Smith
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional RepositoriesRobin Rice
 
ICDMWorkshopProposal.doc
ICDMWorkshopProposal.docICDMWorkshopProposal.doc
ICDMWorkshopProposal.docbutest
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Susanna-Assunta Sansone
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...Alejandra Gonzalez-Beltran
 
Australia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityAustralia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityTERN Australia
 

Ähnlich wie The biodiversity informatics landscape: a systematics perspective (20)

Delivering biodiversity knowledge in the information age
Delivering biodiversity knowledge in the information ageDelivering biodiversity knowledge in the information age
Delivering biodiversity knowledge in the information age
 
Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notext
 
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
 
Building data infrastructures for science
Building data infrastructures for scienceBuilding data infrastructures for science
Building data infrastructures for science
 
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open Access
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2
 
Access methods for analysing sensitive data (amased)
Access methods for analysing sensitive data (amased)Access methods for analysing sensitive data (amased)
Access methods for analysing sensitive data (amased)
 
Current and emerging scientific data curation practices
Current and emerging scientific data curation practicesCurrent and emerging scientific data curation practices
Current and emerging scientific data curation practices
 
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
 
Open Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon HodsonOpen Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon Hodson
 
eROSA Policy WS2: Second Stakeholder Workshop
eROSA Policy WS2: Second Stakeholder WorkshopeROSA Policy WS2: Second Stakeholder Workshop
eROSA Policy WS2: Second Stakeholder Workshop
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
The role of libraries and information professionals during the Big Data Era/ ...
The role of libraries and information professionals during the Big Data Era/ ...The role of libraries and information professionals during the Big Data Era/ ...
The role of libraries and information professionals during the Big Data Era/ ...
 
Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional Repositories
 
ICDMWorkshopProposal.doc
ICDMWorkshopProposal.docICDMWorkshopProposal.doc
ICDMWorkshopProposal.doc
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...
 
Australia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityAustralia's Environmental Predictive Capability
Australia's Environmental Predictive Capability
 

Mehr von Vince Smith

DiSSCo institutional benefits
DiSSCo institutional benefitsDiSSCo institutional benefits
DiSSCo institutional benefitsVince Smith
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeVince Smith
 
Moving beyond the box: automating the digitisation of insect collections
Moving beyond the box: automating the digitisation of insect collectionsMoving beyond the box: automating the digitisation of insect collections
Moving beyond the box: automating the digitisation of insect collectionsVince Smith
 
FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...Vince Smith
 
Use it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresUse it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresVince Smith
 
No specimen left behind: Collections digitisation at the NHM, London*
No specimen left behind:  Collections digitisation at the NHM, London*No specimen left behind:  Collections digitisation at the NHM, London*
No specimen left behind: Collections digitisation at the NHM, London*Vince Smith
 
SYNTHESYS 3 Overview
SYNTHESYS 3 OverviewSYNTHESYS 3 Overview
SYNTHESYS 3 OverviewVince Smith
 
Scratchpad 2014-introduction
Scratchpad 2014-introductionScratchpad 2014-introduction
Scratchpad 2014-introductionVince Smith
 
Consolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsConsolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsVince Smith
 
Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Vince Smith
 
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Vince Smith
 
Scratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataScratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataVince Smith
 
Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Vince Smith
 
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Vince Smith
 
Don't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easyDon't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easyVince Smith
 
Don’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easyDon’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easyVince Smith
 
Digitised collections: Toward a digital strategy for for the NHM, London
Digitised collections: Toward a digital strategy for for the NHM, LondonDigitised collections: Toward a digital strategy for for the NHM, London
Digitised collections: Toward a digital strategy for for the NHM, LondonVince Smith
 
2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smith2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smithVince Smith
 
Sharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT waySharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT wayVince Smith
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
 

Mehr von Vince Smith (20)

DiSSCo institutional benefits
DiSSCo institutional benefitsDiSSCo institutional benefits
DiSSCo institutional benefits
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 
Moving beyond the box: automating the digitisation of insect collections
Moving beyond the box: automating the digitisation of insect collectionsMoving beyond the box: automating the digitisation of insect collections
Moving beyond the box: automating the digitisation of insect collections
 
FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...
 
Use it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresUse it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructures
 
No specimen left behind: Collections digitisation at the NHM, London*
No specimen left behind:  Collections digitisation at the NHM, London*No specimen left behind:  Collections digitisation at the NHM, London*
No specimen left behind: Collections digitisation at the NHM, London*
 
SYNTHESYS 3 Overview
SYNTHESYS 3 OverviewSYNTHESYS 3 Overview
SYNTHESYS 3 Overview
 
Scratchpad 2014-introduction
Scratchpad 2014-introductionScratchpad 2014-introduction
Scratchpad 2014-introduction
 
Consolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsConsolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review Presentations
 
Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...
 
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
 
Scratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataScratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity data
 
Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...
 
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
 
Don't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easyDon't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easy
 
Don’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easyDon’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easy
 
Digitised collections: Toward a digital strategy for for the NHM, London
Digitised collections: Toward a digital strategy for for the NHM, LondonDigitised collections: Toward a digital strategy for for the NHM, London
Digitised collections: Toward a digital strategy for for the NHM, London
 
2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smith2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smith
 
Sharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT waySharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT way
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
 

Kürzlich hochgeladen

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 

Kürzlich hochgeladen (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 

The biodiversity informatics landscape: a systematics perspective

  • 1. The biodiversity informatics landscape: a systematics perspective Vince Smith Biodiversity Informatics Horizons Rome, 3-6 Sept 2013
  • 2. Overview 1. Background – the biodiversity informatics domain • • • 2. Social challenges • • • 3. Mobilizing existing data (metadata, literature, collections) New forms of data ([meta]genomics & observatories) Synthetic challenges • • • 5. Openness Collaboration and communities Standards, identifiers & protocols (Big) data challenges • • 4. The problem (i.e. why are we here) Representations of the domain (data, infrastructures, projects…) Toward an integrated view (strategy) Data Aggregation & linking Visualisation Modeling Next steps (data infrastructures & funding) • Lessons learned: new informatics opportunities in H2020
  • 4. The problem – integrating biodiversity research How to we join up these activities? What infrastructures do we need? (technologies, tools, standards…) What processes do we need? (Modelling, workflows…) What data do we need? (Genes, localities…) How do we use this as a tool? Species conservation & protected areas Impacts of human development Biodiversity & human health Impacts of climate change Food, farming & biofuels Invasive alien species
  • 5. Natural History – the foundation Darwin’s “tangled bank”… "It is interesting to contemplate a tangled bank, clothed with many plants of many kinds, …, so different from each other, and dependent upon each other in so complex a manner, have all been produced by laws acting around us.” C. Darwin "On the Origin of Species”, 1859 Systematics, a foundational “law”
  • 7. A granular understanding of biodiversity Genes Individuals Populations Species Interactions AB C D E F GCGC GTAC CTAG GenBank i ii iii iv v vi 1 2 1 2 3 Local populations A B C D E F Global biodiversity -+++++ +-+++ +++ + + Biological networks
  • 8. An informaticians view of biodiversity GenBank MorphBank Interactions Geospatial Census Genotype Phenotype Biotic Interactions Environment Human Effects IUCN Pop. data Niche & Pop. Ecology TreeBase Biodiversity Loss GBIF Phylogenetic Trees IPNI, Zoobank Taxonomy AquaMaps Geographic Dsitributions Extent of Occurrence Range Maps Conservation & management AquaMaps Forecasts of Change Data Products Systems Key problems • Landscape is complex, fragmented & hard to navigate • Many audiences (policy makers, scientists, amateurs, citizen scientists) • Many scales (global solutions to local problems) Figure adapted from Peterson et al 2010
  • 9. A project centric view of biodiversity Scan / Mark/up PLAZI Inotaxa BHL eFloras CDM GNA (NameBank) Phylogenetic Tree of Life TreeBase CIPRES Descriptive / classification EoL Scratchpads CATE MorphoBank Wikipedia Molecular Databases NCBI/EMBL/DDBJ CBoL Barcode of Life Initiative Bibliographic IPNI Google Scholar Connotea ViTaL ISI Institutional EMu (=MOA) Recorder uBio TDWG Checklists Identification Key2Nature IdentifyLife Inter-Institutional Synthesis BCI BioCASE GeoCASE MaNIS PESI: ERMS Fauna Europea Euro+Med Plantbase ORBIS WORMS Flora Europea Nomenclators Index Fungorum ZooBank IPNI (Kew/AUS/Harvard) ING AFD/APC/APUI NZOR CoL (Sp2000& ITIS) ZooRecord LifeWatch GBIF Biodiversity ALA CONABIO CRIA (Brazil) IUCN SEEK OPAL DAISIE iNaturalist A snapshot from 2009, “the dance of the initiatives”
  • 10. The strategic view: community informatics challenges GBIF GBIC Report (Coming soon) EU Biodiversity Strategy (2011) Biodiv. Inf. Challenges (2013) Grand Challenges for Biodiversity Informatics (integrating activities for H2020)
  • 11. 2. Social challenges - Openness - Collaboration and communities - Standards, identifiers & links
  • 12. Openness in biodiversity informatics “A piece of data or content is open if anyone is free to use, reuse, and redistribute it subject, at most, to the requirement to attribute and/or share-alike.” http://opendefinition.org/ • Sharing data is a foundation for our activities • Normal practice in some communities (molecular) • Mandated by some funders & governments Many kinds of openness: • Open Access • Open Data • Open Science • Open Source E. Archambault et. al., Proportion of Open Access Peer-Reviewed Papers at the European and World Levels--2004-2011, June 2013, Science-Metrix Inc. “One-half of all papers are now freely available within a year or two of publication”
  • 13. Openness in biodiversity informatics “A piece of data or content is open if anyone is free to use, reuse, and redistribute it subject, at most, to the requirement to attribute and/or share-alike.” http://opendefinition.org/ • Sharing data is a foundation for our activities • Normal practice in some communities (molecular) • Mandated by some funders & governments Many kinds of openness: • Open Access • Open Data • Open Science • Open Source Incentivise through credit via citation (e.g. BDJ) Need to continue to incentivise openness
  • 14. What are Scratchpads? (http://scratchpads.eu) Collaboration & communities Making taxonomy a team sport e.g., Scratchpad Virtual Research Communities Taxa Projects 544 Scratchpad Communities by 6,644 active registered users covering 91,631 taxa in 535,317 pages. Regions Societies In total more than 1,300,000 visitors 81 paper citations in 2012 Our infrastructures need to facilitate collaboration
  • 15. Standards, identifiers & protocols Facilitating data sharing across communities A foundation for integration Key requirements: • Need to be inclusive, practical & extensible • Readable by humans & machines • Widely used Good examples: • Darwin Core • CrossRef & DataCite DOIs • ORCHID Author identifiers Gaps / Problems • Reuse & persistence of identifiers • Vocabularies & ontologies (time consuming / little reward) Potential solutions • Build them into our credit systems • Show sematic reasoning potential (LOD & RDF demonstrators) Standards can’t be developed in isolation – they must be used
  • 16. 3. (Big) data challenges - Mobilising existing data - New forms of data
  • 17. Mobilising existing data Collections, literature & metadata How can we quickly, efficiently and cost effectively mobilise biological data at scale? Collections • 1.5-3B specimens in collections worldwide • Fragments efforts / heterogeneity of process • Needs ambition (NHM: 20M in 5 yrs.) & coord. Literature • >300M pages of biodiversity literature • BHL (41M pp.) an example of what can be done • Needs a sustainability & article metadata NHM Digitisation BHL literature Metadata registries • Data about data (cheaper & scalable) • e.g. bibliographic data, dataset portals Informatics challenges • Storage & persistence • Automation & annotation • Incentives to digitise & fitness for use Bibliography of Life (RefFinder & RefBank)
  • 18. Mobilising & managing new forms of data Metagenomics & ecological observatories These new data types do not depend on traditional taxonomy & systematics New Molecular approaches • Molecular detection & monitoring of organisms is routine • Metagenomics (env. sequencing) commonplace • Becoming the 1° route to understanding biodiversity 3-4 June 2013, NHM Ecological observatories • Automated biodiversity detection • Remote sensing (e.g. satellite & acoustic data, drones, camera traps) • Monitoring conspicuous, rare or invasive spp. (algal blooms, palms) • Monitoring human activity Informatics challenges • Very large quantities of data (2.5-10TB per researcher per yr.) • Doesn’t map well to existing data infrastructures • Challenge current networking & storage capacity • Digital and physical collections become equally important? 22 July, 2013
  • 19. 4. Synthetic challenges - Data aggregation & linking - Visualisation - Modeling
  • 20. Aggregation & linking Portals bringing together distributed & diverse forms of data Giving consistent and comprehensive access to all biological data eMonocot Several approaches, with different advantages • Tightly coupled to a few data sources • (e.g. eMonocot, CDM) • Loosely coupled to many sources • • (e.g. BioNames, Wikipedia) Hybrid forms (e.g. Canadensys, EOL, GBIF) Selective & accurate but hard to scale (276k taxa, 8k images, 13 keys & 3 phylogenies) Informatics challenges • Portals are hard to sustain • New methods of data discovery & access • Create new windows (views) on content • New data structures, new types of database BioNames Scalable but less accurate (3M taxon names, 93k phylogenies & 28k articles)
  • 21. Visualisation Visually synthesizing large, linked biodiversity datasets Making biodiversity data accessible & understandable Research opportunities • Tools integration (e.g. GeoCat, CartoDB) • Span multiple audiences Outreach opportunities • Visually compelling story telling • Crowdsourcing tools (e.g. Notes From Nature) Exploiting new technologies • Touch screens • Mobile • Location awareness Informatics challenges • Very specific to individual use cases • Sustainability issues NHM specimen records http://data.nhm.ac.uk/globe/
  • 22. Modeling the biosphere: a (the) 30 year goal? Reasoning across large, linked biodiversity datasets A clear, singular, long-term vision, which biodiversity data can contribute too Conceptually has many potential uses • Identifying trends • Explaining patterns • Making predictions • Real time alerts - when data contradicts current knowledge • The ultimate policy tool Major informatics challenges • Technical very difficult (many years off) • Needs effective prototypes & platforms • Some first steps e.g. OBOE, LEFT Nature 2013, doi:10.1038/493295a
  • 24. Lessons learned: new opportunities in H2020 PATHWAYS TO INTEGRATION (by addressing these social, data & synthetic challenges) • Break out of the discipline, technical & project centric activities (it is unsustainable, inefficient & bad for science) • Integrate & build on exiting programmes where possible (LifeWatch is a potential umbrella for these activities) • Bridge the disconnect between informaticians & users (make the users informaticians & in informaticians users) • Our products well suited to address these challenges • Use H2020 as a mechanism to achieve integration How do we join up these activities?
  • 26. Possible biodiversity informatics design principles* = experience from 7-years with the Scratchpads = lessons for infrastructures in H2020? 1. Start with needs - focus on real user needs (not just the ‘official process’) 2. Do less - if someone else is doing it, link to it or use it 3. Design with data - prototype and test with real users on the live website 4. Do the hard work to make it simple - let the computer take the strain 5. Iterate. Then iterate again. - iteration reduces risk & is more sustainable 6. Build for inclusion – it’s easier in the long run 7. Understand context - we are designing for people, not a screen or a brand 8. Build digital services, not websites - there is life beyond the website 9. Be consistent, not uniform - every circumstance is different 10. Make things open: it makes things better - it’s more sustainable *https://www.gov.uk/designprinciples
  • 27. Mobilising existing data: how to prioritise CONTENT FUN LEARNING OUTREACH Digitise a few things & invest in depth, description & promotion A LITTLE A LOT Digitise lots of things, put little effort into description & promotion AGGREGATION COLECTIONS MANAGEMENT METADATA DATA MINING RESEARCH Nick Poole, UK Collections Trust
  • 28. Collaboration & communities Making taxonomy a team sport Average dates when increasing numbers of taxonomists were involved in describing species CONE SNAILS BIRDS MAMMALS AMPHIBIANS SPIDERS PLANTS Joppa et al, 2011 • • • • Very few recent single author papers Most (fundable) science is cross-disciplinary Need to incentivise data curation & annotation Need mechanisms to share annotations Our infrastructures need to facilitate collaboration