The presentation provides an overview of Advanced Ecological Knowledge and Observation System and SHaRED services by the TERN Eco-informatics to publish plot-based ecological data. The presentation was part of the Workshop on Approaches to Terrestrial Ecosystem Data Management : from collection to synthesis and beyond which was held on 9th of March 2016 in University of Queensland.
2. Data Publishing Services
Datasets
Integrated &
Federated Site-level
Data
Self-service
Dataset
Submission
Assisted Data
Submission
PortalRepository
Services
RIF-CS
EML
SPARQL
OWL
Site Maps
Dataset Download
Site Maps
Observation
Diagram
MS Access
Integrated Site
Data Download
Data In Data Management Data Out Data UsedWho
Researchers,
Governments,
Supersites,
LTERN, ATN
Universities,
Government
Databases
AusPlots. CSIRO,
AusCover
Individual data creators
Organisational data creators
Research
efficiencies
3. Products
Metadata
Data Update
ÆKOS data portal
and repositorySHaRED data
submission tool
Soils-to-Satellites
visualisation tool
Data update tool
Metadata
catalogues
Australian
Ecological
Knowledge &
Observation
System
4. ÆKOS’s Niche
N
sites, surveys, plotscomplex well-described integrated
ÆKOS Data
Primary
ecology
ÆKOS’s Niche
Key Achievement:
AEKOS is recognised globally as the first product to successfully
integrate ecological data at the plot level and deliver this with all
relevant knowledge through an open access data portal
6. Barriers to reuse
Identify
problem
Draft
approach
Search for
data
Acquire data
Assess
suitability
Modify
approach
Prepare data
Conduct
analysis
Interpret
results
Dispersed:
Data is stored in many
storage locations and formats
Source:Forestcheck:
www.dec.wa.gov.au
Complex:
Data usually needs
explanation and context before
it can be accurately used
www.nswrail.net
Diverse and fragmented:
Ecological data covers a wide range of topics and there
are many different ways of measuring, observing and
expressing different concepts
* Rapidly evolving with few measurement standards
19. ÆKOS in the data landscape
LevelofDescription
DataOne
Nature
ALA
(species
data)
No data
ANDS-
RDA,
TDDP
Vegbank
Pangaea
Other
Atlases
ÆKOS
Researcher
Datasets
(SHaRED)
ÆKOS
Integrated
Site Data
Fully Integration
Level of data integration
21. Application - Global plant community trait patterns
Who
• German Centre for Integrative Biodiversity Research (sPlot Working Group)
Analyses
• Global vegetation database (sPlot) with global TRY plant trait database
Fig. 4: Community-weighted means of specific-leaf area (SLA) in m²/kg (ln-transformed) averaged across
all plots within a grid cell (values ranging from 2.2 to 38.1 m²/kg). SLA is highest in the Temperate
midlatitudes from where it decreases towards the poles, towards the interiors of the contents and
towards the subtropics and tropics (based on sPlot 1.0 and a fully gap-filled version of TRY 2.0).
22. Thank you
Email: anita.smyth@adelaide.edu.au (Data Relationships)
craig.walker@adelaide.edu.au (Director)
Website: www.ecoinformatics.org.au
Portal: www.aekos.org.au
Data submission (DOI): www.shared.org.au
Twitter: @tern_aekos
plotcomplex well-described integrated
ÆKOS Data
Primary
ecology
23. This presentation can be:
• Distributed, remixed, and built upon, even
commercially
• Photographed, filmed or broadcasted
• Blogged, live-posted or videoed
Provided:
You attribute the work to its author and respect the
rights and licenses associated with its components.
Hinweis der Redaktion
Australian is now Advanced - Not limited to Australia or even ecology
The steps in self-service data publication involve researchers making deposits online to a repository and these are accessible via an online data portal.
Data publication for organisations is more technical due to the volume of data, its structure and complexity.
A data analyst is at the coal face assisting data managers to deposit their data so that it’s integrated and comparable across different research topics and methods.
In response to researchers wanting greater data access efficiencies and iconic research profiles, more portals I predict will shift to integrated site data in the future.
So that’s what’s broadly involved but let’s look at TERN’s SHaRED as an example to see the broad steps in data publishing.
Key message:
We built the Australian Ecological Knowledge and Observation System to support the urgent needs of the ecological science community.
Two core products, one developing and 2 core e-services.
Niche:
Primary, complex, well-described, integrated, ecological survey data – the baseline data fundamental to ecologists undertaking their profession to create new science, knowledge, policy and management of Australia’s nature and enviroments.
Lots of pubs discussing the challenges and benefits of data publication and sharing
Key Points:
Data is dispersed meaning that discovery involves having to in search multiple places
Because there are multiple custodians, multiple approaches to owners are needed and there is often ambiguity around licensing and conditions of use
Data is complex meaning that it needs explanation and context to be understood.
Nevertheless, data is often poorly described making it hard to interpret – leading to the possibility of inappropriate use
Ecological data is diverse and fragmented meaning it covers a broad range of topics. Things can be observed and measured in different ways using different models
There are however a lack of standards for methods around the way data is stored and represented
Key Points:
Aekos literally makes new research possible both by exposing data that was previously inaccessible, and also by lowering the barriers to reuse by condensing literally months work down to days by focussing on streamlining many of the more time consuming parts of the process
AEKOS has a number of unique features like extensive contextual information aimed at giving the user a good level of understanding of the nuances of the data and the ability to search the actual data at a site level
AEKOS also integrates the data where possible saving the user a lot of time trying to massage the data into a consistent format for analysis
Many online systems allow searching for data and some allow direct access BUT AEKOS does this in a way that is optimized for the target user community – which we determined by soliciting community feedback
Need to think of information as comprising several components
Data will always be important but we need to improve management of supporting details
Accessed via pickers currently
Can’t us query languages as our users cant speak them…
Initial result summaries help user determine which records are likely to be useful
Important if you have a lot of results to review
Exploring the data to help determine appropriateness
Understanding the goals of the project, methods used, sampling design etc
Conditions of use
Because there is so much variation in method, need to provided detailed information on what was done
Shows detailed method info divided under standard headings
Info on any classification systems used, measurement standards
+ Diagrams etc to aid understanding
The idea is that enough info should be conveyed for a new researcher to go out and confidently reproduce the method in the field
ÆKOS applies a flexible Knowledge Representation approach to dealing with observation data. We consider our problem as publishing a set of books discussing overlapping topics, written by different authors in different languages:
Our job is to translate a set of disparate books into a common language and style to make a “monographic series”.
For those that want to get into the actual data
The system has a rather unique way of representing data as models of reality rather than tables with rows and columns of numbers
Because different surveys observe and measure different aspect of the environment, this approach allows the graph layout to represent any sampling configuration and can be extended to easily incorporate new attributes as necessary
Integration is carried through to download
Key message:
ÆKOS spatial coverage is extensive although there are obvious gaps in the NT, WA, Victoria and Tasmania.
Tasmanian holdings will improve at the end of the month with a platypus dataset.
Victorian fire data and NT vegetation database which are promised will fill some gaps.
Key message:
ÆKOS offers the full richness of ecological data to meets user demands for ‘researcher standard’ data.
Equally ALA and ANDS offer less complexity because the public aren’t seeking research details and ANDS is a cataloguer of all data repositories in Australia – totally different purpose to other platforms.
Key message:
ALA, other Atlases and ÆKOS are leaders in integrated data.
The difference is ÆKOS supplies ‘research standard’ site data – the traditional unit of collecting ecological data.
ALA supplies integrated species data ALA which is excellent for assessing species list, species distributions and taxonomies
ÆKOS provides offers well-described ecological ‘plot’ data to support researcher needs.
ALA provides minimal description as there focus is species atlas data to meet citizen science needs.