Poster presented at AGU 2012
My web page: http://www.linkedin.com/in/ericstephan
My citations: http://scholar.google.com/citations?hl=en&user=f4bH2esAAAAJ
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Enabling Linked Science in Uncertainty Quantification Research
1. Enabling Linked Science in Global Climate
Uncertainty Quantification (UQ) Research
Eric Stephan, Todd Elsethagen
Linked Science Problem Provenance Environment (ProvEn) Services
Uncertainty quantification (UQ) studies are often difficult to share between
Portable Java-based lightweight restful ETL knowledge pipeline Climate Science for a
collaborating scientists because they consist of many different interrelated
simulation results, analytical reports, and ancillary data describing scientific Sustainable Energy Future
rationale and calibrations used for setting parameters. The Climate Science for a Sustainable Energy
Study NetCDF Future (CSSEF) is a collaborative project
Scientists producing the data need linked science to provide a consistent
means to correlate study data to background knowledge describing how the
Plan Headers Citation among Oak Ridge National Laboratory,
study was conducted. Argonne National Laboratory, Brookhaven
Contributed
Analysis Simulation Raw Native
National Laboratory, Lawrence Berkeley
Collaborating scientists using the UQ studies need this knowledge and National Laboratory, Lawrence Livermore
references to data provided in a consistent and cross referenced form. Scripts Log files Provenance
National Laboratory, Pacific Northwest National
Ancillary
Laboratory, and Sandia National Laboratories,
Info
together with the National Center for
Atmospheric Research to transform the climate
What is Data Provenance? model development and testing process and
Data provenance is historical information describing the people, institutions, thereby accelerate the development of the
software, and activities, responsible for creating or modifying data.
Community Earth System Model's sixth-
generation version, CESM3, scheduled to be
released for predictive simulation in the 5 to 10
Extract: ProvEn extracts
When is Data Provenance Generated? provenance from native year time frame.
Historical information directly using data provenance vocabularies such as W3C Extract sources and load raw
PROV-O is atypical. A better alternative is to extract from information produced native sources For more information about ProvEn Services
by scientists conducting the study, simulation log files, workflow logs, ancillary
ProvEn Services Pipeline
materials, or scripts, spreadsheets, and pictures used for analysis. We call you see here, please contact:
extracted historical information native provenance. When integrated
together these sources can provide a composite story of files in the data Eric Stephan
set origin to collaborators. Pacific Northwest National Laboratory
Transform: ProvEn maps (509) 375-6977
extracted provenance to Eric.stephan@pnl.gov
Transform scientific registered
domain ontology
Study NetCDF Native
Plan Headers Citation Provenance
Analysis Simulation
Log files Load: For cross-referencing
Scripts ProvEn aligns transformed
Load provenance with registered
Ancillary Write
foundational ontologies W3C
PROV-O, Dublin Core, and
Info Execute
FOAF.
Re
ad
Search
Resulting Triplestore Browse
Composite History of the
Scientists conducting UQ Study UQ Study
generate native provenance
URI links to files URI links to original Collaborating scientists making
in UQ Study Native provenance Inquiries about UQ Study