SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
Analysis of the
Quality Metadata in
GEOSS Clearinghouse
QUAlity aware VIsualisation for
the Global Earth Observation
system of systems
SEVILLANO Eva1, DÍAZ Paula2, NINYEROLA Miquel1,
MASÓ Joan2 , ZABALA Alaitz1, PONS Xavier1
1 UAB Universitat Autònoma de Barcelona.
2 CREAF Centre for Ecological Research and Forestry Applications.
Objectives
• To get a first analysis of the data quality in the
Clearinghouse
• Analyze the quality contained in the metadata (ISO 19115)
– Quality indicators
www.geoviqua.org
– Quality indicators
– Lineage
– Usage
• Start building components for the GEO Portal
– Quality Broker
– Quality searcher
– Quality visualization
Methodology
97203 XML
documents
CSW
GEOSS
Clearinghouse
www.geoviqua.org
• Harvest all XML documents,
ISO 19115. (October 2011)
Methodology
97203 XML
documents
Database
GestBD
Xpath
extraction
CSW
GEOSS
Clearinghouse
www.geoviqua.org
• Massive extraction of MD
quality elements
– Quality indicators
– Lineage
– Usage
Overall Results
• Total metadata records in the
Clearinghouse
– 97203
• Total number of quality
indicators
– 52187
www.geoviqua.org
– 52187
• Metadata records with quality
indicators
– 19107
• Metadata records with lineage
– 10899 (9261 process, 3771
source)
• Metadata with usage
– 1226
Quality Scope
• 19.66% Metadata records with quality indicators
– 2.7 quality indicator per metadata record
www.geoviqua.org
1. Quality indicators
• 19.66% Metadata records with quality
www.geoviqua.org
Quality indicators
• 19.66% Metadata records with quality
– 2.7 QI/MD
www.geoviqua.org
Quality indicators
www.geoviqua.org
Quality Indicators in IDEC Metadata
Quality indicators – Comparison
Clearinghouse - IDEC
www.geoviqua.org
Positional
Accuracy
95.38%
Thematic
Accuracy
2.60%
Completeness
0.46%
Temporal
Accuracy
0.06%
Logical
Consistency
0.02%
Quality Indicators in IDEC Metadata
GEOSS
IDEC
Quality indicator result
85.8%
(22275 QI)
14.18%
(3669 QI
mainly conformance to INSPIRE)
www.geoviqua.org
0.02%
(5 QI)
19115-2 Extension
for "per pixel"
quality
Quality indicator result
www.geoviqua.org
Quality indicators - Quantitative
10000
12000
14000
16000
18000
Numberofqualityelements
Quality elements - Quantitative measures
www.geoviqua.org
0
2000
4000
6000
8000
10000
Numberofqualityelements
Complete value
Declarevalue
Quantitativetype
Quality indicators - Qualitative
600
800
1000
1200
1400
Numberofqualityelements
Quality elements - Conformance measures
www.geoviqua.org
0
200
400
600
Numberofqualityelements
Conformance to specification
Declare conformance
Conformance type
Coverage result
(ISO19115-2 extension)
• Clearinghouse record ID: 273234, 273232, 273233, 273235, 273236)
• Only 5 records use this. Bad news for visualizing data + quality maps
• Title: OMNO2e:OMI Column Amount NO2:ColumnAmountNO2CS30
<gmd:DQ_QuantitativeAttributeAccuracy>
<gmd:measureDescription>
<gco:CharacterString>The 'version 003' product is the second public release. It is based on improved
radiance calibration. For details, please see document:
www.geoviqua.org
radiance calibration. For details, please see document:
http://disc.sci.gsfc.nasa.gov/Aura/OMI/OMTO3e_v003.shtml</gco:CharacterString>
</gmd:measureDescription>
<gmd:result><gmi:QE_CoverageResult>
<gmi:spatialRepresentationType><gmd:MD_SpatialRepresentationTypeCode
codeList="./resources/codeList.xml#MD_SpatialRepresentationTypeCode"
codeListValue="grid">grid</gmd:MD_SpatialRepresentationTypeCode></gmi:spatialRepresentationType>
<gmi:resultFile gco:nilReason="missing" />
<gmi:resultFormat>
<gmd:MD_Format>
<gmd:name><gco:CharacterString>CF-netCDF</gco:CharacterString></gmd:name>
</gmd:MD_Format>
</gmi:resultFormat>
</gmi:QE_CoverageResult></gmd:result>
</gmd:DQ_QuantitativeAttributeAccuracy>
2. Lineage
www.geoviqua.org
2. Lineage
www.geoviqua.org
2. Lineage
www.geoviqua.org
LI_ProcessStep with LI_Source
Example
Clearinghouse record ID 131007 (simplified)
• Compile survey input data from the best and most current survey records.
– BLM database of the index to all official (microfilm, CD, other) BLM survey records.
– USFS survey records.
– Private land surveyor records
– GCDB Data Collection Attribute Definitions Version 2.0, Appendix A, 2/14/1991. Survey records
used - source abbreviations.
• Compile listings of known locations of PLSS corners.
– USGS topographic quadrangles and other sources.
– USC&amp;GS published coordinate data.
– NGS published coordinate data.
– BLM global positioning Data.
– USFS global positioning data.
• Coordinates of control stations are entered into a control data base with associated
reliabilities.
• Topologically correct GIS coverages are modified to use FGDC compliant naming
www.geoviqua.org
• Topologically correct GIS coverages are modified to use FGDC compliant naming
conventions and then loaded into the LSI database. These layers can then be
downloaded as shapefiles through the LSI website.
• GCDB Data was downloaded for Kiowa and Cheyenne Counties, Colorado.
– C:fgis_datasandzippedkiowatwnshp.shp.xml
• Metadata imported and data was exported from regions format to shapefile format
• Dataset copied.
– C:fgis_datasanddatabasedataplssck_gcdb_region_township
• Source Contribution: Survey data in the form of official (microfilm, CD, other) survey
and BLM, abstracted into a vector digital format.online
• Source Contribution: Survey and control data from the Cartographic Feature File
(CFF) data set.disc
• Source Contribution: Digitized control data from standard topological quadrangle
sheets.disc
LI_Lineage: LI_Source
• 6.02% metadata records (5851)
contain direct list of the data
sources.
– 1.85% (1798) with temporal extent
class LI_Source_only
LI_Lineage
+ statement :CharacterString [0..1]
+ scope :DQ_Scope [0..*]
constraints
{"source" role is mandatory if LI_Lineage.statement
and "processStep" role are not documented}
Metadata Information::MD_Metadata
+resourceLineage
0..*
www.geoviqua.org
• Gives credit (attribution, and
eventually some trust on them)
• If quality indicators are not
provided for the dataset, the
quality indicators from sources
can be a clue.
LI_Source
+ description :CharacterString [0..1]
+ sourceSpatialResolution :MD_Resolution [0..1]
+ sourceReferenceSystem :MD_ReferenceSystem [0..1]
+ sourceCitation :CI_Citation [0..1]
+ sourceMetadata :CI_Citation [0..*]
+ scope :DQ_Scope [0..*]
constraints
{"description" is mandatory if "scope" is not documented}
{"scope" is mandatory if "description" is not documented}
and "processStep" role are not documented}
{"processStep" role is mandatory if
LI_Lineage.statement and "source" role are not
documented}
+source 0..*
LI_Lineage: LI_ProcessStep
• 8.26% metadata records
(8035) contain the direct
list of the processes
without sources
– 292 (0.30%) contain date
class From_LI_ProcessStep_to_LI_Source
LI_Lineage
+ statement :CharacterString [0..1]
+ scope :DQ_Scope [0..*]
constraints
Metadata Information::MD_Metadata
+resourceLineage0..*
www.geoviqua.org
• With the order of these
processes.
• If quality indicators are not
provided for the dataset, it’s
difficult to infer resource
quality with only a process
list
LI_ProcessStep
+ description :CharacterString
+ rationale :CharacterString [0..1]
+ stepDateTime :TM_Primitive [0..*]
+ processor :CI_ResponsiblePartyInfo [0..*]
+ reference :CI_Citation [0..*]
+ scope :DQ_Scope [0..*]
constraints
{"source" role is mandatory if LI_Lineage.statement
and "processStep" role are not documented}
{"processStep" role is mandatory if
LI_Lineage.statement and "source" role are not
documented}
+processStep 0..*
Complete Provenance:
MD_ProcessStep with MD_Source
• 1.26% metadata
records (1226 ) with
more complete
provenance process .
• How and when the data
sources where used
class From_LI_ProcessStep_to_LI_Source
LI_Lineage
+ statement :CharacterString [0..1]
+ scope :DQ_Scope [0..*]
constraints
{"source" role is mandatory if LI_Lineage.statement
and "processStep" role are not documented}
Metadata Information::MD_Metadata
+resourceLineage0..*
www.geoviqua.org
sources where used
• If quality indicators are
not provided for the
dataset, we can
deduce which sources
have more influence in
the quality of the final
result
LI_Source
+ description :CharacterString [0..1]
+ sourceSpatialResolution :MD_Resolution [0..1]
+ sourceReferenceSystem :MD_ReferenceSystem [0..1]
+ sourceCitation :CI_Citation [0..1]
+ sourceMetadata :CI_Citation [0..*]
+ scope :DQ_Scope [0..*]
constraints
{"description" is mandatory if "scope" is not documented}
{"scope" is mandatory if "description" is not documented}
LI_ProcessStep
+ description :CharacterString
+ rationale :CharacterString [0..1]
+ stepDateTime :TM_Primitive [0..*]
+ processor :CI_ResponsiblePartyInfo [0..*]
+ reference :CI_Citation [0..*]
+ scope :DQ_Scope [0..*]
and "processStep" role are not documented}
{"processStep" role is mandatory if
LI_Lineage.statement and "source" role are not
documented}
+processStep 0..*
+source
0..*
Complete provenance in ISO19115-2
• LI_ProcessStep includes a
LE_Processing that has a
runTimeParameters attribute that
allows us describing the exact list of
parameters used in the execution.
• There is a citation of the algorithm
used (LI_Algorithm).
class From_LE_ProcessStep_to_LE_Source
LI_Source
+ description :CharacterString [0..1]
+ sourceSpatialResolution :MD_Resolution [0..1]
+ sourceReferenceSystem :MD_ReferenceSystem [0..1]
+ sourceCitation :CI_Citation [0..1]
+ sourceMetadata :CI_Citation [0..*]
+ scope :DQ_Scope [0..*]
constraints
{"description" is mandatory if "scope" is not documented}
{"scope" is mandatory if "description" is not documented}
LI_ProcessStep
+ description :CharacterString
+ rationale :CharacterString [0..1]
+ stepDateTime :TM_Primitive [0..*]
+ processor :CI_ResponsiblePartyInfo [0..*]
+ reference :CI_Citation [0..*]
+ scope :DQ_Scope [0..*]
LI_Lineage
+ statement :CharacterString [0..1]
+ scope :DQ_Scope [0..*]
constraints
{"source" role is mandatory if LI_Lineage.statement
and "processStep" role are not documented}
{"processStep" role is mandatory if
LI_Lineage.statement and "source" role are not
documented}
Metadata Information::MD_Metadata
+processStep 0..*
+resourceLineage
0..*
www.geoviqua.org
used (LI_Algorithm).
• All these extensions were done for
the benefit of the EO gridded data,
but there are not in the
Clearinghouse.
• We can completely evaluate the
quality of the resulting product if we
know the uncertainties that sources
have in their metadata
(sourceMetadata citation in
LI_Source).
From ISO 19115-2:2009 shown for informative purposes only
Data quality information -
Imagery::
LE_ProcessStep
Data quality information - Imagery::
LE_ProcessStepReport
+ name :CharacterString
+ description :CharacterString [0..1]
+ fileType :CharacterString [0..1]
If "LE_NominalResolution.scanningResolution" is used
then "LE_Source.scaleDenominator" is required
Data quality information - Imagery::
LE_Source
+ processedLevel :MD_Identifier [0..1]
+ resolution :LE_NominalResolution [0..1]
Data quality information - Imagery::
LE_Processing
+ identifier :MD_Identifier
+ softwareReference :CI_Citation [0..1]
+ procedureDescription :CharacterString [0..1]
+ documentation :CI_Citation [0..*]
+ runTimeParameters :CharacterString [0..1]
Data quality information -
Imagery::LE_Algorithm
+ citation :CI_Citation
+ description :CharacterString
«Union»
Data quality information - Imagery::
LE_NominalResolution
+ scanningResolution :Distance
+ groundResolution :Distance
"description" is mandatory if
"sourceExtent" is not documented
"sourceExtent" is mandatory if
"description" is not documented
+report 0..*
+output
0..*
+processingInformation0..1
+algorithm 0..*
3. Usage - User feedback
www.geoviqua.org
• There is one small entry for user
feedback in the current ISO-19115:
• MD_Usage
– Brief description of ways in which the
resource is currently or has been used
• There are 1.2% (1133) entries
– SpecificUsage and
– UserContactInfo, only
• All made by the same institution!:
MD_Usage - User feedback
www.geoviqua.org
– Landesvermessung und Geobasisinformation
Brandenburg (LGB)
– Tel +49-331-8844-123, Fax. +49-331-8844-16123
– Heinrich-Mann-Allee 103, Potsdam, Brandenburg 14473,
Deutschland
– kundenservice@geobasis-bb.de
– http://www.geobasis-bb.de
Conclusions
• There are many different kinds of quality indicators
– There is a lack of a complete description of values provided (no units, missing
measure name, missing evaluation method)
• Quality coverage results (by pixel) are almost inexistent and the the link is
not there
• Lineage information is rich in many records, some with more that 100
entries in source or ProcessSteps
www.geoviqua.org
entries in source or ProcessSteps
• We have usage examples -> Feedback
• Current data is enough to demonstrate search and visualization with
some limitations. Good for GeoViQua.
• Next steps:
– Assess the Quality of Quality Metadata?
– Extend this analysis to other capacity catalogues integrated in the EuroGEOSS
Broker
Thank you!
Danke!
Grazie!
Merci!
Ευχαριστίες!
Diolch!
Bedankt!
Köszönöm!
Ačiū!
Благодарам! Спасибі!Ευχαριστίες!
Vďaka!
Tak!
Díky!
Tänan!
Kiitos!
Благодарам!
Dzięki!
Mulţumiri!
Хвала!
Tack!
Teşekkürler!
Спасибі!
Спасибо!
Obrigado!
Takk!
Gràcies!
Gracias!
QUAlity aware VIsualisation for
the Global Earth Observation
system of systems

Weitere ähnliche Inhalte

Ähnlich wie Analysis of quality metadata in the GEOSS Clearinghouse

Modularity and Domain Driven Design; a killer Combination? - Tom de Wolf & St...
Modularity and Domain Driven Design; a killer Combination? - Tom de Wolf & St...Modularity and Domain Driven Design; a killer Combination? - Tom de Wolf & St...
Modularity and Domain Driven Design; a killer Combination? - Tom de Wolf & St...NLJUG
 
Apache Carbondata: An Indexed Columnar File Format for Interactive Query with...
Apache Carbondata: An Indexed Columnar File Format for Interactive Query with...Apache Carbondata: An Indexed Columnar File Format for Interactive Query with...
Apache Carbondata: An Indexed Columnar File Format for Interactive Query with...Spark Summit
 
Machine Learning with Azure
Machine Learning with AzureMachine Learning with Azure
Machine Learning with AzureBarbara Fusinska
 
Scientific Data Cataloging Framework
Scientific Data Cataloging FrameworkScientific Data Cataloging Framework
Scientific Data Cataloging FrameworkSupun Nakandala
 
ArcGIS Add-in Development In Python and C# for LIDAR Product Quality Control
ArcGIS Add-in Development In Python and C# for LIDAR Product Quality ControlArcGIS Add-in Development In Python and C# for LIDAR Product Quality Control
ArcGIS Add-in Development In Python and C# for LIDAR Product Quality ControlDavid Eliseo Martinez Castellanos
 
How to Leverage Big Data to Deliver Smart Logistics
How to Leverage Big Data to Deliver Smart LogisticsHow to Leverage Big Data to Deliver Smart Logistics
How to Leverage Big Data to Deliver Smart LogisticsAlibaba Cloud
 
Implementing a VO archive for datacubes of galaxies
Implementing a VO archive for datacubes of galaxiesImplementing a VO archive for datacubes of galaxies
Implementing a VO archive for datacubes of galaxiesJose Enrique Ruiz
 
Apache CarbonData+Spark to realize data convergence and Unified high performa...
Apache CarbonData+Spark to realize data convergence and Unified high performa...Apache CarbonData+Spark to realize data convergence and Unified high performa...
Apache CarbonData+Spark to realize data convergence and Unified high performa...Tech Triveni
 
Optimizing Your Supply Chain with the Neo4j Graph
Optimizing Your Supply Chain with the Neo4j GraphOptimizing Your Supply Chain with the Neo4j Graph
Optimizing Your Supply Chain with the Neo4j GraphNeo4j
 
Can ISO 19157 support current NASA data quality metadata?
Can ISO 19157 support current NASA data quality metadata?Can ISO 19157 support current NASA data quality metadata?
Can ISO 19157 support current NASA data quality metadata?Ted Habermann
 
State of GeoServer 2012
State of GeoServer 2012State of GeoServer 2012
State of GeoServer 2012Jody Garnett
 
Cloudera Movies Data Science Project On Big Data
Cloudera Movies Data Science Project On Big DataCloudera Movies Data Science Project On Big Data
Cloudera Movies Data Science Project On Big DataAbhishek M Shivalingaiah
 
Predicting query performance and explaining results to assist Linked Data con...
Predicting query performance and explaining results to assist Linked Data con...Predicting query performance and explaining results to assist Linked Data con...
Predicting query performance and explaining results to assist Linked Data con...Rakebul Hasan
 
Pega Mock questions
Pega Mock questionsPega Mock questions
Pega Mock questionsAshock Roy
 
SQL Server Deep Dive, Denis Reznik
SQL Server Deep Dive, Denis ReznikSQL Server Deep Dive, Denis Reznik
SQL Server Deep Dive, Denis ReznikSigma Software
 
DSDT Meetup October 2017
DSDT Meetup October 2017DSDT Meetup October 2017
DSDT Meetup October 2017DSDT_MTL
 

Ähnlich wie Analysis of quality metadata in the GEOSS Clearinghouse (20)

Modularity and Domain Driven Design; a killer Combination? - Tom de Wolf & St...
Modularity and Domain Driven Design; a killer Combination? - Tom de Wolf & St...Modularity and Domain Driven Design; a killer Combination? - Tom de Wolf & St...
Modularity and Domain Driven Design; a killer Combination? - Tom de Wolf & St...
 
Apache Carbondata: An Indexed Columnar File Format for Interactive Query with...
Apache Carbondata: An Indexed Columnar File Format for Interactive Query with...Apache Carbondata: An Indexed Columnar File Format for Interactive Query with...
Apache Carbondata: An Indexed Columnar File Format for Interactive Query with...
 
Machine Learning with Azure
Machine Learning with AzureMachine Learning with Azure
Machine Learning with Azure
 
Scientific Data Cataloging Framework
Scientific Data Cataloging FrameworkScientific Data Cataloging Framework
Scientific Data Cataloging Framework
 
ArcGIS Add-in Development In Python and C# for LIDAR Product Quality Control
ArcGIS Add-in Development In Python and C# for LIDAR Product Quality ControlArcGIS Add-in Development In Python and C# for LIDAR Product Quality Control
ArcGIS Add-in Development In Python and C# for LIDAR Product Quality Control
 
How to Leverage Big Data to Deliver Smart Logistics
How to Leverage Big Data to Deliver Smart LogisticsHow to Leverage Big Data to Deliver Smart Logistics
How to Leverage Big Data to Deliver Smart Logistics
 
dc09ttp-2011-thesis
dc09ttp-2011-thesisdc09ttp-2011-thesis
dc09ttp-2011-thesis
 
Implementing a VO archive for datacubes of galaxies
Implementing a VO archive for datacubes of galaxiesImplementing a VO archive for datacubes of galaxies
Implementing a VO archive for datacubes of galaxies
 
Apache CarbonData+Spark to realize data convergence and Unified high performa...
Apache CarbonData+Spark to realize data convergence and Unified high performa...Apache CarbonData+Spark to realize data convergence and Unified high performa...
Apache CarbonData+Spark to realize data convergence and Unified high performa...
 
Optimizing Your Supply Chain with the Neo4j Graph
Optimizing Your Supply Chain with the Neo4j GraphOptimizing Your Supply Chain with the Neo4j Graph
Optimizing Your Supply Chain with the Neo4j Graph
 
Can ISO 19157 support current NASA data quality metadata?
Can ISO 19157 support current NASA data quality metadata?Can ISO 19157 support current NASA data quality metadata?
Can ISO 19157 support current NASA data quality metadata?
 
State of GeoServer 2012
State of GeoServer 2012State of GeoServer 2012
State of GeoServer 2012
 
Cloudera Movies Data Science Project On Big Data
Cloudera Movies Data Science Project On Big DataCloudera Movies Data Science Project On Big Data
Cloudera Movies Data Science Project On Big Data
 
Predicting query performance and explaining results to assist Linked Data con...
Predicting query performance and explaining results to assist Linked Data con...Predicting query performance and explaining results to assist Linked Data con...
Predicting query performance and explaining results to assist Linked Data con...
 
query_tuning.pdf
query_tuning.pdfquery_tuning.pdf
query_tuning.pdf
 
Pega Mock questions
Pega Mock questionsPega Mock questions
Pega Mock questions
 
SQL Server Deep Dive, Denis Reznik
SQL Server Deep Dive, Denis ReznikSQL Server Deep Dive, Denis Reznik
SQL Server Deep Dive, Denis Reznik
 
DSDT Meetup October 2017
DSDT Meetup October 2017DSDT Meetup October 2017
DSDT Meetup October 2017
 
Dsdt meetup oct24
Dsdt meetup oct24Dsdt meetup oct24
Dsdt meetup oct24
 
Dsdt meetup oct24
Dsdt meetup oct24Dsdt meetup oct24
Dsdt meetup oct24
 

Mehr von Paula Díaz

Workshop: «Trade-offs der Schweizer Energiewende» ETH Zürich 2017
Workshop: «Trade-offs der Schweizer Energiewende» ETH Zürich 2017Workshop: «Trade-offs der Schweizer Energiewende» ETH Zürich 2017
Workshop: «Trade-offs der Schweizer Energiewende» ETH Zürich 2017Paula Díaz
 
Poster in the 18th Swiss Global Change Day
Poster in the 18th Swiss Global Change DayPoster in the 18th Swiss Global Change Day
Poster in the 18th Swiss Global Change DayPaula Díaz
 
Q method conference 2016: Do stakeholders' perspectives pose a risk to energy...
Q method conference 2016: Do stakeholders' perspectives pose a risk to energy...Q method conference 2016: Do stakeholders' perspectives pose a risk to energy...
Q method conference 2016: Do stakeholders' perspectives pose a risk to energy...Paula Díaz
 
IDRIM Conference 2016
IDRIM Conference 2016IDRIM Conference 2016
IDRIM Conference 2016Paula Díaz
 
Modeling the energy future of Switzerland after the phase out of nuclear powe...
Modeling the energy future of Switzerland after the phase out of nuclear powe...Modeling the energy future of Switzerland after the phase out of nuclear powe...
Modeling the energy future of Switzerland after the phase out of nuclear powe...Paula Díaz
 
Exchanging the Status between Clients of Geospatial Web Services and GIS appl...
Exchanging the Status between Clients of Geospatial Web Services and GIS appl...Exchanging the Status between Clients of Geospatial Web Services and GIS appl...
Exchanging the Status between Clients of Geospatial Web Services and GIS appl...Paula Díaz
 
Análisis crítico de los metadatos distribuidos por la IDEC presentacion
Análisis crítico de los metadatos distribuidos por la IDEC presentacionAnálisis crítico de los metadatos distribuidos por la IDEC presentacion
Análisis crítico de los metadatos distribuidos por la IDEC presentacionPaula Díaz
 
Impact of user concurrency in commonly used OGC map server implementations
Impact of user concurrency in commonly used OGC map server implementationsImpact of user concurrency in commonly used OGC map server implementations
Impact of user concurrency in commonly used OGC map server implementationsPaula Díaz
 
Performance of standardized web map servers for remote sensing Imagery
Performance of standardized web map servers for remote sensing ImageryPerformance of standardized web map servers for remote sensing Imagery
Performance of standardized web map servers for remote sensing ImageryPaula Díaz
 
Analysis of quality metadata in the GEOSS Clearinghouse - Poster
Analysis of quality metadata in the GEOSS Clearinghouse - PosterAnalysis of quality metadata in the GEOSS Clearinghouse - Poster
Analysis of quality metadata in the GEOSS Clearinghouse - PosterPaula Díaz
 
The importance of geospatial data to calculate the optimal distribution of re...
The importance of geospatial data to calculate the optimal distribution of re...The importance of geospatial data to calculate the optimal distribution of re...
The importance of geospatial data to calculate the optimal distribution of re...Paula Díaz
 
Mapping the evolution of renewable resources and their relation with EROI and...
Mapping the evolution of renewable resources and their relation with EROI and...Mapping the evolution of renewable resources and their relation with EROI and...
Mapping the evolution of renewable resources and their relation with EROI and...Paula Díaz
 

Mehr von Paula Díaz (12)

Workshop: «Trade-offs der Schweizer Energiewende» ETH Zürich 2017
Workshop: «Trade-offs der Schweizer Energiewende» ETH Zürich 2017Workshop: «Trade-offs der Schweizer Energiewende» ETH Zürich 2017
Workshop: «Trade-offs der Schweizer Energiewende» ETH Zürich 2017
 
Poster in the 18th Swiss Global Change Day
Poster in the 18th Swiss Global Change DayPoster in the 18th Swiss Global Change Day
Poster in the 18th Swiss Global Change Day
 
Q method conference 2016: Do stakeholders' perspectives pose a risk to energy...
Q method conference 2016: Do stakeholders' perspectives pose a risk to energy...Q method conference 2016: Do stakeholders' perspectives pose a risk to energy...
Q method conference 2016: Do stakeholders' perspectives pose a risk to energy...
 
IDRIM Conference 2016
IDRIM Conference 2016IDRIM Conference 2016
IDRIM Conference 2016
 
Modeling the energy future of Switzerland after the phase out of nuclear powe...
Modeling the energy future of Switzerland after the phase out of nuclear powe...Modeling the energy future of Switzerland after the phase out of nuclear powe...
Modeling the energy future of Switzerland after the phase out of nuclear powe...
 
Exchanging the Status between Clients of Geospatial Web Services and GIS appl...
Exchanging the Status between Clients of Geospatial Web Services and GIS appl...Exchanging the Status between Clients of Geospatial Web Services and GIS appl...
Exchanging the Status between Clients of Geospatial Web Services and GIS appl...
 
Análisis crítico de los metadatos distribuidos por la IDEC presentacion
Análisis crítico de los metadatos distribuidos por la IDEC presentacionAnálisis crítico de los metadatos distribuidos por la IDEC presentacion
Análisis crítico de los metadatos distribuidos por la IDEC presentacion
 
Impact of user concurrency in commonly used OGC map server implementations
Impact of user concurrency in commonly used OGC map server implementationsImpact of user concurrency in commonly used OGC map server implementations
Impact of user concurrency in commonly used OGC map server implementations
 
Performance of standardized web map servers for remote sensing Imagery
Performance of standardized web map servers for remote sensing ImageryPerformance of standardized web map servers for remote sensing Imagery
Performance of standardized web map servers for remote sensing Imagery
 
Analysis of quality metadata in the GEOSS Clearinghouse - Poster
Analysis of quality metadata in the GEOSS Clearinghouse - PosterAnalysis of quality metadata in the GEOSS Clearinghouse - Poster
Analysis of quality metadata in the GEOSS Clearinghouse - Poster
 
The importance of geospatial data to calculate the optimal distribution of re...
The importance of geospatial data to calculate the optimal distribution of re...The importance of geospatial data to calculate the optimal distribution of re...
The importance of geospatial data to calculate the optimal distribution of re...
 
Mapping the evolution of renewable resources and their relation with EROI and...
Mapping the evolution of renewable resources and their relation with EROI and...Mapping the evolution of renewable resources and their relation with EROI and...
Mapping the evolution of renewable resources and their relation with EROI and...
 

Kürzlich hochgeladen

Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfSubhamKumar3239
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...KarteekMane1
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 

Kürzlich hochgeladen (20)

Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdf
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 

Analysis of quality metadata in the GEOSS Clearinghouse

  • 1. Analysis of the Quality Metadata in GEOSS Clearinghouse QUAlity aware VIsualisation for the Global Earth Observation system of systems SEVILLANO Eva1, DÍAZ Paula2, NINYEROLA Miquel1, MASÓ Joan2 , ZABALA Alaitz1, PONS Xavier1 1 UAB Universitat Autònoma de Barcelona. 2 CREAF Centre for Ecological Research and Forestry Applications.
  • 2. Objectives • To get a first analysis of the data quality in the Clearinghouse • Analyze the quality contained in the metadata (ISO 19115) – Quality indicators www.geoviqua.org – Quality indicators – Lineage – Usage • Start building components for the GEO Portal – Quality Broker – Quality searcher – Quality visualization
  • 4. Methodology 97203 XML documents Database GestBD Xpath extraction CSW GEOSS Clearinghouse www.geoviqua.org • Massive extraction of MD quality elements – Quality indicators – Lineage – Usage
  • 5. Overall Results • Total metadata records in the Clearinghouse – 97203 • Total number of quality indicators – 52187 www.geoviqua.org – 52187 • Metadata records with quality indicators – 19107 • Metadata records with lineage – 10899 (9261 process, 3771 source) • Metadata with usage – 1226
  • 6. Quality Scope • 19.66% Metadata records with quality indicators – 2.7 quality indicator per metadata record www.geoviqua.org
  • 7. 1. Quality indicators • 19.66% Metadata records with quality www.geoviqua.org
  • 8. Quality indicators • 19.66% Metadata records with quality – 2.7 QI/MD www.geoviqua.org
  • 10. Quality Indicators in IDEC Metadata Quality indicators – Comparison Clearinghouse - IDEC www.geoviqua.org Positional Accuracy 95.38% Thematic Accuracy 2.60% Completeness 0.46% Temporal Accuracy 0.06% Logical Consistency 0.02% Quality Indicators in IDEC Metadata GEOSS IDEC
  • 11. Quality indicator result 85.8% (22275 QI) 14.18% (3669 QI mainly conformance to INSPIRE) www.geoviqua.org 0.02% (5 QI) 19115-2 Extension for "per pixel" quality
  • 13. Quality indicators - Quantitative 10000 12000 14000 16000 18000 Numberofqualityelements Quality elements - Quantitative measures www.geoviqua.org 0 2000 4000 6000 8000 10000 Numberofqualityelements Complete value Declarevalue Quantitativetype
  • 14. Quality indicators - Qualitative 600 800 1000 1200 1400 Numberofqualityelements Quality elements - Conformance measures www.geoviqua.org 0 200 400 600 Numberofqualityelements Conformance to specification Declare conformance Conformance type
  • 15. Coverage result (ISO19115-2 extension) • Clearinghouse record ID: 273234, 273232, 273233, 273235, 273236) • Only 5 records use this. Bad news for visualizing data + quality maps • Title: OMNO2e:OMI Column Amount NO2:ColumnAmountNO2CS30 <gmd:DQ_QuantitativeAttributeAccuracy> <gmd:measureDescription> <gco:CharacterString>The 'version 003' product is the second public release. It is based on improved radiance calibration. For details, please see document: www.geoviqua.org radiance calibration. For details, please see document: http://disc.sci.gsfc.nasa.gov/Aura/OMI/OMTO3e_v003.shtml</gco:CharacterString> </gmd:measureDescription> <gmd:result><gmi:QE_CoverageResult> <gmi:spatialRepresentationType><gmd:MD_SpatialRepresentationTypeCode codeList="./resources/codeList.xml#MD_SpatialRepresentationTypeCode" codeListValue="grid">grid</gmd:MD_SpatialRepresentationTypeCode></gmi:spatialRepresentationType> <gmi:resultFile gco:nilReason="missing" /> <gmi:resultFormat> <gmd:MD_Format> <gmd:name><gco:CharacterString>CF-netCDF</gco:CharacterString></gmd:name> </gmd:MD_Format> </gmi:resultFormat> </gmi:QE_CoverageResult></gmd:result> </gmd:DQ_QuantitativeAttributeAccuracy>
  • 19. LI_ProcessStep with LI_Source Example Clearinghouse record ID 131007 (simplified) • Compile survey input data from the best and most current survey records. – BLM database of the index to all official (microfilm, CD, other) BLM survey records. – USFS survey records. – Private land surveyor records – GCDB Data Collection Attribute Definitions Version 2.0, Appendix A, 2/14/1991. Survey records used - source abbreviations. • Compile listings of known locations of PLSS corners. – USGS topographic quadrangles and other sources. – USC&amp;GS published coordinate data. – NGS published coordinate data. – BLM global positioning Data. – USFS global positioning data. • Coordinates of control stations are entered into a control data base with associated reliabilities. • Topologically correct GIS coverages are modified to use FGDC compliant naming www.geoviqua.org • Topologically correct GIS coverages are modified to use FGDC compliant naming conventions and then loaded into the LSI database. These layers can then be downloaded as shapefiles through the LSI website. • GCDB Data was downloaded for Kiowa and Cheyenne Counties, Colorado. – C:fgis_datasandzippedkiowatwnshp.shp.xml • Metadata imported and data was exported from regions format to shapefile format • Dataset copied. – C:fgis_datasanddatabasedataplssck_gcdb_region_township • Source Contribution: Survey data in the form of official (microfilm, CD, other) survey and BLM, abstracted into a vector digital format.online • Source Contribution: Survey and control data from the Cartographic Feature File (CFF) data set.disc • Source Contribution: Digitized control data from standard topological quadrangle sheets.disc
  • 20. LI_Lineage: LI_Source • 6.02% metadata records (5851) contain direct list of the data sources. – 1.85% (1798) with temporal extent class LI_Source_only LI_Lineage + statement :CharacterString [0..1] + scope :DQ_Scope [0..*] constraints {"source" role is mandatory if LI_Lineage.statement and "processStep" role are not documented} Metadata Information::MD_Metadata +resourceLineage 0..* www.geoviqua.org • Gives credit (attribution, and eventually some trust on them) • If quality indicators are not provided for the dataset, the quality indicators from sources can be a clue. LI_Source + description :CharacterString [0..1] + sourceSpatialResolution :MD_Resolution [0..1] + sourceReferenceSystem :MD_ReferenceSystem [0..1] + sourceCitation :CI_Citation [0..1] + sourceMetadata :CI_Citation [0..*] + scope :DQ_Scope [0..*] constraints {"description" is mandatory if "scope" is not documented} {"scope" is mandatory if "description" is not documented} and "processStep" role are not documented} {"processStep" role is mandatory if LI_Lineage.statement and "source" role are not documented} +source 0..*
  • 21. LI_Lineage: LI_ProcessStep • 8.26% metadata records (8035) contain the direct list of the processes without sources – 292 (0.30%) contain date class From_LI_ProcessStep_to_LI_Source LI_Lineage + statement :CharacterString [0..1] + scope :DQ_Scope [0..*] constraints Metadata Information::MD_Metadata +resourceLineage0..* www.geoviqua.org • With the order of these processes. • If quality indicators are not provided for the dataset, it’s difficult to infer resource quality with only a process list LI_ProcessStep + description :CharacterString + rationale :CharacterString [0..1] + stepDateTime :TM_Primitive [0..*] + processor :CI_ResponsiblePartyInfo [0..*] + reference :CI_Citation [0..*] + scope :DQ_Scope [0..*] constraints {"source" role is mandatory if LI_Lineage.statement and "processStep" role are not documented} {"processStep" role is mandatory if LI_Lineage.statement and "source" role are not documented} +processStep 0..*
  • 22. Complete Provenance: MD_ProcessStep with MD_Source • 1.26% metadata records (1226 ) with more complete provenance process . • How and when the data sources where used class From_LI_ProcessStep_to_LI_Source LI_Lineage + statement :CharacterString [0..1] + scope :DQ_Scope [0..*] constraints {"source" role is mandatory if LI_Lineage.statement and "processStep" role are not documented} Metadata Information::MD_Metadata +resourceLineage0..* www.geoviqua.org sources where used • If quality indicators are not provided for the dataset, we can deduce which sources have more influence in the quality of the final result LI_Source + description :CharacterString [0..1] + sourceSpatialResolution :MD_Resolution [0..1] + sourceReferenceSystem :MD_ReferenceSystem [0..1] + sourceCitation :CI_Citation [0..1] + sourceMetadata :CI_Citation [0..*] + scope :DQ_Scope [0..*] constraints {"description" is mandatory if "scope" is not documented} {"scope" is mandatory if "description" is not documented} LI_ProcessStep + description :CharacterString + rationale :CharacterString [0..1] + stepDateTime :TM_Primitive [0..*] + processor :CI_ResponsiblePartyInfo [0..*] + reference :CI_Citation [0..*] + scope :DQ_Scope [0..*] and "processStep" role are not documented} {"processStep" role is mandatory if LI_Lineage.statement and "source" role are not documented} +processStep 0..* +source 0..*
  • 23. Complete provenance in ISO19115-2 • LI_ProcessStep includes a LE_Processing that has a runTimeParameters attribute that allows us describing the exact list of parameters used in the execution. • There is a citation of the algorithm used (LI_Algorithm). class From_LE_ProcessStep_to_LE_Source LI_Source + description :CharacterString [0..1] + sourceSpatialResolution :MD_Resolution [0..1] + sourceReferenceSystem :MD_ReferenceSystem [0..1] + sourceCitation :CI_Citation [0..1] + sourceMetadata :CI_Citation [0..*] + scope :DQ_Scope [0..*] constraints {"description" is mandatory if "scope" is not documented} {"scope" is mandatory if "description" is not documented} LI_ProcessStep + description :CharacterString + rationale :CharacterString [0..1] + stepDateTime :TM_Primitive [0..*] + processor :CI_ResponsiblePartyInfo [0..*] + reference :CI_Citation [0..*] + scope :DQ_Scope [0..*] LI_Lineage + statement :CharacterString [0..1] + scope :DQ_Scope [0..*] constraints {"source" role is mandatory if LI_Lineage.statement and "processStep" role are not documented} {"processStep" role is mandatory if LI_Lineage.statement and "source" role are not documented} Metadata Information::MD_Metadata +processStep 0..* +resourceLineage 0..* www.geoviqua.org used (LI_Algorithm). • All these extensions were done for the benefit of the EO gridded data, but there are not in the Clearinghouse. • We can completely evaluate the quality of the resulting product if we know the uncertainties that sources have in their metadata (sourceMetadata citation in LI_Source). From ISO 19115-2:2009 shown for informative purposes only Data quality information - Imagery:: LE_ProcessStep Data quality information - Imagery:: LE_ProcessStepReport + name :CharacterString + description :CharacterString [0..1] + fileType :CharacterString [0..1] If "LE_NominalResolution.scanningResolution" is used then "LE_Source.scaleDenominator" is required Data quality information - Imagery:: LE_Source + processedLevel :MD_Identifier [0..1] + resolution :LE_NominalResolution [0..1] Data quality information - Imagery:: LE_Processing + identifier :MD_Identifier + softwareReference :CI_Citation [0..1] + procedureDescription :CharacterString [0..1] + documentation :CI_Citation [0..*] + runTimeParameters :CharacterString [0..1] Data quality information - Imagery::LE_Algorithm + citation :CI_Citation + description :CharacterString «Union» Data quality information - Imagery:: LE_NominalResolution + scanningResolution :Distance + groundResolution :Distance "description" is mandatory if "sourceExtent" is not documented "sourceExtent" is mandatory if "description" is not documented +report 0..* +output 0..* +processingInformation0..1 +algorithm 0..*
  • 24. 3. Usage - User feedback www.geoviqua.org • There is one small entry for user feedback in the current ISO-19115: • MD_Usage – Brief description of ways in which the resource is currently or has been used
  • 25. • There are 1.2% (1133) entries – SpecificUsage and – UserContactInfo, only • All made by the same institution!: MD_Usage - User feedback www.geoviqua.org – Landesvermessung und Geobasisinformation Brandenburg (LGB) – Tel +49-331-8844-123, Fax. +49-331-8844-16123 – Heinrich-Mann-Allee 103, Potsdam, Brandenburg 14473, Deutschland – kundenservice@geobasis-bb.de – http://www.geobasis-bb.de
  • 26. Conclusions • There are many different kinds of quality indicators – There is a lack of a complete description of values provided (no units, missing measure name, missing evaluation method) • Quality coverage results (by pixel) are almost inexistent and the the link is not there • Lineage information is rich in many records, some with more that 100 entries in source or ProcessSteps www.geoviqua.org entries in source or ProcessSteps • We have usage examples -> Feedback • Current data is enough to demonstrate search and visualization with some limitations. Good for GeoViQua. • Next steps: – Assess the Quality of Quality Metadata? – Extend this analysis to other capacity catalogues integrated in the EuroGEOSS Broker