Cameron Kiddle
Research Fellow, Grid Research Centre
University of Calgary
Presented at the Cybera/CANARIE National Summit 2009, as part of the session "New Frontiers in Data Integration." This session showcased a selection of leading-edge initiatives that are breaking new ground and setting new precedents around the collection and integration of data.
GRC Integration And Management Of Diverse Environmental Data Sets
1. Integration and Management
of Diverse Environmental
Data Sets
Cameron Kiddle
Research Fellow
Grid Research Centre
University of Calgary
2. Outline
!! Data Challenges
!! GeoChronos (Spectral Libraries)
!! Spectral Library Demonstration
!! Cloud Services for Water Management
!! Cloud Services for Water Management
Demonstration
Summit 09 2
Oct. 14, 2009
3. Data Challenges - Acquisition
!! Many different data sources
!! Different regulations/mechanisms for
accessing data
!! Lack of automation
!! Finding the right data
Summit 09 3
Oct. 14, 2009
4. Data Challenges - Management
!! Scattered and unorganized data
!! Inadequate tools for recording/
maintaining metadata
"! Data without metadata is meaningless
"! Lack of suitable metadata standards
"! Validation of metadata
!! Tracking provenance of data
Summit 09 4
Oct. 14, 2009
5. Data Challenges – Pre-processing
!! Raw data typically cannot be directly
analyzed
!! Significant amount of time spent
preparing data for analysis
!! Lack of automation
Summit 09 5
Oct. 14, 2009
6. GeoChronos
!! Partners
"! CANARIE (NEP-1)
"! Center for Earth Observation Sciences,
University of Alberta
"! Cybera
"! Grid Research Centre, University of Calgary
Summit 09 6
Oct. 14, 2009
7. GeoChronos
!! An on-line platform (http://geochronos.org/)
"! For:
!! Earth Observation Scientists
"! Facilitating:
!! Collaboration between scientists
!! Application access, management and sharing
!! Data access, management and sharing
"! Leveraging:
!! Web 2.0 and social networking technologies
!! Cloud computing technologies
!! Semantic Web technologies
Summit 09 7
Oct. 14, 2009
8. GeoChronos
!! Data Solutions - Spectral Libraries
"! Store, share and browse spectral data
"! View spectral plots, metadata, ancillary data and maps
"! Manage and generate metadata for spectra
"! Create and share metadata schemas
!! Technology
"! iRODS (http://www.irods.org/) for data storage/management
"! Semantic Web technologies such as RDF (Resource
Description Framework) to link/relate data
!! Next Steps
"! Generalization of spectral library solution - acquire, store,
manage, browse and share other types of data (i.e., satellite,
flux, phenology, meteorological, etc.)
"! Automate data workflows (i.e., mosaic, reproject and subset
MODIS data) using cloud-based services
Summit 09 8
Oct. 14, 2009
13. Cloud Services for Water Management
!! Partners
"! Alberta Advanced Education and Technology
"! Alberta WaterSMART
"! Cybera
"! Geosensor Web Lab, University of Calgary
"! Grid Research Centre, University of Calgary
"! Tesera Systems Inc
Summit 09 13
Oct. 14, 2009
14. Cloud Services for Water Management
!! In preliminary stages of project
!! Explore use of cloud services to store,
manipulate and expose data related to water
management
!! Link and correlate a wide variety of data
from a large number of sources
!! Cloud-based analysis and visualization tools
!! Investigate integration with existing Alberta
WaterPortal (http://www.albertawater.com/)
Summit 09 14
Oct. 14, 2009
15. Cloud Services for Water Management -
Demonstration
Summit 09 15
Oct. 14, 2009
16. Data Types and Locations Displayed on
Google Earth
Summit 09 16
Oct. 14, 2009
17. Water Flow Data Shown using Google
Visualization API
Summit 09 17
Oct. 14, 2009
18. Integrating and Visualizing Different Data Sets
From Calgary’s Flood in 2005 on Google Earth
Summit 09 18
Oct. 14, 2009