SlideShare ist ein Scribd-Unternehmen logo
1 von 15
Semantic Data Integration I6 Core Group Nic Bertrand Herbert Schentz LTER-Europe Conference, Mallorca, Dec. 2008
Overview ,[object Object],[object Object],[object Object],[object Object]
Architecture Goal: Enable seamless access to distributed data  Allow  local  data analysis for all members  with their own tools Distributed Socio-Ecological  Data See all data as if it came from ONE Data Source Distributed  Data mining with local tools Portal
Longer term vision Distributed Applications Extend seamless access to distributed services (SOA) Allow  local  data analysis for all members  with their own tools and common services See all data as if data came from ONE Data Source processed within ONE application Distributed  Socio-ecological  Data Distributed  Data Mining With local tools
Role of Ontology  Distributed Data Mining with local tools Distributed Socio-Ecological Data SERONTO SERONTO: basis to discover, retrieve and integrate distributed heterogenous data  common concepts and structures Portal
Testing... Why? ,[object Object],[object Object],[object Object]
Proof of concept: Acceptance Criteria  •  The databases must have different structures and must have been developed independently of SERONTO; •  The databases must feature reference lists (e.g. species lists); •  The database structures must not be altered as a result of the integration work; •  New concepts may be imported into SERONTO as and when required; •  The databases must contain data relevant to Long Term Ecological Research (e.g. vegetation surveys, records of species occurrences, measurement of biotic and abiotic components).
Testing: Connecting 5 databases JOKL cultural landscapes JODI vegetation 2835 floodplain ECN Summary Database More about the databases: Independently developed,  Not developed for the purpose of data integration Different data models  Different languages Similar data types collected in ALTER-Net,  Some obvious integration points  (e.g. Vegetation) Pythia vegetation SERONTO
Data Integration using SERONTO Import Ontology Connect Databases Query SERONTO Results
Getting value sets back SERONTO parameter_method parameter method Value_sets Unit Scale
Data Integration Results ,[object Object],SERONTO
Data Integration Results import diverse ecological databases JOKL cultural landscapes JODI vegetation 2835 floodplain ECN Summary Database Pythia vegetation
Data Integration Results Extend SERONTO Classes Using the content of the databases (SERONTO Core does not  contain domain specific concepts) Map databases to SERONTO (Simple and complex mappings) Query individual databases directly Query multiple databases from the SERONTO (Simple and Complex queries) Map once, reuse data many times, querying does not require knowledge of the structures of the databases Semantic data integration is possible
Open Questions ,[object Object],[object Object],[object Object],[object Object],[object Object],SERONTO Core domain ontologies ? <?xml version=&quot;1.0&quot; encoding=&quot;UTF-8&quot;?> <flg:flogic xmlns:flg=&quot; http://www.wsmo.org/2004/d16/d16.2/v0.1/ &quot;> <!-- Test data to test the WSML F-Logic XML syntax --> <!-- The following <rule></rule> encodes this fact (taken from the F-Logic JACM paper, page 7): bob[name -> &quot;Bob&quot;; age -> 40; affiliation -> cs1[dname -> &quot;CS&quot;; mngr -> bob; assistents -> {john, sally}] this encoding writes only elementary molecules --> <rule> <head> <molecule> <object> <constant name=&quot;bob&quot;/> </object> <superclass isaType=&quot;:&quot;> <class> <constant name=&quot;empl&quot;/> </class> </superclass> <methodSpec arrow=&quot;->&quot;> <name> <constant name=&quot;name&quot;/> </name> <result> <oid> <constant name=&quot;&quot;Bob&quot;&quot;/> </oid> </result> Portal Query Databases Performance
Possible uses for LTER Europe Distributed Data Mining with local tools Distributed Socio-Ecological Data SERONTO &   Domain Ontologies common concepts and domain knowledge Portal Seamless access... Ready for use now

Weitere ähnliche Inhalte

Was ist angesagt?

Geant4 Model Testing Framework: From PAW to ROOT
Geant4 Model Testing Framework:  From PAW to ROOTGeant4 Model Testing Framework:  From PAW to ROOT
Geant4 Model Testing Framework: From PAW to ROOT
Roman Atachiants
 
TreeBASE CIPRES
TreeBASE CIPRESTreeBASE CIPRES
TreeBASE CIPRES
Rutger Vos
 

Was ist angesagt? (20)

Why ICT Fails in Healthcare: Software Maintenance and Maintainability
Why ICT Fails in Healthcare: Software Maintenance and MaintainabilityWhy ICT Fails in Healthcare: Software Maintenance and Maintainability
Why ICT Fails in Healthcare: Software Maintenance and Maintainability
 
Benchmarking Versioning for Big Linked Data
Benchmarking Versioning for Big Linked DataBenchmarking Versioning for Big Linked Data
Benchmarking Versioning for Big Linked Data
 
Information Extraction from EuroParliament and UK Parliament data
Information Extraction from EuroParliament and UK Parliament dataInformation Extraction from EuroParliament and UK Parliament data
Information Extraction from EuroParliament and UK Parliament data
 
Information Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative CampInformation Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative Camp
 
ELIXIR-UK and the ELIXIR Interoperability Platform
ELIXIR-UK and the ELIXIR Interoperability PlatformELIXIR-UK and the ELIXIR Interoperability Platform
ELIXIR-UK and the ELIXIR Interoperability Platform
 
1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe
1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe
1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe
 
Holistic Benchmarking of Big Linked Data: HOBBIT
Holistic Benchmarking of Big Linked Data: HOBBITHolistic Benchmarking of Big Linked Data: HOBBIT
Holistic Benchmarking of Big Linked Data: HOBBIT
 
Aggregating Research papers from Publishers' Systems to Support Text and Data...
Aggregating Research papers from Publishers' Systems to Support Text and Data...Aggregating Research papers from Publishers' Systems to Support Text and Data...
Aggregating Research papers from Publishers' Systems to Support Text and Data...
 
Clinical modelling with openEHR Archetypes
Clinical modelling with openEHR ArchetypesClinical modelling with openEHR Archetypes
Clinical modelling with openEHR Archetypes
 
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
 
Geant4 Model Testing Framework: From PAW to ROOT
Geant4 Model Testing Framework:  From PAW to ROOTGeant4 Model Testing Framework:  From PAW to ROOT
Geant4 Model Testing Framework: From PAW to ROOT
 
Standards and tools for model management in biomedical research
Standards and tools for model management in biomedical researchStandards and tools for model management in biomedical research
Standards and tools for model management in biomedical research
 
Presentation of HOBBIT's versioning benchmark at Graph-TA
Presentation of HOBBIT's versioning benchmark at Graph-TAPresentation of HOBBIT's versioning benchmark at Graph-TA
Presentation of HOBBIT's versioning benchmark at Graph-TA
 
DIACHRON Project Overview
DIACHRON Project OverviewDIACHRON Project Overview
DIACHRON Project Overview
 
EnCore & EnVision
EnCore & EnVisionEnCore & EnVision
EnCore & EnVision
 
GeoChronos - SpecNet Workshop 2009 Presentation
GeoChronos - SpecNet Workshop 2009 PresentationGeoChronos - SpecNet Workshop 2009 Presentation
GeoChronos - SpecNet Workshop 2009 Presentation
 
The Chemtools LaBLog
The Chemtools LaBLogThe Chemtools LaBLog
The Chemtools LaBLog
 
Short introduction to SED-ML
Short introduction to SED-MLShort introduction to SED-ML
Short introduction to SED-ML
 
TreeBASE CIPRES
TreeBASE CIPRESTreeBASE CIPRES
TreeBASE CIPRES
 
Data and Model Management for Systems Biology
Data and Model Management  for Systems BiologyData and Model Management  for Systems Biology
Data and Model Management for Systems Biology
 

Andere mochten auch

Poster Semantic data integration proof of concept
Poster Semantic data integration proof of conceptPoster Semantic data integration proof of concept
Poster Semantic data integration proof of concept
Nicolas Bertrand
 
Seronto Domain Ontology Framework
Seronto Domain Ontology FrameworkSeronto Domain Ontology Framework
Seronto Domain Ontology Framework
Nicolas Bertrand
 
Proof of Concept for Hadoop: storage and analytics of electrical time-series
Proof of Concept for Hadoop: storage and analytics of electrical time-seriesProof of Concept for Hadoop: storage and analytics of electrical time-series
Proof of Concept for Hadoop: storage and analytics of electrical time-series
DataWorks Summit
 

Andere mochten auch (20)

Wegman Cincinnati
Wegman CincinnatiWegman Cincinnati
Wegman Cincinnati
 
Poster Semantic data integration proof of concept
Poster Semantic data integration proof of conceptPoster Semantic data integration proof of concept
Poster Semantic data integration proof of concept
 
Seronto Process
Seronto ProcessSeronto Process
Seronto Process
 
Semantic Data Integration of Biodiversity Data with the SERONTO Ontology
Semantic Data Integration of Biodiversity  Data with the SERONTO OntologySemantic Data Integration of Biodiversity  Data with the SERONTO Ontology
Semantic Data Integration of Biodiversity Data with the SERONTO Ontology
 
Seronto Domain Ontology Framework
Seronto Domain Ontology FrameworkSeronto Domain Ontology Framework
Seronto Domain Ontology Framework
 
Eln Jisc Mrc 18dec07 Nsb
Eln Jisc Mrc 18dec07 NsbEln Jisc Mrc 18dec07 Nsb
Eln Jisc Mrc 18dec07 Nsb
 
Ceh Conference Nsb
Ceh Conference NsbCeh Conference Nsb
Ceh Conference Nsb
 
HPE | Network Virtualization | POC
HPE | Network Virtualization | POCHPE | Network Virtualization | POC
HPE | Network Virtualization | POC
 
Infosys Ltd: Performance Tuning - A Key to Successful Cassandra Migration
Infosys Ltd: Performance Tuning - A Key to Successful Cassandra MigrationInfosys Ltd: Performance Tuning - A Key to Successful Cassandra Migration
Infosys Ltd: Performance Tuning - A Key to Successful Cassandra Migration
 
Optimizing Preclinical Proof of Concept
Optimizing Preclinical Proof of ConceptOptimizing Preclinical Proof of Concept
Optimizing Preclinical Proof of Concept
 
c-quilibrium R forecasting integration
c-quilibrium R forecasting integrationc-quilibrium R forecasting integration
c-quilibrium R forecasting integration
 
OTN tour 2015 Oracle Enterprise Manager 12c – Proof of Concept
OTN tour 2015 Oracle Enterprise Manager 12c – Proof of ConceptOTN tour 2015 Oracle Enterprise Manager 12c – Proof of Concept
OTN tour 2015 Oracle Enterprise Manager 12c – Proof of Concept
 
Architect Concept - .NET Interview Senior / Mid Level
Architect Concept - .NET Interview Senior / Mid Level Architect Concept - .NET Interview Senior / Mid Level
Architect Concept - .NET Interview Senior / Mid Level
 
Music recommendations API with Neo4j
Music recommendations API with Neo4jMusic recommendations API with Neo4j
Music recommendations API with Neo4j
 
Proof of Concept with Real Application Testing 12c
Proof of Concept with Real Application Testing 12cProof of Concept with Real Application Testing 12c
Proof of Concept with Real Application Testing 12c
 
Presenting a Technical Proof of Concept to Customers
Presenting a Technical Proof of Concept to CustomersPresenting a Technical Proof of Concept to Customers
Presenting a Technical Proof of Concept to Customers
 
Cloud Migration
Cloud MigrationCloud Migration
Cloud Migration
 
Proof of Concept for Hadoop: storage and analytics of electrical time-series
Proof of Concept for Hadoop: storage and analytics of electrical time-seriesProof of Concept for Hadoop: storage and analytics of electrical time-series
Proof of Concept for Hadoop: storage and analytics of electrical time-series
 
How to Build a Proof of Concept
How to Build a Proof of Concept How to Build a Proof of Concept
How to Build a Proof of Concept
 
Big Data Proof of Concept
Big Data Proof of ConceptBig Data Proof of Concept
Big Data Proof of Concept
 

Ähnlich wie Semantic data integration proof of concept

Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
OSTHUS
 
Tim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasetsTim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasets
TERN Australia
 
Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...
Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...
Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...
IJwest
 
Oxford University Geospatial Metadata Workshop 20110415
Oxford University Geospatial Metadata Workshop 20110415Oxford University Geospatial Metadata Workshop 20110415
Oxford University Geospatial Metadata Workshop 20110415
EDINA, University of Edinburgh
 

Ähnlich wie Semantic data integration proof of concept (20)

From allotrope to reference master data management
From allotrope to reference master data management From allotrope to reference master data management
From allotrope to reference master data management
 
Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014
 
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
 
Next-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information RetrievalNext-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information Retrieval
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysis
 
OSFair2017 Workshop | EPOS: European Plate Observing System
OSFair2017 Workshop | EPOS: European Plate Observing SystemOSFair2017 Workshop | EPOS: European Plate Observing System
OSFair2017 Workshop | EPOS: European Plate Observing System
 
Geospatial Metadata and Spatial Data: It's all Greek to me!
Geospatial Metadata and Spatial Data: It's all Greek to me!Geospatial Metadata and Spatial Data: It's all Greek to me!
Geospatial Metadata and Spatial Data: It's all Greek to me!
 
Tim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasetsTim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasets
 
The CIARD RINGValeri
The CIARD RINGValeriThe CIARD RINGValeri
The CIARD RINGValeri
 
A Finnish perspective on FAIRsFAIR outputs
A Finnish perspective on FAIRsFAIR outputsA Finnish perspective on FAIRsFAIR outputs
A Finnish perspective on FAIRsFAIR outputs
 
Resource Description Framework Approach to Data Publication and Federation
Resource Description Framework Approach to Data Publication and FederationResource Description Framework Approach to Data Publication and Federation
Resource Description Framework Approach to Data Publication and Federation
 
EUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan BroederEUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan Broeder
 
A Software Infrastructure for Multidimensional Data Analysis: A Data Modellin...
A Software Infrastructure for Multidimensional Data Analysis: A Data Modellin...A Software Infrastructure for Multidimensional Data Analysis: A Data Modellin...
A Software Infrastructure for Multidimensional Data Analysis: A Data Modellin...
 
AUTOMATIC CONVERSION OF RELATIONAL DATABASES INTO ONTOLOGIES: A COMPARATIVE A...
AUTOMATIC CONVERSION OF RELATIONAL DATABASES INTO ONTOLOGIES: A COMPARATIVE A...AUTOMATIC CONVERSION OF RELATIONAL DATABASES INTO ONTOLOGIES: A COMPARATIVE A...
AUTOMATIC CONVERSION OF RELATIONAL DATABASES INTO ONTOLOGIES: A COMPARATIVE A...
 
Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matrices
 
Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...
Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...
Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...
 
Northumbria University Geospatial Metadata Workshop 20110505
Northumbria University Geospatial Metadata Workshop 20110505Northumbria University Geospatial Metadata Workshop 20110505
Northumbria University Geospatial Metadata Workshop 20110505
 
Oxford University Geospatial Metadata Workshop 20110415
Oxford University Geospatial Metadata Workshop 20110415Oxford University Geospatial Metadata Workshop 20110415
Oxford University Geospatial Metadata Workshop 20110415
 
IRJET - Voice based Natural Language Query Processing
IRJET -  	  Voice based Natural Language Query ProcessingIRJET -  	  Voice based Natural Language Query Processing
IRJET - Voice based Natural Language Query Processing
 
Planetdata simpda
Planetdata simpdaPlanetdata simpda
Planetdata simpda
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 

Semantic data integration proof of concept

  • 1. Semantic Data Integration I6 Core Group Nic Bertrand Herbert Schentz LTER-Europe Conference, Mallorca, Dec. 2008
  • 2.
  • 3. Architecture Goal: Enable seamless access to distributed data Allow local data analysis for all members with their own tools Distributed Socio-Ecological Data See all data as if it came from ONE Data Source Distributed Data mining with local tools Portal
  • 4. Longer term vision Distributed Applications Extend seamless access to distributed services (SOA) Allow local data analysis for all members with their own tools and common services See all data as if data came from ONE Data Source processed within ONE application Distributed Socio-ecological Data Distributed Data Mining With local tools
  • 5. Role of Ontology Distributed Data Mining with local tools Distributed Socio-Ecological Data SERONTO SERONTO: basis to discover, retrieve and integrate distributed heterogenous data common concepts and structures Portal
  • 6.
  • 7. Proof of concept: Acceptance Criteria • The databases must have different structures and must have been developed independently of SERONTO; • The databases must feature reference lists (e.g. species lists); • The database structures must not be altered as a result of the integration work; • New concepts may be imported into SERONTO as and when required; • The databases must contain data relevant to Long Term Ecological Research (e.g. vegetation surveys, records of species occurrences, measurement of biotic and abiotic components).
  • 8. Testing: Connecting 5 databases JOKL cultural landscapes JODI vegetation 2835 floodplain ECN Summary Database More about the databases: Independently developed, Not developed for the purpose of data integration Different data models Different languages Similar data types collected in ALTER-Net, Some obvious integration points (e.g. Vegetation) Pythia vegetation SERONTO
  • 9. Data Integration using SERONTO Import Ontology Connect Databases Query SERONTO Results
  • 10. Getting value sets back SERONTO parameter_method parameter method Value_sets Unit Scale
  • 11.
  • 12. Data Integration Results import diverse ecological databases JOKL cultural landscapes JODI vegetation 2835 floodplain ECN Summary Database Pythia vegetation
  • 13. Data Integration Results Extend SERONTO Classes Using the content of the databases (SERONTO Core does not contain domain specific concepts) Map databases to SERONTO (Simple and complex mappings) Query individual databases directly Query multiple databases from the SERONTO (Simple and Complex queries) Map once, reuse data many times, querying does not require knowledge of the structures of the databases Semantic data integration is possible
  • 14.
  • 15. Possible uses for LTER Europe Distributed Data Mining with local tools Distributed Socio-Ecological Data SERONTO & Domain Ontologies common concepts and domain knowledge Portal Seamless access... Ready for use now