SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Some Thoughts Juan Esteva, Ph. D. . 751 Malena Dr., Ann Arbor, MI 48103 Tel:  734-786-0233  Cell  734-277-4962 Fax  734-821-0235 SkypeDrEsteva juan.esteva@Ajatella.com Ontology Data Integration For Competitive Decision Making
Not Just The Facts 3/4/2010 Juan Esteva, Ph. D. 2 “Good decisions are based on information that is analyzed and transformed into usable knowledge” Eileen Feretic
Information at the point of impact 3/4/2010 Juan Esteva, Ph. D. 3 “Information needs to be at the point of impact—at the front lines where people are making decisions. The right analysis needs to be done at the right place. It’s important for organizations to treat information as a strategic asset in order to optimize every decision, every process, everything they do.” AmbujGoyal,
Data in Silos 3/4/2010 Juan Esteva, Ph. D. 4 “One of the biggest challenges organizations face is the amount of data sitting in silos, too often, valuable data simply isn’t accessible or available.” Boris Evelson
Business Decisions for Competitive Advantage 3/4/2010 Juan Esteva, Ph. D. 5 “In today’s troubled economy and competitive business environment, making good decisions is a matter of survival. But good decisions aren’t based on gut feeling alone. They should be based on information gathered from multiple sources, which is then synthesized and analyzed to generate a road map of options and possible outcomes that transform data into usable knowledge” Eileen Feretic
Business Intelligence 3/4/2010 Juan Esteva, Ph. D. 6 Business Intelligence and now Business Analytics systems come into play  [However,] it is hard to assemble [heterogeneous data and] disparate pieces of information in a way that provides the intelligence and insight needed to make good business decisions. Eileen Feretic Alas enter Ontology Data Integration.
Data Integration 3/4/2010 Juan Esteva, Ph. D. 7 Data integration provides the ability to manipulate data transparently across multiple data sources. Based on the architecture there are 2 systems: Central Data Integration A central data integration system usually has a global schema, which provides the user with a uniform interface to access information stored in the data sources Peer-2-peer In contrast, in a peer-to-peer data integration system, there are no global points of control on the data sources (or peers). Instead, any peer can accept user queries for the information distributed in the whole system.
Common Approaches for Data Integration 3/4/2010 Juan Esteva, Ph. D. 8 Global-as-View In the GaV approach, every entity in the global schema is associated with a view over the source local schema. Therefore querying strategies are simple, but the evolution of the local source schemas is not easily supported. Local-as-View On the contrary, the LaV approach permits changes to source schemas without affecting the global schema, since the local schemas are defined as views over the global schema, but query processing can be complex.
Data Heterogeneity 3/4/2010 Juan Esteva, Ph. D. 9 Data sources can be heterogeneous in: Syntax Syntactic heterogeneity is caused by the use of different models or languages. Schema Schematic heterogeneity results from structural differences. Semantics Semantic heterogeneity is caused by different meanings or interpretations of data in various contexts To achieve data interoperability, the issues posed by data heterogeneity need to be eliminated
Possible Solutions 3/4/2010 Juan Esteva, Ph. D. 10 The advent of XML has created a syntactic platform for Web data standardization and exchange. However, schematic data heterogeneity may persist, depending on the XML schemas used (e.g., nesting hierarchies). Likewise, semantic heterogeneity may persist even if both syntactic and schematic heterogeneities do not occur (e.g., naming concepts differently). We should be concerned with solving all three kinds of heterogeneities by bridging syntactic, schematic, and semantic heterogeneities across different sources.
Semantic Data Integration Using Ontologies 3/4/2010 Juan Esteva, Ph. D. 11 We call semantic data integration the process of using a conceptual representation of the data and of their relationships to eliminate possible heterogeneities. At the heart of semantic data integration is the concept of ontology, which is an explicit specification of a shared conceptualization
Ontology & Data Integration 3/4/2010 Juan Esteva, Ph. D. 12 Metadata Representation. Metadata (i.e., source schemas) in each data source can be explicitly represented by a local ontology, using a single language. Global Conceptualization. The global ontology provides a conceptual view over the schematically-heterogeneous source schemas. Support for High-level Queries. Given a high-level view of the sources, as provided by a global ontology, the user can formulate a query without specific knowledge of the different data sources. The query is then rewritten into queries over the sources, based on the semantic mappings between the global and local ontologies. Declarative Mediation. Query processing in a hybrid peer-to-peer system uses the global ontology as a declarative mediator for query rewriting between peers. Mapping Support. A thesaurus, formalized in terms of an ontology, can be used for the mapping process to facilitate its automation.
What do we need? 3/4/2010 Juan Esteva, Ph. D. 13 Increase search capabilities From discovery to reasoning Increasing metadata  as to provide strong semantics From glossaries to ontologies Consequently, moving from syntactic interoperability to structural interoperability and finally to semantic interoperability
Graphically the model progression will be [2]  3/4/2010 Juan Esteva, Ph. D. 14 The point of this graph is that Increasing Metadata (from glossaries to ontologies) is highly correlated with Increasing Search Capability (from discovery to reasoning).
Juan Esteva, Ph. D. 3/4/2010 15 References
References 3/4/2010 Juan Esteva, Ph. D. 16 Applying 4D ontologies to Enterprise Architecture, Matthew West,  Shell Corp. FHA Data Architecture Working Group: SICoP DRM 2.0 Pilot, 2005 The Role of Ontologies in Data Integration, Isabel F. Cruz Huiyong Xiao
Topic Maps 3/4/2010 Juan Esteva, Ph. D. 17 Topic Maps is a standard for the representation and interchange of knowledge, with an emphasis on the findability of information. The ISO standard is formally known as ISO/IEC 13250:2003. A topic map represents information using topics (representing any concept, from people, countries, and organizations to software modules, individual files, and events), associations (representing the relationships between topics), and occurrences (representing information resources relevant to a particular topic).
SKOS 3/4/2010 Juan Esteva, Ph. D. 18 Simple Knowledge Organization System (SKOS)  SKOS is a common data model for sharing and linking knowledge organization systems via the Web.
RDF 3/4/2010 Juan Esteva, Ph. D. 19 Resource Description Language RDF RDF is a standard model for data interchange on the Web. RDF has features that facilitate data merging even if the underlying schemas differ, and it specifically supports the evolution of schemas over time without requiring all the data consumers to be changed.
OWL 3/4/2010 Juan Esteva, Ph. D. 20 Web Ontology Language OWL is a Semantic Web language designed to represent rich and complex knowledge about things, groups of things, and relations between things. OWL is a computational logic-based language such that knowledge expressed in OWL can be reasoned with by computer programs either to verify the consistency of that knowledge or to make implicit knowledge explicit. OWL documents, known as ontologies, can be published in the World Wide Web and may refer to or be referred from other OWL ontologies. OWL is part of the W3C’s Semantic Web technology stack, which includes RDF, RDFS, SPARQL, etc.

Weitere ähnliche Inhalte

Was ist angesagt?

Using linguistic analysis to translate
Using linguistic analysis to translateUsing linguistic analysis to translate
Using linguistic analysis to translate
IJwest
 
Improve information retrieval and e learning using
Improve information retrieval and e learning usingImprove information retrieval and e learning using
Improve information retrieval and e learning using
IJwest
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mapping
butest
 

Was ist angesagt? (20)

Using linguistic analysis to translate
Using linguistic analysis to translateUsing linguistic analysis to translate
Using linguistic analysis to translate
 
Data Integration Ontology Mapping
Data Integration Ontology MappingData Integration Ontology Mapping
Data Integration Ontology Mapping
 
Enhancing Semantic Mining
Enhancing Semantic MiningEnhancing Semantic Mining
Enhancing Semantic Mining
 
Improve information retrieval and e learning using
Improve information retrieval and e learning usingImprove information retrieval and e learning using
Improve information retrieval and e learning using
 
The Standardization of Semantic Web Ontology
The Standardization of Semantic Web OntologyThe Standardization of Semantic Web Ontology
The Standardization of Semantic Web Ontology
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mapping
 
Heterogeneous fuzzy xml data integration based on structrual and semantic sim...
Heterogeneous fuzzy xml data integration based on structrual and semantic sim...Heterogeneous fuzzy xml data integration based on structrual and semantic sim...
Heterogeneous fuzzy xml data integration based on structrual and semantic sim...
 
USING RELATIONAL MODEL TO STORE OWL ONTOLOGIES AND FACTS
USING RELATIONAL MODEL TO STORE OWL ONTOLOGIES AND FACTSUSING RELATIONAL MODEL TO STORE OWL ONTOLOGIES AND FACTS
USING RELATIONAL MODEL TO STORE OWL ONTOLOGIES AND FACTS
 
Translating Ontologies in Real-World Settings
Translating Ontologies in Real-World SettingsTranslating Ontologies in Real-World Settings
Translating Ontologies in Real-World Settings
 
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
 
Semantic Web, Ontology, and Ontology Learning: Introduction
Semantic Web, Ontology, and Ontology Learning: IntroductionSemantic Web, Ontology, and Ontology Learning: Introduction
Semantic Web, Ontology, and Ontology Learning: Introduction
 
Ontologies for big data
Ontologies for big dataOntologies for big data
Ontologies for big data
 
4 semantic web and ontology
4 semantic web and ontology4 semantic web and ontology
4 semantic web and ontology
 
Are Data Models Superfluous Nov2003
Are Data Models Superfluous Nov2003Are Data Models Superfluous Nov2003
Are Data Models Superfluous Nov2003
 
Learning ontologies
Learning ontologiesLearning ontologies
Learning ontologies
 
Ontology
OntologyOntology
Ontology
 
Towards a Query Rewriting Algorithm Over Proteomics XML Resources
Towards a Query Rewriting Algorithm Over Proteomics XML ResourcesTowards a Query Rewriting Algorithm Over Proteomics XML Resources
Towards a Query Rewriting Algorithm Over Proteomics XML Resources
 
mlss
mlssmlss
mlss
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mapping
 
Swoogle: Showcasing the Significance of Semantic Search
Swoogle: Showcasing the Significance of Semantic SearchSwoogle: Showcasing the Significance of Semantic Search
Swoogle: Showcasing the Significance of Semantic Search
 

Andere mochten auch (6)

X Som Graduation Presentation
X Som   Graduation PresentationX Som   Graduation Presentation
X Som Graduation Presentation
 
Horizontal Integration of Big Intelligence Data
Horizontal Integration of Big Intelligence DataHorizontal Integration of Big Intelligence Data
Horizontal Integration of Big Intelligence Data
 
The Role Of Ontology In Modern Expert Systems Dallas 2008
The Role Of Ontology In Modern Expert Systems   Dallas   2008The Role Of Ontology In Modern Expert Systems   Dallas   2008
The Role Of Ontology In Modern Expert Systems Dallas 2008
 
8 ontology integration and interoperability (onto i op)
8 ontology integration and interoperability (onto i op)8 ontology integration and interoperability (onto i op)
8 ontology integration and interoperability (onto i op)
 
Enterprise and Data Mining Ontology Integration to Extract Actionable Knowled...
Enterprise and Data Mining Ontology Integration to Extract Actionable Knowled...Enterprise and Data Mining Ontology Integration to Extract Actionable Knowled...
Enterprise and Data Mining Ontology Integration to Extract Actionable Knowled...
 
Management Gurus
Management GurusManagement Gurus
Management Gurus
 

Ähnlich wie Ontology For Data Integration

A semantic framework and software design to enable the transparent integratio...
A semantic framework and software design to enable the transparent integratio...A semantic framework and software design to enable the transparent integratio...
A semantic framework and software design to enable the transparent integratio...
Patricia Tavares Boralli
 
An improved technique for ranking semantic associationst07
An improved technique for ranking semantic associationst07An improved technique for ranking semantic associationst07
An improved technique for ranking semantic associationst07
IJwest
 
Association Rule Mining Based Extraction of Semantic Relations Using Markov L...
Association Rule Mining Based Extraction of Semantic Relations Using Markov L...Association Rule Mining Based Extraction of Semantic Relations Using Markov L...
Association Rule Mining Based Extraction of Semantic Relations Using Markov L...
IJwest
 
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATIONONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
IJwest
 
Concept integration using edit distance and n gram match
Concept integration using edit distance and n gram match Concept integration using edit distance and n gram match
Concept integration using edit distance and n gram match
ijdms
 
Poster Abstracts
Poster AbstractsPoster Abstracts
Poster Abstracts
butest
 

Ähnlich wie Ontology For Data Integration (20)

An Incremental Method For Meaning Elicitation Of A Domain Ontology
An Incremental Method For Meaning Elicitation Of A Domain OntologyAn Incremental Method For Meaning Elicitation Of A Domain Ontology
An Incremental Method For Meaning Elicitation Of A Domain Ontology
 
A semantic framework and software design to enable the transparent integratio...
A semantic framework and software design to enable the transparent integratio...A semantic framework and software design to enable the transparent integratio...
A semantic framework and software design to enable the transparent integratio...
 
An improved technique for ranking semantic associationst07
An improved technique for ranking semantic associationst07An improved technique for ranking semantic associationst07
An improved technique for ranking semantic associationst07
 
Open Government Data on the Web - A Semantic Approach
Open Government Data on the Web - A Semantic ApproachOpen Government Data on the Web - A Semantic Approach
Open Government Data on the Web - A Semantic Approach
 
Association Rule Mining Based Extraction of Semantic Relations Using Markov ...
Association Rule Mining Based Extraction of  Semantic Relations Using Markov ...Association Rule Mining Based Extraction of  Semantic Relations Using Markov ...
Association Rule Mining Based Extraction of Semantic Relations Using Markov ...
 
Semantics in Financial Services -David Newman
Semantics in Financial Services -David NewmanSemantics in Financial Services -David Newman
Semantics in Financial Services -David Newman
 
Association Rule Mining Based Extraction of Semantic Relations Using Markov L...
Association Rule Mining Based Extraction of Semantic Relations Using Markov L...Association Rule Mining Based Extraction of Semantic Relations Using Markov L...
Association Rule Mining Based Extraction of Semantic Relations Using Markov L...
 
The basics of ontologies
The basics of ontologiesThe basics of ontologies
The basics of ontologies
 
ONTOLOGY-DRIVEN INFORMATION RETRIEVAL FOR HEALTHCARE INFORMATION SYSTEM : A C...
ONTOLOGY-DRIVEN INFORMATION RETRIEVAL FOR HEALTHCARE INFORMATION SYSTEM : A C...ONTOLOGY-DRIVEN INFORMATION RETRIEVAL FOR HEALTHCARE INFORMATION SYSTEM : A C...
ONTOLOGY-DRIVEN INFORMATION RETRIEVAL FOR HEALTHCARE INFORMATION SYSTEM : A C...
 
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATIONONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
 
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
ONTOLOGY SERVICE CENTER: A DATAHUB FOR  ONTOLOGY APPLICATION ONTOLOGY SERVICE CENTER: A DATAHUB FOR  ONTOLOGY APPLICATION
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
 
Semantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-WorldSemantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-World
 
The impact of standardized terminologies and domain-ontologies in multilingua...
The impact of standardized terminologies and domain-ontologies in multilingua...The impact of standardized terminologies and domain-ontologies in multilingua...
The impact of standardized terminologies and domain-ontologies in multilingua...
 
A Survey On Ontology Agent Based Distributed Data Mining
A Survey On Ontology Agent Based Distributed Data MiningA Survey On Ontology Agent Based Distributed Data Mining
A Survey On Ontology Agent Based Distributed Data Mining
 
Concept integration using edit distance and n gram match
Concept integration using edit distance and n gram match Concept integration using edit distance and n gram match
Concept integration using edit distance and n gram match
 
Neuroinformatics_Databses_Ontologies_Federated Database.pptx
Neuroinformatics_Databses_Ontologies_Federated Database.pptxNeuroinformatics_Databses_Ontologies_Federated Database.pptx
Neuroinformatics_Databses_Ontologies_Federated Database.pptx
 
Neuroinformatics Databases Ontologies Federated Database.pptx
Neuroinformatics Databases Ontologies Federated Database.pptxNeuroinformatics Databases Ontologies Federated Database.pptx
Neuroinformatics Databases Ontologies Federated Database.pptx
 
Session III Census and registers - M. Scannapieco,The Italian Integrated Syst...
Session III Census and registers - M. Scannapieco,The Italian Integrated Syst...Session III Census and registers - M. Scannapieco,The Italian Integrated Syst...
Session III Census and registers - M. Scannapieco,The Italian Integrated Syst...
 
Poster Abstracts
Poster AbstractsPoster Abstracts
Poster Abstracts
 
A NAIVE METHOD FOR ONTOLOGY CONSTRUCTION
A NAIVE METHOD FOR ONTOLOGY CONSTRUCTIONA NAIVE METHOD FOR ONTOLOGY CONSTRUCTION
A NAIVE METHOD FOR ONTOLOGY CONSTRUCTION
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 

Ontology For Data Integration

  • 1. Some Thoughts Juan Esteva, Ph. D. . 751 Malena Dr., Ann Arbor, MI 48103 Tel: 734-786-0233 Cell 734-277-4962 Fax 734-821-0235 SkypeDrEsteva juan.esteva@Ajatella.com Ontology Data Integration For Competitive Decision Making
  • 2. Not Just The Facts 3/4/2010 Juan Esteva, Ph. D. 2 “Good decisions are based on information that is analyzed and transformed into usable knowledge” Eileen Feretic
  • 3. Information at the point of impact 3/4/2010 Juan Esteva, Ph. D. 3 “Information needs to be at the point of impact—at the front lines where people are making decisions. The right analysis needs to be done at the right place. It’s important for organizations to treat information as a strategic asset in order to optimize every decision, every process, everything they do.” AmbujGoyal,
  • 4. Data in Silos 3/4/2010 Juan Esteva, Ph. D. 4 “One of the biggest challenges organizations face is the amount of data sitting in silos, too often, valuable data simply isn’t accessible or available.” Boris Evelson
  • 5. Business Decisions for Competitive Advantage 3/4/2010 Juan Esteva, Ph. D. 5 “In today’s troubled economy and competitive business environment, making good decisions is a matter of survival. But good decisions aren’t based on gut feeling alone. They should be based on information gathered from multiple sources, which is then synthesized and analyzed to generate a road map of options and possible outcomes that transform data into usable knowledge” Eileen Feretic
  • 6. Business Intelligence 3/4/2010 Juan Esteva, Ph. D. 6 Business Intelligence and now Business Analytics systems come into play [However,] it is hard to assemble [heterogeneous data and] disparate pieces of information in a way that provides the intelligence and insight needed to make good business decisions. Eileen Feretic Alas enter Ontology Data Integration.
  • 7. Data Integration 3/4/2010 Juan Esteva, Ph. D. 7 Data integration provides the ability to manipulate data transparently across multiple data sources. Based on the architecture there are 2 systems: Central Data Integration A central data integration system usually has a global schema, which provides the user with a uniform interface to access information stored in the data sources Peer-2-peer In contrast, in a peer-to-peer data integration system, there are no global points of control on the data sources (or peers). Instead, any peer can accept user queries for the information distributed in the whole system.
  • 8. Common Approaches for Data Integration 3/4/2010 Juan Esteva, Ph. D. 8 Global-as-View In the GaV approach, every entity in the global schema is associated with a view over the source local schema. Therefore querying strategies are simple, but the evolution of the local source schemas is not easily supported. Local-as-View On the contrary, the LaV approach permits changes to source schemas without affecting the global schema, since the local schemas are defined as views over the global schema, but query processing can be complex.
  • 9. Data Heterogeneity 3/4/2010 Juan Esteva, Ph. D. 9 Data sources can be heterogeneous in: Syntax Syntactic heterogeneity is caused by the use of different models or languages. Schema Schematic heterogeneity results from structural differences. Semantics Semantic heterogeneity is caused by different meanings or interpretations of data in various contexts To achieve data interoperability, the issues posed by data heterogeneity need to be eliminated
  • 10. Possible Solutions 3/4/2010 Juan Esteva, Ph. D. 10 The advent of XML has created a syntactic platform for Web data standardization and exchange. However, schematic data heterogeneity may persist, depending on the XML schemas used (e.g., nesting hierarchies). Likewise, semantic heterogeneity may persist even if both syntactic and schematic heterogeneities do not occur (e.g., naming concepts differently). We should be concerned with solving all three kinds of heterogeneities by bridging syntactic, schematic, and semantic heterogeneities across different sources.
  • 11. Semantic Data Integration Using Ontologies 3/4/2010 Juan Esteva, Ph. D. 11 We call semantic data integration the process of using a conceptual representation of the data and of their relationships to eliminate possible heterogeneities. At the heart of semantic data integration is the concept of ontology, which is an explicit specification of a shared conceptualization
  • 12. Ontology & Data Integration 3/4/2010 Juan Esteva, Ph. D. 12 Metadata Representation. Metadata (i.e., source schemas) in each data source can be explicitly represented by a local ontology, using a single language. Global Conceptualization. The global ontology provides a conceptual view over the schematically-heterogeneous source schemas. Support for High-level Queries. Given a high-level view of the sources, as provided by a global ontology, the user can formulate a query without specific knowledge of the different data sources. The query is then rewritten into queries over the sources, based on the semantic mappings between the global and local ontologies. Declarative Mediation. Query processing in a hybrid peer-to-peer system uses the global ontology as a declarative mediator for query rewriting between peers. Mapping Support. A thesaurus, formalized in terms of an ontology, can be used for the mapping process to facilitate its automation.
  • 13. What do we need? 3/4/2010 Juan Esteva, Ph. D. 13 Increase search capabilities From discovery to reasoning Increasing metadata as to provide strong semantics From glossaries to ontologies Consequently, moving from syntactic interoperability to structural interoperability and finally to semantic interoperability
  • 14. Graphically the model progression will be [2] 3/4/2010 Juan Esteva, Ph. D. 14 The point of this graph is that Increasing Metadata (from glossaries to ontologies) is highly correlated with Increasing Search Capability (from discovery to reasoning).
  • 15. Juan Esteva, Ph. D. 3/4/2010 15 References
  • 16. References 3/4/2010 Juan Esteva, Ph. D. 16 Applying 4D ontologies to Enterprise Architecture, Matthew West, Shell Corp. FHA Data Architecture Working Group: SICoP DRM 2.0 Pilot, 2005 The Role of Ontologies in Data Integration, Isabel F. Cruz Huiyong Xiao
  • 17. Topic Maps 3/4/2010 Juan Esteva, Ph. D. 17 Topic Maps is a standard for the representation and interchange of knowledge, with an emphasis on the findability of information. The ISO standard is formally known as ISO/IEC 13250:2003. A topic map represents information using topics (representing any concept, from people, countries, and organizations to software modules, individual files, and events), associations (representing the relationships between topics), and occurrences (representing information resources relevant to a particular topic).
  • 18. SKOS 3/4/2010 Juan Esteva, Ph. D. 18 Simple Knowledge Organization System (SKOS) SKOS is a common data model for sharing and linking knowledge organization systems via the Web.
  • 19. RDF 3/4/2010 Juan Esteva, Ph. D. 19 Resource Description Language RDF RDF is a standard model for data interchange on the Web. RDF has features that facilitate data merging even if the underlying schemas differ, and it specifically supports the evolution of schemas over time without requiring all the data consumers to be changed.
  • 20. OWL 3/4/2010 Juan Esteva, Ph. D. 20 Web Ontology Language OWL is a Semantic Web language designed to represent rich and complex knowledge about things, groups of things, and relations between things. OWL is a computational logic-based language such that knowledge expressed in OWL can be reasoned with by computer programs either to verify the consistency of that knowledge or to make implicit knowledge explicit. OWL documents, known as ontologies, can be published in the World Wide Web and may refer to or be referred from other OWL ontologies. OWL is part of the W3C’s Semantic Web technology stack, which includes RDF, RDFS, SPARQL, etc.