Suche senden
Hochladen
Database , 4 Data Integration
ā¢
Als PPTX, PDF herunterladen
ā¢
3 gefƤllt mir
ā¢
4,557 views
A
Ali Usman
Folgen
Technologie
Melden
Teilen
Melden
Teilen
1 von 31
Jetzt herunterladen
Empfohlen
Introduction to distributed database
Introduction to distributed database
Sonia Panesar
Ā
DDBMS Paper with Solution
DDBMS Paper with Solution
Gyanmanjari Institute Of Technology
Ā
Distributed DBMS - Unit 6 - Query Processing
Distributed DBMS - Unit 6 - Query Processing
Gyanmanjari Institute Of Technology
Ā
Distributed Database System
Distributed Database System
Sulemang
Ā
Distributed Database Management System
Distributed Database Management System
AAKANKSHA JAIN
Ā
Distributed DBMS - Unit 3 - Distributed DBMS Architecture
Distributed DBMS - Unit 3 - Distributed DBMS Architecture
Gyanmanjari Institute Of Technology
Ā
The CAP Theorem
The CAP Theorem
Aleksandar Bradic
Ā
Query trees
Query trees
Shefa Idrees
Ā
Empfohlen
Introduction to distributed database
Introduction to distributed database
Sonia Panesar
Ā
DDBMS Paper with Solution
DDBMS Paper with Solution
Gyanmanjari Institute Of Technology
Ā
Distributed DBMS - Unit 6 - Query Processing
Distributed DBMS - Unit 6 - Query Processing
Gyanmanjari Institute Of Technology
Ā
Distributed Database System
Distributed Database System
Sulemang
Ā
Distributed Database Management System
Distributed Database Management System
AAKANKSHA JAIN
Ā
Distributed DBMS - Unit 3 - Distributed DBMS Architecture
Distributed DBMS - Unit 3 - Distributed DBMS Architecture
Gyanmanjari Institute Of Technology
Ā
The CAP Theorem
The CAP Theorem
Aleksandar Bradic
Ā
Query trees
Query trees
Shefa Idrees
Ā
Distributed DBMS - Unit 8 - Distributed Transaction Management & Concurrency ...
Distributed DBMS - Unit 8 - Distributed Transaction Management & Concurrency ...
Gyanmanjari Institute Of Technology
Ā
Lec 7 query processing
Lec 7 query processing
Md. Mashiur Rahman
Ā
Query Decomposition and data localization
Query Decomposition and data localization
Hafiz faiz
Ā
Lecture 1 ddbms
Lecture 1 ddbms
Mangesh Wanjari
Ā
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
Ramakant Soni
Ā
Distributed database
Distributed database
ReachLocal Services India
Ā
Distributed file system
Distributed file system
Anamika Singh
Ā
Database , 8 Query Optimization
Database , 8 Query Optimization
Ali Usman
Ā
Ddbms1
Ddbms1
pranjal_das
Ā
Unit 3
Unit 3
Ravi Kumar
Ā
Distributed DBMS - Unit 1 - Introduction
Distributed DBMS - Unit 1 - Introduction
Gyanmanjari Institute Of Technology
Ā
management of distributed transactions
management of distributed transactions
Nilu Desai
Ā
Concurrency Control in Distributed Database.
Concurrency Control in Distributed Database.
Meghaj Mallick
Ā
Data preparation
Data preparation
Tony Nguyen
Ā
Ddb 1.6-design issues
Ddb 1.6-design issues
Esar Qasmi
Ā
Distributed Query Processing
Distributed Query Processing
Mythili Kannan
Ā
Object Oriented Database Management System
Object Oriented Database Management System
Ajay Jha
Ā
Query processing and optimization (updated)
Query processing and optimization (updated)
Ravinder Kamboj
Ā
Object oriented databases
Object oriented databases
Sajith Ekanayaka
Ā
20. Parallel Databases in DBMS
20. Parallel Databases in DBMS
koolkampus
Ā
Jarrar: Data Schema Integration
Jarrar: Data Schema Integration
Mustafa Jarrar
Ā
Data integration
Data integration
Umar Alharaky
Ā
Weitere Ƥhnliche Inhalte
Was ist angesagt?
Distributed DBMS - Unit 8 - Distributed Transaction Management & Concurrency ...
Distributed DBMS - Unit 8 - Distributed Transaction Management & Concurrency ...
Gyanmanjari Institute Of Technology
Ā
Lec 7 query processing
Lec 7 query processing
Md. Mashiur Rahman
Ā
Query Decomposition and data localization
Query Decomposition and data localization
Hafiz faiz
Ā
Lecture 1 ddbms
Lecture 1 ddbms
Mangesh Wanjari
Ā
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
Ramakant Soni
Ā
Distributed database
Distributed database
ReachLocal Services India
Ā
Distributed file system
Distributed file system
Anamika Singh
Ā
Database , 8 Query Optimization
Database , 8 Query Optimization
Ali Usman
Ā
Ddbms1
Ddbms1
pranjal_das
Ā
Unit 3
Unit 3
Ravi Kumar
Ā
Distributed DBMS - Unit 1 - Introduction
Distributed DBMS - Unit 1 - Introduction
Gyanmanjari Institute Of Technology
Ā
management of distributed transactions
management of distributed transactions
Nilu Desai
Ā
Concurrency Control in Distributed Database.
Concurrency Control in Distributed Database.
Meghaj Mallick
Ā
Data preparation
Data preparation
Tony Nguyen
Ā
Ddb 1.6-design issues
Ddb 1.6-design issues
Esar Qasmi
Ā
Distributed Query Processing
Distributed Query Processing
Mythili Kannan
Ā
Object Oriented Database Management System
Object Oriented Database Management System
Ajay Jha
Ā
Query processing and optimization (updated)
Query processing and optimization (updated)
Ravinder Kamboj
Ā
Object oriented databases
Object oriented databases
Sajith Ekanayaka
Ā
20. Parallel Databases in DBMS
20. Parallel Databases in DBMS
koolkampus
Ā
Was ist angesagt?
(20)
Distributed DBMS - Unit 8 - Distributed Transaction Management & Concurrency ...
Distributed DBMS - Unit 8 - Distributed Transaction Management & Concurrency ...
Ā
Lec 7 query processing
Lec 7 query processing
Ā
Query Decomposition and data localization
Query Decomposition and data localization
Ā
Lecture 1 ddbms
Lecture 1 ddbms
Ā
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
Ā
Distributed database
Distributed database
Ā
Distributed file system
Distributed file system
Ā
Database , 8 Query Optimization
Database , 8 Query Optimization
Ā
Ddbms1
Ddbms1
Ā
Unit 3
Unit 3
Ā
Distributed DBMS - Unit 1 - Introduction
Distributed DBMS - Unit 1 - Introduction
Ā
management of distributed transactions
management of distributed transactions
Ā
Concurrency Control in Distributed Database.
Concurrency Control in Distributed Database.
Ā
Data preparation
Data preparation
Ā
Ddb 1.6-design issues
Ddb 1.6-design issues
Ā
Distributed Query Processing
Distributed Query Processing
Ā
Object Oriented Database Management System
Object Oriented Database Management System
Ā
Query processing and optimization (updated)
Query processing and optimization (updated)
Ā
Object oriented databases
Object oriented databases
Ā
20. Parallel Databases in DBMS
20. Parallel Databases in DBMS
Ā
Andere mochten auch
Jarrar: Data Schema Integration
Jarrar: Data Schema Integration
Mustafa Jarrar
Ā
Data integration
Data integration
Umar Alharaky
Ā
Introduction to ETL and Data Integration
Introduction to ETL and Data Integration
CloverDX (formerly known as CloverETL)
Ā
Data Integration (ETL)
Data Integration (ETL)
easysoft
Ā
DBMS Canonical cover
DBMS Canonical cover
Saurabh Tandel
Ā
Data integration ppt-bhawani nandan prasad - iim calcutta
Data integration ppt-bhawani nandan prasad - iim calcutta
Bhawani N Prasad
Ā
Database ,7 query localization
Database ,7 query localization
Ali Usman
Ā
Database, 3 Distribution Design
Database, 3 Distribution Design
Ali Usman
Ā
Database ,11 Concurrency Control
Database ,11 Concurrency Control
Ali Usman
Ā
Database , 15 Object DBMS
Database , 15 Object DBMS
Ali Usman
Ā
Database ,18 Current Issues
Database ,18 Current Issues
Ali Usman
Ā
Database ,2 Background
Database ,2 Background
Ali Usman
Ā
Database , 6 Query Introduction
Database , 6 Query Introduction
Ali Usman
Ā
Pal gov.tutorial2.session13 1.data schema integration
Pal gov.tutorial2.session13 1.data schema integration
Mustafa Jarrar
Ā
test
test
eduard_c
Ā
Modul 04 ta1_ metodologi penelitian
Modul 04 ta1_ metodologi penelitian
Fokgusta
Ā
Media ajarelektronik
Media ajarelektronik
Fokgusta
Ā
Processor Specifications
Processor Specifications
Ali Usman
Ā
SysML as a Common Integration Platform for Co-Simulations ā Example of a Cybe...
SysML as a Common Integration Platform for Co-Simulations ā Example of a Cybe...
Andrey Sadovykh
Ā
Pal gov.tutorial2.session15 1.linkeddata
Pal gov.tutorial2.session15 1.linkeddata
Mustafa Jarrar
Ā
Andere mochten auch
(20)
Jarrar: Data Schema Integration
Jarrar: Data Schema Integration
Ā
Data integration
Data integration
Ā
Introduction to ETL and Data Integration
Introduction to ETL and Data Integration
Ā
Data Integration (ETL)
Data Integration (ETL)
Ā
DBMS Canonical cover
DBMS Canonical cover
Ā
Data integration ppt-bhawani nandan prasad - iim calcutta
Data integration ppt-bhawani nandan prasad - iim calcutta
Ā
Database ,7 query localization
Database ,7 query localization
Ā
Database, 3 Distribution Design
Database, 3 Distribution Design
Ā
Database ,11 Concurrency Control
Database ,11 Concurrency Control
Ā
Database , 15 Object DBMS
Database , 15 Object DBMS
Ā
Database ,18 Current Issues
Database ,18 Current Issues
Ā
Database ,2 Background
Database ,2 Background
Ā
Database , 6 Query Introduction
Database , 6 Query Introduction
Ā
Pal gov.tutorial2.session13 1.data schema integration
Pal gov.tutorial2.session13 1.data schema integration
Ā
test
test
Ā
Modul 04 ta1_ metodologi penelitian
Modul 04 ta1_ metodologi penelitian
Ā
Media ajarelektronik
Media ajarelektronik
Ā
Processor Specifications
Processor Specifications
Ā
SysML as a Common Integration Platform for Co-Simulations ā Example of a Cybe...
SysML as a Common Integration Platform for Co-Simulations ā Example of a Cybe...
Ā
Pal gov.tutorial2.session15 1.linkeddata
Pal gov.tutorial2.session15 1.linkeddata
Ā
Ćhnlich wie Database , 4 Data Integration
Database ,16 P2P
Database ,16 P2P
Ali Usman
Ā
Database , 17 Web
Database , 17 Web
Ali Usman
Ā
1 introduction
1 introduction
Amrit Kaur
Ā
6-Query_Intro (5).pdf
6-Query_Intro (5).pdf
JaveriaShoaib4
Ā
Nosql
Nosql
Roxana Tadayon
Ā
Nosql
Nosql
ROXTAD71
Ā
[Mas 500] Data Basics
[Mas 500] Data Basics
rahulbot
Ā
1 introduction DDBS
1 introduction DDBS
naimanighat
Ā
Database , 1 Introduction
Database , 1 Introduction
Ali Usman
Ā
DDBS PPT (1).pptx
DDBS PPT (1).pptx
HarshitSingh334328
Ā
Dunsire roadmap meeting proposal
Dunsire roadmap meeting proposal
National Information Standards Organization (NISO)
Ā
Top 5-nosql
Top 5-nosql
Mehul Jariwala
Ā
Selecting the right database type for your knowledge management needs.
Selecting the right database type for your knowledge management needs.
Synaptica, LLC
Ā
DBMS outline.pptx
DBMS outline.pptx
DrThenmozhiKarunanit
Ā
NoSql
NoSql
AnitaSenthilkumar
Ā
01-Database Administration and Management.pdf
01-Database Administration and Management.pdf
TOUSEEQHAIDER14
Ā
OpenLSH - a framework for locality sensitive hashing
OpenLSH - a framework for locality sensitive hashing
J Singh
Ā
1 introduction ddbms
1 introduction ddbms
amna izzat
Ā
Nosql
Nosql
Muluken Sholaye Tesfaye
Ā
Info systems databases
Info systems databases
MR Z
Ā
Ćhnlich wie Database , 4 Data Integration
(20)
Database ,16 P2P
Database ,16 P2P
Ā
Database , 17 Web
Database , 17 Web
Ā
1 introduction
1 introduction
Ā
6-Query_Intro (5).pdf
6-Query_Intro (5).pdf
Ā
Nosql
Nosql
Ā
Nosql
Nosql
Ā
[Mas 500] Data Basics
[Mas 500] Data Basics
Ā
1 introduction DDBS
1 introduction DDBS
Ā
Database , 1 Introduction
Database , 1 Introduction
Ā
DDBS PPT (1).pptx
DDBS PPT (1).pptx
Ā
Dunsire roadmap meeting proposal
Dunsire roadmap meeting proposal
Ā
Top 5-nosql
Top 5-nosql
Ā
Selecting the right database type for your knowledge management needs.
Selecting the right database type for your knowledge management needs.
Ā
DBMS outline.pptx
DBMS outline.pptx
Ā
NoSql
NoSql
Ā
01-Database Administration and Management.pdf
01-Database Administration and Management.pdf
Ā
OpenLSH - a framework for locality sensitive hashing
OpenLSH - a framework for locality sensitive hashing
Ā
1 introduction ddbms
1 introduction ddbms
Ā
Nosql
Nosql
Ā
Info systems databases
Info systems databases
Ā
Mehr von Ali Usman
Cisco Packet Tracer Overview
Cisco Packet Tracer Overview
Ali Usman
Ā
Islamic Arts and Architecture
Islamic Arts and Architecture
Ali Usman
Ā
Database ,14 Parallel DBMS
Database ,14 Parallel DBMS
Ali Usman
Ā
Database , 13 Replication
Database , 13 Replication
Ali Usman
Ā
Database , 12 Reliability
Database , 12 Reliability
Ali Usman
Ā
Database ,10 Transactions
Database ,10 Transactions
Ali Usman
Ā
Database , 5 Semantic
Database , 5 Semantic
Ali Usman
Ā
Processor Specifications
Processor Specifications
Ali Usman
Ā
Fifty Year Of Microprocessor
Fifty Year Of Microprocessor
Ali Usman
Ā
Discrete Structures lecture 2
Discrete Structures lecture 2
Ali Usman
Ā
Discrete Structures. Lecture 1
Discrete Structures. Lecture 1
Ali Usman
Ā
Muslim Contributions in Medicine-Geography-Astronomy
Muslim Contributions in Medicine-Geography-Astronomy
Ali Usman
Ā
Muslim Contributions in Geography
Muslim Contributions in Geography
Ali Usman
Ā
Muslim Contributions in Astronomy
Muslim Contributions in Astronomy
Ali Usman
Ā
Ptcl modem (user manual)
Ptcl modem (user manual)
Ali Usman
Ā
Nimat-ul-ALLAH shah wali
Nimat-ul-ALLAH shah wali
Ali Usman
Ā
Muslim Contributions in Mathematics
Muslim Contributions in Mathematics
Ali Usman
Ā
Osi protocols
Osi protocols
Ali Usman
Ā
Mehr von Ali Usman
(18)
Cisco Packet Tracer Overview
Cisco Packet Tracer Overview
Ā
Islamic Arts and Architecture
Islamic Arts and Architecture
Ā
Database ,14 Parallel DBMS
Database ,14 Parallel DBMS
Ā
Database , 13 Replication
Database , 13 Replication
Ā
Database , 12 Reliability
Database , 12 Reliability
Ā
Database ,10 Transactions
Database ,10 Transactions
Ā
Database , 5 Semantic
Database , 5 Semantic
Ā
Processor Specifications
Processor Specifications
Ā
Fifty Year Of Microprocessor
Fifty Year Of Microprocessor
Ā
Discrete Structures lecture 2
Discrete Structures lecture 2
Ā
Discrete Structures. Lecture 1
Discrete Structures. Lecture 1
Ā
Muslim Contributions in Medicine-Geography-Astronomy
Muslim Contributions in Medicine-Geography-Astronomy
Ā
Muslim Contributions in Geography
Muslim Contributions in Geography
Ā
Muslim Contributions in Astronomy
Muslim Contributions in Astronomy
Ā
Ptcl modem (user manual)
Ptcl modem (user manual)
Ā
Nimat-ul-ALLAH shah wali
Nimat-ul-ALLAH shah wali
Ā
Muslim Contributions in Mathematics
Muslim Contributions in Mathematics
Ā
Osi protocols
Osi protocols
Ā
KĆ¼rzlich hochgeladen
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
The Digital Insurer
Ā
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
Ā
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
V3cube
Ā
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
hans926745
Ā
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
Results
Ā
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
Paola De la Torre
Ā
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc
Ā
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
Delhi Call girls
Ā
š¬ The future of MySQL is Postgres š
š¬ The future of MySQL is Postgres š
RTylerCroy
Ā
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Neo4j
Ā
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
Puma Security, LLC
Ā
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
Malak Abu Hammad
Ā
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
Principled Technologies
Ā
WhatsApp 9892124323 āCall Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 āCall Girls In Kalyan ( Mumbai ) secure service
Pooja Nehwal
Ā
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
Anna Loughnan Colquhoun
Ā
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
naman860154
Ā
How to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
naman860154
Ā
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
Delhi Call girls
Ā
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
Ā
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
The Digital Insurer
Ā
KĆ¼rzlich hochgeladen
(20)
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
Ā
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
Ā
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
Ā
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
Ā
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
Ā
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
Ā
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
Ā
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
Ā
š¬ The future of MySQL is Postgres š
š¬ The future of MySQL is Postgres š
Ā
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Ā
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
Ā
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
Ā
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
Ā
WhatsApp 9892124323 āCall Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 āCall Girls In Kalyan ( Mumbai ) secure service
Ā
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
Ā
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
Ā
How to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
Ā
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
Ā
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Ā
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Ā
Database , 4 Data Integration
1.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/1 Outline ā¢ Introduction ā¢ Background ā¢ Distributed Database Design ā¢ Database Integration ā” Schema Matching ā” Schema Mapping ā¢ Semantic Data Control ā¢ Distributed Query Processing ā¢ Multimedia Query Processing ā¢ Distributed Transaction Management ā¢ Data Replication ā¢ Parallel Database Systems ā¢ Distributed Object DBMS ā¢ Peer-to-Peer Data Management ā¢ Web Data Management ā¢ Current Issues
2.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/2 Problem Definition ā¢ Given existing databases with their Local Conceptual Schemas (LCSs), how to integrate the LCSs into a Global Conceptual Schema (GCS) ā” GCS is also called mediated schema ā¢ Bottom-up design process
3.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/3 Integration Alternatives ā¢ Physical integration ā” Source databases integrated and the integrated database is materialized ā” Data warehouses ā¢ Logical integration ā” Global conceptual schema is virtual and not materialized ā” Enterprise Information Integration (EII)
4.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/4 Data Warehouse Approach
5.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/5 Bottom-up Design ā¢ GCS (also called mediated schema) is defined first ā” Map LCSs to this schema ā” As in data warehouses ā¢ GCS is defined as an integration of parts of LCSs ā” Generate GCS and map LCSs to this GCS
6.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/6 GCS/LCS Relationship ā¢ Local-as-view ā” The GCS definition is assumed to exist, and each LCS is treated as a view definition over it ā¢ Global-as-view ā” The GCS is defined as a set of views over the LCSs
7.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/7 Database Integration Process
8.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/8 Recall Access Architecture
9.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/9 Database Integration Issues ā¢ Schema translation ā” Component database schemas translated to a common intermediate canonical representation ā¢ Schema generation ā” Intermediate schemas are used to create a global conceptual schema
10.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/10 Schema Translation ā¢ What is the canonical data model? ā” Relational ā” Entity-relationship ā¦ DIKE ā” Object-oriented ā¦ ARTEMIS ā” Graph-oriented ā¦ DIPE, TranScm, COMA, Cupid ā¦ Preferable with emergence of XML ā¦ No common graph formalism ā¢ Mapping algorithms ā” These are well-known
11.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/11 Schema Generation ā¢ Schema matching ā” Finding the correspondences between multiple schemas ā¢ Schema integration ā” Creation of the GCS (or mediated schema) using the correspondences ā¢ Schema mapping ā” How to map data from local databases to the GCS ā¢ Important: sometimes the GCS is defined first and schema matching and schema mapping is done against this target GCS
12.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/12 Running Example EMP(ENO, ENAME, TITLE) PROJ(PNO, PNAME, BUDGET, LOC, CNAME) ASG(ENO, PNO, RESP, DUR) PAY(TITLE, SAL) Relational E-R Model
13.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/13 Schema Matching ā¢ Schema heterogeneity ā” Structural heterogeneity ā¦ Type conflicts ā¦ Dependency conflicts ā¦ Key conflicts ā¦ Behavioral conflicts ā” Semantic heterogeneity ā¦ More important and harder to deal with ā¦ Synonyms, homonyms, hypernyms ā¦ Different ontology ā¦ Imprecise wording
14.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/14 Schema Matching (contād) ā¢ Other complications ā” Insufficient schema and instance information ā” Unavailability of schema documentation ā” Subjectivity of matching ā¢ Issues that affect schema matching ā” Schema versus instance matching ā” Element versus structure level matching ā” Matching cardinality
15.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/15 Schema Matching Approaches
16.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/16 Linguistic Schema Matching ā¢ Use element names and other textual information (textual descriptions, annotations) ā¢ May use external sources (e.g., Thesauri) ā¢ ćSC1.element-1 ā SC2.element-2, p,sć ā” Element-1 in schema SC1 is similar to element-2 in schema SC2 if predicate p holds with a similarity value of s ā¢ Schema level ā” Deal with names of schema elements ā” Handle cases such as synonyms, homonyms, hypernyms, data type similarities ā¢ Instance level ā” Focus on information retrieval techniques (e.g., word frequencies, key terms) ā” āDeduceā similarities from these
17.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/17 Linguistic Matchers ā¢ Use a set of linguistic (terminological) rules ā¢ Basic rules can be hand-crafted or may be discovered from outside sources (e.g., WordNet) ā¢ Predicate p and similarity value s ā” hand-crafted ā specified, ā” discovered ā may be computed or specified by an expert after discovery ā¢ Examples ā” ćuppercase names ā lower case names, true, 1.0ć ā” ćuppercase names ā capitalized names, true, 1.0ć ā” ćcapitalized names ā lower case names, true, 1.0ć ā” ćDB1.ASG ā DB2.WORKS_IN, true, 0.8ć
18.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/18 Automatic Discovery of Name Similarities ā¢ Affixes ā” Common prefixes and suffixes between two element name strings ā¢ N-grams ā” Comparing how many substrings of length n are common between the two name strings ā¢ Edit distance ā” Number of character modifications (additions, deletions, insertions) that needs to be performed to convert one string into the other ā¢ Soundex code ā” Phonetic similarity between names based on their soundex codes ā¢ Also look at data types ā” Data type similarity may suggest stronger relationship than the computed similarity using these methods or to differentiate between multiple strings with same value
19.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/19 N-gram Example ā¢ 3-grams of string āResponsibilityā are the following: ļ¬Res ļ¬ sib ļ¬ibi ļ¬ esp ļ¬bip ļ¬ spo ļ¬ili ļ¬ pon ļ¬lit ļ¬ ons ļ¬ity ļ¬ nsi ā¢ 3-grams of string āRespā are ā” Res ā” esp ā¢ 3-gram similarity: 2/12 = 0.17
20.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/20 Edit Distance Example ā¢ Again consider āResponsibilityā and āRespā ā¢ To convert āResponsibilityā to āRespā ā” Delete characters āoā, ānā, āsā, āiā, ābā, āiā, ālā, āiā, ātā, āyā ā¢ To convert āRespā to āResponsibilityā ā” Add characters āoā, ānā, āsā, āiā, ābā, āiā, ālā, āiā, ātā, āyā ā¢ The number of edit operations required is 10 ā¢ Similarity is 1 ā (10/14) = 0.29
21.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/21 Constraint-based Matchers ā¢ Data always have constraints ā use them ā” Data type information ā” Value ranges ā” ā¦ ā¢ Examples ā” RESP and RESPONSIBILITY: n-gram similarity = 0.17, edit distance similarity = 0.19 (low) ā” If they come from the same domain, this may increase their similarity value ā” ENO in relational, WORKER.NUMBER and PROJECT.NUMBER in E-R ā” ENO and WORKER.NUMBER may have type INTEGER while PROJECT.NUMBER may have STRING
22.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/22 Constraint-based Structural Matching ā¢ If two schema elements are structurally similar, then there is a higher likelihood that they represent the same concept ā¢ Structural similarity: ā” Same properties (attributes) ā” āNeighborhoodā similarity ā¦ Using graph representation ā¦ The set of nodes that can be reached within a particular path length from a node are the neighbors of that node ā¦ If two concepts (nodes) have similar set of neighbors, they are likely to represent the same concept
23.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/23 Learning-based Schema Matching ā¢ Use machine learning techniques to determine schema matches ā¢ Classification problem: classify concepts from various schemas into classes according to their similarity. Those that fall into the same class represent similar concepts ā¢ Similarity is defined according to features of data instances ā¢ Classification is ālearnedā from a training set
24.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/24 Learning-based Schema Matching
25.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/25 Combined Schema Matching Approaches ā¢ Use multiple matchers ā” Each matcher focuses on one area (name, etc) ā¢ Meta-matcher integrates these into one prediction ā¢ Integration may be simple (take average of similarity values) or more complex (see Faginās work)
26.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/26 Schema Integration ā¢ Use the correspondences to create a GCS ā¢ Mainly a manual process, although rules can help
27.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/27 Binary Integration Methods
28.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/28 N-ary Integration Methods
29.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/29 Schema Mapping ā¢ Mapping data from each local database (source) to GCS (target) while preserving semantic consistency as defined in both source and target. ā¢ Data warehouses ā actual translation ā¢ Data integration systems ā discover mappings that can be used in the query processing phase ā¢ Mapping creation ā¢ Mapping maintenance
30.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/30 Mapping Creation Given ā” A source LCS ā” A target GCS ā” A set of value correspondences discovered during schema matching phase Produce a set of queries that, when executed, will create GCS data instances from the source data. We are looking, for each Tk, a query Qk that is defined on a (possibly proper) subset of the relations in S such that, when executed, will generate data for Ti from the source relations
31.
Distributed DBMS Ā©
M. T. Ćzsu & P. Valduriez Ch.4/31 Mapping Creation Algorithm General idea: ā¢ Consider each Tk in turn. Divide Vk into subsets such that each specifies one possible way that values of Tk can be computed. ā¢ Each can be mapped to a query that, when executed, would generate some of Tkās data. ā¢ Union of these queries gives
Jetzt herunterladen