Distributed database

ReachLocal Services India
ReachLocal Services IndiaWeb Analyst at ReachLocal Services um ReachLocal Services India
DISTRIBUTED DATABASE




           By-Bharat P. Patil
              Bihag Mehta
Distributed Database
 Database:- Logical interrelated
 collection of shared data, along with
 description of data, physically
 distributed over a computer network.
What is Distributed Database?
• A distributed database (DDB) is a
collection of multiple, logically
interrelated databases distributed over a
computer network.

• A distributed database management
system (DDBMS) is the software that
manages the DDB and provides an access
mechanism that makes this distribution
transparent to the users
Distributed database
 A DDBMS mainly classified into two
 types:

 ◦ Homogeneous Distributed database
   management systems

 ◦ Heterogeneous Distributed database
   management systems
Characteristics
 All sites are interconnected.
 Fragments can be replicated.
 Logically related shared data can be
  collected.
 Data at each and every site is controlled
  by the DBMS.
 Each Distributed Database Management
  System takes part in at least one global
  application.
Functionality

 Security
 Keeping track of data
 Replicated data management
 System catalog management
 Distributed transaction management
 Distributed database recovery
Homogeneous DDBMS
 In a homogeneous distributed database all
  sites have identical software and are
  aware of each other and agree to
  cooperate in processing user requests.
 The homogeneous system is much easier
  to design and manage
 The operating system used, at each
  location must be same or compatible.
 The database application (or DBMS) used
  at each location must be same or
  compatible.
Heterogeneous DDBMS

 In a heterogeneous distributed database
  different sites may use different schema
  and software.
 In heterogeneous systems, different nodes
  may have different hardware & software
  and data structures at various nodes or
  locations are also incompatible.
 Different computers and operating
  systems, database applications or data
  models may be used at each of the
  locations.
Heterogeneous DDBMS (contd..)
 On heterogeneous system, translations
  are required to allow communication
  between different sites (or DBMS).
 The heterogeneous system is often not
  technically or economically feasible. In
  this system, a user at one location may
  be able to read but not update the data
  at another location.
Advantages
 Less danger of a single-point failure.
  When one of the computers fails, the
  workload is picked up by other
  workstations.
 Data are also distributed at multiple
  sites.
 The end user is able to access any
  available copy of the data, and an end
  user's request is processed by any
  processor at the data location.
Advantages (contd..)
 Improved communications. Because local
  sites are smaller and located closer to
  customers.
 Reduced operating costs. It is more cost-
  effective to add workstations to a network
  than to update a mainframe system.
 Faster data access, faster data processing.
 A distributed database system spreads out
  the systems workload by processing data
  at several sites.
Disadvantages

 Complexity of management and control.
 Applications must recognize data
  location, and they must be able to stitch
  together data from various sites.
 Security.
Disadvantages (contd..)
 Increased storage and infrastructure
  requirements.
 Multiple copies of data has to be at
  different sites, thus an additional disk
  storage space will be required.
 The probability of security lapses
  increases when data are located at
  multiple sites.
What is Parallel database...??
   A parallel database system is to improve
    performance through parallelization of various
    operations, such as loading data, building
    indexes and evaluating queries.
   The distribution is solely done on the bases of
    performance.
   Parallel databases improve processing and
    input/output speeds by using multiple CPUs
    and disks in parallel.
   Many operations are performed
    simultaneously
   Data may be stored in a distributed fashion.
Difference b/w Distributed
 Database and Parallel Database
Characteristics      Parallel Database          Distributed database
  Definition      It is a software system      It is a software system that
                  where multiple               manages multiple logically
                  processors or machines are   interrelated databases
                  used to                      distributed over a computer
                  execute and run queries in   network.
                  parallel.
 Geographical     The nodes are located at     The nodes are usually
   Location       geographically               located at geographically
                  same location.               different locations.

   Execution      Quicker                      Slower
     Speed


   Overhead       Less                         More
Node types    Compulsorily Homogeneous Need not be homogeneous


Performance   Lower reliability &          Higher reliability &
              availability.                availability.


  Scope of    Difficult to expand          Easier to expand
 Expansion


  Backup      Backup at one site only      Backup at multiple sites

Consistency   Maintaining consistency is   Maintaining consistency is
              easier                       difficult.
Data fragmentation
 Fragmentation is a process of division or the
  mapping of the tables based on the columns
  and rows of data into the smallest unit of data.
 Data that has broken down is still possible to
  be combined again with the intention to
  complete the data collection using
  fragmentation.
 Fragmentation is a database server feature that
  allows you to control where data is stored at
  the table level.
 Fragmentation enables you to define groups of
  rows or index keys within a table.
Replication
   Replication is that we store several copies
    of a relation or relation fragment. An
    entire relation can be replicated at one or
    more sites.

   Similarly, one or more fragments of a
    relation can be replicated at other sites.

    For example, if a relation R is fragmented
    into R1,R2, and R3, there might be just one
    copy of R1, whereas R2 is replicated at two
    other sites and R3 is replicated at all sites.
Two Fold Replication
The motivation for replication is twofold:
1. Increased Availability of Data: If a site
   that contains a replica goes down, we can
   find the same data at other sites.
   Similarly, if local copies of remote
   relations are available, we are less
   vulnerable to failure of communication
   links.
2. Faster Query Evaluation: Queries can
   execute faster by using a local copy of a
   relation instead of going to a remote site.
Distributed Transaction
 In a distributed DBMS, a given transaction
  is submitted at some one site, but it can
  access data at other sites as well.
 When a transaction is submitted at some
  site, the transaction manager at that site
  breaks it up into a collection of one or
  more sub-transactions that execute at
  different sites, submits them to
  transaction managers at the other sites,
  and coordinates their activity.
   Distributed Concurrency Control: How can
    locks for objects stored across several sites
    be managed?

   Distributed Recovery: Transaction
    atomicity must be ensured when a
    transaction commits, all its actions, across
    all the sites at which it executes, must
    persist. Similarly, when a transaction
    aborts, none of its actions must be allowed
    to persist.
Distributed Concurrency Control
   The choice of technique determines
    which objects are to be locked. When
    locks are obtained and released is
    determined by the concurrency control
    protocol. We now consider how lock
    and unlock requests are implemented
    in a distributed environment. Lock
    management can be distributed across
    sites in many ways:
 Centralized : A single site is in charge of
  handling lock and unlock requests for all
  objects.
 Primary Copy: One copy of each object is
  designated the primary copy. All requests to
  lock or unlock a copy of this object are
  handled by the lock manager at the site
  where the primary copy is stored,
  regardless of where the copy itself is stored.
 Fully Distributed : Requests to lock or
  unlock a copy of an object stored at a site
  are handled by the lock manager at the site
  where the copy is stored.
DISTRIBUTED RECOVERY
   Recovery in a distributed DBMS is more
    complicated than in a centralized DBMS
    for the following reasons:
    ◦ New kinds of failure can arise : Failure of
      communication links and failure of a remote
      site at which a sub-transaction is executing.

    ◦ Either all sub-transactions of a given
      transaction must commit or none must
      commit, and this property must be guaranteed
      despite any combination of site and link
      failures. This guarantee is achieved using a
      commit protocol.
Concepts Of Locks
 A lock is used when multiple users need to
  access a database concurrently. This prevents
  data from being corrupted or invalidated when
  multiple users try to write to the database.
 Any single user can only modify those database
  records (that is, items in the database) to which
  they have applied a lock that gives them exclusive
  access to the record until the lock is released.
  Locking not only provides exclusivity to write but
  also prevents (or controls) reading of unfinished
  modifications.
Distributed database
1 von 27

Recomendados

Relational algebra ppt von
Relational algebra pptRelational algebra ppt
Relational algebra pptGirdharRatne
19.6K views17 Folien
Concurrency Control in Database Management System von
Concurrency Control in Database Management SystemConcurrency Control in Database Management System
Concurrency Control in Database Management SystemJanki Shah
14.4K views21 Folien
Distributed database management system von
Distributed database management  systemDistributed database management  system
Distributed database management systemPooja Dixit
4.8K views15 Folien
Dbms relational model von
Dbms relational modelDbms relational model
Dbms relational modelChirag vasava
9.7K views24 Folien
Normalization in DBMS von
Normalization in DBMSNormalization in DBMS
Normalization in DBMSPrateek Parimal
89.3K views23 Folien
Dbms Introduction and Basics von
Dbms Introduction and BasicsDbms Introduction and Basics
Dbms Introduction and BasicsSHIKHA GAUTAM
25.8K views26 Folien

Más contenido relacionado

Was ist angesagt?

Relational model von
Relational modelRelational model
Relational modelDabbal Singh Mahara
2.8K views57 Folien
Query processing and optimization (updated) von
Query processing and optimization (updated)Query processing and optimization (updated)
Query processing and optimization (updated)Ravinder Kamboj
25.9K views31 Folien
Concurrency control von
Concurrency controlConcurrency control
Concurrency controlSubhasish Pati
10.2K views29 Folien
Transaction management DBMS von
Transaction  management DBMSTransaction  management DBMS
Transaction management DBMSMegha Patel
16.2K views20 Folien
Query Decomposition and data localization von
Query Decomposition and data localization Query Decomposition and data localization
Query Decomposition and data localization Hafiz faiz
9.6K views42 Folien
12. Indexing and Hashing in DBMS von
12. Indexing and Hashing in DBMS12. Indexing and Hashing in DBMS
12. Indexing and Hashing in DBMSkoolkampus
66.8K views75 Folien

Was ist angesagt?(20)

Query processing and optimization (updated) von Ravinder Kamboj
Query processing and optimization (updated)Query processing and optimization (updated)
Query processing and optimization (updated)
Ravinder Kamboj25.9K views
Transaction management DBMS von Megha Patel
Transaction  management DBMSTransaction  management DBMS
Transaction management DBMS
Megha Patel16.2K views
Query Decomposition and data localization von Hafiz faiz
Query Decomposition and data localization Query Decomposition and data localization
Query Decomposition and data localization
Hafiz faiz9.6K views
12. Indexing and Hashing in DBMS von koolkampus
12. Indexing and Hashing in DBMS12. Indexing and Hashing in DBMS
12. Indexing and Hashing in DBMS
koolkampus66.8K views
15. Transactions in DBMS von koolkampus
15. Transactions in DBMS15. Transactions in DBMS
15. Transactions in DBMS
koolkampus80.4K views
Integrity Constraints von madhav bansal
Integrity ConstraintsIntegrity Constraints
Integrity Constraints
madhav bansal17.1K views
Relational Database Design von Archit Saxena
Relational Database DesignRelational Database Design
Relational Database Design
Archit Saxena15K views
Object oriented database concepts von Temesgenthanks
Object oriented database conceptsObject oriented database concepts
Object oriented database concepts
Temesgenthanks2.2K views
Database Design von learnt
Database DesignDatabase Design
Database Design
learnt14.9K views
Distributed dbms architectures von Pooja Dixit
Distributed dbms architecturesDistributed dbms architectures
Distributed dbms architectures
Pooja Dixit10K views
17. Recovery System in DBMS von koolkampus
17. Recovery System in DBMS17. Recovery System in DBMS
17. Recovery System in DBMS
koolkampus40.5K views

Destacado

Distributed Multimedia Systems(DMMS) von
Distributed Multimedia Systems(DMMS)Distributed Multimedia Systems(DMMS)
Distributed Multimedia Systems(DMMS)Nidhi Baranwal
22.9K views18 Folien
Distributed Database Management System von
Distributed Database Management SystemDistributed Database Management System
Distributed Database Management SystemHardik Patil
33.7K views87 Folien
Multimedia Database von
Multimedia Database Multimedia Database
Multimedia Database Avnish Patel
48K views29 Folien
Heterogeneous vs. Homogenous product ecosystems von
Heterogeneous vs. Homogenous product ecosystemsHeterogeneous vs. Homogenous product ecosystems
Heterogeneous vs. Homogenous product ecosystemscrayg
4.2K views11 Folien
Heterogeneous Or Homogeneous Classrooms Jane von
Heterogeneous Or Homogeneous Classrooms   JaneHeterogeneous Or Homogeneous Classrooms   Jane
Heterogeneous Or Homogeneous Classrooms JaneKevin Hodgson
2.4K views8 Folien
Difference between Homogeneous and Heterogeneous von
Difference between Homogeneous  and    HeterogeneousDifference between Homogeneous  and    Heterogeneous
Difference between Homogeneous and HeterogeneousFaraz Qaisrani
50.4K views13 Folien

Destacado(9)

Distributed Multimedia Systems(DMMS) von Nidhi Baranwal
Distributed Multimedia Systems(DMMS)Distributed Multimedia Systems(DMMS)
Distributed Multimedia Systems(DMMS)
Nidhi Baranwal22.9K views
Distributed Database Management System von Hardik Patil
Distributed Database Management SystemDistributed Database Management System
Distributed Database Management System
Hardik Patil33.7K views
Multimedia Database von Avnish Patel
Multimedia Database Multimedia Database
Multimedia Database
Avnish Patel48K views
Heterogeneous vs. Homogenous product ecosystems von crayg
Heterogeneous vs. Homogenous product ecosystemsHeterogeneous vs. Homogenous product ecosystems
Heterogeneous vs. Homogenous product ecosystems
crayg4.2K views
Heterogeneous Or Homogeneous Classrooms Jane von Kevin Hodgson
Heterogeneous Or Homogeneous Classrooms   JaneHeterogeneous Or Homogeneous Classrooms   Jane
Heterogeneous Or Homogeneous Classrooms Jane
Kevin Hodgson2.4K views
Difference between Homogeneous and Heterogeneous von Faraz Qaisrani
Difference between Homogeneous  and    HeterogeneousDifference between Homogeneous  and    Heterogeneous
Difference between Homogeneous and Heterogeneous
Faraz Qaisrani50.4K views
Hetrogenous Vs Homogenous von trevor.fockler
Hetrogenous Vs HomogenousHetrogenous Vs Homogenous
Hetrogenous Vs Homogenous
trevor.fockler30.6K views
Fragmentation and types of fragmentation in Distributed Database von Abhilasha Lahigude
Fragmentation and types of fragmentation in Distributed DatabaseFragmentation and types of fragmentation in Distributed Database
Fragmentation and types of fragmentation in Distributed Database
Abhilasha Lahigude71.2K views
Distributed database management systems von Dhani Ahmad
Distributed database management systemsDistributed database management systems
Distributed database management systems
Dhani Ahmad6.4K views

Similar a Distributed database

Distributed database von
Distributed databaseDistributed database
Distributed databasesanjay joshi
686 views27 Folien
Distributed database von
Distributed databaseDistributed database
Distributed databasesanjay joshi
433 views27 Folien
nnnn.pptx von
nnnn.pptxnnnn.pptx
nnnn.pptxNANDHINIS900805
9 views26 Folien
DBMS.pptx von
DBMS.pptxDBMS.pptx
DBMS.pptxNANDHINIS900805
3 views26 Folien
lecture-13.pptx von
lecture-13.pptxlecture-13.pptx
lecture-13.pptxlaiba29012
6 views21 Folien

Similar a Distributed database(20)

Introduction to distributed database von Sonia Panesar
Introduction to distributed databaseIntroduction to distributed database
Introduction to distributed database
Sonia Panesar1.9K views
Distribution transparency and Distributed transaction von shraddha mane
Distribution transparency and Distributed transactionDistribution transparency and Distributed transaction
Distribution transparency and Distributed transaction
shraddha mane4.5K views
Distributed D B von guest8fdbdd
Distributed  D BDistributed  D B
Distributed D B
guest8fdbdd1.6K views
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx von SANTOSH KUMAR MEHER
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptxSantosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx
Software Architecture In Banking von Ana Shah
Software Architecture In BankingSoftware Architecture In Banking
Software Architecture In Banking
Ana Shah4 views
ditributed databases von Hira Awan
ditributed databasesditributed databases
ditributed databases
Hira Awan546 views

Más de ReachLocal Services India

Excel ppt von
Excel pptExcel ppt
Excel pptReachLocal Services India
42.8K views14 Folien
Virtual reality von
Virtual realityVirtual reality
Virtual realityReachLocal Services India
2.3K views34 Folien
Digital signatures von
Digital signaturesDigital signatures
Digital signaturesReachLocal Services India
5.9K views21 Folien
System security von
System securitySystem security
System securityReachLocal Services India
3.7K views29 Folien
Artificial intelligence von
Artificial intelligenceArtificial intelligence
Artificial intelligenceReachLocal Services India
1.3K views20 Folien
Loop invariant computation von
Loop invariant computationLoop invariant computation
Loop invariant computationReachLocal Services India
7.7K views14 Folien

Último

ANATOMY AND PHYSIOLOGY UNIT 1 { PART-1} von
ANATOMY AND PHYSIOLOGY UNIT 1 { PART-1}ANATOMY AND PHYSIOLOGY UNIT 1 { PART-1}
ANATOMY AND PHYSIOLOGY UNIT 1 { PART-1}DR .PALLAVI PATHANIA
234 views195 Folien
Drama KS5 Breakdown von
Drama KS5 BreakdownDrama KS5 Breakdown
Drama KS5 BreakdownWestHatch
64 views2 Folien
Structure and Functions of Cell.pdf von
Structure and Functions of Cell.pdfStructure and Functions of Cell.pdf
Structure and Functions of Cell.pdfNithya Murugan
317 views10 Folien
GSoC 2024 von
GSoC 2024GSoC 2024
GSoC 2024DeveloperStudentClub10
63 views15 Folien
STERILITY TEST.pptx von
STERILITY TEST.pptxSTERILITY TEST.pptx
STERILITY TEST.pptxAnupkumar Sharma
114 views9 Folien
Community-led Open Access Publishing webinar.pptx von
Community-led Open Access Publishing webinar.pptxCommunity-led Open Access Publishing webinar.pptx
Community-led Open Access Publishing webinar.pptxJisc
69 views9 Folien

Último(20)

Drama KS5 Breakdown von WestHatch
Drama KS5 BreakdownDrama KS5 Breakdown
Drama KS5 Breakdown
WestHatch64 views
Structure and Functions of Cell.pdf von Nithya Murugan
Structure and Functions of Cell.pdfStructure and Functions of Cell.pdf
Structure and Functions of Cell.pdf
Nithya Murugan317 views
Community-led Open Access Publishing webinar.pptx von Jisc
Community-led Open Access Publishing webinar.pptxCommunity-led Open Access Publishing webinar.pptx
Community-led Open Access Publishing webinar.pptx
Jisc69 views
Are we onboard yet University of Sussex.pptx von Jisc
Are we onboard yet University of Sussex.pptxAre we onboard yet University of Sussex.pptx
Are we onboard yet University of Sussex.pptx
Jisc71 views
Lecture: Open Innovation von Michal Hron
Lecture: Open InnovationLecture: Open Innovation
Lecture: Open Innovation
Michal Hron95 views
11.28.23 Social Capital and Social Exclusion.pptx von mary850239
11.28.23 Social Capital and Social Exclusion.pptx11.28.23 Social Capital and Social Exclusion.pptx
11.28.23 Social Capital and Social Exclusion.pptx
mary850239112 views
The Open Access Community Framework (OACF) 2023 (1).pptx von Jisc
The Open Access Community Framework (OACF) 2023 (1).pptxThe Open Access Community Framework (OACF) 2023 (1).pptx
The Open Access Community Framework (OACF) 2023 (1).pptx
Jisc77 views
ISO/IEC 27001 and ISO/IEC 27005: Managing AI Risks Effectively von PECB
ISO/IEC 27001 and ISO/IEC 27005: Managing AI Risks EffectivelyISO/IEC 27001 and ISO/IEC 27005: Managing AI Risks Effectively
ISO/IEC 27001 and ISO/IEC 27005: Managing AI Risks Effectively
PECB 457 views
Plastic waste.pdf von alqaseedae
Plastic waste.pdfPlastic waste.pdf
Plastic waste.pdf
alqaseedae110 views
OEB 2023 Co-learning To Speed Up AI Implementation in Courses.pptx von Inge de Waard
OEB 2023 Co-learning To Speed Up AI Implementation in Courses.pptxOEB 2023 Co-learning To Speed Up AI Implementation in Courses.pptx
OEB 2023 Co-learning To Speed Up AI Implementation in Courses.pptx
Inge de Waard165 views
AI Tools for Business and Startups von Svetlin Nakov
AI Tools for Business and StartupsAI Tools for Business and Startups
AI Tools for Business and Startups
Svetlin Nakov89 views
The basics - information, data, technology and systems.pdf von JonathanCovena1
The basics - information, data, technology and systems.pdfThe basics - information, data, technology and systems.pdf
The basics - information, data, technology and systems.pdf
JonathanCovena177 views
Psychology KS4 von WestHatch
Psychology KS4Psychology KS4
Psychology KS4
WestHatch62 views
Class 10 English notes 23-24.pptx von TARIQ KHAN
Class 10 English notes 23-24.pptxClass 10 English notes 23-24.pptx
Class 10 English notes 23-24.pptx
TARIQ KHAN95 views
Scope of Biochemistry.pptx von shoba shoba
Scope of Biochemistry.pptxScope of Biochemistry.pptx
Scope of Biochemistry.pptx
shoba shoba121 views

Distributed database

  • 1. DISTRIBUTED DATABASE By-Bharat P. Patil Bihag Mehta
  • 2. Distributed Database  Database:- Logical interrelated collection of shared data, along with description of data, physically distributed over a computer network.
  • 3. What is Distributed Database? • A distributed database (DDB) is a collection of multiple, logically interrelated databases distributed over a computer network. • A distributed database management system (DDBMS) is the software that manages the DDB and provides an access mechanism that makes this distribution transparent to the users
  • 5.  A DDBMS mainly classified into two types: ◦ Homogeneous Distributed database management systems ◦ Heterogeneous Distributed database management systems
  • 6. Characteristics  All sites are interconnected.  Fragments can be replicated.  Logically related shared data can be collected.  Data at each and every site is controlled by the DBMS.  Each Distributed Database Management System takes part in at least one global application.
  • 7. Functionality  Security  Keeping track of data  Replicated data management  System catalog management  Distributed transaction management  Distributed database recovery
  • 8. Homogeneous DDBMS  In a homogeneous distributed database all sites have identical software and are aware of each other and agree to cooperate in processing user requests.  The homogeneous system is much easier to design and manage  The operating system used, at each location must be same or compatible.  The database application (or DBMS) used at each location must be same or compatible.
  • 9. Heterogeneous DDBMS  In a heterogeneous distributed database different sites may use different schema and software.  In heterogeneous systems, different nodes may have different hardware & software and data structures at various nodes or locations are also incompatible.  Different computers and operating systems, database applications or data models may be used at each of the locations.
  • 10. Heterogeneous DDBMS (contd..)  On heterogeneous system, translations are required to allow communication between different sites (or DBMS).  The heterogeneous system is often not technically or economically feasible. In this system, a user at one location may be able to read but not update the data at another location.
  • 11. Advantages  Less danger of a single-point failure. When one of the computers fails, the workload is picked up by other workstations.  Data are also distributed at multiple sites.  The end user is able to access any available copy of the data, and an end user's request is processed by any processor at the data location.
  • 12. Advantages (contd..)  Improved communications. Because local sites are smaller and located closer to customers.  Reduced operating costs. It is more cost- effective to add workstations to a network than to update a mainframe system.  Faster data access, faster data processing.  A distributed database system spreads out the systems workload by processing data at several sites.
  • 13. Disadvantages  Complexity of management and control.  Applications must recognize data location, and they must be able to stitch together data from various sites.  Security.
  • 14. Disadvantages (contd..)  Increased storage and infrastructure requirements.  Multiple copies of data has to be at different sites, thus an additional disk storage space will be required.  The probability of security lapses increases when data are located at multiple sites.
  • 15. What is Parallel database...??  A parallel database system is to improve performance through parallelization of various operations, such as loading data, building indexes and evaluating queries.  The distribution is solely done on the bases of performance.  Parallel databases improve processing and input/output speeds by using multiple CPUs and disks in parallel.  Many operations are performed simultaneously  Data may be stored in a distributed fashion.
  • 16. Difference b/w Distributed Database and Parallel Database Characteristics Parallel Database Distributed database Definition It is a software system It is a software system that where multiple manages multiple logically processors or machines are interrelated databases used to distributed over a computer execute and run queries in network. parallel. Geographical The nodes are located at The nodes are usually Location geographically located at geographically same location. different locations. Execution Quicker Slower Speed Overhead Less More
  • 17. Node types Compulsorily Homogeneous Need not be homogeneous Performance Lower reliability & Higher reliability & availability. availability. Scope of Difficult to expand Easier to expand Expansion Backup Backup at one site only Backup at multiple sites Consistency Maintaining consistency is Maintaining consistency is easier difficult.
  • 18. Data fragmentation  Fragmentation is a process of division or the mapping of the tables based on the columns and rows of data into the smallest unit of data.  Data that has broken down is still possible to be combined again with the intention to complete the data collection using fragmentation.  Fragmentation is a database server feature that allows you to control where data is stored at the table level.  Fragmentation enables you to define groups of rows or index keys within a table.
  • 19. Replication  Replication is that we store several copies of a relation or relation fragment. An entire relation can be replicated at one or more sites.  Similarly, one or more fragments of a relation can be replicated at other sites.  For example, if a relation R is fragmented into R1,R2, and R3, there might be just one copy of R1, whereas R2 is replicated at two other sites and R3 is replicated at all sites.
  • 20. Two Fold Replication The motivation for replication is twofold: 1. Increased Availability of Data: If a site that contains a replica goes down, we can find the same data at other sites. Similarly, if local copies of remote relations are available, we are less vulnerable to failure of communication links. 2. Faster Query Evaluation: Queries can execute faster by using a local copy of a relation instead of going to a remote site.
  • 21. Distributed Transaction  In a distributed DBMS, a given transaction is submitted at some one site, but it can access data at other sites as well.  When a transaction is submitted at some site, the transaction manager at that site breaks it up into a collection of one or more sub-transactions that execute at different sites, submits them to transaction managers at the other sites, and coordinates their activity.
  • 22. Distributed Concurrency Control: How can locks for objects stored across several sites be managed?  Distributed Recovery: Transaction atomicity must be ensured when a transaction commits, all its actions, across all the sites at which it executes, must persist. Similarly, when a transaction aborts, none of its actions must be allowed to persist.
  • 23. Distributed Concurrency Control  The choice of technique determines which objects are to be locked. When locks are obtained and released is determined by the concurrency control protocol. We now consider how lock and unlock requests are implemented in a distributed environment. Lock management can be distributed across sites in many ways:
  • 24.  Centralized : A single site is in charge of handling lock and unlock requests for all objects.  Primary Copy: One copy of each object is designated the primary copy. All requests to lock or unlock a copy of this object are handled by the lock manager at the site where the primary copy is stored, regardless of where the copy itself is stored.  Fully Distributed : Requests to lock or unlock a copy of an object stored at a site are handled by the lock manager at the site where the copy is stored.
  • 25. DISTRIBUTED RECOVERY  Recovery in a distributed DBMS is more complicated than in a centralized DBMS for the following reasons: ◦ New kinds of failure can arise : Failure of communication links and failure of a remote site at which a sub-transaction is executing. ◦ Either all sub-transactions of a given transaction must commit or none must commit, and this property must be guaranteed despite any combination of site and link failures. This guarantee is achieved using a commit protocol.
  • 26. Concepts Of Locks  A lock is used when multiple users need to access a database concurrently. This prevents data from being corrupted or invalidated when multiple users try to write to the database.  Any single user can only modify those database records (that is, items in the database) to which they have applied a lock that gives them exclusive access to the record until the lock is released. Locking not only provides exclusivity to write but also prevents (or controls) reading of unfinished modifications.