SlideShare ist ein Scribd-Unternehmen logo
1 von 45
Databases
    Alan Medlar

amedlar@cs.ucl.ac.uk
Schedule

• Today: Introduction
• Monday 2 Feb: Networking
           nd


• Monday 2 Feb: Principles of Transactions
           nd


• Tuesday 3 Feb: Concurrent Transactions
           rd


• To be decided: Distributed Transactions
Introduction
Introduction
•   Why do we care about databases?
Introduction
•   Why do we care about databases?

    •   Abstraction
Introduction
•   Why do we care about databases?

    •   Abstraction

        •   Details of storage and access are irrelevant
Introduction
•   Why do we care about databases?

    •   Abstraction

        •   Details of storage and access are irrelevant

        •   Concurrency
Introduction
•   Why do we care about databases?

    •   Abstraction

        •   Details of storage and access are irrelevant

        •   Concurrency

        •   Crash recovery
Introduction
•   Why do we care about databases?

    •   Abstraction

        •   Details of storage and access are irrelevant

        •   Concurrency

        •   Crash recovery

    •   Integrity and Security (privacy)
Introduction
•   Why do we care about databases?

    •   Abstraction

        •   Details of storage and access are irrelevant

        •   Concurrency

        •   Crash recovery

    •   Integrity and Security (privacy)

    •   Multi-user
Centralised Databases
Centralised Databases


                       Bottleneck!

Communication
  overhead!

                 Single point
                  of failure!
Centralised Databases
•   Bad news...
    •   Performance
        •   Processing
        •   I/O
    •   Distributed nature of data (departments,
        companies, “mashups”)
    •   Availability (single point of failure)
Distributed Databases
Distributed Databases
    Distributed
    Processing
                      Distributed




       {
                       Storage

Localised
 Traffic
Distributed Databases


• Definition: A single DBMS running across
  multiple CPUs, disks and/or networks,
  designed to permit safe parallel access.
Concepts
•   Distributed Processing

    •   Central database, distributed processing
Distributed Processing

• Multiple processors or cores
 • Same memory (multi-core)
 • Same disk (networked storage)
• Concurrency Control provided by
  transactions
Concepts
•   Distributed Processing

    •   Central database, distributed processing

•   Ad-hoc Distributed Database

    •   Database physically distributed over
        network
Distributed Databases
•   Fragmentation

    •   Large dataset broken up into smaller components
Distributed Databases
•   Fragmentation

    •   Large dataset broken up into smaller components

•   Allocation

    •   Fragments should be stored according to usage
Distributed Databases
•   Fragmentation

    •   Large dataset broken up into smaller components

•   Allocation

    •   Fragments should be stored according to usage

•   Replication

    •   Copy maintained at multiple sites to take
        advantage of additional processing power or
        decreased latency
Distributed Databases
•                                      Might be implicit!
    Fragmentation

    •   Large dataset broken up into smaller components

•   Allocation

    •   Fragments should be stored according to usage

•   Replication

    •   Copy maintained at multiple sites to take
        advantage of additional processing power or
        decreased latency
Distributed Databases (2)
•   How do we decide how to fragment a
    database?

    •   Data, application and usage dependant!
Distributed Databases (2)
•   How do we decide how to fragment a
    database?

    •   Data, application and usage dependant!

•   Goals:

    •   Locality

    •   Minimal Communication

    •   Balance storage, processing, monetary costs
Concepts
•   Distributed Processing

    •   Central database, distributed processing

•   Ad-hoc Distributed Database

    •   Database physically distributed over
        network

•   Distributed DBMS (DDBMS)

    •   Software that makes distribution transparent
Distributed DBMS
• Key Concept: Transparency
Distributed DBMS
• Key Concept: Transparency
 • Transparent Distribution
Distributed DBMS
• Key Concept: Transparency
 • Transparent Distribution
   • One centralised database from
      perspective of user (programmer)
Distributed DBMS
• Key Concept: Transparency
 • Transparent Distribution
   • One centralised database from
      perspective of user (programmer)
 • Transparent Transactions
Distributed DBMS
• Key Concept: Transparency
 • Transparent Distribution
   • One centralised database from
      perspective of user (programmer)
 • Transparent Transactions
  • Integrity of data maintained across
      multiple databases
Distributed DBMS (2)

• Requires advanced:
 • Recovery Services
 • Concurrency control
• Focus of this course
Summary
•   Advantages
Summary
•   Advantages

    •   Performance
Summary
•   Advantages

    •   Performance

    •   Reflect organisational structure
Summary
•   Advantages

    •   Performance

    •   Reflect organisational structure

    •   Economics
Summary
•   Advantages

    •   Performance

    •   Reflect organisational structure

    •   Economics

    •   Modular Growth
Summary
•   Advantages

    •   Performance

    •   Reflect organisational structure

    •   Economics

    •   Modular Growth

    •   Availability
Summary
•   Advantages

    •   Performance

    •   Reflect organisational structure

    •   Economics

    •   Modular Growth

    •   Availability

    •   Reliability
Summary (2)
•   Disadvantages
Summary (2)
•   Disadvantages
    •   Complexity (design, transparency, integrity)
Summary (2)
•   Disadvantages
    •   Complexity (design, transparency, integrity)
    •   Security more of an issue
Summary (2)
•   Disadvantages
    •   Complexity (design, transparency, integrity)
    •   Security more of an issue
    •   Maintenance Costs
Summary (2)
•   Disadvantages
    •   Complexity (design, transparency, integrity)
    •   Security more of an issue
    •   Maintenance Costs
    •   Lack of standards (so much dependant on
        data, application, usage, etc)
Next: Transactions...

Weitere ähnliche Inhalte

Ähnlich wie Introduction to Distributed Databases

Brian Oliver Pimp My Data Grid
Brian Oliver  Pimp My Data GridBrian Oliver  Pimp My Data Grid
Brian Oliver Pimp My Data Griddeimos
 
Greatdebate Postgres vs Mysql
Greatdebate Postgres vs MysqlGreatdebate Postgres vs Mysql
Greatdebate Postgres vs MysqlKrishna Infosoft
 
The Great Debate: PostgreSQL vs MySQL
The Great Debate: PostgreSQL vs MySQLThe Great Debate: PostgreSQL vs MySQL
The Great Debate: PostgreSQL vs MySQLEDB
 
Branch Office Infrastructure
Branch Office InfrastructureBranch Office Infrastructure
Branch Office InfrastructureAidan Finn
 
Next Generation Datacenter Oracle - Alan Hartwell
Next Generation Datacenter Oracle - Alan HartwellNext Generation Datacenter Oracle - Alan Hartwell
Next Generation Datacenter Oracle - Alan HartwellHPDutchWorld
 
Oracle - Next Generation Datacenter - Alan Hartwell
Oracle - Next Generation Datacenter - Alan HartwellOracle - Next Generation Datacenter - Alan Hartwell
Oracle - Next Generation Datacenter - Alan HartwellHPDutchWorld
 
Randy Shoup eBays Architectural Principles
Randy Shoup eBays Architectural PrinciplesRandy Shoup eBays Architectural Principles
Randy Shoup eBays Architectural Principlesdeimos
 
Challenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data GenomicsChallenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data GenomicsYasin Memari
 
Four Assumptions Killing Backup Storage Webinar
Four Assumptions Killing Backup Storage WebinarFour Assumptions Killing Backup Storage Webinar
Four Assumptions Killing Backup Storage WebinarStorage Switzerland
 
Storage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems PresentationStorage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems Presentationandyman3000
 
How to build a state-of-the-art rails cluster
How to build a state-of-the-art rails clusterHow to build a state-of-the-art rails cluster
How to build a state-of-the-art rails clusterTim Lossen
 
Distributed systems - A Primer
Distributed systems - A PrimerDistributed systems - A Primer
Distributed systems - A PrimerMD Sayem Ahmed
 
Bit Level Preservation
Bit Level PreservationBit Level Preservation
Bit Level PreservationMicah Altman
 
Scalabe MySQL Infrastructure
Scalabe MySQL InfrastructureScalabe MySQL Infrastructure
Scalabe MySQL InfrastructureBalazs Pocze
 
Netcetera Proactive Management Service
Netcetera Proactive Management ServiceNetcetera Proactive Management Service
Netcetera Proactive Management ServicePeter Skelton
 
Evolution Of Dedupe
Evolution Of DedupeEvolution Of Dedupe
Evolution Of Deduperammotive
 
Solving the Database Problem
Solving the Database ProblemSolving the Database Problem
Solving the Database ProblemJay Gordon
 
Stopping Storage Hardware Sprawl
Stopping Storage Hardware SprawlStopping Storage Hardware Sprawl
Stopping Storage Hardware SprawlStorage Switzerland
 

Ähnlich wie Introduction to Distributed Databases (20)

Advanced Deployment
Advanced DeploymentAdvanced Deployment
Advanced Deployment
 
Brian Oliver Pimp My Data Grid
Brian Oliver  Pimp My Data GridBrian Oliver  Pimp My Data Grid
Brian Oliver Pimp My Data Grid
 
Greatdebate Postgres vs Mysql
Greatdebate Postgres vs MysqlGreatdebate Postgres vs Mysql
Greatdebate Postgres vs Mysql
 
The Great Debate: PostgreSQL vs MySQL
The Great Debate: PostgreSQL vs MySQLThe Great Debate: PostgreSQL vs MySQL
The Great Debate: PostgreSQL vs MySQL
 
Branch Office Infrastructure
Branch Office InfrastructureBranch Office Infrastructure
Branch Office Infrastructure
 
Next Generation Datacenter Oracle - Alan Hartwell
Next Generation Datacenter Oracle - Alan HartwellNext Generation Datacenter Oracle - Alan Hartwell
Next Generation Datacenter Oracle - Alan Hartwell
 
Oracle - Next Generation Datacenter - Alan Hartwell
Oracle - Next Generation Datacenter - Alan HartwellOracle - Next Generation Datacenter - Alan Hartwell
Oracle - Next Generation Datacenter - Alan Hartwell
 
Randy Shoup eBays Architectural Principles
Randy Shoup eBays Architectural PrinciplesRandy Shoup eBays Architectural Principles
Randy Shoup eBays Architectural Principles
 
Challenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data GenomicsChallenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data Genomics
 
Four Assumptions Killing Backup Storage Webinar
Four Assumptions Killing Backup Storage WebinarFour Assumptions Killing Backup Storage Webinar
Four Assumptions Killing Backup Storage Webinar
 
Storage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems PresentationStorage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems Presentation
 
How to build a state-of-the-art rails cluster
How to build a state-of-the-art rails clusterHow to build a state-of-the-art rails cluster
How to build a state-of-the-art rails cluster
 
Distributed systems - A Primer
Distributed systems - A PrimerDistributed systems - A Primer
Distributed systems - A Primer
 
Bit Level Preservation
Bit Level PreservationBit Level Preservation
Bit Level Preservation
 
Big data pipelines
Big data pipelinesBig data pipelines
Big data pipelines
 
Scalabe MySQL Infrastructure
Scalabe MySQL InfrastructureScalabe MySQL Infrastructure
Scalabe MySQL Infrastructure
 
Netcetera Proactive Management Service
Netcetera Proactive Management ServiceNetcetera Proactive Management Service
Netcetera Proactive Management Service
 
Evolution Of Dedupe
Evolution Of DedupeEvolution Of Dedupe
Evolution Of Dedupe
 
Solving the Database Problem
Solving the Database ProblemSolving the Database Problem
Solving the Database Problem
 
Stopping Storage Hardware Sprawl
Stopping Storage Hardware SprawlStopping Storage Hardware Sprawl
Stopping Storage Hardware Sprawl
 

Kürzlich hochgeladen

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 

Kürzlich hochgeladen (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 

Introduction to Distributed Databases

  • 1. Databases Alan Medlar amedlar@cs.ucl.ac.uk
  • 2. Schedule • Today: Introduction • Monday 2 Feb: Networking nd • Monday 2 Feb: Principles of Transactions nd • Tuesday 3 Feb: Concurrent Transactions rd • To be decided: Distributed Transactions
  • 4. Introduction • Why do we care about databases?
  • 5. Introduction • Why do we care about databases? • Abstraction
  • 6. Introduction • Why do we care about databases? • Abstraction • Details of storage and access are irrelevant
  • 7. Introduction • Why do we care about databases? • Abstraction • Details of storage and access are irrelevant • Concurrency
  • 8. Introduction • Why do we care about databases? • Abstraction • Details of storage and access are irrelevant • Concurrency • Crash recovery
  • 9. Introduction • Why do we care about databases? • Abstraction • Details of storage and access are irrelevant • Concurrency • Crash recovery • Integrity and Security (privacy)
  • 10. Introduction • Why do we care about databases? • Abstraction • Details of storage and access are irrelevant • Concurrency • Crash recovery • Integrity and Security (privacy) • Multi-user
  • 12. Centralised Databases Bottleneck! Communication overhead! Single point of failure!
  • 13. Centralised Databases • Bad news... • Performance • Processing • I/O • Distributed nature of data (departments, companies, “mashups”) • Availability (single point of failure)
  • 15. Distributed Databases Distributed Processing Distributed { Storage Localised Traffic
  • 16. Distributed Databases • Definition: A single DBMS running across multiple CPUs, disks and/or networks, designed to permit safe parallel access.
  • 17. Concepts • Distributed Processing • Central database, distributed processing
  • 18. Distributed Processing • Multiple processors or cores • Same memory (multi-core) • Same disk (networked storage) • Concurrency Control provided by transactions
  • 19. Concepts • Distributed Processing • Central database, distributed processing • Ad-hoc Distributed Database • Database physically distributed over network
  • 20. Distributed Databases • Fragmentation • Large dataset broken up into smaller components
  • 21. Distributed Databases • Fragmentation • Large dataset broken up into smaller components • Allocation • Fragments should be stored according to usage
  • 22. Distributed Databases • Fragmentation • Large dataset broken up into smaller components • Allocation • Fragments should be stored according to usage • Replication • Copy maintained at multiple sites to take advantage of additional processing power or decreased latency
  • 23. Distributed Databases • Might be implicit! Fragmentation • Large dataset broken up into smaller components • Allocation • Fragments should be stored according to usage • Replication • Copy maintained at multiple sites to take advantage of additional processing power or decreased latency
  • 24. Distributed Databases (2) • How do we decide how to fragment a database? • Data, application and usage dependant!
  • 25. Distributed Databases (2) • How do we decide how to fragment a database? • Data, application and usage dependant! • Goals: • Locality • Minimal Communication • Balance storage, processing, monetary costs
  • 26. Concepts • Distributed Processing • Central database, distributed processing • Ad-hoc Distributed Database • Database physically distributed over network • Distributed DBMS (DDBMS) • Software that makes distribution transparent
  • 27. Distributed DBMS • Key Concept: Transparency
  • 28. Distributed DBMS • Key Concept: Transparency • Transparent Distribution
  • 29. Distributed DBMS • Key Concept: Transparency • Transparent Distribution • One centralised database from perspective of user (programmer)
  • 30. Distributed DBMS • Key Concept: Transparency • Transparent Distribution • One centralised database from perspective of user (programmer) • Transparent Transactions
  • 31. Distributed DBMS • Key Concept: Transparency • Transparent Distribution • One centralised database from perspective of user (programmer) • Transparent Transactions • Integrity of data maintained across multiple databases
  • 32. Distributed DBMS (2) • Requires advanced: • Recovery Services • Concurrency control • Focus of this course
  • 33. Summary • Advantages
  • 34. Summary • Advantages • Performance
  • 35. Summary • Advantages • Performance • Reflect organisational structure
  • 36. Summary • Advantages • Performance • Reflect organisational structure • Economics
  • 37. Summary • Advantages • Performance • Reflect organisational structure • Economics • Modular Growth
  • 38. Summary • Advantages • Performance • Reflect organisational structure • Economics • Modular Growth • Availability
  • 39. Summary • Advantages • Performance • Reflect organisational structure • Economics • Modular Growth • Availability • Reliability
  • 40. Summary (2) • Disadvantages
  • 41. Summary (2) • Disadvantages • Complexity (design, transparency, integrity)
  • 42. Summary (2) • Disadvantages • Complexity (design, transparency, integrity) • Security more of an issue
  • 43. Summary (2) • Disadvantages • Complexity (design, transparency, integrity) • Security more of an issue • Maintenance Costs
  • 44. Summary (2) • Disadvantages • Complexity (design, transparency, integrity) • Security more of an issue • Maintenance Costs • Lack of standards (so much dependant on data, application, usage, etc)

Hinweis der Redaktion