Database Scalability Patterns

•

24 gefällt mir•11,330 views

** Talk Given at OSCon 2010 ** We often have clients approach us looking for help in scaling their systems, and all too often their long term vision is a mixed reality based on the approaches read about on popular blogs trying to solve very different problems. Hey, scaling your database can be difficult enough by itself, you don't want to get tripped up by not understanding where you're going. In Database Scalability Patterns we will attempt to distill all of the information/hype/discussions around scaling databases, and break down the common patterns we've seen dealing with scaling databases. "Buzzwords" we'll cover (and hopefully debuzz) include: Vertical Scaling Horizontal Partitioning Horizontal Scaling Read Slaves Multi-Master Vertical Partitioning Federated Data Storage More important than just describing what these things are (although that's a good first step), we'll also discuss along the way different points in the life-cycle of your database when you need to be thinking about the different options in front of you. We'll factor in the types of application that your working on (think OLTP vs OLAP, or Social Networking vs. Corporate Application), the environment you'll be working on (Scaling "in the cloud" is very different from DIY in the datacenter), and we will talk about the types of tools you'll need to accomplish these goals (All replication systems are not the same, and some won't help at all).

Technologie

Database Scalability
{Patterns}

/ Robert Treat

Wednesday, July 21, 2010

robert treat
omniti

Wednesday, July 21, 2010

robert treat
omniti
postgres

Wednesday, July 21, 2010

robert treat
omniti
postgres
oracle - mysql

Wednesday, July 21, 2010

robert treat
omniti
postgres
oracle - mysql
mssql - sqlite - nosql

Wednesday, July 21, 2010

What are “Database Scalability Patterns?”

Wednesday, July 21, 2010

Part Design Patterns

Part Application Life-Cycle

Wednesday, July 21, 2010

MyFirstDatabase

Wednesday, July 21, 2010

Vertical Partitioning

Wednesday, July 21, 2010

Vertical Scaling

Wednesday, July 21, 2010

Federated Data Storage

“sharding” Horizontal Partitioning

Read Slaves

Multi-Master

Horizontal Scaling

Wednesday, July 21, 2010

Read Slaves / Master - Slave
Scale Read Load

Wednesday, July 21, 2010

Read Slaves / Master - Slave
Scale Read Load

writes go here!

Wednesday, July 21, 2010

Read Slaves / Master - Slave
Scale Read Load

writes go here!

reads go here (or here) (or here)
Wednesday, July 21, 2010

Read Slaves / Master - Slave
Scale Read Load

database writes
data to slaves writes go here!

slave db slave db slave db

reads go here (or here) (or here)
Wednesday, July 21, 2010

Read Slaves / Master - Slave
Scale Read Load

app writes data
everywhere writes go here!

memcached memcached memcached

reads go here (or here) (or here)
Wednesday, July 21, 2010

Read Slaves / Master - Slave
Scale Read Load

• Typically
• Full Copy of Data
On Each Node
• Asynchronous

Wednesday, July 21, 2010

Read Slaves / Master - Slave
Scale Read Load

• Typically • Consider
• Full Copy of Data • Partial Copy
On Each Node • Synchronous
• Asynchronous • Don’t use a RDBMS?

Wednesday, July 21, 2010

Scaling Writes

Wednesday, July 21, 2010

Scaling Writes

“not easy”

Wednesday, July 21, 2010

Multi-Master

many different ways to implement this,
few that actually work in production

Wednesday, July 21, 2010

Multi-Master

many different ways to implement this,
few that actually work in production

write to any node, database syncs data

Wednesday, July 21, 2010

Multi-Master

many different ways to implement this,
few that actually work in production

write to any node, database syncs data
can reduce cpu, doesn’t reduce i/o

Wednesday, July 21, 2010

Horizontal Partitioning

“sharding” “SOA”

Wednesday, July 21, 2010

Horizontal Partitioning

• Divide schema by job operations
• Move each piece to own server
• Duplicate some data as needed

Wednesday, July 21, 2010

Horizontal Partitioning

items
• Divide schema by job operations
• Move each piece to own server
• Duplicate some data as needed

forums

users

Wednesday, July 21, 2010

Horizontal Partitioning

items
• Divide schema by job operations
• Move each piece to own server
• Duplicate some data as needed

forums
• You must separate dependencies
in the app code ﬁrst!
users

Wednesday, July 21, 2010

Horizontal Scaling
• data split across servers based on algorithm
• data dropped into buckets (multiple?)
app

magic hash algorithm

Wednesday, July 21, 2010

Horizontal Scaling
• data split across servers based on algorithm
• data dropped into buckets (multiple?)

magic hash algorithm app

Wednesday, July 21, 2010

Horizontal Scaling
• data split across servers based on algorithm
• data dropped into buckets (multiple?)
•someone must keep track of data, and provide
lookup services

magic hash algorithm app

Wednesday, July 21, 2010

Universal Truths of Scaling Databases

Vertical Scalability is Helpful for Every Pattern

Even in a horizontally scaled, fully distributed
database, the number of nodes needed is
affected by vertical scalability

Wednesday, July 21, 2010

Universal Truths of Scaling Databases

New Nodes Are Never Free

• Add points of failure
• Add management costs
• Add complexity to architecture
• Add complexity to your app code

Wednesday, July 21, 2010

MyFirstDB
V
Vertical Partitioning
V
Vertical Scaling
V
Read Slaves
V
Horizontal Partitioning

Wednesday, July 21, 2010

tips

plan for layered data sources

Wednesday, July 21, 2010

tips

plan for layered data sources

read / write connections in code

Wednesday, July 21, 2010

tips

plan for layered data sources

read / write connections in code

use schemas to separate services

Wednesday, July 21, 2010

THANKS!

more:
xzilla.net
@robtreat2

omniti.com/surge
(scalability conference)

Wednesday, July 21, 2010

Empfohlen

Database Scalability - The Shard ConflictScaleBase

Microsoft SQL High Availability and ScalingJustin Whyte

Scalability, Availability & Stability PatternsJonas Bonér

Scalability Design Principles - Internal SessionSachin Sancheti - Microsoft Azure Architect

Challenges in Querying a Distributed Relational DatabaseScaleBase

MongoDB at eBayMongoDB

Replication Election and Consensus Algorithm Refinements for MongoDB 3.2MongoDB

Database index by Reema GajjarReema Gajjar

Empfohlen

Database Scalability - The Shard ConflictScaleBase

Microsoft SQL High Availability and ScalingJustin Whyte

Scalability, Availability & Stability PatternsJonas Bonér

Scalability Design Principles - Internal SessionSachin Sancheti - Microsoft Azure Architect

Challenges in Querying a Distributed Relational DatabaseScaleBase

MongoDB at eBayMongoDB

Replication Election and Consensus Algorithm Refinements for MongoDB 3.2MongoDB

Database index by Reema GajjarReema Gajjar

Db performance optimization with indexingRajeev Kumar

Database Architecture & Scaling Strategies, in the Cloud & on the Rack Clustrix

Database indexRiteshkiit

Data indexing presentationgmbmanikandan

What Every Developer Should Know About Database Scalabilityjbellis

Banco de Dados Não Relacionais vs Banco de Dados Relacionaisalexculpado

Scaling PinterestC4Media

Relational Database DesignArchit Saxena

Sql vs NoSQLRTigger

Advanced Int->Bigint ConversionsRobert Treat

Explaining ExplainRobert Treat

the-lost-art-of-plpgsqlRobert Treat

Managing Chaos In Production: Testing vs MonitoringRobert Treat

Managing Databases In A DevOps Environment 2016Robert Treat

Less Alarming Alerts - SRECon 2016 Robert Treat

What Ops Can Learn From DesignRobert Treat

Postgres 9.4 First LookRobert Treat

Less Alarming Alerts!Robert Treat

Past, Present, and Pachyderm - All Things Open - 2013Robert Treat

Big Bad "Upgraded" PostgresRobert Treat

Managing Databases In A DevOps EnvironmentRobert Treat

The Essential PostgreSQL.confRobert Treat

Weitere ähnliche Inhalte

Andere mochten auch

Db performance optimization with indexingRajeev Kumar

Database Architecture & Scaling Strategies, in the Cloud & on the Rack Clustrix

Database indexRiteshkiit

Data indexing presentationgmbmanikandan

What Every Developer Should Know About Database Scalabilityjbellis

Banco de Dados Não Relacionais vs Banco de Dados Relacionaisalexculpado

Scaling PinterestC4Media

Relational Database DesignArchit Saxena

Sql vs NoSQLRTigger

Andere mochten auch (9)

Db performance optimization with indexing

Database Architecture & Scaling Strategies, in the Cloud & on the Rack

Database index

Data indexing presentation

What Every Developer Should Know About Database Scalability

Banco de Dados Não Relacionais vs Banco de Dados Relacionais

Scaling Pinterest

Relational Database Design

Sql vs NoSQL

Mehr von Robert Treat

Advanced Int->Bigint ConversionsRobert Treat

Explaining ExplainRobert Treat

the-lost-art-of-plpgsqlRobert Treat

Managing Chaos In Production: Testing vs MonitoringRobert Treat

Managing Databases In A DevOps Environment 2016Robert Treat

Less Alarming Alerts - SRECon 2016 Robert Treat

What Ops Can Learn From DesignRobert Treat

Postgres 9.4 First LookRobert Treat

Less Alarming Alerts!Robert Treat

Past, Present, and Pachyderm - All Things Open - 2013Robert Treat

Big Bad "Upgraded" PostgresRobert Treat

Managing Databases In A DevOps EnvironmentRobert Treat

The Essential PostgreSQL.confRobert Treat

Pro Postgres 9Robert Treat

Advanced WAL File Management With OmniPITRRobert Treat

Scaling with Postgres (Highload++ 2010)Robert Treat

Intro to Postgres 9 TutorialRobert Treat

Check Please!Robert Treat

Database Scalability PatternsRobert Treat

A Guide To PostgreSQL 9.0Robert Treat

Mehr von Robert Treat (20)

Advanced Int->Bigint Conversions

Explaining Explain

the-lost-art-of-plpgsql

Managing Chaos In Production: Testing vs Monitoring

Managing Databases In A DevOps Environment 2016

Less Alarming Alerts - SRECon 2016

What Ops Can Learn From Design

Postgres 9.4 First Look

Less Alarming Alerts!

Past, Present, and Pachyderm - All Things Open - 2013

Big Bad "Upgraded" Postgres

Managing Databases In A DevOps Environment

The Essential PostgreSQL.conf

Pro Postgres 9

Advanced WAL File Management With OmniPITR

Scaling with Postgres (Highload++ 2010)

Intro to Postgres 9 Tutorial

Check Please!

Database Scalability Patterns

A Guide To PostgreSQL 9.0

Kürzlich hochgeladen

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Salesforce Community Group Quito, Salesforce 101Paola De la Torre

Slack Application Development 101 Slidespraypatel2

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

How to convert PDF to text with Nanonetsnaman860154

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Kürzlich hochgeladen (20)

Unblocking The Main Thread Solving ANRs and Frozen Frames

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...

Handwritten Text Recognition for manuscripts and early printed texts

Salesforce Community Group Quito, Salesforce 101

Slack Application Development 101 Slides

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Presentation on how to chat with PDF using ChatGPT code interpreter

Maximizing Board Effectiveness 2024 Webinar.pptx

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...

My Hashitalk Indonesia April 2024 Presentation

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

How to convert PDF to text with Nanonets

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Injustice - Developers Among Us (SciFiDevCon 2024)

Breaking the Kubernetes Kill Chain: Host Path Mount

How to Troubleshoot Apps for the Modern Connected Worker

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Database Scalability Patterns

1. Database Scalability {Patterns} / Robert Treat Wednesday, July 21, 2010

2. robert treat omniti Wednesday, July 21, 2010

3. robert treat omniti postgres Wednesday, July 21, 2010

4. robert treat omniti postgres oracle - mysql Wednesday, July 21, 2010

5. robert treat omniti postgres oracle - mysql mssql - sqlite - nosql Wednesday, July 21, 2010

6. What are “Database Scalability Patterns?” Wednesday, July 21, 2010

7. Part Design Patterns Part Application Life-Cycle Wednesday, July 21, 2010

8. Part Design Patterns Part Application Life-Cycle Wednesday, July 21, 2010

9. Part Design Patterns Part Application Life-Cycle Wednesday, July 21, 2010

10. Part Design Patterns Part Application Life-Cycle Wednesday, July 21, 2010

11. MyFirstDatabase Wednesday, July 21, 2010

12. Vertical Partitioning Wednesday, July 21, 2010

13. Vertical Scaling Wednesday, July 21, 2010

14. Vertical Scaling Wednesday, July 21, 2010

15. Vertical Scaling Wednesday, July 21, 2010

16. Federated Data Storage “sharding” Horizontal Partitioning Read Slaves Multi-Master Horizontal Scaling Wednesday, July 21, 2010

17. Read Slaves / Master - Slave Scale Read Load Wednesday, July 21, 2010

18. Read Slaves / Master - Slave Scale Read Load writes go here! Wednesday, July 21, 2010

19. Read Slaves / Master - Slave Scale Read Load writes go here! reads go here (or here) (or here) Wednesday, July 21, 2010

20. Read Slaves / Master - Slave Scale Read Load database writes data to slaves writes go here! slave db slave db slave db reads go here (or here) (or here) Wednesday, July 21, 2010

21. Read Slaves / Master - Slave Scale Read Load app writes data everywhere writes go here! memcached memcached memcached reads go here (or here) (or here) Wednesday, July 21, 2010

22. Read Slaves / Master - Slave Scale Read Load • Typically • Full Copy of Data On Each Node • Asynchronous Wednesday, July 21, 2010

23. Read Slaves / Master - Slave Scale Read Load • Typically • Consider • Full Copy of Data • Partial Copy On Each Node • Synchronous • Asynchronous • Don’t use a RDBMS? Wednesday, July 21, 2010

24. Read Slaves / Master - Slave Scale Read Load • Typically • Consider • Full Copy of Data • Partial Copy On Each Node • Synchronous • Asynchronous • Don’t use a RDBMS? Requires Application Changes Wednesday, July 21, 2010

25. Read Slaves / Master - Slave Scale Read Load • Typically • Consider • Full Copy of Data • Partial Copy On Each Node • Synchronous • Asynchronous • Don’t use a RDBMS? Requires Application Changes “easy” Wednesday, July 21, 2010

26. Scaling Writes Wednesday, July 21, 2010

27. Scaling Writes “not easy” Wednesday, July 21, 2010

28. Multi-Master many different ways to implement this, few that actually work in production Wednesday, July 21, 2010

29. Multi-Master many different ways to implement this, few that actually work in production write to any node, database syncs data Wednesday, July 21, 2010

30. Multi-Master many different ways to implement this, few that actually work in production write to any node, database syncs data can reduce cpu, doesn’t reduce i/o Wednesday, July 21, 2010

31. Multi-Master many different ways to implement this, few that actually work in production write to any node, database syncs data can reduce cpu, doesn’t reduce i/o failover solution not a scalability solution Wednesday, July 21, 2010

32. Horizontal Partitioning “sharding” “SOA” Wednesday, July 21, 2010

33. Horizontal Partitioning “sharding” “SOA” Wednesday, July 21, 2010

34. Horizontal Partitioning • Divide schema by job operations • Move each piece to own server • Duplicate some data as needed Wednesday, July 21, 2010

35. Horizontal Partitioning items • Divide schema by job operations • Move each piece to own server • Duplicate some data as needed forums users Wednesday, July 21, 2010

36. Horizontal Partitioning items • Divide schema by job operations • Move each piece to own server • Duplicate some data as needed forums • You must separate dependencies in the app code ﬁrst! users Wednesday, July 21, 2010

37. Horizontal Partitioning items • Divide schema by job operations • Move each piece to own server • Duplicate some data as needed forums • You must separate dependencies in the app code ﬁrst! users Each node is a new instance of vertical scaling Wednesday, July 21, 2010

38. Horizontal Scaling • data split across servers based on algorithm • data dropped into buckets (multiple?) app magic hash algorithm Wednesday, July 21, 2010

39. Horizontal Scaling • data split across servers based on algorithm • data dropped into buckets (multiple?) magic hash algorithm app Wednesday, July 21, 2010

40. Horizontal Scaling • data split across servers based on algorithm • data dropped into buckets (multiple?) •someone must keep track of data, and provide lookup services magic hash algorithm app Wednesday, July 21, 2010

41. Universal Truths of Scaling Databases Vertical Scalability is Helpful for Every Pattern Even in a horizontally scaled, fully distributed database, the number of nodes needed is affected by vertical scalability Wednesday, July 21, 2010

42. Universal Truths of Scaling Databases New Nodes Are Never Free • Add points of failure • Add management costs • Add complexity to architecture • Add complexity to your app code Wednesday, July 21, 2010

43. MyFirstDB V Vertical Partitioning V Vertical Scaling V Read Slaves V Horizontal Partitioning Wednesday, July 21, 2010

44. tips Wednesday, July 21, 2010

45. tips plan for layered data sources Wednesday, July 21, 2010

46. tips plan for layered data sources read / write connections in code Wednesday, July 21, 2010

47. tips plan for layered data sources read / write connections in code use schemas to separate services Wednesday, July 21, 2010

48. THANKS! more: xzilla.net @robtreat2 omniti.com/surge (scalability conference) Wednesday, July 21, 2010