MongoDB Deployment Tips

MongoDB deployment tipsJared Rosoff - @forjared

Agenda Sizing Machines Understanding working set Sizing RAM Sizing Disk Configuring Replica Sets Understanding failover Avoiding single points of failure Minimizing recovery time

Overview Secondary Secondary Secondary Config Config Config Router Router Shard 1 Shard 2 Shard 3 Primary Primary Primary Secondary Secondary Secondary

Servers and Hardware Secondary Secondary Secondary Config Config Config Router Router Shard 1 Shard 2 Shard 3 Primary Primary Primary Secondary Secondary Secondary

Collection 1 Virtual Address Space 1 Index 1

Collection 1 Virtual Address Space 1 This is your virtual memory size (mapped) Index 1

Collection 1 Virtual Address Space 1 Physical RAM Index 1

Collection 1 Virtual Address Space 1 Physical RAM Index 1 This is your resident memory size

Disk Collection 1 Virtual Address Space 1 Physical RAM Index 1

Disk Collection 1 Virtual Address Space 1 Physical RAM Index 1 Virtual Address Space 2

Disk Collection 1 Virtual Address Space 1 Physical RAM Index 1 100 ns = 10,000 ns =

Disk configurations Single Disk ~200 seeks / second

Disk configurations Single Disk RAID 0 ~200 seeks / second ~200 seeks / second ~200 seeks / second ~200 seeks / second

Disk configurations Single Disk RAID 0 ~200 seeks / second RAID 10 ~200 seeks / second ~200 seeks / second ~200 seeks / second ~400 seeks / second ~400 seeks / second ~400 seeks / second

SSDs ?? Seek time of 0.1ms vs 5ms (200 seeks / sec => 10000 seeks / sec) But expensive

Tips for sizing hardware Know how important page faults are If you want low latency, avoid page faults Size memory appropriately To avoid page faults, fit everything in RAM Collection Data + Index Data Provision disk appropriately RAID10 is recommended SSD’s are fast, if you can afford them

Replica Sets Secondary Secondary Secondary Config Config Config Router Router Shard 1 Shard 2 Shard 3 Primary Primary Primary Secondary Secondary Secondary

Understanding automatic failover

Primary Election As long as a partition can see a majority (>50%) of the cluster, then it will elect a primary. Secondary Primary Secondary

Simple Failure 66% of cluster visible. Primary is elected Primary Failed Node Secondary

Simple Failure 33% of cluster visible. Read only mode. Failed Node Failed Node Secondary

Network Partition Secondary Primary Secondary

Network Partition 66% of cluster visible. Primary is elected Secondary Failed Node Primary Primary Secondary Secondary

Secondary Network Partition Primary Failed Node Secondary Secondary Failed Node 33% of cluster visible. Read only mode.

Even Cluster Size Secondary Primary Secondary Secondary

Even Cluster Size 50% of cluster visible. Read only mode. Secondary Failed Node Primary Secondary Failed Node Secondary Secondary Secondary

Even Cluster Size Secondary Failed Node Primary Secondary Failed Node Secondary Secondary Secondary 50% of cluster visible. Read only mode.

Avoiding single points of failure

Avoid Single points of failure

Avoid Single points of failure Secondary Primary Secondary Top of rack switch Rack falls over

Better Secondary Primary Secondary Loss of internet Building burns down

Better yet San Francisco Secondary Primary Dallas Secondary

Priorities San Francisco Priority 1 Priority 1 Secondary Priority 0 Primary Dallas Secondary Disaster recover data center. Will never become primary automatically.

Even Better San Francisco New York Secondary Primary Dallas Secondary

2 Replicas + Arbiter?? Arbiter Is this a good idea? Primary Secondary

2 Replicas + Arbiter?? 1 Arbiter Primary Secondary

2 Replicas + Arbiter?? 1 2 Arbiter Arbiter Primary Primary Secondary Secondary

2 Replicas + Arbiter?? 3 1 2 Full Sync Arbiter Arbiter Arbiter Primary Primary Primary Secondary Secondary Secondary Secondary Uh oh. Full Sync is going to use a lot of resources on the primary. So I may have downtime or degraded performance

With 3 replicas 1 Primary Secondary Secondary

With 3 replicas 1 2 Primary Primary Secondary Secondary Secondary Secondary

With 3 replicas 3 1 2 Primary Primary Primary Full Sync Secondary Secondary Secondary Secondary Secondary Secondary Secondary Sync can happen from secondary, which will not impact traffic on Primary.

Tips for choosing replica set topology Avoid single points of failure Separate racks Separate data centers Avoid long recovery downtime Use journaling Use 3+ replicas Keep your actives close Use priority to control where failovers happen

Summary Sizing a machine Know your working set size Size RAM appropriately Provision sufficient disks Designing a replica set Know how failover happens Design for failure Design for fast recover

MongoDB Deployment Tips

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie MongoDB Deployment Tips

Ähnlich wie MongoDB Deployment Tips (20)

Mehr von Jared Rosoff

Mehr von Jared Rosoff (8)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

MongoDB Deployment Tips