MongoDB: Scaling write performance | Devon 2012

MongoDB:
Scaling write performance
Junegunn Choi

First impression:
Easy

• Easy installation

• Easy data model

• No prior schema design

• Native support for secondary indexes

Second thought:
Not so easy

• No SQL

• Coping with massive data growth

• Setting up and operating sharded cluster

• Scaling write performance

Today we’ll talk about
insert performance

Insert throughput
on a replica set

Steady 5k inserts/sec

* 1kB record. ObjectId as PK
* WriteConcern: Journal sync on Majority

Insert throughput
with a secondary index

Culprit:
B+Tree index

• Good at sequential insert

• e.g. ObjectId, Sequence #, Timestamp

• Poor at random insert

• Indexes on randomly-distributed data

Sequential vs. Random insert
1 55
2 75
3 78
4 1
5 99
6 36
7 80
8 91
9 52
10 B+Tree 63 B+Tree
11 56
12 33
working set working set

Sequential insert ➔ Small working set Random insert ➔ Large working set
➔ Fits in RAM ➔ Sequential I/O ➔ Cannot ﬁt in RAM ➔ Random I/O
(bandwidth-bound) (IOPS-bound)

1. Partitioning

Aug 2012 Sep 2012 Oct 2012

B+Tree

ﬁts in memory

does not ﬁt in memory

1. Partitioning

• MongoDB doesn’t support partitioning

• Partitioning at application-level

• e.g. Daily log collection

• logs_20121012

2. Better H/W

• More RAM

• More IOPS

• RAID striping

• SSD

• AWS Provisioned IOPS (1k ~ 10k)

3. More H/W: Sharding
• Automatic partitioning across nodes

SHARD1 SHARD2 SHARD3

mongos router

There’s no free lunch
• Manual partitioning

• Incidental complexity

• Better H/W

• $

• Sharding

• $$

• Operational complexity

“Do you really need that index?”

Scaling insert performance
with sharding

=
Choosing the right shard key

Shard key example:
year_of_birth
64MB chunk

~ 1950 1971 ~ 1990 1951 ~ 1970

1991 ~ 2005 2006 ~ 2010

2010 ~ ∞

USERS USERS USERS
SHARD1 SHARD2 SHARD3

mongos router

Sequential key

• ObjectId as shard key

• Sequence #

• Timestamp

Sequential key

1000 ~ 2000

• All inserts into one chunk 5000 ~ 7500

• Chunk migration overhead 9000 ~ ∞
USERS
SHARD-x

9001, 9002, 9003, 9004, ...

Hash key

• e.g. SHA1(_id) = 9f2feb0f1ef425b292f2f94 ...

• Distributes evenly across all ranges

Hash key

• Performance drops as collection grows

• Why? Mandatory shard key index

• B+Tree problem again!

Sequential + hash key
• Coarse-grained sequential preﬁx

• e.g. Year-month + hash value

• 201210_24c3a5b9

B+Tree

201208_* 201209_* 201210_*

But what if...

B+Tree

large working set

201208_* 201209_* 201210_*


• Can you predict data growth rate?

• Balancer not clever enough

• Only considers # of chunks

• Migration slow during heavy-writes

Sequential key
Hash key

Low-cardinality hash key
Shard key range: A ~ D
• e.g. A~Z, 00~FF

• Alleviates B+Tree problem

• Sequential access on ﬁxed # Local
of parts B+Tree

A A A B B B C C C


• Limits the # of possible chunks

• e.g. 00 ~ FF ➔ 256 chunks

• Chunk grows past 64MB

• Balancing becomes difﬁcult

Sequential key
Hash key

Low-cardinality hash preﬁx
+ sequential part
Shard key range: A000 ~ C999

• e.g. Short hash preﬁx + timestamp

• FA1350005981
Local
• Nice index access pattern B+Tree
• Unlimited # of chunks

A000 A123 B000 B123 C000 C123

Lessons learned
• Know the performance impact of secondary index

• Choose the right shard key

• Test with large data sets

• Linear scalability is hard

• If you really need it, consider HBase or Cassandra

• SSD

Thank you. Questions?

gunn@daumcorp.com

MongoDB: Scaling write performance | Devon 2012

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie MongoDB: Scaling write performance | Devon 2012

Ähnlich wie MongoDB: Scaling write performance | Devon 2012 (20)

Mehr von Daum DNA

Mehr von Daum DNA (20)

MongoDB: Scaling write performance | Devon 2012