MongoDB Basic Concepts

•Als PPT, PDF herunterladen•

7 gefällt mir•11,041 views

MongoDB

Agenda

• Overview
• Replication
• Scalability
• Consistency & Durability
• Flexibility / Developer Experience

2

Norberto Leite
Senior Solutions Architect
@nleite / norberto@10gen.com

6

Norberto Leite
Senior Solutions
Architect
@nleite /
norberto@10gen.com
Barcelona

7

Norberto Leite
Senior Solutions
Architect
@nleite /
norberto@10gen.com
Barcelona

Love MongoDB

8

Norberto Leite
Senior Solutions
Architect
@nleite /
norberto@10gen.com
Barcelona

Love MongoDB

and others ...

9

Fundamentals

Document
Application
High Oriented
{

Performance
name: ‘Norberto Leite’,
position: ‘SA’,
nick: ‘WingMan’,
based: [‘Barcelona’, ‘London’]
}

mongoDB mongoDB mongoDB mongoDB Fully
Consistent
Horizontal Scalability
13

Why do we need Replication?

• Failover
• Backups
• Secondary Batch Jobs
• High Availability

15

Outages

• Planned
– Hardware upgrade
– OS or file-system tuning
– Software upgrade
– Relocation of data to new file-system / storage
• Un-planed
– Human Error
– Hardware Failure
– Data Center / Region Outage
– Application Corruption

16

Replica Sets

• Data Protection
– Multiple copies of data
– Data spread across data centers, AZ’s etc
• High Availability
– Automated Failover
– Automated Recovery

17

Asynchronous
Replication
App Write
Primary

Read
(default)

Secondary
Read
(optional)

Secondary
Read
(optional)

Failover
App Write
Primary

Read
(default)

Secondary
Read
(optional)

Secondary
Read
(optional)

Automatic Failover
Primary Election

App
Primary

Write
Primary
Read
(default)

Secondary
Read
(optional)

Automatic Recovery

App
Read Recovery
Secondary
(optional)

Write
Primary
Read
(default)

Secondary
Read
(optional)

Sharding

• Data Location Transparent to Code

• Data Distribution is Automatic
– as well as re-distribution

• Aggregation System resources Horizontally

• No CODE Changes!!!

23

$sh.shardCollection("test.tweets", {_id: 1} , false) Range Distribution shard01 shard02 shard03 a-i j-m n-z$

Chunk Split

shard01 shard02 shard03

a-i ja-jz
j-m n-z
ka-kj
k-m
ki-m

Auto Balancing

shard01 shard02 shard03

a-i ja-jz
j-m n-z
ka-kj
ki-m

$Routed db.tweets.find( {_id: Queries ‘norberto’}) shard01 shard02 shard03 a-i ja-jz j-m n-z ka-kj ki-m$

db.tweets.find( {email:
‘norberto@10gen’})
Scatter Gather

shard01 shard02 shard03

a-i ja-jz
j-m n-z
ka-kj
ki-m

Caching
96 GB Mem
3:1 Data/Mem

shard01
a-i
300 GB Data

j-r
n-z
300 GB

Horizontal Distribution
96 GB Mem 96 GB Mem 96 GB Mem
1:1 Data/Mem 1:1 Data/Mem 1:1 Data/Mem

shard01 shard02 shard03
a-i a-i
j-r n-z
300 GB Data

100 GB 100 GB 100 GB

Consistency

• Eventual Consistency
– Allow updates when a system as been partitioned
– Resolve conflicts later
– Ex: Cassandra, CouchDB

• Immediate Consistency
– Single Master
– Avoids conflicts
– Example: MongoDB

32

Durability

• For how long is my data available?
• When do I know my data is safe?!
• Where is it safe?

• MongoDB style:
– Fire and Forget
– Get Last Error
– Journal Sync
– Replica Safe

33

Durability
Multiple Data Centers
Memory Journal Secondary Nodes

RDMS

Async
w=1
(default)
j=true

w=majority

w=”tag”

34

Data Model

• Why Json?

– Well understood data format

– Maps simply to objects

– Linking & Embedding to describe relationships

36

JSON

place1 = { : "578 Broadway 7th Floor",
name : "10gen HQ",
address
city : "New York",
zip "business", "tech" ]}
: "10011",
tags : [
}

JSON & Scale Out

• Embedding removes the need for:

– Distributed Joins

– Two Phase Commit

• Enables data to be distributed across many
nodes without penalty

40

Empfohlen

MongoFr : MongoDB as a log CollectorPierre Baillet

Scalable Event Analytics with MongoDB & Ruby on RailsJared Rosoff

Mongo Web Apps: OSCON 2011rogerbodamer

ShardingMongoDB

MongodbScott Motte

MongoDB's New Aggregation frameworkChris Westin

Webinar Back to Basics 3 - Introduzione ai Replica SetMongoDB

MongoDB performance tuning and load testing, NOSQL Now! 2013 Conference prese...ronwarshawsky

Empfohlen

MongoFr : MongoDB as a log CollectorPierre Baillet

Scalable Event Analytics with MongoDB & Ruby on RailsJared Rosoff

Mongo Web Apps: OSCON 2011rogerbodamer

ShardingMongoDB

MongodbScott Motte

MongoDB's New Aggregation frameworkChris Westin

Webinar Back to Basics 3 - Introduzione ai Replica SetMongoDB

MongoDB performance tuning and load testing, NOSQL Now! 2013 Conference prese...ronwarshawsky

Back to Basics Webinar 6: Production DeploymentMongoDB

ソーシャルゲームログ解析基盤のHadoop活用事例知教本間

MongoDB World 2016: From the Polls to the Trolls: Seeing What the World Think...MongoDB

Making the case for write-optimized database algorithms / Mark Callaghan (Fac...Ontico

MongoDB Best Practices in AWS Chris Harris

Cassandra vs. RedisTim Lossen

Realtime Search Infrastructure at Craigslist (OpenWest 2014)Jeremy Zawodny

Running MongoDB 3.0 on AWSMongoDB

MongoDB Performance Tuning and MonitoringMongoDB

Sphinx at Craigslist in 2012Jeremy Zawodny

Introduction to RedisArnab Mitra

Frontera распределенный робот для обхода веба в больших объемах / Александр С...Ontico

Large Scale Log collection using LogStash & mongoDB Gaurav Bhardwaj

MongoDB Memory Management DemystifiedMongoDB

[245] presto 내부구조 파헤치기NAVER D2

Attack monitoring using ElasticSearch Logstash and KibanaPrajal Kulkarni

Bringing code to the data: from MySQL to RocksDB for high volume searchesIvan Kruglov

企業・業界情報プラットフォームSPEEDAにおけるElasticsearchの活用Akira Kitauchi

MongoTokyoHiroaki Kubota

High Performance Weibo QCon Beijing 2011Tim Y

IndexingMike Dirolf

Agility and Scalability with MongoDBMongoDB

Weitere ähnliche Inhalte

Was ist angesagt?

Back to Basics Webinar 6: Production DeploymentMongoDB

ソーシャルゲームログ解析基盤のHadoop活用事例知教本間

MongoDB World 2016: From the Polls to the Trolls: Seeing What the World Think...MongoDB

Making the case for write-optimized database algorithms / Mark Callaghan (Fac...Ontico

MongoDB Best Practices in AWS Chris Harris

Cassandra vs. RedisTim Lossen

Realtime Search Infrastructure at Craigslist (OpenWest 2014)Jeremy Zawodny

Running MongoDB 3.0 on AWSMongoDB

MongoDB Performance Tuning and MonitoringMongoDB

Sphinx at Craigslist in 2012Jeremy Zawodny

Introduction to RedisArnab Mitra

Frontera распределенный робот для обхода веба в больших объемах / Александр С...Ontico

Large Scale Log collection using LogStash & mongoDB Gaurav Bhardwaj

MongoDB Memory Management DemystifiedMongoDB

[245] presto 내부구조 파헤치기NAVER D2

Attack monitoring using ElasticSearch Logstash and KibanaPrajal Kulkarni

Bringing code to the data: from MySQL to RocksDB for high volume searchesIvan Kruglov

企業・業界情報プラットフォームSPEEDAにおけるElasticsearchの活用Akira Kitauchi

MongoTokyoHiroaki Kubota

High Performance Weibo QCon Beijing 2011Tim Y

Was ist angesagt? (20)

Back to Basics Webinar 6: Production Deployment

ソーシャルゲームログ解析基盤のHadoop活用事例

MongoDB World 2016: From the Polls to the Trolls: Seeing What the World Think...

Making the case for write-optimized database algorithms / Mark Callaghan (Fac...

MongoDB Best Practices in AWS

Cassandra vs. Redis

Realtime Search Infrastructure at Craigslist (OpenWest 2014)

Running MongoDB 3.0 on AWS

MongoDB Performance Tuning and Monitoring

Sphinx at Craigslist in 2012

Introduction to Redis

Frontera распределенный робот для обхода веба в больших объемах / Александр С...

Large Scale Log collection using LogStash & mongoDB

MongoDB Memory Management Demystified

[245] presto 내부구조 파헤치기

Attack monitoring using ElasticSearch Logstash and Kibana

Bringing code to the data: from MySQL to RocksDB for high volume searches

企業・業界情報プラットフォームSPEEDAにおけるElasticsearchの活用

MongoTokyo

High Performance Weibo QCon Beijing 2011

Andere mochten auch

IndexingMike Dirolf

Agility and Scalability with MongoDBMongoDB

Trading up: Adding Flexibility and Scalability to Bouygues Telecom with MongoDBMongoDB

Performance Tuning and OptimizationMongoDB

Availability and scalability in mongoMd. Khairul Anam

Inside MongoDB: the Internals of an Open-Source DatabaseMike Dirolf

MongoDB: How it WorksMike Dirolf

Scaling and Transaction FuturesMongoDB

Developing with the Modern App Stack: MEAN and MERN (with Angular2 and ReactJS)MongoDB

Andere mochten auch (9)

Indexing

Agility and Scalability with MongoDB

Trading up: Adding Flexibility and Scalability to Bouygues Telecom with MongoDB

Performance Tuning and Optimization

Availability and scalability in mongo

Inside MongoDB: the Internals of an Open-Source Database

MongoDB: How it Works

Scaling and Transaction Futures

Developing with the Modern App Stack: MEAN and MERN (with Angular2 and ReactJS)

Ähnlich wie MongoDB Basic Concepts

Cacheconcurrencyconsistency cassandra svccsrisatish ambati

MongoDB: Optimising for Performance, Scale & AnalyticsServer Density

Scaling with MongoDBRick Copeland

MongoDB for Time Series Data: ShardingMongoDB

MongoDB for Time Series Data Part 3: ShardingMongoDB

Ensuring High Availability for Real-time Analytics featuring Boxed Ice / Serv...MongoDB

Building a Scalable Distributed Stats Infrastructure with Storm and KairosDBCody Ray

Mongo db roma replication and shardingGuglielmo Incisa Di Camerana

Webinar: General Technical Overview of MongoDBMongoDB

Far cry 3sojuwugor

How MongoDB worksVladimir Miguro

2014 05-07-fr - add dev series - session 6 - deploying your application-2MongoDB

Sharding Architecturesguest0e6d5e

Eusecwestzynamics GmbH

Extreme Apache Spark: how in 3 months we created a pipeline that can process ...Josef A. Habdank

Optimizing MongoDB: Lessons Learned at Localyticsandrew311

Scaling with MongoDBMongoDB

MongoDB: Intro & Application for Big DataTakahiro Inoue

Scaling MongoDB (Mongo Austin)MongoDB

Thoughts on Transaction and Consistency Modelsiammutex

Ähnlich wie MongoDB Basic Concepts (20)

Cacheconcurrencyconsistency cassandra svcc

MongoDB: Optimising for Performance, Scale & Analytics

Scaling with MongoDB

MongoDB for Time Series Data: Sharding

MongoDB for Time Series Data Part 3: Sharding

Ensuring High Availability for Real-time Analytics featuring Boxed Ice / Serv...

Building a Scalable Distributed Stats Infrastructure with Storm and KairosDB

Mongo db roma replication and sharding

Webinar: General Technical Overview of MongoDB

Far cry 3

How MongoDB works

2014 05-07-fr - add dev series - session 6 - deploying your application-2

Sharding Architectures

Eusecwest

Extreme Apache Spark: how in 3 months we created a pipeline that can process ...

Optimizing MongoDB: Lessons Learned at Localytics

Scaling with MongoDB

MongoDB: Intro & Application for Big Data

Scaling MongoDB (Mongo Austin)

Thoughts on Transaction and Consistency Models

Mehr von MongoDB

MongoDB SoCal 2020: Migrate Anything* to MongoDB AtlasMongoDB

MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!MongoDB

MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...MongoDB

MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDBMongoDB

MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...MongoDB

MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series DataMongoDB

MongoDB SoCal 2020: MongoDB Atlas Jump StartMongoDB

MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]MongoDB

MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2MongoDB

MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...MongoDB

MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!MongoDB

MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your MindsetMongoDB

MongoDB .local San Francisco 2020: MongoDB Atlas JumpstartMongoDB

MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...MongoDB

MongoDB .local San Francisco 2020: Aggregation Pipeline Power++MongoDB

MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...MongoDB

MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep DiveMongoDB

MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & GolangMongoDB

MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...MongoDB

MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...MongoDB

Mehr von MongoDB (20)

MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas

MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!

MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...

MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB

MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...

MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data

MongoDB SoCal 2020: MongoDB Atlas Jump Start

MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]

MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2

MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...

MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!

MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset

MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart

MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...

MongoDB .local San Francisco 2020: Aggregation Pipeline Power++

MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...

MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive

MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang

MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...

MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...

MongoDB Basic Concepts

1. MongoDB Basic Concepts Norberto Leite Senior Solutions Architect, 10gen

2. Agenda • Overview • Replication • Scalability • Consistency & Durability • Flexibility / Developer Experience 2

3. But first ...

4. Happy Hanukkah!!!

5. Who’s this guy?

6. Norberto Leite Senior Solutions Architect @nleite / norberto@10gen.com 6

7. Norberto Leite Senior Solutions Architect @nleite / norberto@10gen.com Barcelona 7

8. Norberto Leite Senior Solutions Architect @nleite / norberto@10gen.com Barcelona Love MongoDB 8

9. Norberto Leite Senior Solutions Architect @nleite / norberto@10gen.com Barcelona Love MongoDB and others ... 9

10. Your Data

11.

12.

13. Fundamentals Document Application High Oriented { Performance name: ‘Norberto Leite’, position: ‘SA’, nick: ‘WingMan’, based: [‘Barcelona’, ‘London’] } mongoDB mongoDB mongoDB mongoDB Fully Consistent Horizontal Scalability 13

14. Replication

15. Why do we need Replication? • Failover • Backups • Secondary Batch Jobs • High Availability 15

16. Outages • Planned – Hardware upgrade – OS or file-system tuning – Software upgrade – Relocation of data to new file-system / storage • Un-planed – Human Error – Hardware Failure – Data Center / Region Outage – Application Corruption 16

17. Replica Sets • Data Protection – Multiple copies of data – Data spread across data centers, AZ’s etc • High Availability – Automated Failover – Automated Recovery 17

18. Asynchronous Replication App Write Primary Read (default) Secondary Read (optional) Secondary Read (optional)

19. Failover App Write Primary Read (default) Secondary Read (optional) Secondary Read (optional)

20. Automatic Failover Primary Election App Primary Write Primary Read (default) Secondary Read (optional)

21. Automatic Recovery App Read Recovery Secondary (optional) Write Primary Read (default) Secondary Read (optional)

22. Sharding

23. Sharding • Data Location Transparent to Code • Data Distribution is Automatic – as well as re-distribution • Aggregation System resources Horizontally • No CODE Changes!!! 23

24. sh.shardCollection("test.tweets", {_id: 1} , false) Range Distribution shard01 shard02 shard03 a-i j-m n-z

25. Chunk Split shard01 shard02 shard03 a-i ja-jz j-m n-z ka-kj k-m ki-m

26. Auto Balancing shard01 shard02 shard03 a-i ja-jz j-m n-z ka-kj ki-m

27. Routed db.tweets.find( {_id: Queries ‘norberto’}) shard01 shard02 shard03 a-i ja-jz j-m n-z ka-kj ki-m

28. db.tweets.find( {email: ‘norberto@10gen’}) Scatter Gather shard01 shard02 shard03 a-i ja-jz j-m n-z ka-kj ki-m

29. Caching 96 GB Mem 3:1 Data/Mem shard01 a-i 300 GB Data j-r n-z 300 GB

30. Horizontal Distribution 96 GB Mem 96 GB Mem 96 GB Mem 1:1 Data/Mem 1:1 Data/Mem 1:1 Data/Mem shard01 shard02 shard03 a-i a-i j-r n-z 300 GB Data 100 GB 100 GB 100 GB

31. Consistency and Durability

32. Consistency • Eventual Consistency – Allow updates when a system as been partitioned – Resolve conflicts later – Ex: Cassandra, CouchDB • Immediate Consistency – Single Master – Avoids conflicts – Example: MongoDB 32

33. Durability • For how long is my data available? • When do I know my data is safe?! • Where is it safe? • MongoDB style: – Fire and Forget – Get Last Error – Journal Sync – Replica Safe 33

34. Durability Multiple Data Centers Memory Journal Secondary Nodes RDMS Async w=1 (default) j=true w=majority w=”tag” 34

35. Flexibility

36. Data Model • Why Json? – Well understood data format – Maps simply to objects – Linking & Embedding to describe relationships 36

37. JSON place1 = { : "578 Broadway 7th Floor", name : "10gen HQ", address city : "New York", zip "business", "tech" ]} : "10011", tags : [ }

38. Relational Way

39. MongoDB Way embedding linking

40. JSON & Scale Out • Embedding removes the need for: – Distributed Joins – Two Phase Commit • Enables data to be distributed across many nodes without penalty 40