Using Mongo At Shopwiki

•Download as KEY, PDF•

3 likes•1,461 views

Avery Rosen

Presentation by Avery Rosen, CTO of ShopWiki.com, on how MongoDB is being used all over their enterprise.

Technology

MongoDB
@ShopWiki.com
our swiss-army datastore

Overview

Introductions
Uses at ShopWiki
Beneﬁts and Tradeoffs
Gotchas

Uses at ShopWiki

Site Visit Analytics
Datafeeds

Uses at ShopWiki

Site Visit Analytics
Datafeeds
Site Browsers

Uses at ShopWiki

Site Visit Analytics
Datafeeds
Site Browsers
Image/Thumbnail Server

Uses at ShopWiki

Site Visit Analytics
Datafeeds
Site Browsers
Image/Thumbnail Server
One-offs of All Kinds

Visit Analytics - contents
Data Size
Total On Disk: 869GB
Largest collection:
count : 88729347 items
size : 165GB
totalIndexSize : 18GB

Visit Analytics - usage
Typical
inserts/s query/s update/s delete/s getmore/s locked % conn

222 133 284 0 2 11% 738

Use spike
inserts/s query/s update/s delete/s getmore/s locked % conn

710 420 654 0 9 10% 650

Image/Thumbnail Server

Before: custom append-only datastore
After: MongoDB all the way!

Beneﬁts

Prototype to Production, always extensible

Beneﬁts

Prototype to Production, always extensible
JSON objects > ORM

Beneﬁts

Prototype to Production, always extensible
JSON objects > ORM
No joins in code

Beneﬁts

Prototype to Production, always extensible
JSON objects > ORM
No joins in code
One-Button Replication

Tradeoffs

No “DESCRIBE” (use indices instead)
Denormalization: Storage and Replication
Date handling
Typos mean schema corruption

Many-to-many
NodeID Color Shape ProductID Feel Temp
890 Purple Round 98 Soft 50
1039 Brown Square 202 Hard 98
6029 Brown Triangle 451 Squishy 102

NodeID ProductID
890 202
890 98
6029 451
1039 451

Inverted List Pairs
{ NodeID : 890, Products : [ 202, 98 ], Color : "Purple",
Shape : "Round" },
{ NodeID : 6029, Products : [ 451 ], Color : "Brown", Shape :
"Triangle" },
etc...

Inverted List Pairs
{ NodeID : 890, Products : [ 202, 98 ], Color : "Purple",
Shape : "Round" },
{ NodeID : 6029, Products : [ 451 ], Color : "Brown", Shape :
"Triangle" },
etc...

YOUR CODE HERE

Inverted List Pairs
{ NodeID : 890, Products : [ 202, 98 ], Color : "Purple",
Shape : "Round" },
{ NodeID : 6029, Products : [ 451 ], Color : "Brown", Shape :
"Triangle" },
etc...

YOUR CODE HERE

{ ProductID : 451, BrowseNodes : [ 6029, 1039 ], Feel :
"Squishy", Temp : 102 },
{ ProductID : 202, BrowseNodes : [ 890 ], Feel : "Hard",
Temp : 98 },
etc...

Datafeed alerting, RDB

No “INTERVAL 1 DAY”
Feed status
feed offer_count site_id date

Alerts
feed offer_count site_id date ack ﬁx_target

$Datafeed alerting, Mongo No joins, selects... index sub-objects { feed, offer_count, site_id, date, alert : [ status, time, etc...], last_good_count }$

Gotchas

Prototype to Production: ensureIndex() is cheap

$Gotchas Prototype to Production: ensureIndex() is cheap ext3 -- banished from the land oplog size for replication {number_of_times_the_user_clicked : 1}$

AFTER PARTY @SLATE
SPECIAL THANKS TO GILT FOR SPONSORING
54 WEST 21st STREET

Similar to Using Mongo At Shopwiki

Keynote: New in MongoDB: Atlas, Charts, and StitchMongoDB

Super spikeMichael Falanga

Java/Scala Lab: Борис Трофимов - Обжигающая Big Data.GeeksLab Odessa

MongoDB at ZPUGDCMike Dirolf

Benefits of using MongoDB: Reduce Complexity & Adapt to ChangesAlex Nguyen

tranSMART Community Meeting 5-7 Nov 13 - Session 2: MongoDB: What, Why And WhenDavid Peyruc

MongoDB, E-commerce and TransactionsSteven Francia

MongoDB World 2018: KeynoteMongoDB

Modeling JSON data for NoSQL document databasesRyan CrawCour

MongoDB for AnalyticsMongoDB

WSO2 Analytics Platform: The one stop shop for all your data needsSriskandarajah Suhothayan

Simplifying & accelerating application development with MongoDB's intelligent...Maxime Beugnet

Freeing Yourself from an RDBMS ArchitectureDavid Hoerster

Applying NLP to product comparison at visual metaRoss Turner

Agile Database Development with JSONChris Saxon

Webinar: Position and Trade Management with MongoDBMongoDB

Webinar: Best Practices for Getting Started with MongoDBMongoDB

MongoDB Best PracticesLewis Lin 🦊

Webinar: Scaling MongoDBMongoDB

Autogenerate Awesome GraphQL Documentation with SpectaQLNordic APIs

Similar to Using Mongo At Shopwiki (20)

Keynote: New in MongoDB: Atlas, Charts, and Stitch

Super spike

Java/Scala Lab: Борис Трофимов - Обжигающая Big Data.

MongoDB at ZPUGDC

Benefits of using MongoDB: Reduce Complexity & Adapt to Changes

tranSMART Community Meeting 5-7 Nov 13 - Session 2: MongoDB: What, Why And When

MongoDB, E-commerce and Transactions

MongoDB World 2018: Keynote

Modeling JSON data for NoSQL document databases

MongoDB for Analytics

WSO2 Analytics Platform: The one stop shop for all your data needs

Simplifying & accelerating application development with MongoDB's intelligent...

Freeing Yourself from an RDBMS Architecture

Applying NLP to product comparison at visual meta

Agile Database Development with JSON

Webinar: Position and Trade Management with MongoDB

Webinar: Best Practices for Getting Started with MongoDB

MongoDB Best Practices

Webinar: Scaling MongoDB

Autogenerate Awesome GraphQL Documentation with SpectaQL

Recently uploaded

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

A Call to Action for Generative AI in 2024Results

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

How to convert PDF to text with Nanonetsnaman860154

Developing An App To Navigate The Roads of BrazilV3cube

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Partners Life - Insurer Innovation Award 2024The Digital Insurer

Scaling API-first – The story of a global engineering organizationRadu Cotescu

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

🐬 The future of MySQL is Postgres 🐘RTylerCroy

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Recently uploaded (20)

Presentation on how to chat with PDF using ChatGPT code interpreter

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

[2024]Digital Global Overview Report 2024 Meltwater.pdf

Finology Group – Insurtech Innovation Award 2024

A Call to Action for Generative AI in 2024

CNv6 Instructor Chapter 6 Quality of Service

How to convert PDF to text with Nanonets

Developing An App To Navigate The Roads of Brazil

08448380779 Call Girls In Civil Lines Women Seeking Men

Partners Life - Insurer Innovation Award 2024

Scaling API-first – The story of a global engineering organization

How to Troubleshoot Apps for the Modern Connected Worker

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

🐬 The future of MySQL is Postgres 🐘

08448380779 Call Girls In Friends Colony Women Seeking Men

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Driving Behavioral Change for Information Management through Data-Driven Gree...

Exploring the Future Potential of AI-Enabled Smartphone Processors

Using Mongo At Shopwiki

1. MongoDB @ShopWiki.com our swiss-army datastore

2. Overview Introductions Uses at ShopWiki Beneﬁts and Tradeoffs Gotchas

3. ShopWiki - what we do

4. MongoDB Founder Pedigree

5. Uses at ShopWiki

6. Uses at ShopWiki Site Visit Analytics

7. Uses at ShopWiki Site Visit Analytics Datafeeds

8. Uses at ShopWiki Site Visit Analytics Datafeeds Site Browsers

9. Uses at ShopWiki Site Visit Analytics Datafeeds Site Browsers Image/Thumbnail Server

10. Uses at ShopWiki Site Visit Analytics Datafeeds Site Browsers Image/Thumbnail Server One-offs of All Kinds

11. Visit Analytics - contents Data Size Total On Disk: 869GB Largest collection: count : 88729347 items size : 165GB totalIndexSize : 18GB

12. Visit Analytics - usage Typical inserts/s query/s update/s delete/s getmore/s locked % conn 222 133 284 0 2 11% 738 Use spike inserts/s query/s update/s delete/s getmore/s locked % conn 710 420 654 0 9 10% 650

13. Datafeeds { ProductID : 2309, Title : “Elephant Leash”, Brand : “Acme”, Price : 49.99, Breadcrumbs : [ “Pets”, “Exotic”, “Accessories” ], Description : “Horton will love this stylish and functional leash, and you won’t violate any local statutes when you walk around with the Acme Elephant Leash!” }

14. Site-Browsing Datastore

15. Image/Thumbnail Server Before: custom append-only datastore After: MongoDB all the way!

16. Beneﬁts

17. Beneﬁts Prototype to Production, always extensible

18. Beneﬁts Prototype to Production, always extensible JSON objects > ORM

19. Beneﬁts Prototype to Production, always extensible JSON objects > ORM No joins in code

20. Beneﬁts Prototype to Production, always extensible JSON objects > ORM No joins in code One-Button Replication

21. Tradeoffs No “DESCRIBE” (use indices instead) Denormalization: Storage and Replication Date handling Typos mean schema corruption

22. Many-to-many NodeID Color Shape ProductID Feel Temp 890 Purple Round 98 Soft 50 1039 Brown Square 202 Hard 98 6029 Brown Triangle 451 Squishy 102 NodeID ProductID 890 202 890 98 6029 451 1039 451

23. Inverted List Pairs { NodeID : 890, Products : [ 202, 98 ], Color : "Purple", Shape : "Round" }, { NodeID : 6029, Products : [ 451 ], Color : "Brown", Shape : "Triangle" }, etc...

24. Inverted List Pairs { NodeID : 890, Products : [ 202, 98 ], Color : "Purple", Shape : "Round" }, { NodeID : 6029, Products : [ 451 ], Color : "Brown", Shape : "Triangle" }, etc... YOUR CODE HERE

25. Inverted List Pairs { NodeID : 890, Products : [ 202, 98 ], Color : "Purple", Shape : "Round" }, { NodeID : 6029, Products : [ 451 ], Color : "Brown", Shape : "Triangle" }, etc... YOUR CODE HERE { ProductID : 451, BrowseNodes : [ 6029, 1039 ], Feel : "Squishy", Temp : 102 }, { ProductID : 202, BrowseNodes : [ 890 ], Feel : "Hard", Temp : 98 }, etc...

26. Datafeed alerting, RDB No “INTERVAL 1 DAY” Feed status feed offer_count site_id date Alerts feed offer_count site_id date ack ﬁx_target

27. Datafeed alerting, Mongo No joins, selects... index sub-objects { feed, offer_count, site_id, date, alert : [ status, time, etc...], last_good_count }

28. Gotchas

29. Gotchas Prototype to Production: ensureIndex() is cheap

30. Gotchas Prototype to Production: ensureIndex() is cheap ext3 -- banished from the land

31. Gotchas Prototype to Production: ensureIndex() is cheap ext3 -- banished from the land oplog size for replication

32. Gotchas Prototype to Production: ensureIndex() is cheap ext3 -- banished from the land oplog size for replication {number_of_times_the_user_clicked : 1}

33. AFTER PARTY @SLATE SPECIAL THANKS TO GILT FOR SPONSORING 54 WEST 21st STREET

Editor's Notes

Shopping search engine; crawl the web using AI to aggregate; add data feeds; in-memory search; web front-end
relationship with founders, opportunity, Eliot: final project together, I was playing with QT he wrote a DB and network protocol. Dwight wrote the adserver, code I became highly familiar with on the adserver team.
Largest, write-only
highly utilized
perfect for document oriented architecture, same format as we use to eventually index
browse structure for consumers and SEO, daily updates, live access, cached in front-end
historical note: doubleclick&#x2019;s imageserver. no brainer to convert backend to avoid maintenance overhead
Prototype: schema extensible, no need for table alters, as in visit table; JSON instead of ORM; joins can be ugly and unpredictable
Prototype: schema extensible, no need for table alters, as in visit table; JSON instead of ORM; joins can be ugly and unpredictable
Prototype: schema extensible, no need for table alters, as in visit table; JSON instead of ORM; joins can be ugly and unpredictable
Prototype: schema extensible, no need for table alters, as in visit table; JSON instead of ORM; joins can be ugly and unpredictable
Many to many joins are missing, but you might not miss them. Storage is cheap, although has consequences for replication; correct for typos with testing
denormalization, but storage is cheap
denormalization, but storage is cheap
date functions missing
with document, can key on alerting, no hunting for last_good_count
it&#x2019;s easy to roll out code without indices; ext3 is just terrible; big data, 10% of empty too much, custom oplog size, too small; some people using false-ORM to minify attribute labels
it&#x2019;s easy to roll out code without indices; ext3 is just terrible; big data, 10% of empty too much, custom oplog size, too small; some people using false-ORM to minify attribute labels
it&#x2019;s easy to roll out code without indices; ext3 is just terrible; big data, 10% of empty too much, custom oplog size, too small; some people using false-ORM to minify attribute labels
it&#x2019;s easy to roll out code without indices; ext3 is just terrible; big data, 10% of empty too much, custom oplog size, too small; some people using false-ORM to minify attribute labels

Using Mongo At Shopwiki

Recommended

Recommended

More Related Content

Similar to Using Mongo At Shopwiki

Similar to Using Mongo At Shopwiki (20)

Recently uploaded

Recently uploaded (20)

Using Mongo At Shopwiki

Editor's Notes