SlideShare ist ein Scribd-Unternehmen logo
1 von 12
Large Data
Largest Living creation:
1 California Redwood tree named the General Sherman
PROBLEM
Highly Interactive
2 Starlings forming fascinating formations over Tøndermarsken,
south-west Jutland, Denmark
PROBLEM
Huge user base
5 World’s largest gathering: Kumbh Mela 2007
PROBLEM
3
4
6 Brad Fitzpatrick, LiveJournal
SOUNDS FAMILIAR!
WHO ELSE HAS FACED THIS BEFORE?
Secondary Storage: Low access speed, high capacity
Processor: Small size, high speed
Network: Blazing fast
Primary Memory: Fast, cheap
Machine: Low reliability
COMPUTING HARDWARE
Pre-Generate Static Pages
• Fast Read
• Dynamic context aware content
• Popularity prediction in advance
Web Server Farm
• More concurrent web requests served
• Dynamic Content delivery
DB Replication
• Data Replicas for Load Balancing
• Fast Read
• Slow Writes for ACID conformance
ALTERNATIVES
DB Sharding
• Fast Read due to Horizontal partitioning
• Limited scalability
• Reliance on file system performance
MEMCACHED
Distributed object caching
Global distributed two layered object caching
MEMCACHED
Distributed object caching
Right granularity at right Layer
hashtableon RAM
Cross-Platform
Application namespace isolation
Network independence
Caches Objects, not raw data
Dynamic server cluster reconfiguration
Compression
No authentication Highly volatile
Servers
1
2
4 3
6
5
10
11
7
8
Clients on application Servers Data Stores
12
9
Cross-Platform
Application namespace isolation
Network independence
Caches Objects, not raw data
Dynamic server cluster reconfiguration
Compression
No authentication Highly volatile
MEMCACHED
Distributed object caching
Associative Array: Meta information
Runtime Cache: Single node applications
Database Cache: No large data, rarely updated, similar requests, object
creation not costly
Memcached: Large scale distributed applications with frequently
updated data and involving complex objects
File Cache: Long term, Large Objects
DB Read: Small scale needs
8 Cheetah, fastest land animal: 120km/h
7,9 Slug, slowest animal: 0.03mph
8
9
P
E
R
F
O
R
M
A
N
C
E
MEMCACHED
Distributed object caching
REFERENCES
1. http://www.mnn.com/sites/default/files/imagecache/node-gallery-display/general%20sherman.jpg
2. http://en.wikipedia.org/wiki/File:Sort_sol_pdfnet.jpg
3. http://odeworld.wordpress.com/2007/05/04/size-matters/
4. http://www.kumbhamela.net/
5. http://en.wikipedia.org/wiki/List_of_largest_peaceful_gatherings_in_history
6. http://picasaweb.google.com/dolboeb/BradFitzInSF#5176116242455706722
7. http://lilomag.com/2010/07/29/the-7-worlds-slowest-animal/
8. http://en.wikipedia.org/wiki/Cheetah
9. http://en.wikipedia.org/wiki/File:Slugs_1896.png
Memcached: http://benrobb.com/wp-content/uploads/2009/01/memcached.pdf
By: Jeremy Leishman, Ben Robison, Josh Taylor

Weitere ähnliche Inhalte

Ähnlich wie High scalability | Memcached - Distributed object caching

Big Data and Hadoop Ecosystem
Big Data and Hadoop EcosystemBig Data and Hadoop Ecosystem
Big Data and Hadoop EcosystemRajkumar Singh
 
Keith Norbie Flash Storage decision methodology - mnvmug
Keith Norbie Flash Storage decision methodology - mnvmugKeith Norbie Flash Storage decision methodology - mnvmug
Keith Norbie Flash Storage decision methodology - mnvmugKeith Norbie
 
Gluster open stack dev summit 042011
Gluster open stack dev summit 042011Gluster open stack dev summit 042011
Gluster open stack dev summit 042011Open Stack
 
Future of cloud storage
Future of cloud storageFuture of cloud storage
Future of cloud storageGlusterFS
 
Elastic storage in the cloud session 5224 final v2
Elastic storage in the cloud session 5224 final v2Elastic storage in the cloud session 5224 final v2
Elastic storage in the cloud session 5224 final v2BradDesAulniers2
 
Borthakur hadoop univ-research
Borthakur hadoop univ-researchBorthakur hadoop univ-research
Borthakur hadoop univ-researchsaintdevil163
 
WANdisco Non-Stop Hadoop: PHXDataConference Presentation Oct 2014
WANdisco Non-Stop Hadoop: PHXDataConference Presentation Oct 2014 WANdisco Non-Stop Hadoop: PHXDataConference Presentation Oct 2014
WANdisco Non-Stop Hadoop: PHXDataConference Presentation Oct 2014 Chris Almond
 
Константин Швачко, Yahoo!, - Scaling Storage and Computation with Hadoop
Константин Швачко, Yahoo!, - Scaling Storage and Computation with HadoopКонстантин Швачко, Yahoo!, - Scaling Storage and Computation with Hadoop
Константин Швачко, Yahoo!, - Scaling Storage and Computation with HadoopMedia Gorod
 
Nimble storage investor_deck_public
Nimble storage investor_deck_publicNimble storage investor_deck_public
Nimble storage investor_deck_publicSequoia Capital
 
Scaling HDFS with a Strongly Consistent Relational Model for Metadata
Scaling HDFS with a Strongly Consistent Relational Model for MetadataScaling HDFS with a Strongly Consistent Relational Model for Metadata
Scaling HDFS with a Strongly Consistent Relational Model for MetadataHooman Peiro Sajjad
 
Big Data Architecture Workshop - Vahid Amiri
Big Data Architecture Workshop -  Vahid AmiriBig Data Architecture Workshop -  Vahid Amiri
Big Data Architecture Workshop - Vahid Amiridatastack
 
high performance databases
high performance databaseshigh performance databases
high performance databasesmahdi_92
 
SD Big Data Monthly Meetup #4 - Session 2 - WANDisco
SD Big Data Monthly Meetup #4 - Session 2 - WANDiscoSD Big Data Monthly Meetup #4 - Session 2 - WANDisco
SD Big Data Monthly Meetup #4 - Session 2 - WANDiscoBig Data Joe™ Rossi
 
Big data and hadoop overvew
Big data and hadoop overvewBig data and hadoop overvew
Big data and hadoop overvewKunal Khanna
 
Hadoop for Bioinformatics: Building a Scalable Variant Store
Hadoop for Bioinformatics: Building a Scalable Variant StoreHadoop for Bioinformatics: Building a Scalable Variant Store
Hadoop for Bioinformatics: Building a Scalable Variant StoreUri Laserson
 

Ähnlich wie High scalability | Memcached - Distributed object caching (20)

Big Data and Hadoop Ecosystem
Big Data and Hadoop EcosystemBig Data and Hadoop Ecosystem
Big Data and Hadoop Ecosystem
 
Keith Norbie Flash Storage decision methodology - mnvmug
Keith Norbie Flash Storage decision methodology - mnvmugKeith Norbie Flash Storage decision methodology - mnvmug
Keith Norbie Flash Storage decision methodology - mnvmug
 
Gluster open stack dev summit 042011
Gluster open stack dev summit 042011Gluster open stack dev summit 042011
Gluster open stack dev summit 042011
 
Hadoop training in bangalore
Hadoop training in bangaloreHadoop training in bangalore
Hadoop training in bangalore
 
Future of cloud storage
Future of cloud storageFuture of cloud storage
Future of cloud storage
 
Elastic storage in the cloud session 5224 final v2
Elastic storage in the cloud session 5224 final v2Elastic storage in the cloud session 5224 final v2
Elastic storage in the cloud session 5224 final v2
 
Borthakur hadoop univ-research
Borthakur hadoop univ-researchBorthakur hadoop univ-research
Borthakur hadoop univ-research
 
WANdisco Non-Stop Hadoop: PHXDataConference Presentation Oct 2014
WANdisco Non-Stop Hadoop: PHXDataConference Presentation Oct 2014 WANdisco Non-Stop Hadoop: PHXDataConference Presentation Oct 2014
WANdisco Non-Stop Hadoop: PHXDataConference Presentation Oct 2014
 
Константин Швачко, Yahoo!, - Scaling Storage and Computation with Hadoop
Константин Швачко, Yahoo!, - Scaling Storage and Computation with HadoopКонстантин Швачко, Yahoo!, - Scaling Storage and Computation with Hadoop
Константин Швачко, Yahoo!, - Scaling Storage and Computation with Hadoop
 
Nimble storage investor_deck_public
Nimble storage investor_deck_publicNimble storage investor_deck_public
Nimble storage investor_deck_public
 
Spectra Logic
Spectra LogicSpectra Logic
Spectra Logic
 
Scaling HDFS with a Strongly Consistent Relational Model for Metadata
Scaling HDFS with a Strongly Consistent Relational Model for MetadataScaling HDFS with a Strongly Consistent Relational Model for Metadata
Scaling HDFS with a Strongly Consistent Relational Model for Metadata
 
Giraffa - November 2014
Giraffa - November 2014Giraffa - November 2014
Giraffa - November 2014
 
BIG DATA Session 6
BIG DATA Session 6BIG DATA Session 6
BIG DATA Session 6
 
Big Data Architecture Workshop - Vahid Amiri
Big Data Architecture Workshop -  Vahid AmiriBig Data Architecture Workshop -  Vahid Amiri
Big Data Architecture Workshop - Vahid Amiri
 
high performance databases
high performance databaseshigh performance databases
high performance databases
 
SD Big Data Monthly Meetup #4 - Session 2 - WANDisco
SD Big Data Monthly Meetup #4 - Session 2 - WANDiscoSD Big Data Monthly Meetup #4 - Session 2 - WANDisco
SD Big Data Monthly Meetup #4 - Session 2 - WANDisco
 
Bigdata
BigdataBigdata
Bigdata
 
Big data and hadoop overvew
Big data and hadoop overvewBig data and hadoop overvew
Big data and hadoop overvew
 
Hadoop for Bioinformatics: Building a Scalable Variant Store
Hadoop for Bioinformatics: Building a Scalable Variant StoreHadoop for Bioinformatics: Building a Scalable Variant Store
Hadoop for Bioinformatics: Building a Scalable Variant Store
 

Kürzlich hochgeladen

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 

Kürzlich hochgeladen (20)

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 

High scalability | Memcached - Distributed object caching

  • 1.
  • 2. Large Data Largest Living creation: 1 California Redwood tree named the General Sherman PROBLEM
  • 3. Highly Interactive 2 Starlings forming fascinating formations over Tøndermarsken, south-west Jutland, Denmark PROBLEM
  • 4. Huge user base 5 World’s largest gathering: Kumbh Mela 2007 PROBLEM 3 4
  • 5. 6 Brad Fitzpatrick, LiveJournal SOUNDS FAMILIAR! WHO ELSE HAS FACED THIS BEFORE?
  • 6. Secondary Storage: Low access speed, high capacity Processor: Small size, high speed Network: Blazing fast Primary Memory: Fast, cheap Machine: Low reliability COMPUTING HARDWARE
  • 7. Pre-Generate Static Pages • Fast Read • Dynamic context aware content • Popularity prediction in advance Web Server Farm • More concurrent web requests served • Dynamic Content delivery DB Replication • Data Replicas for Load Balancing • Fast Read • Slow Writes for ACID conformance ALTERNATIVES DB Sharding • Fast Read due to Horizontal partitioning • Limited scalability • Reliance on file system performance
  • 9. Global distributed two layered object caching MEMCACHED Distributed object caching Right granularity at right Layer hashtableon RAM Cross-Platform Application namespace isolation Network independence Caches Objects, not raw data Dynamic server cluster reconfiguration Compression No authentication Highly volatile
  • 10. Servers 1 2 4 3 6 5 10 11 7 8 Clients on application Servers Data Stores 12 9 Cross-Platform Application namespace isolation Network independence Caches Objects, not raw data Dynamic server cluster reconfiguration Compression No authentication Highly volatile MEMCACHED Distributed object caching
  • 11. Associative Array: Meta information Runtime Cache: Single node applications Database Cache: No large data, rarely updated, similar requests, object creation not costly Memcached: Large scale distributed applications with frequently updated data and involving complex objects File Cache: Long term, Large Objects DB Read: Small scale needs 8 Cheetah, fastest land animal: 120km/h 7,9 Slug, slowest animal: 0.03mph 8 9 P E R F O R M A N C E MEMCACHED Distributed object caching
  • 12. REFERENCES 1. http://www.mnn.com/sites/default/files/imagecache/node-gallery-display/general%20sherman.jpg 2. http://en.wikipedia.org/wiki/File:Sort_sol_pdfnet.jpg 3. http://odeworld.wordpress.com/2007/05/04/size-matters/ 4. http://www.kumbhamela.net/ 5. http://en.wikipedia.org/wiki/List_of_largest_peaceful_gatherings_in_history 6. http://picasaweb.google.com/dolboeb/BradFitzInSF#5176116242455706722 7. http://lilomag.com/2010/07/29/the-7-worlds-slowest-animal/ 8. http://en.wikipedia.org/wiki/Cheetah 9. http://en.wikipedia.org/wiki/File:Slugs_1896.png Memcached: http://benrobb.com/wp-content/uploads/2009/01/memcached.pdf By: Jeremy Leishman, Ben Robison, Josh Taylor

Hinweis der Redaktion

  1. DB Replication Image Reference : http://technet.microsoft.com/en-us/library/ms152567.aspx
  2. Storage keys evenly spread across servers, application uses Hash Table to determine which server to go to Two layers: 1: Server where key is stored 2: Actual Serialized object Since Caching: Collisions result in loss or wrong data being read : Application responsibility to avoid collision Blocking vs Non-Blocking Thread invoking an I/O function, like read value, does not have to wait on any previous operation before executing Objects have multiple versions and are reference counted
  3. Server Instance Listens on a specified IP address and Port Multiple instances on same machine, where server total memory greater than the amount that the kernel makes available to a single process Client Instance Read: Hash(Object Key) if Memcached has return else fetch from DB and Put Write: No transactions, Pull from DB -> Update Object -> Save to DB -> Save to Cache Interdependence: Data lost but rest of the machines in farm function normally, subsequent requests can be routed accordingly Expiration: LRU