SlideShare ist ein Scribd-Unternehmen logo
1 von 1
Downloaden Sie, um offline zu lesen
PlanetenWachHundNetz: Locality-Aware MapReduce for Peer-to-Peer
Robust Systems Group
                                                                                                          Vitus Lorenz-Meyer, Eric Freudenthal                                                                                                                                     University of
                                                                                                                                                                                                                                                                                   Texas at El Paso


                                                                                                                                                                                                                                                              Other Approaches
                                                                                                                                                                                                                                                           From distributed databases
                                                                                                                                                                                                                                                         Key-based Tree
                                                                                                                                                                                                                                                         Routing based on node keys
                                                                                                                                                                                                                                                         • Children of node n with key k are
                                                                                                                                                                                                                                                         nodes with keys closest to k with n-1
                                                                                                                                                                                                                                                         prefix bits in common with k
                                                                                                                                                                                                                                                         • Node n is its own child!


                                                                                                                                                                                                                                                           Aggregation tree            Stochastically
                                                                                                                                                                                                                                                                                       balanced




                        Motivation                                                                                                                                                                                                                          Prefix tree

     • Efficient reduction of data from self-
       organized (P2P) sources
                         Problem
     • Dynamic membership
         • Static tree (for parallel prefix)                                                                                                                                         Non-existent node 001 (A)
           inappropriate                                                                                                                                                                                                                                              •Not locality aware!
                                                                                                                                                                                                                                                                      •Only 1 global tree.
         Problems w/ Known Techniques
                                                                                                                                                                                                                                                         Finger-table based tree
     •    Do not localize network traffic                                                                                                                                            000 takes the role                                                  • Tree edges selected from nodes
     •    Branching factor not controlled                                                                                                                                            of its parent 001                                                   finger tables (towards query root).
     •    State lost when parent nodes fail/leave
                                                                                                                                                                                                                                                           Paths to 111              Non-optimized
     •    Details on right side of figure                                                                                                                                                                                                                                              fan-out!
                                                                                                                                                                                      Message arrives
                        Our Goals                                                                                                                                                     at closest node 000 (B)

     • Exploit locality, minimize remote
       communication                                                                                                                                                                                                                                       Resulting Aggre-
                                                                                                                                                                                                                                                           gation tree
     • Efficient parallelism: Control of                                                                                                                                              Message ends up at self:
       branching factor                                                                                                                                                               No more nodes -> discard

     • Minimal state-loss when nodes
       enter/leave
     • Self-healing continuous queries

                     Implementation                                                                                                                                                                                                                            •Unique tree for every query
                                                                                                                                                                                                                                                               •Locality awareness from DHT
     • Left-tree
                                                                                                                                                                                                                                                               •but does not force locality
         • Node role determined by key                                                                                                                                                                                                                         •Challenge: what to do if nodes
     • Locality/clustering: Coral or Oasis                                                                                                                                                                                                                     enter/leave


                                                                                                                                                                                  Goal: Efficient Aggregation Tree
                                                                                                                                                                             Our Solution: Key-based MapReduce (KMR)
                                                                                                                                          Overview                                                                                  Finding nodes
                                                               Locality-aware Clustering
                      MapReduce (Google)
                                                                                                          • Associate aggregation tree with a operation-specific root key.                                              n = key of some node
                                                                                                          Characteristics                                                                                               t = root-id of aggregation tree
                     Generalized Parallel Prefix                  Coral (locality-aware DsHT)
                                                                                                                                                                                                                        π = n’s position among leaves (from left)
                                                                                                          • Rooted at host whose key is closest to root key
                 •   Associative & communitive                •                                                                                                                                                           π=n⊕t
                                                                   Coral partitions its members in             • One tree per query, distributes load over all nodes for multiple
                     operations mapped to an                                                                                                                                                                              key of n’s parent at height h: π / 2h ⊕ t
                                                                   clusters based on connection                    queries at a time
                     aggregation tree                              latency                                     • Nodes that are roots of sub-trees are their own left-children all the
                                                                                                                                                                                                                        To find parent, n searches for smallest height such
                                                                                                                   way down to their appropriate place among leaves.
                                                                                                                                                                                                                        that parent is in the tree.
                 •   Google‘s map-reduce framework                 Aggregation tree at each                    • Each node is leaf and root of the sub-tree at depth d if it has the d’th
                    Data is mapped to low-dimensional
                                                                                                                                                                                                                                    Collecting data
                                                                                                                   bit of root-ID flipped; node may also substitute for missing parents.
                                                                            cluster
                     tuples                                                                               • Set of nodes and MR-specific root-key fully defines a unique tree (complete
                                                                                                                                                                                                 •   Each node reduces messages from children, sends result(s) to parent
                                                              •
                    Tuples are recursively combined               Build cluster aggregation trees          knowledge not required: lookup parent, sibling, children)
                                                              •
                     using an associative reduction                A single member of a cluster-tree
                                                                                                                                                                                                 Challenges                                     Approach
                     algorithm that emits a (summary)              serves as aggregation
                     tuple                                         representative                                                                                                                •   Find children                           DHT lookup
                                                                                                                                                                                                 •   Find parent                             DHT lookup
                                  Example:                                  Challenges                                                                                                           •   Establishing new tree at exit/departure DHT lookup (key-space determines topology)
                 •   Counting occurrence of a particular
                                                              •    Choosing sub-cluster
                     word in a document:
                                                                                                                                     Building the Tree
                                                                   representatives
                         Provide map algorithm that
                                                              •    Constructing aggregation tree for
                            emits the tuple ‘(1)’ for every                                                  Send build-message to all siblings down the tree
                                                                   sub-cluster
                                                                                                                                                                                                                                Continuous Queries
                            occurrence                                                                       Each node forwards build-message to all its siblings all the way down
                                                              •    Avoiding inefficient setup broadcast
                         Provide a reduce algorithm                                                         • Recall that node is its own child: siblings include its children!
                                                                                                                                                                                                                  Maintaining tree consistency under churn
                            that sums these tuples
                                                                                                             Non-existing parent
                                                                                                                                                                                                 •   Each node periodically checks roles of self and immediate relatives
                                                                                                             • Closest node fulfills its role
                                                                                                                                                                                                 •   Churn (membership change) must result in tree-node role change
                                                                                                             • Therefore, search for parent will discover this node
                                                                                                                                                                                                       • Change propagated by DHT finger tables
                                                                                                             Non-existing child
                                                                                                             • If child not in DHT, then node is a leaf
                                                                                                                                                                                                 Challenges                             Approach
                                                                                                             Finding children
                                                                                                             • Use DHT                                                                           •   What roles to check when            periodically check (lookup) self, children,
                                                                                                                                                                                                                                         parent only when its assuming role
                                                                                                                                                                                                                                         parent → child, old → new
                                                                                                                                                                                                 •   Who notifies who

Weitere ähnliche Inhalte

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Kürzlich hochgeladen (20)

WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 

Empfohlen

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Empfohlen (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Poster for NSDI

  • 1. PlanetenWachHundNetz: Locality-Aware MapReduce for Peer-to-Peer Robust Systems Group Vitus Lorenz-Meyer, Eric Freudenthal University of Texas at El Paso Other Approaches From distributed databases Key-based Tree Routing based on node keys • Children of node n with key k are nodes with keys closest to k with n-1 prefix bits in common with k • Node n is its own child! Aggregation tree Stochastically balanced Motivation Prefix tree • Efficient reduction of data from self- organized (P2P) sources Problem • Dynamic membership • Static tree (for parallel prefix) Non-existent node 001 (A) inappropriate •Not locality aware! •Only 1 global tree. Problems w/ Known Techniques Finger-table based tree • Do not localize network traffic 000 takes the role • Tree edges selected from nodes • Branching factor not controlled of its parent 001 finger tables (towards query root). • State lost when parent nodes fail/leave Paths to 111 Non-optimized • Details on right side of figure fan-out! Message arrives Our Goals at closest node 000 (B) • Exploit locality, minimize remote communication Resulting Aggre- gation tree • Efficient parallelism: Control of Message ends up at self: branching factor No more nodes -> discard • Minimal state-loss when nodes enter/leave • Self-healing continuous queries Implementation •Unique tree for every query •Locality awareness from DHT • Left-tree •but does not force locality • Node role determined by key •Challenge: what to do if nodes • Locality/clustering: Coral or Oasis enter/leave Goal: Efficient Aggregation Tree Our Solution: Key-based MapReduce (KMR) Overview Finding nodes Locality-aware Clustering MapReduce (Google) • Associate aggregation tree with a operation-specific root key. n = key of some node Characteristics t = root-id of aggregation tree Generalized Parallel Prefix Coral (locality-aware DsHT) π = n’s position among leaves (from left) • Rooted at host whose key is closest to root key • Associative & communitive • π=n⊕t Coral partitions its members in • One tree per query, distributes load over all nodes for multiple operations mapped to an key of n’s parent at height h: π / 2h ⊕ t clusters based on connection queries at a time aggregation tree latency • Nodes that are roots of sub-trees are their own left-children all the To find parent, n searches for smallest height such way down to their appropriate place among leaves. that parent is in the tree. • Google‘s map-reduce framework Aggregation tree at each • Each node is leaf and root of the sub-tree at depth d if it has the d’th  Data is mapped to low-dimensional Collecting data bit of root-ID flipped; node may also substitute for missing parents. cluster tuples • Set of nodes and MR-specific root-key fully defines a unique tree (complete • Each node reduces messages from children, sends result(s) to parent •  Tuples are recursively combined Build cluster aggregation trees knowledge not required: lookup parent, sibling, children) • using an associative reduction A single member of a cluster-tree Challenges Approach algorithm that emits a (summary) serves as aggregation tuple representative • Find children DHT lookup • Find parent DHT lookup Example: Challenges • Establishing new tree at exit/departure DHT lookup (key-space determines topology) • Counting occurrence of a particular • Choosing sub-cluster word in a document: Building the Tree representatives  Provide map algorithm that • Constructing aggregation tree for emits the tuple ‘(1)’ for every Send build-message to all siblings down the tree sub-cluster Continuous Queries occurrence Each node forwards build-message to all its siblings all the way down • Avoiding inefficient setup broadcast  Provide a reduce algorithm • Recall that node is its own child: siblings include its children! Maintaining tree consistency under churn that sums these tuples Non-existing parent • Each node periodically checks roles of self and immediate relatives • Closest node fulfills its role • Churn (membership change) must result in tree-node role change • Therefore, search for parent will discover this node • Change propagated by DHT finger tables Non-existing child • If child not in DHT, then node is a leaf Challenges Approach Finding children • Use DHT • What roles to check when periodically check (lookup) self, children, parent only when its assuming role parent → child, old → new • Who notifies who