SlideShare ist ein Scribd-Unternehmen logo
1 von 32
Phoenix: A Weight-Based
Network Coordinate System
 Using Matrix Factorization

           Yang Chen
 Department of Computer Science
         Duke University
       ychen@cs.duke.edu
Outline
•   Background
•   System Design
•   Evaluation
•   Perspective Future Work




                              2
BACKGROUND


             3
Internet Distance
        What?

        • Round-trip propagation / transmission delay between two
          Internet nodes

        Why?

        • Strong indicator of network proximity
        • Relatively stable

        How?

        • Measurement tool “Ping” is with major operating systems


                               50ms
Alice                                                         Bob
                                                                    4
Use Cases
• Knowledge of Internet distance is useful
  for…
  – P2P content delivery (file sharing/streaming)
  – Online/mobile games
  – Overlay routing
  – Server selection in P2P/Cloud
  – Network monitoring



                                                    5
Scalability

• Huge number of end-to-end paths in large
  scale systems




          N nodes                      N ´ N measurements

    SLOW and COSTLY when the system becomes large!
                                                            6
Network Coordinate (NC) Systems
  (5, 10, 2)                                (-3, 4, -2)
Alice
                                                    Bob


                   Distance Function




                         22ms


  • Scalable measurement: N2  NK (K << N)
  • Every node is assigned with coordinates
  • Distance function: compute the distance between
    two nodes without explicit measurement
                                                          7
               [Ng et al, INFOCOM’02]
Deployments




     They are all using
Network Coordinate Systems!
                              8
Basic models
• Euclidean Distance-based NC (ENC)
  – Modeling the Internet as a Euclidean space
  – Systems: Vivaldi [Dabek et al., SIGCOMM’04], GNP [Ng et al,
    INFOCOM’02], NPS [Ng et al., USENIX ATC’04], PIC [Costa et al.,

    ICDCS’04]…

• Matrix Factorization-based NC (MFNC)
  – Factorizing an Internet distance matrix as the
    product of two smaller matrices
  – Systems: IDES [Mao et al., JSAC’06], Phoenix, …
                                                                      9
Modeling the Internet as
           a Euclidean space
                             d=3

• In a d-dimensional
  Euclidean space, each
  node will be mapped to
  a position
• Compute distances
  based on coordinates
  using Euclidean distance




                                    10
Triangle Inequality Violation
                                         29.9 > 5.6+3.6
                                                                Czech
                                                               Republic
                                             5.6 ms

                                                                       29.9 ms
                                        Slovakia
                                                   3.6 ms
                                                               Hungary

                                       A Triangle Inequality Violation (TIV)
Predicted distances in                 example in GEANT network
Euclidean space must
   satisfy triangle
      inequality                             Lots of TIVs in the Internet
                                              due sub-optimal routing!!


                                                                            11
                         [Zheng et al, PAM’05]
Correlation in Internet Distance Matrices
                    Distance measurement using PlanetLab nodes


             Duke       UNC     Yale    Aachen   Oxford   Toronto   THU    NUS


Duke          -          3       24      107      122       37      219    252


UNC           3          -       24      106      109       38      219    253


                       Internet paths with nearby
                      end nodes are often overlap!!

  Rows in different Internet distance matrices are large correlated (low
  effective rank)
  [Tang et al, IMC’03], [Lim et al, ToN’05], [Liao et al, CoNEXT’11]
                                                                             12
Factorization of an Internet Distance Matrix

             
             N columns




         {
                                 d columns
                                  


                           »
    N rows




                                             ´



          M                         X                Y   T

          
                                          
                                                
                             X7 = [ 1 0 3 ],Y2 = [ 2 0 5 ]
  M ij » Xi ×Yj                       
                                          
                              M 72 » X7 ×Y2 =1´ 2 + 0 + 3´ 5 =17

                     [Mao et al., JSAC’06]                         13
Matrix Factorization-Based NC
                 N columns
                                                        
                                                        X2



          {
                                          d columns
                                                               
                                                                 
                                                                Y2

                                   »
     N rows




                                                    ´



                    M                        X                  Y   T

• Each node i has an outgoing vector Xi and an
  incoming vector Yi
• Distance function is the dot product.
                                                                        14
              No triangle inequality constrain in this model!
SYSTEM DESIGN


                15
Goals
• Substantial improvement in prediction
  accuracy
• Decentralized and scalable
• Robust to dynamic Internet




                                          16
Workflow of Phoenix



   System          Peer        Scalable    Coordinates
Initialization   Discovery   Measurement   Calculation




                                                         17
System Initialization
              Measured Distance
              Predicted Distance
                                                   (X1,Y1)
                                                                                   (X2,Y2)
 H1                                            H1
                                        H2                                            H2



                                H4                                            H4
H3                                            H3
                                                                         (X4,Y4)
                                                 (X3,Y3)
 •   Early nodes (N<K): Full-mesh measurement
 •   Compute coordinates of early nodes by minimizing the overall discrepancy
     between predicted distances and measured distances

Nonnegative matrix factorization: [D. D. Lee and H. S. Seung, Nature, 401(6755):788–791,
                                                                                       18
1999.]
Dynamic Peer Discovery

                               Tracker

  H2     H3     H5                        H3    H4    H6



    H1                                               H2

H2 H3 H4 H5 H6                           H1    H3    H4    H5   H6


                     Gossip among nodes


              • N>K, all nodes become ordinary nodes
                                                                     19
Reference Node Selection




• Every new node randomly selects K existing nodes as
  reference nodes                                       20
Measurement and
      Bootstrap Coordinates Calculation
       Measured Distance
       Predicted Distance
                                            (X2,Y2)           (XK,YK)
                             (X1,Y1)
 R1      R2          RK              R1        R2              RK



                                                      H new
           H new
                                             (Xnew,Ynew)


• Node Hnew computes its own coordinates by
  minimizing the overall discrepancy between predicted
  distances and measured distances (Non-negative
  least squares)                                                        21
Accuracy of Reference Coordinates

Node N
                                                      (XA,YA)


    …
                                                      Node A

Node 3                                             Predicted Distance
                                                   Measured distance
Node 2


Node 1


         0         50           100          150
    Distance between Node A and every other node
                                                                  22
Accuracy of Reference Coordinates (cont.)

Node N
                                                           (XB,YB)


    …
                                                           Node B
                    Misleading the nodes
Node 3              referring to Node B!!
                                                        Predicted Distance
                                                        Measured Distance
Node 2


Node 1


         0   20    40     60      80        100   120
    Distance between Node B and every other node
                                                                       23
Referring to Inaccurate
                Coordinates
               (X2,Y2)       (XK,YK)
(X1,Y1)
          R1         R2          RK
                                                      Error Propagation:
                                                      Hnew may mislead
                                                       nodes refer to it

           H new
                                        (Xnew,Ynew)



                                            Give preference to
     Minimize
                                            accurate reference
    the impact
                                               coordinates
       of RK

                                                                           24
Heuristic Weight Assignment
RK                                                       Predicted Distance

                                                         Measured distance
…


R3                                                    Enhanced Coordinates
                                                       Bootstrap Coordinates
R2


R1                                                           H new
                                                         Updating coordinates
      0         50         100        150        200           regularly
     Distance between Hnew and every reference node
                                                                         25
EVALUATION


             26
Evaluation Setup
• Data sets
  – PL: 169 PlanetLab nodes
  – King: 1740 Internet DNS servers
• Metric
  – Relative Error (RE)

           MeasuredDist - PredictedDist
     RE =
          min(MeasuredDist, PredictedDist)

                                             27
Evaluation: Relative Error


                          90th Percentile
                          Relative Error

        Phoenix   Phoenix        Vivaldi    IDES
                  (Simple)
         0.63      0.91           0.83      0.89




                                                   28
Evaluation (cont.)
• Other findings through evaluation
  – Robust to node churn
  – Fast convergence
  – Robust to measurement anomalies
  – Robust to distance variation




                                      29
FUTURE WORK


              30
Perspective Topics
• NC systems in mobile-centric environment
  – Access latency, host mobility, host churn
• Scalable Prediction of other important
  network parameters
  – Available bandwidth, shortest-path distance in
    social graph




                                                 31
Software
• NCSim
  – Simulator of Decentralized Network
    Coordinate Algorithms
  – http://code.google.com/p/ncsim/
• Phoenix
  – Original Phoenix simulator in IEEE TNSM
    paper
  – http://www.cs.duke.edu/~ychen/Phoenix_TNS
    M_2011.zip
                                            32

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Convolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetConvolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNet
 
CNN
CNNCNN
CNN
 
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image SegmentationDeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
 
Image Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A surveyImage Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A survey
 
Review on cs231 part-2
Review on cs231 part-2Review on cs231 part-2
Review on cs231 part-2
 
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...
 
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
 
Cnn method
Cnn methodCnn method
Cnn method
 
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
 
Offline Character Recognition Using Monte Carlo Method and Neural Network
Offline Character Recognition Using Monte Carlo Method and Neural NetworkOffline Character Recognition Using Monte Carlo Method and Neural Network
Offline Character Recognition Using Monte Carlo Method and Neural Network
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
 
Convolutional neural networks deepa
Convolutional neural networks deepaConvolutional neural networks deepa
Convolutional neural networks deepa
 
LeNet to ResNet
LeNet to ResNetLeNet to ResNet
LeNet to ResNet
 
Mnist report
Mnist reportMnist report
Mnist report
 
AI&BigData Lab 2016. Александр Баев: Transfer learning - зачем, как и где.
AI&BigData Lab 2016. Александр Баев: Transfer learning - зачем, как и где.AI&BigData Lab 2016. Александр Баев: Transfer learning - зачем, как и где.
AI&BigData Lab 2016. Александр Баев: Transfer learning - зачем, как и где.
 
Learning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for GraphsLearning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for Graphs
 
HardNet: Convolutional Network for Local Image Description
HardNet: Convolutional Network for Local Image DescriptionHardNet: Convolutional Network for Local Image Description
HardNet: Convolutional Network for Local Image Description
 
Deep learning for image super resolution
Deep learning for image super resolutionDeep learning for image super resolution
Deep learning for image super resolution
 
From neural networks to deep learning
From neural networks to deep learningFrom neural networks to deep learning
From neural networks to deep learning
 
RNN and its applications
RNN and its applicationsRNN and its applications
RNN and its applications
 

Ähnlich wie Phoenix: A Weight-based Network Coordinate System Using Matrix Factorization

FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
Hiroki Nakahara
 
intro-to-cnn-April_2020.pptx
intro-to-cnn-April_2020.pptxintro-to-cnn-April_2020.pptx
intro-to-cnn-April_2020.pptx
ssuser3aa461
 
Large Scale Parallel FDTD Simulation of Full 3D Photonic Crystal Structures
Large Scale Parallel FDTD Simulation of Full 3D Photonic Crystal StructuresLarge Scale Parallel FDTD Simulation of Full 3D Photonic Crystal Structures
Large Scale Parallel FDTD Simulation of Full 3D Photonic Crystal Structures
ayubimoak
 
Multiuser MIMO Vector Perturbation Precoding
Multiuser MIMO Vector Perturbation PrecodingMultiuser MIMO Vector Perturbation Precoding
Multiuser MIMO Vector Perturbation Precoding
adeelrazi
 
Fixed-Point Code Synthesis for Neural Networks
Fixed-Point Code Synthesis for Neural NetworksFixed-Point Code Synthesis for Neural Networks
Fixed-Point Code Synthesis for Neural Networks
gerogepatton
 

Ähnlich wie Phoenix: A Weight-based Network Coordinate System Using Matrix Factorization (20)

Exploring Randomly Wired Neural Networks for Image Recognition
Exploring Randomly Wired Neural Networks for Image RecognitionExploring Randomly Wired Neural Networks for Image Recognition
Exploring Randomly Wired Neural Networks for Image Recognition
 
PR-155: Exploring Randomly Wired Neural Networks for Image Recognition
PR-155: Exploring Randomly Wired Neural Networks for Image RecognitionPR-155: Exploring Randomly Wired Neural Networks for Image Recognition
PR-155: Exploring Randomly Wired Neural Networks for Image Recognition
 
Ricci Curvature of Internet Topology
Ricci Curvature of Internet TopologyRicci Curvature of Internet Topology
Ricci Curvature of Internet Topology
 
Modelling Probabilistic Wireless Networks (Extended Abstract)
Modelling Probabilistic Wireless Networks (Extended Abstract)Modelling Probabilistic Wireless Networks (Extended Abstract)
Modelling Probabilistic Wireless Networks (Extended Abstract)
 
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
 
intro-to-cnn-April_2020.pptx
intro-to-cnn-April_2020.pptxintro-to-cnn-April_2020.pptx
intro-to-cnn-April_2020.pptx
 
Large Scale Parallel FDTD Simulation of Full 3D Photonic Crystal Structures
Large Scale Parallel FDTD Simulation of Full 3D Photonic Crystal StructuresLarge Scale Parallel FDTD Simulation of Full 3D Photonic Crystal Structures
Large Scale Parallel FDTD Simulation of Full 3D Photonic Crystal Structures
 
Parn pyramidal+affine+regression+networks+for+dense+semantic+correspondence
Parn pyramidal+affine+regression+networks+for+dense+semantic+correspondenceParn pyramidal+affine+regression+networks+for+dense+semantic+correspondence
Parn pyramidal+affine+regression+networks+for+dense+semantic+correspondence
 
network mining and representation learning
network mining and representation learningnetwork mining and representation learning
network mining and representation learning
 
Multiuser MIMO Vector Perturbation Precoding
Multiuser MIMO Vector Perturbation PrecodingMultiuser MIMO Vector Perturbation Precoding
Multiuser MIMO Vector Perturbation Precoding
 
NS-CUK Seminar: S.T.Nguyen, Review on "Improving Graph Neural Network Express...
NS-CUK Seminar: S.T.Nguyen, Review on "Improving Graph Neural Network Express...NS-CUK Seminar: S.T.Nguyen, Review on "Improving Graph Neural Network Express...
NS-CUK Seminar: S.T.Nguyen, Review on "Improving Graph Neural Network Express...
 
Fixed-Point Code Synthesis for Neural Networks
Fixed-Point Code Synthesis for Neural NetworksFixed-Point Code Synthesis for Neural Networks
Fixed-Point Code Synthesis for Neural Networks
 
Fixed-Point Code Synthesis for Neural Networks
Fixed-Point Code Synthesis for Neural NetworksFixed-Point Code Synthesis for Neural Networks
Fixed-Point Code Synthesis for Neural Networks
 
On-line Fault diagnosis of Arbitrary Connected Networks
On-line Fault diagnosis of Arbitrary Connected NetworksOn-line Fault diagnosis of Arbitrary Connected Networks
On-line Fault diagnosis of Arbitrary Connected Networks
 
201907 AutoML and Neural Architecture Search
201907 AutoML and Neural Architecture Search201907 AutoML and Neural Architecture Search
201907 AutoML and Neural Architecture Search
 
Spectral cnn
Spectral cnnSpectral cnn
Spectral cnn
 
IMPROVED EXTENDED XY ON-CHIP ROUTING IN DIAMETRICAL 2D MESH NOC
IMPROVED EXTENDED XY ON-CHIP ROUTING IN DIAMETRICAL 2D MESH NOCIMPROVED EXTENDED XY ON-CHIP ROUTING IN DIAMETRICAL 2D MESH NOC
IMPROVED EXTENDED XY ON-CHIP ROUTING IN DIAMETRICAL 2D MESH NOC
 
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural NetworkTraffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
 
EDGE-Net: Efficient Deep-learning Gradients Extraction Network
EDGE-Net: Efficient Deep-learning Gradients Extraction NetworkEDGE-Net: Efficient Deep-learning Gradients Extraction Network
EDGE-Net: Efficient Deep-learning Gradients Extraction Network
 
EDGE-Net: Efficient Deep-learning Gradients Extraction Network
EDGE-Net: Efficient Deep-learning Gradients Extraction NetworkEDGE-Net: Efficient Deep-learning Gradients Extraction Network
EDGE-Net: Efficient Deep-learning Gradients Extraction Network
 

Kürzlich hochgeladen

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Kürzlich hochgeladen (20)

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 

Phoenix: A Weight-based Network Coordinate System Using Matrix Factorization

  • 1. Phoenix: A Weight-Based Network Coordinate System Using Matrix Factorization Yang Chen Department of Computer Science Duke University ychen@cs.duke.edu
  • 2. Outline • Background • System Design • Evaluation • Perspective Future Work 2
  • 4. Internet Distance What? • Round-trip propagation / transmission delay between two Internet nodes Why? • Strong indicator of network proximity • Relatively stable How? • Measurement tool “Ping” is with major operating systems 50ms Alice Bob 4
  • 5. Use Cases • Knowledge of Internet distance is useful for… – P2P content delivery (file sharing/streaming) – Online/mobile games – Overlay routing – Server selection in P2P/Cloud – Network monitoring 5
  • 6. Scalability • Huge number of end-to-end paths in large scale systems N nodes N ´ N measurements SLOW and COSTLY when the system becomes large! 6
  • 7. Network Coordinate (NC) Systems (5, 10, 2) (-3, 4, -2) Alice Bob Distance Function 22ms • Scalable measurement: N2  NK (K << N) • Every node is assigned with coordinates • Distance function: compute the distance between two nodes without explicit measurement 7 [Ng et al, INFOCOM’02]
  • 8. Deployments They are all using Network Coordinate Systems! 8
  • 9. Basic models • Euclidean Distance-based NC (ENC) – Modeling the Internet as a Euclidean space – Systems: Vivaldi [Dabek et al., SIGCOMM’04], GNP [Ng et al, INFOCOM’02], NPS [Ng et al., USENIX ATC’04], PIC [Costa et al., ICDCS’04]… • Matrix Factorization-based NC (MFNC) – Factorizing an Internet distance matrix as the product of two smaller matrices – Systems: IDES [Mao et al., JSAC’06], Phoenix, … 9
  • 10. Modeling the Internet as a Euclidean space d=3 • In a d-dimensional Euclidean space, each node will be mapped to a position • Compute distances based on coordinates using Euclidean distance 10
  • 11. Triangle Inequality Violation 29.9 > 5.6+3.6 Czech Republic 5.6 ms 29.9 ms Slovakia 3.6 ms Hungary A Triangle Inequality Violation (TIV) Predicted distances in example in GEANT network Euclidean space must satisfy triangle inequality Lots of TIVs in the Internet due sub-optimal routing!! 11 [Zheng et al, PAM’05]
  • 12. Correlation in Internet Distance Matrices Distance measurement using PlanetLab nodes Duke UNC Yale Aachen Oxford Toronto THU NUS Duke - 3 24 107 122 37 219 252 UNC 3 - 24 106 109 38 219 253 Internet paths with nearby end nodes are often overlap!! Rows in different Internet distance matrices are large correlated (low effective rank) [Tang et al, IMC’03], [Lim et al, ToN’05], [Liao et al, CoNEXT’11] 12
  • 13. Factorization of an Internet Distance Matrix  N columns { d columns  » N rows ´ M X Y T        X7 = [ 1 0 3 ],Y2 = [ 2 0 5 ] M ij » Xi ×Yj    M 72 » X7 ×Y2 =1´ 2 + 0 + 3´ 5 =17 [Mao et al., JSAC’06] 13
  • 14. Matrix Factorization-Based NC  N columns  X2 { d columns    Y2 » N rows ´ M X Y T • Each node i has an outgoing vector Xi and an incoming vector Yi • Distance function is the dot product. 14 No triangle inequality constrain in this model!
  • 16. Goals • Substantial improvement in prediction accuracy • Decentralized and scalable • Robust to dynamic Internet 16
  • 17. Workflow of Phoenix System Peer Scalable Coordinates Initialization Discovery Measurement Calculation 17
  • 18. System Initialization Measured Distance Predicted Distance (X1,Y1) (X2,Y2) H1 H1 H2 H2 H4 H4 H3 H3 (X4,Y4) (X3,Y3) • Early nodes (N<K): Full-mesh measurement • Compute coordinates of early nodes by minimizing the overall discrepancy between predicted distances and measured distances Nonnegative matrix factorization: [D. D. Lee and H. S. Seung, Nature, 401(6755):788–791, 18 1999.]
  • 19. Dynamic Peer Discovery Tracker H2 H3 H5 H3 H4 H6 H1 H2 H2 H3 H4 H5 H6 H1 H3 H4 H5 H6 Gossip among nodes • N>K, all nodes become ordinary nodes 19
  • 20. Reference Node Selection • Every new node randomly selects K existing nodes as reference nodes 20
  • 21. Measurement and Bootstrap Coordinates Calculation Measured Distance Predicted Distance (X2,Y2) (XK,YK) (X1,Y1) R1 R2  RK R1 R2  RK H new H new (Xnew,Ynew) • Node Hnew computes its own coordinates by minimizing the overall discrepancy between predicted distances and measured distances (Non-negative least squares) 21
  • 22. Accuracy of Reference Coordinates Node N (XA,YA) … Node A Node 3 Predicted Distance Measured distance Node 2 Node 1 0 50 100 150 Distance between Node A and every other node 22
  • 23. Accuracy of Reference Coordinates (cont.) Node N (XB,YB) … Node B Misleading the nodes Node 3 referring to Node B!! Predicted Distance Measured Distance Node 2 Node 1 0 20 40 60 80 100 120 Distance between Node B and every other node 23
  • 24. Referring to Inaccurate Coordinates (X2,Y2) (XK,YK) (X1,Y1) R1 R2  RK Error Propagation: Hnew may mislead nodes refer to it H new (Xnew,Ynew) Give preference to Minimize accurate reference the impact coordinates of RK 24
  • 25. Heuristic Weight Assignment RK Predicted Distance Measured distance … R3 Enhanced Coordinates Bootstrap Coordinates R2 R1 H new Updating coordinates 0 50 100 150 200 regularly Distance between Hnew and every reference node 25
  • 27. Evaluation Setup • Data sets – PL: 169 PlanetLab nodes – King: 1740 Internet DNS servers • Metric – Relative Error (RE) MeasuredDist - PredictedDist RE = min(MeasuredDist, PredictedDist) 27
  • 28. Evaluation: Relative Error 90th Percentile Relative Error Phoenix Phoenix Vivaldi IDES (Simple) 0.63 0.91 0.83 0.89 28
  • 29. Evaluation (cont.) • Other findings through evaluation – Robust to node churn – Fast convergence – Robust to measurement anomalies – Robust to distance variation 29
  • 31. Perspective Topics • NC systems in mobile-centric environment – Access latency, host mobility, host churn • Scalable Prediction of other important network parameters – Available bandwidth, shortest-path distance in social graph 31
  • 32. Software • NCSim – Simulator of Decentralized Network Coordinate Algorithms – http://code.google.com/p/ncsim/ • Phoenix – Original Phoenix simulator in IEEE TNSM paper – http://www.cs.duke.edu/~ychen/Phoenix_TNS M_2011.zip 32

Hinweis der Redaktion

  1. \\