SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Downloaden Sie, um offline zu lesen
Using Vector Clocks to Visualize
Communication Flow



Martin Harrigan

Complex & Adaptive Systems Laboratory (CASL)
University College Dublin
Introduction
    Communication Flow
    Vector Clocks


The Methodology


Experiments


Conclusions & Future Work
Introduction
A metaphor for communication: flow in conduits

   ‘slow and fast conduits’       (Kossinets et al., ’08)

   ‘transatlantic flows of communication’                    (Leskovec
   & Horvitz, ’08)

   ‘a series of tubes’   (Stevens, ’06)


It is a tempting metaphor when visualizing communication flow.

                                      A dynamic weighted directed graph?
                                              people ↔ vertices
                                              communications ↔ (weighted)
                                              directed edges
                                              time ↔ animation
Introduction
    Is the graphical design grounded in the substance to be
    communicated (Brandes, ’99)?
    Is there a discrepancy between the visualization and the
    analysis of communication flow?

Visualization                      Analysis

   crossings                           synchronization
   area                                information latency
   slopes                              temporal distance
   symmetries                          network backbones
   mental map                          periodicity

                   Can we integrate the two?
Introduction



Vector clocks are a useful tool when analyzing
communication flow...
    Introduced by Fidge (’88) and Mattern (’89).
    Aid with the causal ordering of events in
    distributed systems.
    Provide greater insight into communication flow
    than dynamic weighted directed graphs (Kossinets et al.,
    ’08).


We can use them to visualize communication flow.
Introduction


A suggests to B,
C, and D that
they meet at the
City Hotel
Odense.
Introduction




C and D agree
to meet at the
Cab Inn.
Introduction




               B and D agree
               to meet at the
               Radisson Blu.
Introduction


A checks with B
and C where they
should meet.




D is no longer
reachable but they
know to meet him
at the Radisson
Blu.
Introduction




Vector clocks maintain, for each actor, the time of their most
recent communication, either directly or indirectly, from every
                         other actor.
The Methodology




           visualizeVectorClocks
The Methodology
The Methodology




  During a time interval [0, T ], we have a set of vertices V and
  a set of time-attributed directed edges E.
  The instantaneous graph Gt = (V, Et ) at time-slice t is the
  graph with vertex set V and edge set
  Et = {(u, v)|(u, v, t) ∈ E}.

         Input: G0 , . . . , GT (instantaneous graphs)
The Methodology




For each instantaneous graph:
  1   Each vertex has a corresponding vector clock which represents
      a point in a high-dimensional space.
  2   Each time-attributed directed edge (communication) updates
      the vector clock of the target (receiving) vertex.
  3   We compute the distances between all pairs of vector clocks
      using an appropriate metric.
  4   We construct a dissimilarity matrix from these distances.
  5   We use multidimensional scaling (MDS) to produce an 2-d
      visualization of the data points.
The Methodology




 Output: C0 , . . . , CT (2-d coordinates for each time-slice)


t=0                    t=1                      t=2       t=3




 Greene et al., ’10, Lancichinetti et al., 08
The Methodology




For each instantaneous graph:
The Methodology




For each instantaneous graph:
  1   Each vertex has a corresponding vector clock which represents
      a point in a high-dimensional space.

                       
                   t1
                                         Initialization
                  t2   
        φu,t = 
                       
                    .
                    .                   Individual increment at
                   .   
                                         each time-slice
                   tn
The Methodology




For each instantaneous graph:
  2    Each time-attributed directed edge (communication) updates
       the vector clock of the target (receiving) vertex.

                                                                     
                 t1                  s1                   max(t1 , s1 )
                t2                s2                 max(t2 , s2 )   
      φu,t =          , φv,t =          =⇒ φv,t = 
                                                                     
                  .
                  .                   .
                                      .                       .
                                                              .           
                 .                 .                     .           
                 tn                  sn                   max(tn , sn )
The Methodology




For each instantaneous graph:
  3   We compute the distances between all pairs of vector clocks
      using an appropriate metric.



      d(φu,t , φv,t ) =   (t1 − s1 )2 + (t2 − s2 )2 + · · · + (tn − sn )2

                               Which metric?
The Methodology




For each instantaneous graph:
  4   We construct a dissimilarity matrix from these distances.

                                                                                  
                    0            d(φu1 ,t , φu2 ,t ) · · ·   d(φu1 ,t , φun ,t )
            d(φu2 ,t , φu1 ,t )        0                                          
  Mt = 
                                                                                  
                     .
                     .                               ..                            
                    .                                   .                         
             d(φun ,t , φu1 ,t )                                     0
The Methodology




For each instantaneous graph:
  5   We use multidimensional scaling (MDS) to produce an 2-d
      visualization of the data points.




                                       Dynamic MDS?
                                       Procrustes Analysis?
Experiments



  Four artificial datasets comprising temporal sequences of
  communications between 100 actors during a time interval
  [0, 99].
  The datasets were generated by fixing the set of possible
  communications and then selecting a communication from the
  set of possible communications at time-slice t with probability
  p = 0.005.
  Each dataset had a distinct underlying communication
  pattern.
Experiments



        DS1: All communications were possible.


 25th           50th             75th            100th
Experiments


DS2: Communications were possible between every pair of actors
 in only one direction such that there were no directed cycles of
                         communication.


  25th              50th               75th               100th
Experiments


DS3: The actors were partitioned into four equal subsets and all
         intra-subset communications were possible.


 25th               50th               75th              100th
Experiments

  DS4: The actors were partitioned as in DS3 and intra-subset
communications were possible between every pair of actors in only
    one direction such that there were no directed cycles of
                        communication.


  25th               50th              75th              100th
Experiments

  We also visualized a VAST 2008 challenge dataset   (Grinstein et al.,

  ’08).

  This dataset comprises mobile phone call records over a 10
  day period between 400 unique mobile phones.
  We set each time-slice equal to one hour.


 25th             50th               75th                100th
Conclusions & Future Work



  A novel methodology for visualizing communication flow.
      temporal sequence of communications → vector clocks
                     distance metric
      vector clocks − − − − − → dissimilarity matrix
                     −−−−−
                            MDS
      dissimilarity matrix −→ 2-d visualizations
                            −
  Actors who have received the same or
  communicatively-equivalent communications are placed close
  together whereas actors that have received largely different
  communications are placed far apart.
Conclusions & Future Work


There is much future work:
    Both the distance metric (Bellman, ’61) and the choice of MDS
    algorithm need investigation.
    Can we extend vector clocks to model the attenuation of
    information, the bounded capacity of communication
    channels, etc.? Can we visualize synchronicity, network
    backbones, periodicity?
    Scalability (maintenance of the vector clocks and the
    computation of the dissimilarity matrix).
Thank You

Weitere ähnliche Inhalte

Was ist angesagt?

ABC and empirical likelihood
ABC and empirical likelihoodABC and empirical likelihood
ABC and empirical likelihoodChristian Robert
 
Clustering Financial Time Series using their Correlations and their Distribut...
Clustering Financial Time Series using their Correlations and their Distribut...Clustering Financial Time Series using their Correlations and their Distribut...
Clustering Financial Time Series using their Correlations and their Distribut...Gautier Marti
 
Some contributions to the clustering of financial time series - Applications ...
Some contributions to the clustering of financial time series - Applications ...Some contributions to the clustering of financial time series - Applications ...
Some contributions to the clustering of financial time series - Applications ...Gautier Marti
 
QRC-ESPRIT Method for Wideband Signals
QRC-ESPRIT Method for Wideband SignalsQRC-ESPRIT Method for Wideband Signals
QRC-ESPRIT Method for Wideband SignalsIDES Editor
 
Rabbit challenge 5_dnn3
Rabbit challenge 5_dnn3Rabbit challenge 5_dnn3
Rabbit challenge 5_dnn3TOMMYLINK1
 
Pricing interest rate derivatives (ext)
Pricing interest rate derivatives (ext)Pricing interest rate derivatives (ext)
Pricing interest rate derivatives (ext)Swati Mital
 
Summary of "A Universally-Truthful Approximation Scheme for Multi-unit Auction"
Summary of "A Universally-Truthful Approximation Scheme for Multi-unit Auction"Summary of "A Universally-Truthful Approximation Scheme for Multi-unit Auction"
Summary of "A Universally-Truthful Approximation Scheme for Multi-unit Auction"Thatchaphol Saranurak
 
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018Universitat Politècnica de Catalunya
 
A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...
A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...
A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...graphhoc
 
Generative Adversarial Networks GAN - Santiago Pascual - UPC Barcelona 2018
Generative Adversarial Networks GAN - Santiago Pascual - UPC Barcelona 2018Generative Adversarial Networks GAN - Santiago Pascual - UPC Barcelona 2018
Generative Adversarial Networks GAN - Santiago Pascual - UPC Barcelona 2018Universitat Politècnica de Catalunya
 
Rabbit challenge 3 DNN Day2
Rabbit challenge 3 DNN Day2Rabbit challenge 3 DNN Day2
Rabbit challenge 3 DNN Day2TOMMYLINK1
 
(DL輪読)Variational Dropout Sparsifies Deep Neural Networks
(DL輪読)Variational Dropout Sparsifies Deep Neural Networks(DL輪読)Variational Dropout Sparsifies Deep Neural Networks
(DL輪読)Variational Dropout Sparsifies Deep Neural NetworksMasahiro Suzuki
 
Digital Signal Processing[ECEG-3171]-Ch1_L07
Digital Signal Processing[ECEG-3171]-Ch1_L07Digital Signal Processing[ECEG-3171]-Ch1_L07
Digital Signal Processing[ECEG-3171]-Ch1_L07Rediet Moges
 
(研究会輪読) Facial Landmark Detection by Deep Multi-task Learning
(研究会輪読) Facial Landmark Detection by Deep Multi-task Learning(研究会輪読) Facial Landmark Detection by Deep Multi-task Learning
(研究会輪読) Facial Landmark Detection by Deep Multi-task LearningMasahiro Suzuki
 

Was ist angesagt? (18)

ABC and empirical likelihood
ABC and empirical likelihoodABC and empirical likelihood
ABC and empirical likelihood
 
2012 cvpr gtw
2012 cvpr gtw2012 cvpr gtw
2012 cvpr gtw
 
Clustering Financial Time Series using their Correlations and their Distribut...
Clustering Financial Time Series using their Correlations and their Distribut...Clustering Financial Time Series using their Correlations and their Distribut...
Clustering Financial Time Series using their Correlations and their Distribut...
 
SSRN-id2718694
SSRN-id2718694SSRN-id2718694
SSRN-id2718694
 
Some contributions to the clustering of financial time series - Applications ...
Some contributions to the clustering of financial time series - Applications ...Some contributions to the clustering of financial time series - Applications ...
Some contributions to the clustering of financial time series - Applications ...
 
QRC-ESPRIT Method for Wideband Signals
QRC-ESPRIT Method for Wideband SignalsQRC-ESPRIT Method for Wideband Signals
QRC-ESPRIT Method for Wideband Signals
 
40120140501004
4012014050100440120140501004
40120140501004
 
Rabbit challenge 5_dnn3
Rabbit challenge 5_dnn3Rabbit challenge 5_dnn3
Rabbit challenge 5_dnn3
 
Pricing interest rate derivatives (ext)
Pricing interest rate derivatives (ext)Pricing interest rate derivatives (ext)
Pricing interest rate derivatives (ext)
 
Summary of "A Universally-Truthful Approximation Scheme for Multi-unit Auction"
Summary of "A Universally-Truthful Approximation Scheme for Multi-unit Auction"Summary of "A Universally-Truthful Approximation Scheme for Multi-unit Auction"
Summary of "A Universally-Truthful Approximation Scheme for Multi-unit Auction"
 
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
 
A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...
A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...
A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...
 
Generative Adversarial Networks GAN - Santiago Pascual - UPC Barcelona 2018
Generative Adversarial Networks GAN - Santiago Pascual - UPC Barcelona 2018Generative Adversarial Networks GAN - Santiago Pascual - UPC Barcelona 2018
Generative Adversarial Networks GAN - Santiago Pascual - UPC Barcelona 2018
 
Rabbit challenge 3 DNN Day2
Rabbit challenge 3 DNN Day2Rabbit challenge 3 DNN Day2
Rabbit challenge 3 DNN Day2
 
zeropadding
zeropaddingzeropadding
zeropadding
 
(DL輪読)Variational Dropout Sparsifies Deep Neural Networks
(DL輪読)Variational Dropout Sparsifies Deep Neural Networks(DL輪読)Variational Dropout Sparsifies Deep Neural Networks
(DL輪読)Variational Dropout Sparsifies Deep Neural Networks
 
Digital Signal Processing[ECEG-3171]-Ch1_L07
Digital Signal Processing[ECEG-3171]-Ch1_L07Digital Signal Processing[ECEG-3171]-Ch1_L07
Digital Signal Processing[ECEG-3171]-Ch1_L07
 
(研究会輪読) Facial Landmark Detection by Deep Multi-task Learning
(研究会輪読) Facial Landmark Detection by Deep Multi-task Learning(研究会輪読) Facial Landmark Detection by Deep Multi-task Learning
(研究会輪読) Facial Landmark Detection by Deep Multi-task Learning
 

Ähnlich wie Using Vector Clocks to Visualize Communication Flow

Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)Frank Nielsen
 
Distributed Beamforming in Sensor Networks
Distributed Beamforming in Sensor NetworksDistributed Beamforming in Sensor Networks
Distributed Beamforming in Sensor NetworksDaniel Tai
 
11.the comparative study of finite difference method and monte carlo method f...
11.the comparative study of finite difference method and monte carlo method f...11.the comparative study of finite difference method and monte carlo method f...
11.the comparative study of finite difference method and monte carlo method f...Alexander Decker
 
The comparative study of finite difference method and monte carlo method for ...
The comparative study of finite difference method and monte carlo method for ...The comparative study of finite difference method and monte carlo method for ...
The comparative study of finite difference method and monte carlo method for ...Alexander Decker
 
Project_report_BSS
Project_report_BSSProject_report_BSS
Project_report_BSSKamal Bhagat
 
Anomaly Detection in Sequences of Short Text Using Iterative Language Models
Anomaly Detection in Sequences of Short Text Using Iterative Language ModelsAnomaly Detection in Sequences of Short Text Using Iterative Language Models
Anomaly Detection in Sequences of Short Text Using Iterative Language ModelsCynthia Freeman
 
Continuum Modeling and Control of Large Nonuniform Networks
Continuum Modeling and Control of Large Nonuniform NetworksContinuum Modeling and Control of Large Nonuniform Networks
Continuum Modeling and Control of Large Nonuniform NetworksYang Zhang
 
Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...
Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...
Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...sipij
 
Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...
Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...
Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...sipij
 
Traveling Salesman Problem in Distributed Environment
Traveling Salesman Problem in Distributed EnvironmentTraveling Salesman Problem in Distributed Environment
Traveling Salesman Problem in Distributed Environmentcsandit
 
TRAVELING SALESMAN PROBLEM IN DISTRIBUTED ENVIRONMENT
TRAVELING SALESMAN PROBLEM IN DISTRIBUTED ENVIRONMENTTRAVELING SALESMAN PROBLEM IN DISTRIBUTED ENVIRONMENT
TRAVELING SALESMAN PROBLEM IN DISTRIBUTED ENVIRONMENTcscpconf
 
BLIND SIGNATURE SCHEME BASED ON CHEBYSHEV POLYNOMIALS
BLIND SIGNATURE SCHEME BASED ON CHEBYSHEV POLYNOMIALSBLIND SIGNATURE SCHEME BASED ON CHEBYSHEV POLYNOMIALS
BLIND SIGNATURE SCHEME BASED ON CHEBYSHEV POLYNOMIALSIJNSA Journal
 
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...Tanjarul Islam Mishu
 
Investigation of repeated blasts at Aitik mine using waveform cross correlation
Investigation of repeated blasts at Aitik mine using waveform cross correlationInvestigation of repeated blasts at Aitik mine using waveform cross correlation
Investigation of repeated blasts at Aitik mine using waveform cross correlationIvan Kitov
 
Fractals in Small-World Networks With Time Delay
Fractals in Small-World Networks With Time DelayFractals in Small-World Networks With Time Delay
Fractals in Small-World Networks With Time DelayXin-She Yang
 

Ähnlich wie Using Vector Clocks to Visualize Communication Flow (20)

Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
 
Where Next
Where NextWhere Next
Where Next
 
Distributed Beamforming in Sensor Networks
Distributed Beamforming in Sensor NetworksDistributed Beamforming in Sensor Networks
Distributed Beamforming in Sensor Networks
 
11.the comparative study of finite difference method and monte carlo method f...
11.the comparative study of finite difference method and monte carlo method f...11.the comparative study of finite difference method and monte carlo method f...
11.the comparative study of finite difference method and monte carlo method f...
 
The comparative study of finite difference method and monte carlo method for ...
The comparative study of finite difference method and monte carlo method for ...The comparative study of finite difference method and monte carlo method for ...
The comparative study of finite difference method and monte carlo method for ...
 
Project_report_BSS
Project_report_BSSProject_report_BSS
Project_report_BSS
 
Anomaly Detection in Sequences of Short Text Using Iterative Language Models
Anomaly Detection in Sequences of Short Text Using Iterative Language ModelsAnomaly Detection in Sequences of Short Text Using Iterative Language Models
Anomaly Detection in Sequences of Short Text Using Iterative Language Models
 
Lab manual
Lab manualLab manual
Lab manual
 
Continuum Modeling and Control of Large Nonuniform Networks
Continuum Modeling and Control of Large Nonuniform NetworksContinuum Modeling and Control of Large Nonuniform Networks
Continuum Modeling and Control of Large Nonuniform Networks
 
Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...
Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...
Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...
 
Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...
Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...
Sensing Method for Two-Target Detection in Time-Constrained Vector Poisson Ch...
 
Traveling Salesman Problem in Distributed Environment
Traveling Salesman Problem in Distributed EnvironmentTraveling Salesman Problem in Distributed Environment
Traveling Salesman Problem in Distributed Environment
 
TRAVELING SALESMAN PROBLEM IN DISTRIBUTED ENVIRONMENT
TRAVELING SALESMAN PROBLEM IN DISTRIBUTED ENVIRONMENTTRAVELING SALESMAN PROBLEM IN DISTRIBUTED ENVIRONMENT
TRAVELING SALESMAN PROBLEM IN DISTRIBUTED ENVIRONMENT
 
BLIND SIGNATURE SCHEME BASED ON CHEBYSHEV POLYNOMIALS
BLIND SIGNATURE SCHEME BASED ON CHEBYSHEV POLYNOMIALSBLIND SIGNATURE SCHEME BASED ON CHEBYSHEV POLYNOMIALS
BLIND SIGNATURE SCHEME BASED ON CHEBYSHEV POLYNOMIALS
 
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
 
Cdc18 dg lee
Cdc18 dg leeCdc18 dg lee
Cdc18 dg lee
 
Investigation of repeated blasts at Aitik mine using waveform cross correlation
Investigation of repeated blasts at Aitik mine using waveform cross correlationInvestigation of repeated blasts at Aitik mine using waveform cross correlation
Investigation of repeated blasts at Aitik mine using waveform cross correlation
 
40120140501004
4012014050100440120140501004
40120140501004
 
Fractals in Small-World Networks With Time Delay
Fractals in Small-World Networks With Time DelayFractals in Small-World Networks With Time Delay
Fractals in Small-World Networks With Time Delay
 
Journal_IJABME
Journal_IJABMEJournal_IJABME
Journal_IJABME
 

Using Vector Clocks to Visualize Communication Flow

  • 1. Using Vector Clocks to Visualize Communication Flow Martin Harrigan Complex & Adaptive Systems Laboratory (CASL) University College Dublin
  • 2. Introduction Communication Flow Vector Clocks The Methodology Experiments Conclusions & Future Work
  • 3. Introduction A metaphor for communication: flow in conduits ‘slow and fast conduits’ (Kossinets et al., ’08) ‘transatlantic flows of communication’ (Leskovec & Horvitz, ’08) ‘a series of tubes’ (Stevens, ’06) It is a tempting metaphor when visualizing communication flow. A dynamic weighted directed graph? people ↔ vertices communications ↔ (weighted) directed edges time ↔ animation
  • 4. Introduction Is the graphical design grounded in the substance to be communicated (Brandes, ’99)? Is there a discrepancy between the visualization and the analysis of communication flow? Visualization Analysis crossings synchronization area information latency slopes temporal distance symmetries network backbones mental map periodicity Can we integrate the two?
  • 5. Introduction Vector clocks are a useful tool when analyzing communication flow... Introduced by Fidge (’88) and Mattern (’89). Aid with the causal ordering of events in distributed systems. Provide greater insight into communication flow than dynamic weighted directed graphs (Kossinets et al., ’08). We can use them to visualize communication flow.
  • 6. Introduction A suggests to B, C, and D that they meet at the City Hotel Odense.
  • 7. Introduction C and D agree to meet at the Cab Inn.
  • 8. Introduction B and D agree to meet at the Radisson Blu.
  • 9. Introduction A checks with B and C where they should meet. D is no longer reachable but they know to meet him at the Radisson Blu.
  • 10. Introduction Vector clocks maintain, for each actor, the time of their most recent communication, either directly or indirectly, from every other actor.
  • 11. The Methodology visualizeVectorClocks
  • 13. The Methodology During a time interval [0, T ], we have a set of vertices V and a set of time-attributed directed edges E. The instantaneous graph Gt = (V, Et ) at time-slice t is the graph with vertex set V and edge set Et = {(u, v)|(u, v, t) ∈ E}. Input: G0 , . . . , GT (instantaneous graphs)
  • 14. The Methodology For each instantaneous graph: 1 Each vertex has a corresponding vector clock which represents a point in a high-dimensional space. 2 Each time-attributed directed edge (communication) updates the vector clock of the target (receiving) vertex. 3 We compute the distances between all pairs of vector clocks using an appropriate metric. 4 We construct a dissimilarity matrix from these distances. 5 We use multidimensional scaling (MDS) to produce an 2-d visualization of the data points.
  • 15. The Methodology Output: C0 , . . . , CT (2-d coordinates for each time-slice) t=0 t=1 t=2 t=3 Greene et al., ’10, Lancichinetti et al., 08
  • 16. The Methodology For each instantaneous graph:
  • 17. The Methodology For each instantaneous graph: 1 Each vertex has a corresponding vector clock which represents a point in a high-dimensional space.   t1 Initialization  t2  φu,t =    . .  Individual increment at  .  each time-slice tn
  • 18. The Methodology For each instantaneous graph: 2 Each time-attributed directed edge (communication) updates the vector clock of the target (receiving) vertex.       t1 s1 max(t1 , s1 )  t2   s2   max(t2 , s2 )  φu,t =   , φv,t =   =⇒ φv,t =        . . . . . .   .   .   .  tn sn max(tn , sn )
  • 19. The Methodology For each instantaneous graph: 3 We compute the distances between all pairs of vector clocks using an appropriate metric. d(φu,t , φv,t ) = (t1 − s1 )2 + (t2 − s2 )2 + · · · + (tn − sn )2 Which metric?
  • 20. The Methodology For each instantaneous graph: 4 We construct a dissimilarity matrix from these distances.   0 d(φu1 ,t , φu2 ,t ) · · · d(φu1 ,t , φun ,t )  d(φu2 ,t , φu1 ,t ) 0  Mt =    . . ..   . .  d(φun ,t , φu1 ,t ) 0
  • 21. The Methodology For each instantaneous graph: 5 We use multidimensional scaling (MDS) to produce an 2-d visualization of the data points. Dynamic MDS? Procrustes Analysis?
  • 22. Experiments Four artificial datasets comprising temporal sequences of communications between 100 actors during a time interval [0, 99]. The datasets were generated by fixing the set of possible communications and then selecting a communication from the set of possible communications at time-slice t with probability p = 0.005. Each dataset had a distinct underlying communication pattern.
  • 23. Experiments DS1: All communications were possible. 25th 50th 75th 100th
  • 24. Experiments DS2: Communications were possible between every pair of actors in only one direction such that there were no directed cycles of communication. 25th 50th 75th 100th
  • 25. Experiments DS3: The actors were partitioned into four equal subsets and all intra-subset communications were possible. 25th 50th 75th 100th
  • 26. Experiments DS4: The actors were partitioned as in DS3 and intra-subset communications were possible between every pair of actors in only one direction such that there were no directed cycles of communication. 25th 50th 75th 100th
  • 27. Experiments We also visualized a VAST 2008 challenge dataset (Grinstein et al., ’08). This dataset comprises mobile phone call records over a 10 day period between 400 unique mobile phones. We set each time-slice equal to one hour. 25th 50th 75th 100th
  • 28. Conclusions & Future Work A novel methodology for visualizing communication flow. temporal sequence of communications → vector clocks distance metric vector clocks − − − − − → dissimilarity matrix −−−−− MDS dissimilarity matrix −→ 2-d visualizations − Actors who have received the same or communicatively-equivalent communications are placed close together whereas actors that have received largely different communications are placed far apart.
  • 29. Conclusions & Future Work There is much future work: Both the distance metric (Bellman, ’61) and the choice of MDS algorithm need investigation. Can we extend vector clocks to model the attenuation of information, the bounded capacity of communication channels, etc.? Can we visualize synchronicity, network backbones, periodicity? Scalability (maintenance of the vector clocks and the computation of the dissimilarity matrix).