SlideShare a Scribd company logo
1 of 27
Download to read offline
Analysis of Streaming Social Networks and
Graphs on Multicore Architectures
Jason Riedy, Henning Meyerhenke (KIT), David A. Bader,
David Ediger, Timothy G. Mattson (Intel MPRL)

                                         29 March, 2012
Outline


       Motivation
         Graphs are pervasive!
         Why analyze data streams?
         Overall streaming approach

       Technical
          STING and STINGER
          Clustering coefficients
          Connected components

       Conclusions




ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy   2/26
Exascale Data Analysis


              Health care Finding outbreaks, population epidemiology
        Social networks Advertising, searching, grouping
               Intelligence Decisions at scale, regulating algorithms
       Systems biology Understanding interactions, drug design
               Power grid Disruptions, conservation
               Simulation Discrete events, cracking meshes

                     The data is full of semantically rich relationships.
                                  Graphs! Graphs! Graphs!



ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy   3/26
These are not easy graphs.
                                Yifan Hu’s (AT&T) visualization of the Livejournal data set




ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy   4/26
But no shortage of structure...




           Protein interactions, Giot et al., “A Protein
           Interaction Map of Drosophila melanogaster”,
                                                                        Jason’s network via LinkedIn Labs
           Science 302, 1722-1736, 2003.



           • Globally, there rarely are good, balanced separators in the
               scientific computing sense.
           • Locally, there are clusters or communities and many levels of
               detail.


ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy         5/26
Also no shortage of data...
       Existing (some out-of-date) data volumes
                 NYSE 1.5 TB generated daily into a maintained 8 PB archive
                Google “Several dozen” 1PB data sets (CACM, Jan 2010)
                    LHC 15 PB per year (avg. 21 TB daily)
                        http://public.web.cern.ch/public/en/lhc/
                        Computing-en.html
            Wal-Mart 536 TB, 1B entries daily (2006)
                  EBay 2 PB, traditional DB, and 6.5PB streaming, 17 trillion
                       records, 1.5B records/day, each web click is 50-150
                       details. http://www.dbms2.com/2009/04/30/
                       ebays-two-enormous-data-warehouses/
            Faceboot 845 M users... and growing.

           • All data is rich and semantic (graphs!) and changing.
           • Base data rates include items and not relationships.
ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy   6/26
General approaches

           • High-performance static graph analysis
               • Develop techniques that apply to unchanging massive graphs.
               • Provides useful after-the-fact information, starting points.
               • Serves many existing applications well: market research, much
                 bioinformatics, ...
           • High-performance streaming graph analysis
               • Focus on the dynamic changes within massive graphs.
               • Find trends or new information as they appear.
               • Serves upcoming applications: fault or threat detection, trend
                 analysis, ...


                     Both very important to different areas.
                        Remaining focus is on streaming.
          Note: Not CS theory streaming, but analysis of streaming data.


ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy   7/26
Why analyze data streams?

                                                     Data transfer
                                                        • 1 Gb Ethernet: 8.7TB daily at
    Data volumes
                                                            100%, 5-6TB daily realistic
              NYSE 1.5TB daily
                                                        • Multi-TB storage on 10GE: 300TB
                LHC 41TB daily                              daily read, 90TB daily write
        Facebook Who knows?                             • CPU ↔ Memory: QPI,HT:
                                                            2PB/day@100%

    Data growth
                                                     Speed growth
        • Facebook: > 2×/yr
                                                        • Ethernet/IB/etc.: 4× in next 2
        • Twitter: > 10×/yr
                                                            years. Maybe.
        • Growing sources:
                                                        • Flash storage, direct: 10× write,
           Bioinformatics,                                  4× read. Relatively huge cost.
           µsensors, security

ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy   8/26
Overall streaming approach




           Protein interactions, Giot et al., “A Protein
           Interaction Map of Drosophila melanogaster”,
                                                                        Jason’s network via LinkedIn Labs
           Science 302, 1722-1736, 2003.



       Assumptions
           • A graph represents some real-world phenomenon.
               • But not necessarily exactly!
               • Noise comes from lost updates, partial information, ...



ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy         9/26
Overall streaming approach




           Protein interactions, Giot et al., “A Protein
           Interaction Map of Drosophila melanogaster”,
                                                                        Jason’s network via LinkedIn Labs
           Science 302, 1722-1736, 2003.



       Assumptions
           • We target massive, “social network” graphs.
              • Small diameter, power-law degrees
              • Small changes in massive graphs often are unrelated.



ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy         10/26
Overall streaming approach




           Protein interactions, Giot et al., “A Protein
           Interaction Map of Drosophila melanogaster”,
                                                                        Jason’s network via LinkedIn Labs
           Science 302, 1722-1736, 2003.



       Assumptions
           • The graph changes, but we don’t need a continuous view.
               • We can accumulate changes into batches...
               • But not so many that it impedes responsiveness.



ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy         11/26
Difficulties for performance

          • What partitioning
             methods apply?
                 • Geometric? Nope.
                 • Balanced? Nope.
                 • Is there a single, useful
                     decomposition?
                         Not likely.
          • Some partitions exist, but
             they don’t often help
             with balanced bisection or
             memory locality.
          • Performance needs new
             approaches, not just
             standard scientific                                         Jason’s network via LinkedIn Labs
             computing methods.

ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy         12/26
STING’s focus

                            Control
             action                                prediction


                                                                                summary
          Source              data          Simulation / query                                        Viz


           • STING manages queries against changing graph data.
                • Visualization and control often are application specific.
           • Ideal: Maintain many persistent graph analysis kernels.
                • Keep one current snapshot of the graph resident.
                • Let kernels maintain smaller histories.
                • Also (a harder goal), coordinate the kernels’ cooperation.
           • Gather data into a typed graph structure, STINGER.



ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy         13/26
STINGER
       STING Extensible Representation:




           • Rule #1: No explicit locking.
               • Rely on atomic operations.
           • Massive graph: Scattered updates, scattered reads rarely
               conflict.
           • Time stamps: provide some thresholds for time.


ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy   14/26
Initial results


       Prototype STING and STINGER
       Monitoring the following properties:
           1   clustering coefficients,
           2   connected components, and
           3   community structure (in progress).

       High-level
           • Support high rates of change, over 10k updates per second.
           • Performance scales somewhat with available processing.

                         http://www.cc.gatech.edu/stinger/



ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy   15/26
Experimental setup
       Unless otherwise noted
                     Line             Model           Speed (GHz)             Sockets         Cores
                 Nehalem              X5570                 2.93                   2              4
                 Westmere            E7-8870                2.40                   4             10

           • Westmere loaned by Intel (thank you!)
           • All memory: 1067MHz DDR3, installed appropriately
           • Implementations: OpenMP, gcc 4.6.1, Linux ≈ 3.0 kernel
           • Artificial graph and edge stream generated by R-MAT
               [Chakrabarti, Zhan, & Faloutsos].
                   • Scale x, edge factor f ⇒ 2x vertices, ≈ f · 2x edges.
                   • Edge actions: 7/8th insertions, 1/8th deletions
                   • Results over five batches of edge actions.
           • Caveat: No vector instructions, low-level optimizations yet.

ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy   16/26
Clustering coefficients
        • Used to measure “small-world-ness”
           [Watts & Strogatz] and potential                                            i                  m
           community structure
        • Larger clustering coefficient ⇒ more                                                          v
           inter-connected
                                                                                       j                  n
        • Roughly the ratio of the number of actual
           to potential triangles

           • Defined in terms of triplets.
           • i – v – j is a closed triplet (triangle).
           • m – v – n is an open triplet.
           • Clustering coefficient:
                                    # of closed triplets / total # of triplets
           • Locally around v or globally for entire graph.

ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy           17/26
Updating triangle counts

                  Given Edge {u, v } to be inserted (+) or deleted (-)
            Approach Search for vertices adjacent to both u and v , update
                     counts on those and u and v

       Three methods
         Brute force Intersect neighbors of u and v by iterating over each,
                     O(du dv ) time.
           Sorted list Sort u’s neighbors. For each neighbor of v , check if in
                       the sorted list.
       Compressed bits Summarize u’s neighbors in a bit array. Reduces
                   check for v ’s neighbors to O(1) time each.
                   Approximate with Bloom filters. [MTAAP10]
       All rely on atomic addition.


ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy   18/26
Batches of 10k actions

                                                                                Brute force                              Bloom filter                                   Sorted list
                                                                 1.7e+04                  3.4e+04         8.8e+04                   1.2e+05        8.8e+04                        1.4e+05
                                                        105.5
                                                                 1.5e+04                                  1.3e+05                                  1.2e+05
                                                                5.1e+03                                  2.4e+04                                  2.2e+04
         Updates per seconds, both metric and STINGER




                                                                3.9e+03                                  2.0e+04                                  1.7e+04                         q        q
                                                                                                                                         q                                        q    q
                                                                                                                    q                                                             q
                                                                                                                                                                                  q
                                                                                                                                                                                  q
                                                            5                                                           q                                                q
                                                                                                                    q                                               q
                                                         10                                                    q                                           q
                                                                                                                                                               qq
                                                                                                                                                                q
                                                                                                                q                                               q
                                                                                                               q                                           q

                                                                                                               q
                                                                                                                                                       q

                                                                                                                                                   q                                            Machine
                                                        104.5                                                                                       q                                           a   4 x E7−8870

                                                                           q                                                                       q
                                                                                                                                                                                                a   2 x X5570

                                                                                                                                                   q
                                                                                                                                                   q


                                                                     q q
                                                                     q

                                                         104




                                                        103.5

                                                                 0         20      40         60    80     0        20       40     60       80    0            20           40       60   80
                                                                                                                 Threads
                                                                                                   Graph size: scale 22, edge factor 16




ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy                                                                                                               19/26
Different batch sizes

                                                                                Brute force                                     Bloom filter                                    Sorted list
                                                        105.5

                                                         105                                                      q
                                                                                                                  q
                                                                                                                                                                    q
                                                                                                                                                                    q
                                                                                                                                                                  q q

                                                                                              q                            q                                            q                         q
         Updates per seconds, both metric and STINGER




                                                                                              q                                        q                                                  q




                                                                                                                                                                                                           100
                                                          4.5                                                                                    q
                                                        10                                                   q                                                                       q
                                                                      q                                                                     q                                             q
                                                                                                                                   q        q
                                                                q q
                                                         104     q

                                                                qq                                                     q
                                                          3.5                                 q   q
                                                        10
                                                        105.5

                                                         105                                                      q
                                                                                                                  q
                                                                                                                   q
                                                                                                                                                                  q                           q
                                                                                                                 q                                   q   q                                                         Machine




                                                                                                                                                                                                           1000
                                                        104.5                                                 q
                                                                                                                                                              q
                                                                                                                                                                                                                      4 x E7−8870
                                                                                                             q                                                q
                                                                                                                                                              q
                                                                                                                                                                                                                      2 x X5570
                                                         104    q
                                                                  q
                                                                q
                                                                q
                                                                q
                                                                q

                                                        103.5
                                                        105.5
                                                                                                                                                     q                                        q   q   q
                                                                                                                           q                                                                  q
                                                                                                                                                                                              q
                                                         105                                                      qq
                                                                                                                  q
                                                                                                                           q   q
                                                                                                                                                                    qqq
                                                                                                                                                                      q
                                                                                                                                                                    q q
                                                                                                                                                                            q    q

                                                                                                                 q




                                                                                                                                                                                                           10000
                                                                                                                                                                q
                                                          4.5                                                                                                 q
                                                        10                 q                                                                                  q
                                                                                                                                                               q

                                                                                                                                                              q
                                                                     q q
                                                         104

                                                        103.5
                                                                0          20      40     60          80     0             20          40       60       80   0         20           40       60      80
                                                                                                                    Threads
                                                                                                      Graph size: scale 22, edge factor 16




ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy                                                                                                                                 20/26
Different batch sizes: Reactivity

                                                                                    Brute force                               Bloom filter                                      Sorted list

                                                              100
                                                            10−0.5
         Seconds between updates, both metric and STINGER




                                                             10−1




                                                                                                                                                                                                           100
                                                                                                                                                                    q
                                                                           qq
                                                            10−1.5   q
                                                                     q
                                                                                                  q
                                                                                                  q
                                                                                                  q   q


                                                             10−2     q
                                                                     q q
                                                                                                                                 q        q
                                                                                                                                          q                                               q
                                                                       q                                                                                                             q
                                                                                                               q
                                                            10−2.5                                q
                                                                                                  q                      q           q
                                                                                                                                               q
                                                                                                                                                                        q                 q       q
                                                                                                                   q
                                                             10−3                                                  q                                            q
                                                                                                                                                                    q




                                                              100
                                                            10−0.5   q
                                                                     q
                                                                     q                                                                                                                                             Machine
                                                             10−1    qq




                                                                                                                                                                                                           1000
                                                                                                                                                            q
                                                                                                               q                                            q
                                                                                                                                                            q                                                         4 x E7−8870
                                                            10−1.5                                             q
                                                                                                                                                       q
                                                                                                                   q                               q
                                                             10−2
                                                                                                                    q
                                                                                                                    qq
                                                                                                                                                                q                             q                       2 x X5570
                                                            10−2.5
                                                             10−3

                                                              100        q q
                                                                                                                                                            q
                                                                               q                                                                            q
                                                            10−0.5                                                 q
                                                                                                                                                            q
                                                                                                                                                            q
                                                                                                                                                              q
                                                                                                                    q                                             q q
                                                             10−1                                                  qq                                             qqq




                                                                                                                                                                                                           10000
                                                                                                                         q   q                                      q       q    q
                                                                                                                         q                         q                                          q
                                                                                                                                                                                              q   q   q

                                                            10−1.5
                                                             10−2
                                                            10−2.5
                                                             10−3

                                                                     0         20      40     60          80   0         20          40       60       80   0           20           40       60      80
                                                                                                                    Threads
                                                                                                      Graph size: scale 22, edge factor 16




ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy                                                                                                                                 21/26
Connected components


           • Maintain a mapping from vertex to
               component.
           • Global property, unlike triangle
               counts
           • In “scale free” social networks:
                • Often one big component, and
                • many tiny ones.
           • Edge changes often sit within
               components.
           • Remaining insertions merge
               components.
           • Deletions are more difficult...



ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy   22/26
Connected components: Deleted edges

    The difficult case
        • Very few deletions
          matter.
        • Determining which
          matter may require a
          large graph search.
               • Re-running static
                 component
                 detection.
               • (Long history, see
                 related work in
                 [MTAAP11].)
        • Coping mechanisms:
            • Heuristics.
            • Second level of
              batching.

ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy   23/26
Deletion heuristics



       Rule out effect-less deletions
           • Use the spanning tree by-product of static connected
               component algorithms.
           • Ignore deletions when one of the following occur:
               1 The deleted edge is not in the spanning tree.
               2 If the endpoints share a common neighbor∗ .
               3 If the loose endpoint can reach the root∗ .

           • In the last two (∗), also fix the spanning tree.

                                     Rules out 99.7% of deletions.




ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy   24/26
Connected components: Performance

                                                        106   2.4e+03 1.6e+04                                                  6.4e+03
                                                              3.2e+03 2.0e+03
                                                        105




                                                                                                                                         100
         Updates per seconds, both metric and STINGER




                                                        104

                                                        103



                                                        106   1.7e+04 7.7e+04                                                  1.4e+04
                                                              1.9e+04 2.0e+04
                                                        105                                                                                      Machine




                                                                                                                                         1000
                                                        104
                                                                                                                                                 a   4 x E7−8870
                                                                                                                                                 a   2 x X5570
                                                          3
                                                        10



                                                        106   5.8e+04 1.3e+05                                                  1.5e+05
                                                              5.5e+04 1.1e+05
                                                          5
                                                        10




                                                                                                                                         10000
                                                        104

                                                        103


                                                               12 4 6 8   12 16    24      32      40      48     56     64   72   80
                                                                                                Threads
                                                                                  Graph size: scale 22, edge factor 16




ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy                                                                25/26
Conclusions

           • On commercial Intel-based platforms, our STING system
               supports update rates of
                   • 100 000 updates per second for clustering coefficients, and
                   • 70 000 updates per second for component labels.
           • The rates are on batches small enough to respond 100 or 70
               times per second, respectively.
           • Batching updates exposes parallelism without much delay.
           • Reducing graph searches (memory access) with better heuristics
               improves performance more with fewer paths to memory.

       Software freely available
                           http://www.cc.gatech.edu/stinger/



ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy   26/26
Acknowledgment of support




ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy   27/26

More Related Content

What's hot

Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012
Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012
Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012Preferred Networks
 
Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...
Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...
Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...Amit Sheth
 
Educating a New Breed of Data Scientists for Scientific Data Management
Educating a New Breed of Data Scientists for Scientific Data Management Educating a New Breed of Data Scientists for Scientific Data Management
Educating a New Breed of Data Scientists for Scientific Data Management Jian Qin
 
The Semantic Web: It's for Real
The Semantic Web: It's for RealThe Semantic Web: It's for Real
The Semantic Web: It's for RealJames Hendler
 
EPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data TalkEPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data TalkAdina Chuang Howe
 
The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration James Hendler
 
Unexpected Challenges in Large Scale Machine Learning by Charles Parker
 Unexpected Challenges in Large Scale Machine Learning by Charles Parker Unexpected Challenges in Large Scale Machine Learning by Charles Parker
Unexpected Challenges in Large Scale Machine Learning by Charles ParkerBigMine
 
VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...
VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...
VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...Keiichiro Ono
 
Taming the Big Data Beast - Together
Taming the Big Data Beast - TogetherTaming the Big Data Beast - Together
Taming the Big Data Beast - TogetherKennisalliantie
 
Building the Pacific Research Platform: Supernetworks for Big Data Science
Building the Pacific Research Platform: Supernetworks for Big Data ScienceBuilding the Pacific Research Platform: Supernetworks for Big Data Science
Building the Pacific Research Platform: Supernetworks for Big Data ScienceLarry Smarr
 
(Tugdual grall) no sql-hadoop
(Tugdual grall)   no sql-hadoop(Tugdual grall)   no sql-hadoop
(Tugdual grall) no sql-hadoopNAVER D2
 
Physical Cyber Social Computing
Physical Cyber Social ComputingPhysical Cyber Social Computing
Physical Cyber Social ComputingAmit Sheth
 
g-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking Functionalityg-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking FunctionalityNicholas Loulloudes
 
Large scale computing
Large scale computing Large scale computing
Large scale computing Bhupesh Bansal
 
Eyeo 2019-Lightning-Cytoscape
Eyeo 2019-Lightning-CytoscapeEyeo 2019-Lightning-Cytoscape
Eyeo 2019-Lightning-CytoscapeKeiichiro Ono
 
Knowledge Will Propel Machine Understanding of Big Data
Knowledge Will Propel Machine Understanding of Big DataKnowledge Will Propel Machine Understanding of Big Data
Knowledge Will Propel Machine Understanding of Big DataAmit Sheth
 
DATA CENTRIC EDUCATION & LEARNING
 DATA CENTRIC EDUCATION & LEARNING DATA CENTRIC EDUCATION & LEARNING
DATA CENTRIC EDUCATION & LEARNINGdatasciencekorea
 
Philosophy of Deep Learning
Philosophy of Deep LearningPhilosophy of Deep Learning
Philosophy of Deep LearningMelanie Swan
 

What's hot (20)

Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012
Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012
Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012
 
Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...
Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...
Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...
 
Educating a New Breed of Data Scientists for Scientific Data Management
Educating a New Breed of Data Scientists for Scientific Data Management Educating a New Breed of Data Scientists for Scientific Data Management
Educating a New Breed of Data Scientists for Scientific Data Management
 
Using CINET
Using CINETUsing CINET
Using CINET
 
The Semantic Web: It's for Real
The Semantic Web: It's for RealThe Semantic Web: It's for Real
The Semantic Web: It's for Real
 
EPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data TalkEPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data Talk
 
The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration
 
Unexpected Challenges in Large Scale Machine Learning by Charles Parker
 Unexpected Challenges in Large Scale Machine Learning by Charles Parker Unexpected Challenges in Large Scale Machine Learning by Charles Parker
Unexpected Challenges in Large Scale Machine Learning by Charles Parker
 
VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...
VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...
VIZBI 2015 Tutorial: Cytoscape, IPython, Docker, and Reproducible Network Dat...
 
Taming the Big Data Beast - Together
Taming the Big Data Beast - TogetherTaming the Big Data Beast - Together
Taming the Big Data Beast - Together
 
Building the Pacific Research Platform: Supernetworks for Big Data Science
Building the Pacific Research Platform: Supernetworks for Big Data ScienceBuilding the Pacific Research Platform: Supernetworks for Big Data Science
Building the Pacific Research Platform: Supernetworks for Big Data Science
 
Lecture #03
Lecture #03Lecture #03
Lecture #03
 
(Tugdual grall) no sql-hadoop
(Tugdual grall)   no sql-hadoop(Tugdual grall)   no sql-hadoop
(Tugdual grall) no sql-hadoop
 
Physical Cyber Social Computing
Physical Cyber Social ComputingPhysical Cyber Social Computing
Physical Cyber Social Computing
 
g-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking Functionalityg-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking Functionality
 
Large scale computing
Large scale computing Large scale computing
Large scale computing
 
Eyeo 2019-Lightning-Cytoscape
Eyeo 2019-Lightning-CytoscapeEyeo 2019-Lightning-Cytoscape
Eyeo 2019-Lightning-Cytoscape
 
Knowledge Will Propel Machine Understanding of Big Data
Knowledge Will Propel Machine Understanding of Big DataKnowledge Will Propel Machine Understanding of Big Data
Knowledge Will Propel Machine Understanding of Big Data
 
DATA CENTRIC EDUCATION & LEARNING
 DATA CENTRIC EDUCATION & LEARNING DATA CENTRIC EDUCATION & LEARNING
DATA CENTRIC EDUCATION & LEARNING
 
Philosophy of Deep Learning
Philosophy of Deep LearningPhilosophy of Deep Learning
Philosophy of Deep Learning
 

Viewers also liked

A massive protocluster of galaxies at a redshift of z<5.3
A massive protocluster of galaxies at a redshift of z<5.3A massive protocluster of galaxies at a redshift of z<5.3
A massive protocluster of galaxies at a redshift of z<5.3Sérgio Sacani
 
La gente-que-me-gusta-milespowerpoints.com
La gente-que-me-gusta-milespowerpoints.comLa gente-que-me-gusta-milespowerpoints.com
La gente-que-me-gusta-milespowerpoints.comagosto1967
 
Klenk & Hoursch Krisenkommunikation Umfrageergebnisse (deutsch)
Klenk & Hoursch Krisenkommunikation Umfrageergebnisse (deutsch)Klenk & Hoursch Krisenkommunikation Umfrageergebnisse (deutsch)
Klenk & Hoursch Krisenkommunikation Umfrageergebnisse (deutsch)Tobias Mueller
 
Inver.giro 3ºc 13 14
Inver.giro 3ºc 13 14Inver.giro 3ºc 13 14
Inver.giro 3ºc 13 14jcallejcb
 
Google tools 101 the kuyu project
Google tools 101   the kuyu projectGoogle tools 101   the kuyu project
Google tools 101 the kuyu project✅ Jo Webber
 
"Mobilmachung" nachgeblättert. Versuch einer Systematisierung der Angebote un...
"Mobilmachung" nachgeblättert. Versuch einer Systematisierung der Angebote un..."Mobilmachung" nachgeblättert. Versuch einer Systematisierung der Angebote un...
"Mobilmachung" nachgeblättert. Versuch einer Systematisierung der Angebote un...Martin Krieg
 
Netcoo bericht zu nwa
Netcoo bericht zu nwa Netcoo bericht zu nwa
Netcoo bericht zu nwa Otmar Lindner
 
El Salvador: migraciones, indígenas y cultura
El Salvador: migraciones, indígenas y culturaEl Salvador: migraciones, indígenas y cultura
El Salvador: migraciones, indígenas y culturaMiguel Huezo-Mixco
 
Drumm moore, vol.1 introducción a la planificación del ecoturismo
Drumm   moore, vol.1 introducción a la planificación del ecoturismoDrumm   moore, vol.1 introducción a la planificación del ecoturismo
Drumm moore, vol.1 introducción a la planificación del ecoturismodanijhoe
 
Profesores Regalones
Profesores RegalonesProfesores Regalones
Profesores Regalonespedro peña
 
Artesmarciales 120402070109-phpapp02
Artesmarciales 120402070109-phpapp02Artesmarciales 120402070109-phpapp02
Artesmarciales 120402070109-phpapp02Cristian Chiriboga
 
Mercadeo y ventas - JCI Argentina
Mercadeo y ventas - JCI ArgentinaMercadeo y ventas - JCI Argentina
Mercadeo y ventas - JCI ArgentinaJCI Argentina
 
Personalizace & automatizace 2014: Etnetera o technologii Soyka
Personalizace & automatizace 2014: Etnetera o technologii SoykaPersonalizace & automatizace 2014: Etnetera o technologii Soyka
Personalizace & automatizace 2014: Etnetera o technologii SoykaBESTETO
 

Viewers also liked (20)

A massive protocluster of galaxies at a redshift of z<5.3
A massive protocluster of galaxies at a redshift of z<5.3A massive protocluster of galaxies at a redshift of z<5.3
A massive protocluster of galaxies at a redshift of z<5.3
 
La gente-que-me-gusta-milespowerpoints.com
La gente-que-me-gusta-milespowerpoints.comLa gente-que-me-gusta-milespowerpoints.com
La gente-que-me-gusta-milespowerpoints.com
 
Klenk & Hoursch Krisenkommunikation Umfrageergebnisse (deutsch)
Klenk & Hoursch Krisenkommunikation Umfrageergebnisse (deutsch)Klenk & Hoursch Krisenkommunikation Umfrageergebnisse (deutsch)
Klenk & Hoursch Krisenkommunikation Umfrageergebnisse (deutsch)
 
Joangarcia tecnica
Joangarcia tecnicaJoangarcia tecnica
Joangarcia tecnica
 
Inver.giro 3ºc 13 14
Inver.giro 3ºc 13 14Inver.giro 3ºc 13 14
Inver.giro 3ºc 13 14
 
Mi familia es unida
Mi  familia  es  unidaMi  familia  es  unida
Mi familia es unida
 
Recompensar en Navidad
Recompensar en NavidadRecompensar en Navidad
Recompensar en Navidad
 
Google tools 101 the kuyu project
Google tools 101   the kuyu projectGoogle tools 101   the kuyu project
Google tools 101 the kuyu project
 
"Mobilmachung" nachgeblättert. Versuch einer Systematisierung der Angebote un...
"Mobilmachung" nachgeblättert. Versuch einer Systematisierung der Angebote un..."Mobilmachung" nachgeblättert. Versuch einer Systematisierung der Angebote un...
"Mobilmachung" nachgeblättert. Versuch einer Systematisierung der Angebote un...
 
Netcoo bericht zu nwa
Netcoo bericht zu nwa Netcoo bericht zu nwa
Netcoo bericht zu nwa
 
El Salvador: migraciones, indígenas y cultura
El Salvador: migraciones, indígenas y culturaEl Salvador: migraciones, indígenas y cultura
El Salvador: migraciones, indígenas y cultura
 
Video marketing en internet
Video marketing en internetVideo marketing en internet
Video marketing en internet
 
Drumm moore, vol.1 introducción a la planificación del ecoturismo
Drumm   moore, vol.1 introducción a la planificación del ecoturismoDrumm   moore, vol.1 introducción a la planificación del ecoturismo
Drumm moore, vol.1 introducción a la planificación del ecoturismo
 
Profesores Regalones
Profesores RegalonesProfesores Regalones
Profesores Regalones
 
Artesmarciales 120402070109-phpapp02
Artesmarciales 120402070109-phpapp02Artesmarciales 120402070109-phpapp02
Artesmarciales 120402070109-phpapp02
 
朋程法說會8255
朋程法說會8255朋程法說會8255
朋程法說會8255
 
Bethesda
BethesdaBethesda
Bethesda
 
Mercadeo y ventas - JCI Argentina
Mercadeo y ventas - JCI ArgentinaMercadeo y ventas - JCI Argentina
Mercadeo y ventas - JCI Argentina
 
Personalizace & automatizace 2014: Etnetera o technologii Soyka
Personalizace & automatizace 2014: Etnetera o technologii SoykaPersonalizace & automatizace 2014: Etnetera o technologii Soyka
Personalizace & automatizace 2014: Etnetera o technologii Soyka
 
Enfermedad fúngica invasora
Enfermedad fúngica invasoraEnfermedad fúngica invasora
Enfermedad fúngica invasora
 

Similar to ICASSP 2012: Analysis of Streaming Social Networks and Graphs on Multicore Architectures

SIAM PP 2012: Scalable Algorithms for Analysis of Massive, Streaming Graphs
SIAM PP 2012: Scalable Algorithms for Analysis of Massive, Streaming Graphs SIAM PP 2012: Scalable Algorithms for Analysis of Massive, Streaming Graphs
SIAM PP 2012: Scalable Algorithms for Analysis of Massive, Streaming Graphs Jason Riedy
 
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel PlatformsSTING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel PlatformsJason Riedy
 
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel PlatformsSTING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel PlatformsJason Riedy
 
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel PlatformsSTING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel PlatformsJason Riedy
 
High-Performance Analysis of Streaming Graphs
High-Performance Analysis of Streaming Graphs High-Performance Analysis of Streaming Graphs
High-Performance Analysis of Streaming Graphs Jason Riedy
 
MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo...
 MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo... MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo...
MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo...Peter Rose
 
DATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONS
DATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONSDATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONS
DATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONSijdms
 
Scalable and Efficient Algorithms for Analysis of Massive, Streaming Graphs
Scalable and Efficient Algorithms for Analysis of Massive, Streaming GraphsScalable and Efficient Algorithms for Analysis of Massive, Streaming Graphs
Scalable and Efficient Algorithms for Analysis of Massive, Streaming GraphsJason Riedy
 
Big Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other thingsBig Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other thingsGeoffrey Fox
 
Graham Pryor
Graham PryorGraham Pryor
Graham PryorEduserv
 
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUMETHE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUMEGigaom
 
STING: A Framework for Analyzing Spacio-Temporal Interaction Networks and Graphs
STING: A Framework for Analyzing Spacio-Temporal Interaction Networks and GraphsSTING: A Framework for Analyzing Spacio-Temporal Interaction Networks and Graphs
STING: A Framework for Analyzing Spacio-Temporal Interaction Networks and GraphsJason Riedy
 
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...Tal Lavian Ph.D.
 
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...i_scienceEU
 
Introduction to STINGER
Introduction to STINGERIntroduction to STINGER
Introduction to STINGERrobertmccoll
 

Similar to ICASSP 2012: Analysis of Streaming Social Networks and Graphs on Multicore Architectures (20)

SIAM PP 2012: Scalable Algorithms for Analysis of Massive, Streaming Graphs
SIAM PP 2012: Scalable Algorithms for Analysis of Massive, Streaming Graphs SIAM PP 2012: Scalable Algorithms for Analysis of Massive, Streaming Graphs
SIAM PP 2012: Scalable Algorithms for Analysis of Massive, Streaming Graphs
 
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel PlatformsSTING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
 
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel PlatformsSTING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
 
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel PlatformsSTING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
 
High-Performance Analysis of Streaming Graphs
High-Performance Analysis of Streaming Graphs High-Performance Analysis of Streaming Graphs
High-Performance Analysis of Streaming Graphs
 
Are you ready for BIG DATA?
Are you ready for BIG DATA?Are you ready for BIG DATA?
Are you ready for BIG DATA?
 
Big Data
Big Data Big Data
Big Data
 
MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo...
 MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo... MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo...
MMTF-Spark: Interactive, Scalable, and Reproducible Datamining of 3D Macromo...
 
DATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONS
DATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONSDATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONS
DATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONS
 
Scalable and Efficient Algorithms for Analysis of Massive, Streaming Graphs
Scalable and Efficient Algorithms for Analysis of Massive, Streaming GraphsScalable and Efficient Algorithms for Analysis of Massive, Streaming Graphs
Scalable and Efficient Algorithms for Analysis of Massive, Streaming Graphs
 
FR.pptx
FR.pptxFR.pptx
FR.pptx
 
Big Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other thingsBig Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other things
 
STI Summit 2011 - Visual analytics and linked data
STI Summit 2011 - Visual analytics and linked dataSTI Summit 2011 - Visual analytics and linked data
STI Summit 2011 - Visual analytics and linked data
 
Graham Pryor
Graham PryorGraham Pryor
Graham Pryor
 
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUMETHE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
 
STING: A Framework for Analyzing Spacio-Temporal Interaction Networks and Graphs
STING: A Framework for Analyzing Spacio-Temporal Interaction Networks and GraphsSTING: A Framework for Analyzing Spacio-Temporal Interaction Networks and Graphs
STING: A Framework for Analyzing Spacio-Temporal Interaction Networks and Graphs
 
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
 
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
 
Introduction to STINGER
Introduction to STINGERIntroduction to STINGER
Introduction to STINGER
 
BigDataCSEKeyNote_2012
BigDataCSEKeyNote_2012BigDataCSEKeyNote_2012
BigDataCSEKeyNote_2012
 

More from Jason Riedy

Lucata at the HPEC GraphBLAS BoF
Lucata at the HPEC GraphBLAS BoFLucata at the HPEC GraphBLAS BoF
Lucata at the HPEC GraphBLAS BoFJason Riedy
 
LAGraph 2021-10-13
LAGraph 2021-10-13LAGraph 2021-10-13
LAGraph 2021-10-13Jason Riedy
 
Lucata at the HPEC GraphBLAS BoF
Lucata at the HPEC GraphBLAS BoFLucata at the HPEC GraphBLAS BoF
Lucata at the HPEC GraphBLAS BoFJason Riedy
 
Graph analysis and novel architectures
Graph analysis and novel architecturesGraph analysis and novel architectures
Graph analysis and novel architecturesJason Riedy
 
GraphBLAS and Emus
GraphBLAS and EmusGraphBLAS and Emus
GraphBLAS and EmusJason Riedy
 
Reproducible Linear Algebra from Application to Architecture
Reproducible Linear Algebra from Application to ArchitectureReproducible Linear Algebra from Application to Architecture
Reproducible Linear Algebra from Application to ArchitectureJason Riedy
 
PEARC19: Wrangling Rogues: A Case Study on Managing Experimental Post-Moore A...
PEARC19: Wrangling Rogues: A Case Study on Managing Experimental Post-Moore A...PEARC19: Wrangling Rogues: A Case Study on Managing Experimental Post-Moore A...
PEARC19: Wrangling Rogues: A Case Study on Managing Experimental Post-Moore A...Jason Riedy
 
ICIAM 2019: Reproducible Linear Algebra from Application to Architecture
ICIAM 2019: Reproducible Linear Algebra from Application to ArchitectureICIAM 2019: Reproducible Linear Algebra from Application to Architecture
ICIAM 2019: Reproducible Linear Algebra from Application to ArchitectureJason Riedy
 
ICIAM 2019: A New Algorithm Model for Massive-Scale Streaming Graph Analysis
ICIAM 2019: A New Algorithm Model for Massive-Scale Streaming Graph AnalysisICIAM 2019: A New Algorithm Model for Massive-Scale Streaming Graph Analysis
ICIAM 2019: A New Algorithm Model for Massive-Scale Streaming Graph AnalysisJason Riedy
 
Novel Architectures for Applications in Data Science and Beyond
Novel Architectures for Applications in Data Science and BeyondNovel Architectures for Applications in Data Science and Beyond
Novel Architectures for Applications in Data Science and BeyondJason Riedy
 
Characterization of Emu Chick with Microbenchmarks
Characterization of Emu Chick with MicrobenchmarksCharacterization of Emu Chick with Microbenchmarks
Characterization of Emu Chick with MicrobenchmarksJason Riedy
 
CRNCH 2018 Summit: Rogues Gallery Update
CRNCH 2018 Summit: Rogues Gallery UpdateCRNCH 2018 Summit: Rogues Gallery Update
CRNCH 2018 Summit: Rogues Gallery UpdateJason Riedy
 
Augmented Arithmetic Operations Proposed for IEEE-754 2018
Augmented Arithmetic Operations Proposed for IEEE-754 2018Augmented Arithmetic Operations Proposed for IEEE-754 2018
Augmented Arithmetic Operations Proposed for IEEE-754 2018Jason Riedy
 
Graph Analysis: New Algorithm Models, New Architectures
Graph Analysis: New Algorithm Models, New ArchitecturesGraph Analysis: New Algorithm Models, New Architectures
Graph Analysis: New Algorithm Models, New ArchitecturesJason Riedy
 
CRNCH Rogues Gallery: A Community Core for Novel Computing Platforms
CRNCH Rogues Gallery: A Community Core for Novel Computing PlatformsCRNCH Rogues Gallery: A Community Core for Novel Computing Platforms
CRNCH Rogues Gallery: A Community Core for Novel Computing PlatformsJason Riedy
 
CRNCH Rogues Gallery: A Community Core for Novel Computing Platforms
CRNCH Rogues Gallery: A Community Core for Novel Computing PlatformsCRNCH Rogues Gallery: A Community Core for Novel Computing Platforms
CRNCH Rogues Gallery: A Community Core for Novel Computing PlatformsJason Riedy
 
A New Algorithm Model for Massive-Scale Streaming Graph Analysis
A New Algorithm Model for Massive-Scale Streaming Graph AnalysisA New Algorithm Model for Massive-Scale Streaming Graph Analysis
A New Algorithm Model for Massive-Scale Streaming Graph AnalysisJason Riedy
 
High-Performance Analysis of Streaming Graphs
High-Performance Analysis of Streaming GraphsHigh-Performance Analysis of Streaming Graphs
High-Performance Analysis of Streaming GraphsJason Riedy
 
Updating PageRank for Streaming Graphs
Updating PageRank for Streaming GraphsUpdating PageRank for Streaming Graphs
Updating PageRank for Streaming GraphsJason Riedy
 
Graph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear AlgebraGraph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear AlgebraJason Riedy
 

More from Jason Riedy (20)

Lucata at the HPEC GraphBLAS BoF
Lucata at the HPEC GraphBLAS BoFLucata at the HPEC GraphBLAS BoF
Lucata at the HPEC GraphBLAS BoF
 
LAGraph 2021-10-13
LAGraph 2021-10-13LAGraph 2021-10-13
LAGraph 2021-10-13
 
Lucata at the HPEC GraphBLAS BoF
Lucata at the HPEC GraphBLAS BoFLucata at the HPEC GraphBLAS BoF
Lucata at the HPEC GraphBLAS BoF
 
Graph analysis and novel architectures
Graph analysis and novel architecturesGraph analysis and novel architectures
Graph analysis and novel architectures
 
GraphBLAS and Emus
GraphBLAS and EmusGraphBLAS and Emus
GraphBLAS and Emus
 
Reproducible Linear Algebra from Application to Architecture
Reproducible Linear Algebra from Application to ArchitectureReproducible Linear Algebra from Application to Architecture
Reproducible Linear Algebra from Application to Architecture
 
PEARC19: Wrangling Rogues: A Case Study on Managing Experimental Post-Moore A...
PEARC19: Wrangling Rogues: A Case Study on Managing Experimental Post-Moore A...PEARC19: Wrangling Rogues: A Case Study on Managing Experimental Post-Moore A...
PEARC19: Wrangling Rogues: A Case Study on Managing Experimental Post-Moore A...
 
ICIAM 2019: Reproducible Linear Algebra from Application to Architecture
ICIAM 2019: Reproducible Linear Algebra from Application to ArchitectureICIAM 2019: Reproducible Linear Algebra from Application to Architecture
ICIAM 2019: Reproducible Linear Algebra from Application to Architecture
 
ICIAM 2019: A New Algorithm Model for Massive-Scale Streaming Graph Analysis
ICIAM 2019: A New Algorithm Model for Massive-Scale Streaming Graph AnalysisICIAM 2019: A New Algorithm Model for Massive-Scale Streaming Graph Analysis
ICIAM 2019: A New Algorithm Model for Massive-Scale Streaming Graph Analysis
 
Novel Architectures for Applications in Data Science and Beyond
Novel Architectures for Applications in Data Science and BeyondNovel Architectures for Applications in Data Science and Beyond
Novel Architectures for Applications in Data Science and Beyond
 
Characterization of Emu Chick with Microbenchmarks
Characterization of Emu Chick with MicrobenchmarksCharacterization of Emu Chick with Microbenchmarks
Characterization of Emu Chick with Microbenchmarks
 
CRNCH 2018 Summit: Rogues Gallery Update
CRNCH 2018 Summit: Rogues Gallery UpdateCRNCH 2018 Summit: Rogues Gallery Update
CRNCH 2018 Summit: Rogues Gallery Update
 
Augmented Arithmetic Operations Proposed for IEEE-754 2018
Augmented Arithmetic Operations Proposed for IEEE-754 2018Augmented Arithmetic Operations Proposed for IEEE-754 2018
Augmented Arithmetic Operations Proposed for IEEE-754 2018
 
Graph Analysis: New Algorithm Models, New Architectures
Graph Analysis: New Algorithm Models, New ArchitecturesGraph Analysis: New Algorithm Models, New Architectures
Graph Analysis: New Algorithm Models, New Architectures
 
CRNCH Rogues Gallery: A Community Core for Novel Computing Platforms
CRNCH Rogues Gallery: A Community Core for Novel Computing PlatformsCRNCH Rogues Gallery: A Community Core for Novel Computing Platforms
CRNCH Rogues Gallery: A Community Core for Novel Computing Platforms
 
CRNCH Rogues Gallery: A Community Core for Novel Computing Platforms
CRNCH Rogues Gallery: A Community Core for Novel Computing PlatformsCRNCH Rogues Gallery: A Community Core for Novel Computing Platforms
CRNCH Rogues Gallery: A Community Core for Novel Computing Platforms
 
A New Algorithm Model for Massive-Scale Streaming Graph Analysis
A New Algorithm Model for Massive-Scale Streaming Graph AnalysisA New Algorithm Model for Massive-Scale Streaming Graph Analysis
A New Algorithm Model for Massive-Scale Streaming Graph Analysis
 
High-Performance Analysis of Streaming Graphs
High-Performance Analysis of Streaming GraphsHigh-Performance Analysis of Streaming Graphs
High-Performance Analysis of Streaming Graphs
 
Updating PageRank for Streaming Graphs
Updating PageRank for Streaming GraphsUpdating PageRank for Streaming Graphs
Updating PageRank for Streaming Graphs
 
Graph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear AlgebraGraph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear Algebra
 

Recently uploaded

Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 

Recently uploaded (20)

Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 

ICASSP 2012: Analysis of Streaming Social Networks and Graphs on Multicore Architectures

  • 1. Analysis of Streaming Social Networks and Graphs on Multicore Architectures Jason Riedy, Henning Meyerhenke (KIT), David A. Bader, David Ediger, Timothy G. Mattson (Intel MPRL) 29 March, 2012
  • 2. Outline Motivation Graphs are pervasive! Why analyze data streams? Overall streaming approach Technical STING and STINGER Clustering coefficients Connected components Conclusions ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 2/26
  • 3. Exascale Data Analysis Health care Finding outbreaks, population epidemiology Social networks Advertising, searching, grouping Intelligence Decisions at scale, regulating algorithms Systems biology Understanding interactions, drug design Power grid Disruptions, conservation Simulation Discrete events, cracking meshes The data is full of semantically rich relationships. Graphs! Graphs! Graphs! ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 3/26
  • 4. These are not easy graphs. Yifan Hu’s (AT&T) visualization of the Livejournal data set ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 4/26
  • 5. But no shortage of structure... Protein interactions, Giot et al., “A Protein Interaction Map of Drosophila melanogaster”, Jason’s network via LinkedIn Labs Science 302, 1722-1736, 2003. • Globally, there rarely are good, balanced separators in the scientific computing sense. • Locally, there are clusters or communities and many levels of detail. ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 5/26
  • 6. Also no shortage of data... Existing (some out-of-date) data volumes NYSE 1.5 TB generated daily into a maintained 8 PB archive Google “Several dozen” 1PB data sets (CACM, Jan 2010) LHC 15 PB per year (avg. 21 TB daily) http://public.web.cern.ch/public/en/lhc/ Computing-en.html Wal-Mart 536 TB, 1B entries daily (2006) EBay 2 PB, traditional DB, and 6.5PB streaming, 17 trillion records, 1.5B records/day, each web click is 50-150 details. http://www.dbms2.com/2009/04/30/ ebays-two-enormous-data-warehouses/ Faceboot 845 M users... and growing. • All data is rich and semantic (graphs!) and changing. • Base data rates include items and not relationships. ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 6/26
  • 7. General approaches • High-performance static graph analysis • Develop techniques that apply to unchanging massive graphs. • Provides useful after-the-fact information, starting points. • Serves many existing applications well: market research, much bioinformatics, ... • High-performance streaming graph analysis • Focus on the dynamic changes within massive graphs. • Find trends or new information as they appear. • Serves upcoming applications: fault or threat detection, trend analysis, ... Both very important to different areas. Remaining focus is on streaming. Note: Not CS theory streaming, but analysis of streaming data. ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 7/26
  • 8. Why analyze data streams? Data transfer • 1 Gb Ethernet: 8.7TB daily at Data volumes 100%, 5-6TB daily realistic NYSE 1.5TB daily • Multi-TB storage on 10GE: 300TB LHC 41TB daily daily read, 90TB daily write Facebook Who knows? • CPU ↔ Memory: QPI,HT: 2PB/day@100% Data growth Speed growth • Facebook: > 2×/yr • Ethernet/IB/etc.: 4× in next 2 • Twitter: > 10×/yr years. Maybe. • Growing sources: • Flash storage, direct: 10× write, Bioinformatics, 4× read. Relatively huge cost. µsensors, security ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 8/26
  • 9. Overall streaming approach Protein interactions, Giot et al., “A Protein Interaction Map of Drosophila melanogaster”, Jason’s network via LinkedIn Labs Science 302, 1722-1736, 2003. Assumptions • A graph represents some real-world phenomenon. • But not necessarily exactly! • Noise comes from lost updates, partial information, ... ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 9/26
  • 10. Overall streaming approach Protein interactions, Giot et al., “A Protein Interaction Map of Drosophila melanogaster”, Jason’s network via LinkedIn Labs Science 302, 1722-1736, 2003. Assumptions • We target massive, “social network” graphs. • Small diameter, power-law degrees • Small changes in massive graphs often are unrelated. ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 10/26
  • 11. Overall streaming approach Protein interactions, Giot et al., “A Protein Interaction Map of Drosophila melanogaster”, Jason’s network via LinkedIn Labs Science 302, 1722-1736, 2003. Assumptions • The graph changes, but we don’t need a continuous view. • We can accumulate changes into batches... • But not so many that it impedes responsiveness. ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 11/26
  • 12. Difficulties for performance • What partitioning methods apply? • Geometric? Nope. • Balanced? Nope. • Is there a single, useful decomposition? Not likely. • Some partitions exist, but they don’t often help with balanced bisection or memory locality. • Performance needs new approaches, not just standard scientific Jason’s network via LinkedIn Labs computing methods. ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 12/26
  • 13. STING’s focus Control action prediction summary Source data Simulation / query Viz • STING manages queries against changing graph data. • Visualization and control often are application specific. • Ideal: Maintain many persistent graph analysis kernels. • Keep one current snapshot of the graph resident. • Let kernels maintain smaller histories. • Also (a harder goal), coordinate the kernels’ cooperation. • Gather data into a typed graph structure, STINGER. ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 13/26
  • 14. STINGER STING Extensible Representation: • Rule #1: No explicit locking. • Rely on atomic operations. • Massive graph: Scattered updates, scattered reads rarely conflict. • Time stamps: provide some thresholds for time. ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 14/26
  • 15. Initial results Prototype STING and STINGER Monitoring the following properties: 1 clustering coefficients, 2 connected components, and 3 community structure (in progress). High-level • Support high rates of change, over 10k updates per second. • Performance scales somewhat with available processing. http://www.cc.gatech.edu/stinger/ ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 15/26
  • 16. Experimental setup Unless otherwise noted Line Model Speed (GHz) Sockets Cores Nehalem X5570 2.93 2 4 Westmere E7-8870 2.40 4 10 • Westmere loaned by Intel (thank you!) • All memory: 1067MHz DDR3, installed appropriately • Implementations: OpenMP, gcc 4.6.1, Linux ≈ 3.0 kernel • Artificial graph and edge stream generated by R-MAT [Chakrabarti, Zhan, & Faloutsos]. • Scale x, edge factor f ⇒ 2x vertices, ≈ f · 2x edges. • Edge actions: 7/8th insertions, 1/8th deletions • Results over five batches of edge actions. • Caveat: No vector instructions, low-level optimizations yet. ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 16/26
  • 17. Clustering coefficients • Used to measure “small-world-ness” [Watts & Strogatz] and potential i m community structure • Larger clustering coefficient ⇒ more v inter-connected j n • Roughly the ratio of the number of actual to potential triangles • Defined in terms of triplets. • i – v – j is a closed triplet (triangle). • m – v – n is an open triplet. • Clustering coefficient: # of closed triplets / total # of triplets • Locally around v or globally for entire graph. ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 17/26
  • 18. Updating triangle counts Given Edge {u, v } to be inserted (+) or deleted (-) Approach Search for vertices adjacent to both u and v , update counts on those and u and v Three methods Brute force Intersect neighbors of u and v by iterating over each, O(du dv ) time. Sorted list Sort u’s neighbors. For each neighbor of v , check if in the sorted list. Compressed bits Summarize u’s neighbors in a bit array. Reduces check for v ’s neighbors to O(1) time each. Approximate with Bloom filters. [MTAAP10] All rely on atomic addition. ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 18/26
  • 19. Batches of 10k actions Brute force Bloom filter Sorted list 1.7e+04 3.4e+04 8.8e+04 1.2e+05 8.8e+04 1.4e+05 105.5 1.5e+04 1.3e+05 1.2e+05 5.1e+03 2.4e+04 2.2e+04 Updates per seconds, both metric and STINGER 3.9e+03 2.0e+04 1.7e+04 q q q q q q q q q 5 q q q q 10 q q qq q q q q q q q q Machine 104.5 q a 4 x E7−8870 q q a 2 x X5570 q q q q q 104 103.5 0 20 40 60 80 0 20 40 60 80 0 20 40 60 80 Threads Graph size: scale 22, edge factor 16 ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 19/26
  • 20. Different batch sizes Brute force Bloom filter Sorted list 105.5 105 q q q q q q q q q q Updates per seconds, both metric and STINGER q q q 100 4.5 q 10 q q q q q q q q q 104 q qq q 3.5 q q 10 105.5 105 q q q q q q q q Machine 1000 104.5 q q 4 x E7−8870 q q q 2 x X5570 104 q q q q q q 103.5 105.5 q q q q q q q 105 qq q q q qqq q q q q q q 10000 q 4.5 q 10 q q q q q q 104 103.5 0 20 40 60 80 0 20 40 60 80 0 20 40 60 80 Threads Graph size: scale 22, edge factor 16 ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 20/26
  • 21. Different batch sizes: Reactivity Brute force Bloom filter Sorted list 100 10−0.5 Seconds between updates, both metric and STINGER 10−1 100 q qq 10−1.5 q q q q q q 10−2 q q q q q q q q q q 10−2.5 q q q q q q q q q 10−3 q q q 100 10−0.5 q q q Machine 10−1 qq 1000 q q q q 4 x E7−8870 10−1.5 q q q q 10−2 q qq q q 2 x X5570 10−2.5 10−3 100 q q q q q 10−0.5 q q q q q q q 10−1 qq qqq 10000 q q q q q q q q q q q 10−1.5 10−2 10−2.5 10−3 0 20 40 60 80 0 20 40 60 80 0 20 40 60 80 Threads Graph size: scale 22, edge factor 16 ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 21/26
  • 22. Connected components • Maintain a mapping from vertex to component. • Global property, unlike triangle counts • In “scale free” social networks: • Often one big component, and • many tiny ones. • Edge changes often sit within components. • Remaining insertions merge components. • Deletions are more difficult... ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 22/26
  • 23. Connected components: Deleted edges The difficult case • Very few deletions matter. • Determining which matter may require a large graph search. • Re-running static component detection. • (Long history, see related work in [MTAAP11].) • Coping mechanisms: • Heuristics. • Second level of batching. ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 23/26
  • 24. Deletion heuristics Rule out effect-less deletions • Use the spanning tree by-product of static connected component algorithms. • Ignore deletions when one of the following occur: 1 The deleted edge is not in the spanning tree. 2 If the endpoints share a common neighbor∗ . 3 If the loose endpoint can reach the root∗ . • In the last two (∗), also fix the spanning tree. Rules out 99.7% of deletions. ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 24/26
  • 25. Connected components: Performance 106 2.4e+03 1.6e+04 6.4e+03 3.2e+03 2.0e+03 105 100 Updates per seconds, both metric and STINGER 104 103 106 1.7e+04 7.7e+04 1.4e+04 1.9e+04 2.0e+04 105 Machine 1000 104 a 4 x E7−8870 a 2 x X5570 3 10 106 5.8e+04 1.3e+05 1.5e+05 5.5e+04 1.1e+05 5 10 10000 104 103 12 4 6 8 12 16 24 32 40 48 56 64 72 80 Threads Graph size: scale 22, edge factor 16 ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 25/26
  • 26. Conclusions • On commercial Intel-based platforms, our STING system supports update rates of • 100 000 updates per second for clustering coefficients, and • 70 000 updates per second for component labels. • The rates are on batches small enough to respond 100 or 70 times per second, respectively. • Batching updates exposes parallelism without much delay. • Reducing graph searches (memory access) with better heuristics improves performance more with fewer paths to memory. Software freely available http://www.cc.gatech.edu/stinger/ ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 26/26
  • 27. Acknowledgment of support ICASSP 2012—Analysis of Streaming Social Networks and Graphs on Multicore Architectures—Jason Riedy 27/26