This document summarizes two interfaces presented as part of a VAST challenge solution: a News Analyzer and Email Analyzer. The News Analyzer allows exploration of news articles to identify events using search, entity recognition, and visualization tools. The Email Analyzer visualizes email networks through a co-occurrence matrix, radial graph, and word cloud to examine relationships. Both interfaces were designed specifically for the challenge and utilize techniques like search engines, named entity recognition, spectral co-clustering, and D3 visualization.
1. Safeguarding Abila through Multiple Data Perspectives
VAST 2014 Grand Challenge Award: Effective Analysis and Presentation
VAST 2014 Mini Challenge 2 Award: Honorable Mention for Effective Presentation
Parang Saraf; Patrick Butler; Naren Ramakrishnan
Discovery Analytics Center, Department of Computer Science, Virginia Tech
1
Presented By: Parang Saraf
3. VAST Challenge Solution From DAC
• The DAC solution offers three key advantages:
1. Provides an efficient front-end interface for user-centered
exploration of data
2. Very little analysis or cleaning of data is performed in the
backend, thereby helping an analyst to understand the
data better
! Example: Faulty news sources or GPS coordinates are displayed
3. Offers an intuitive interface to present data in several
different ways
• Each Interface was designed from scratch
specifically for the VAST Challenge
3
5. Mini Challenge - I
• Two Interfaces:
– News Analyzer
• Helps an Analyst
explore news articles
and Identify Events
– Email Analyzer
• Helps an Analyst
visualize and examine
Email network
5
26. Email Analyzer
26
Spectral Co-Clustering
• Given an n x m matrix of n
documents and m words, the
algorithm performs co-clustering of
documents and words.
• The clustering problem is posed in
terms of finding minimum cut vertex
partitions in a bipartite graph
between document and words
• We provide an m x m matrix where
rows and columns denote
employees and a cell denotes the
number of emails exchanged
• Implemented using the scikit-learn
package