SlideShare a Scribd company logo
1 of 2
Safeguarding Abila: Spatio-Temporal Activity Modeling
VAST 2014 Mini Challenge 2 Award: Honorable Mention for Effective Presentation
Parang Saraf∗ Patrick Butler† Naren Ramakrishnan‡
Discovery Analytics Center
Department of Computer Science
Virginia Tech
ABSTRACT
We introduce a system for visual analysis of GPS tracking and
financial data. This system was developed in response to VAST
Mini-Challenge 2 and comprises of different interfaces for mining
spatio-temporal and financial data.
Index Terms: H.5.2 [Information Interfaces and Presentation
(e.g., HCI)]: User Interfaces—Interaction styles (e.g., commands,
menus, forms, direct manipulation)
1 INTRODUCTION AND PROBLEM OVERVIEW
The VAST 2014 Mini Challenge 2 describes a hypothetical sce-
nario where some of the employees of an imaginary organization,
GAStech have gone missing and it is speculated that an environ-
mental activist group, Protectors of Kronos (POK) is responsible
behind the disappearance. The provided dataset includes two weeks
of GPS tracking data for several company cars assigned to em-
ployees, credit & loyalty card information of employees and ESRI
shapefiles for the fictional city of Abila & country Kronos. The
challenge requires identification of suspicious activities hidden in
data and prioritization of all such activities in order to determine
any unscrupulous persons and locations that are worthy of report-
ing to law enforcement agencies.
2 SYSTEM DESIGN
We developed a web-based visual analytics system for analyzing
geo-spatial, temporal and financial transaction data. The system
provides several widgets that empower an analyst to unearth aber-
rations buried in data. Google Maps was used to visualize spa-
tial data and the Javascript-based graphical libraries d3.js [1] &
nvd3.js were used for plots.
2.1 Geo-Spatial Data Analyzer
The creators for Mini Challenge 2, quite strategically, left out point
type data in the provided ESRI shapefiles. Absence of this crucial
information makes it difficult to identify “Points of Interest (POI)”.
A POI is defined as a specific point location that someone may find
useful or interesting, e.g., shopping center, restaurant, etc. In or-
der to identify POIs by using only the provided GPS tracking data,
following methodology is implemented:
1. For each of the cars, all the geo-coordinates where the car is
stationary for more than 5 minutes are identified. These points
refer to places that users might have found useful.
2. Spatial clustering is performed on these geo-coordinates in
order to represent points in the spatial vicinity of each other as
∗e-mail: parang@cs.vt.edu
†e-mail: pabutler@vt.edu
‡e-mail: naren@cs.vt.edu
one single location. These clusters have a radius of 25 meters
and describe a POI which is frequented by users.
3. Establishment names such as xyz coffee shop are associated
with these spatially identified POIs. The step involves charac-
terization of POIs as home, work and recreational locations.
The system provides three different interfaces for characteriz-
ing POIs.
The interface for characterizing recreational POIs (see Figure 1),
utilizes credit card swipe time information along with user location
to visualize a recreational establishment on map. It is assumed that
if a user is swiping his card at a particular establishment, then he
is present there. Based on this assumption, credit card transactions
are grouped by recreational establishments. Selecting a particular
establishment displays the location of all the customers that vis-
ited that establishment in the past two weeks. The interface also
displays credit and loyalty card transaction information in tabular
format for each of the establishments.
Figure 1: Characterizing Recreational POIs
The second interface provides a more generic platform for an-
alyzing all types of POIs. It introduces two analytical views, viz.
POI distribution over time and POI frequency over time. POI dis-
tribution over time (see Figure 3) is a scatter plot that displays all
the POIs where a particular user was present over the 2 week du-
ration. This helps in identifying home and work POI for each of
the users by assuming that a user spends his nights at home and his
working hours in the office. The widget also allows comparison of
POI distribution for several users.
POI frequency over time (see Figure 4) displays the total number
of users present at a particular POI during the 24 hour window. The
plot helps in characterizing POIs as home, recreational, or work
locations. For example, if several users are present at a particular
POI during work hours (8 am till noon and 2 pm till 5 pm), then
that POI would be classified as an office building.
The third interface (see Figure 5) allows for query and visual-
ization of spatio-temporal GPS data. Using this interface, an ana-
lyst can generate selective playback of locations sequences visited
359
IEEE Symposium on Visual Analytics Science and Technology 2014
November 9-14, Paris, France
978-1-4799-6227-3/14/$31.00 ©2014 IEEE
Figure 2: User Spending Comparison
Figure 3: POI Distribution over Time Plots
Figure 4: POI Frequency over Time Plots
by employees. These sequences are generated by choosing date,
time, POIs and employees. Queries such as find all the employees
from the security department who were present at xyz restaurant
on weekdays from noon till 2 pm can be easily visualized using the
interface. Further, the POI frequency bar graph and POI distribu-
tion by cars scatter plot, which are updated for each playback time
instance, provides a mapping between cars location and POIs.
2.2 Financial Data Analyzer
A fourth interface (see Figure 2) was designed to analyze financial
transaction data that included credit and loyalty card information.
The interface provides three different ways to visualize employees’
spending patterns: a) employee vs employee spending comparison
that plots an employee’s total spending against all other employees.
The comparison can be made either at company level or department
level; b) employee spending distribution that compares total spend-
ing of an employee either across establishments or across days dur-
Figure 5: User Location Playback interface
ing the two week window; c) establishment sales distribution that
compares total sales of an establishment either across employees or
across days for the two week window.
3 RESULTS
The POI identification methodology described in Section 2 resulted
in 129 POIs that required classification as home, office or recre-
ational. During the classification process, car 9 and car 28 appeared
to have broken GPS as there was no trend associated with their lo-
cation data. These two cars accounted for 64 POI locations vis-
ited only by them and no other employee, further strengthening the
claim that these POIs are a result of corrupt data and need to be
ignored from any further analysis. Of the remaining 65 POIs, 22
were classified as home, 1 as work and 24 as recreational, thereby
leaving 18 POIs unclassified that were marked as suspicious. Em-
ployees visiting these suspicious POIs were scrutinized further for
any questionable behavior or activities. This analysis in conjunction
with aberrations identified in employees’ spending data provided
enough compelling clues necessary for reporting specific individu-
als as key to the solution to this mini challenge.
ACKNOWLEDGEMENTS
We would like to thank Ritika Dokania for her creative inputs and
feedback on the visualization, as well as for lending her voice to the
explanatory video that describes the system. This work is partially
supported by US NSF Grant CCF-0937133.
REFERENCES
[1] M. Bostock, V. Ogievetsky, and J. Heer. D3: Data-driven documents.
IEEE Trans. Visualization & Comp. Graphics (Proc. InfoVis), 2011.
360

More Related Content

Viewers also liked

Barcelona, Antonio Gaudi y sus obras
Barcelona, Antonio Gaudi y sus obrasBarcelona, Antonio Gaudi y sus obras
Barcelona, Antonio Gaudi y sus obrasCarlos Colomer
 
Cap. 02 rm - r. c. hibbeler - 7ª ed
Cap. 02   rm - r. c. hibbeler - 7ª edCap. 02   rm - r. c. hibbeler - 7ª ed
Cap. 02 rm - r. c. hibbeler - 7ª ededuardodias dias
 
Evaluación de proceso mate 5° 2016
Evaluación de proceso mate   5° 2016Evaluación de proceso mate   5° 2016
Evaluación de proceso mate 5° 2016Colegio
 
The good language teacher
The good language teacherThe good language teacher
The good language teacherFayez Habbal
 
Manual transmission final
Manual transmission finalManual transmission final
Manual transmission finalSagar Ahire
 
Reducing Concentration Uncertainty in Geological Structures by Conditioning o...
Reducing Concentration Uncertainty in Geological Structures by Conditioning o...Reducing Concentration Uncertainty in Geological Structures by Conditioning o...
Reducing Concentration Uncertainty in Geological Structures by Conditioning o...Amro Elfeki
 

Viewers also liked (9)

Barcelona, Antonio Gaudi y sus obras
Barcelona, Antonio Gaudi y sus obrasBarcelona, Antonio Gaudi y sus obras
Barcelona, Antonio Gaudi y sus obras
 
Cap. 02 rm - r. c. hibbeler - 7ª ed
Cap. 02   rm - r. c. hibbeler - 7ª edCap. 02   rm - r. c. hibbeler - 7ª ed
Cap. 02 rm - r. c. hibbeler - 7ª ed
 
Evaluación de proceso mate 5° 2016
Evaluación de proceso mate   5° 2016Evaluación de proceso mate   5° 2016
Evaluación de proceso mate 5° 2016
 
LUCY NATECHO'S CV
LUCY NATECHO'S CVLUCY NATECHO'S CV
LUCY NATECHO'S CV
 
Habitat Pop Up Shop
Habitat Pop Up ShopHabitat Pop Up Shop
Habitat Pop Up Shop
 
The good language teacher
The good language teacherThe good language teacher
The good language teacher
 
LWC15建一個有溫度的數位典藏報告人:臺北市議會圖書館 鄭有容館員
LWC15建一個有溫度的數位典藏報告人:臺北市議會圖書館 鄭有容館員LWC15建一個有溫度的數位典藏報告人:臺北市議會圖書館 鄭有容館員
LWC15建一個有溫度的數位典藏報告人:臺北市議會圖書館 鄭有容館員
 
Manual transmission final
Manual transmission finalManual transmission final
Manual transmission final
 
Reducing Concentration Uncertainty in Geological Structures by Conditioning o...
Reducing Concentration Uncertainty in Geological Structures by Conditioning o...Reducing Concentration Uncertainty in Geological Structures by Conditioning o...
Reducing Concentration Uncertainty in Geological Structures by Conditioning o...
 

Similar to Safeguarding Abila: Spatio-Temporal Activity Modeling

Safeguarding Abila through Multiple Data Perspectives
Safeguarding Abila through Multiple Data PerspectivesSafeguarding Abila through Multiple Data Perspectives
Safeguarding Abila through Multiple Data PerspectivesParang Saraf
 
IRJET- Analysis of Trajectories
IRJET- Analysis of TrajectoriesIRJET- Analysis of Trajectories
IRJET- Analysis of TrajectoriesIRJET Journal
 
Automatic Itinerary Voyage Suggestion using SoNet in Big Data
Automatic Itinerary Voyage Suggestion using SoNet in Big DataAutomatic Itinerary Voyage Suggestion using SoNet in Big Data
Automatic Itinerary Voyage Suggestion using SoNet in Big DataIRJET Journal
 
IRJET- Next Location Prediction
IRJET-  	  Next Location PredictionIRJET-  	  Next Location Prediction
IRJET- Next Location PredictionIRJET Journal
 
IRJET- Location-Based Route Recommendation System with Effective Query Keywords
IRJET- Location-Based Route Recommendation System with Effective Query KeywordsIRJET- Location-Based Route Recommendation System with Effective Query Keywords
IRJET- Location-Based Route Recommendation System with Effective Query KeywordsIRJET Journal
 
IRJET- Smart Bus for Smart City using IoT
IRJET-  	  Smart Bus for Smart City using IoTIRJET-  	  Smart Bus for Smart City using IoT
IRJET- Smart Bus for Smart City using IoTIRJET Journal
 
Personalized Route Recommendation for self-drive tourists based on V2V commun...
Personalized Route Recommendation for self-drive tourists based on V2V commun...Personalized Route Recommendation for self-drive tourists based on V2V commun...
Personalized Route Recommendation for self-drive tourists based on V2V commun...IRJET Journal
 
IRJET- Explore the World
IRJET- 	  Explore the WorldIRJET- 	  Explore the World
IRJET- Explore the WorldIRJET Journal
 
IRJET - Optimized Travel Planner
IRJET -  	  Optimized Travel PlannerIRJET -  	  Optimized Travel Planner
IRJET - Optimized Travel PlannerIRJET Journal
 
Determination and visualization of density210409
Determination and visualization of density210409 Determination and visualization of density210409
Determination and visualization of density210409 Kenji Sugihara
 
Visual Analytics: Traffic Collisions in Italy
Visual Analytics: Traffic Collisions in ItalyVisual Analytics: Traffic Collisions in Italy
Visual Analytics: Traffic Collisions in ItalyRoberto Falconi
 
IRJET-0 Bus Tracker Via GPS using Andriod Application
IRJET-0 	  Bus Tracker Via GPS using Andriod ApplicationIRJET-0 	  Bus Tracker Via GPS using Andriod Application
IRJET-0 Bus Tracker Via GPS using Andriod ApplicationIRJET Journal
 
A Review on Tourist Analyzer
A Review on Tourist AnalyzerA Review on Tourist Analyzer
A Review on Tourist AnalyzerIRJET Journal
 
Logo Detection Using AI ML
Logo Detection Using AI MLLogo Detection Using AI ML
Logo Detection Using AI MLIRJET Journal
 
User Category Based Estimation of Location Popularity using the Road GPS Traj...
User Category Based Estimation of Location Popularity using the Road GPS Traj...User Category Based Estimation of Location Popularity using the Road GPS Traj...
User Category Based Estimation of Location Popularity using the Road GPS Traj...Waqas Tariq
 
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...IOSR Journals
 
TOURIST PLACE RECOMMENDATION SYSTEM
TOURIST PLACE RECOMMENDATION SYSTEMTOURIST PLACE RECOMMENDATION SYSTEM
TOURIST PLACE RECOMMENDATION SYSTEMIJARIIT
 
Social Friend Overlying Communities Based on Social Network Context
Social Friend Overlying Communities Based on Social Network ContextSocial Friend Overlying Communities Based on Social Network Context
Social Friend Overlying Communities Based on Social Network ContextIRJET Journal
 
Virtual Contact Discovery using Facial Recognition
Virtual Contact Discovery using Facial RecognitionVirtual Contact Discovery using Facial Recognition
Virtual Contact Discovery using Facial RecognitionIRJET Journal
 

Similar to Safeguarding Abila: Spatio-Temporal Activity Modeling (20)

Safeguarding Abila through Multiple Data Perspectives
Safeguarding Abila through Multiple Data PerspectivesSafeguarding Abila through Multiple Data Perspectives
Safeguarding Abila through Multiple Data Perspectives
 
IRJET- Analysis of Trajectories
IRJET- Analysis of TrajectoriesIRJET- Analysis of Trajectories
IRJET- Analysis of Trajectories
 
Automatic Itinerary Voyage Suggestion using SoNet in Big Data
Automatic Itinerary Voyage Suggestion using SoNet in Big DataAutomatic Itinerary Voyage Suggestion using SoNet in Big Data
Automatic Itinerary Voyage Suggestion using SoNet in Big Data
 
IRJET- Next Location Prediction
IRJET-  	  Next Location PredictionIRJET-  	  Next Location Prediction
IRJET- Next Location Prediction
 
IRJET- Location-Based Route Recommendation System with Effective Query Keywords
IRJET- Location-Based Route Recommendation System with Effective Query KeywordsIRJET- Location-Based Route Recommendation System with Effective Query Keywords
IRJET- Location-Based Route Recommendation System with Effective Query Keywords
 
IRJET- Smart Bus for Smart City using IoT
IRJET-  	  Smart Bus for Smart City using IoTIRJET-  	  Smart Bus for Smart City using IoT
IRJET- Smart Bus for Smart City using IoT
 
Personalized Route Recommendation for self-drive tourists based on V2V commun...
Personalized Route Recommendation for self-drive tourists based on V2V commun...Personalized Route Recommendation for self-drive tourists based on V2V commun...
Personalized Route Recommendation for self-drive tourists based on V2V commun...
 
IRJET- Explore the World
IRJET- 	  Explore the WorldIRJET- 	  Explore the World
IRJET- Explore the World
 
IRJET - Optimized Travel Planner
IRJET -  	  Optimized Travel PlannerIRJET -  	  Optimized Travel Planner
IRJET - Optimized Travel Planner
 
Determination and visualization of density210409
Determination and visualization of density210409 Determination and visualization of density210409
Determination and visualization of density210409
 
Visual Analytics: Traffic Collisions in Italy
Visual Analytics: Traffic Collisions in ItalyVisual Analytics: Traffic Collisions in Italy
Visual Analytics: Traffic Collisions in Italy
 
IRJET-0 Bus Tracker Via GPS using Andriod Application
IRJET-0 	  Bus Tracker Via GPS using Andriod ApplicationIRJET-0 	  Bus Tracker Via GPS using Andriod Application
IRJET-0 Bus Tracker Via GPS using Andriod Application
 
A Review on Tourist Analyzer
A Review on Tourist AnalyzerA Review on Tourist Analyzer
A Review on Tourist Analyzer
 
Logo Detection Using AI ML
Logo Detection Using AI MLLogo Detection Using AI ML
Logo Detection Using AI ML
 
User Category Based Estimation of Location Popularity using the Road GPS Traj...
User Category Based Estimation of Location Popularity using the Road GPS Traj...User Category Based Estimation of Location Popularity using the Road GPS Traj...
User Category Based Estimation of Location Popularity using the Road GPS Traj...
 
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...
 
TOURIST PLACE RECOMMENDATION SYSTEM
TOURIST PLACE RECOMMENDATION SYSTEMTOURIST PLACE RECOMMENDATION SYSTEM
TOURIST PLACE RECOMMENDATION SYSTEM
 
iTimer - Count On Your Time
iTimer - Count On Your TimeiTimer - Count On Your Time
iTimer - Count On Your Time
 
Social Friend Overlying Communities Based on Social Network Context
Social Friend Overlying Communities Based on Social Network ContextSocial Friend Overlying Communities Based on Social Network Context
Social Friend Overlying Communities Based on Social Network Context
 
Virtual Contact Discovery using Facial Recognition
Virtual Contact Discovery using Facial RecognitionVirtual Contact Discovery using Facial Recognition
Virtual Contact Discovery using Facial Recognition
 

More from Parang Saraf

Email and Network Analyzer
Email and Network AnalyzerEmail and Network Analyzer
Email and Network AnalyzerParang Saraf
 
Slides: Safeguarding Abila through Multiple Data Perspectives
Slides: Safeguarding Abila through Multiple Data PerspectivesSlides: Safeguarding Abila through Multiple Data Perspectives
Slides: Safeguarding Abila through Multiple Data PerspectivesParang Saraf
 
Slides: Safeguarding Abila: Real-time Streaming Analysis
Slides: Safeguarding Abila: Real-time Streaming AnalysisSlides: Safeguarding Abila: Real-time Streaming Analysis
Slides: Safeguarding Abila: Real-time Streaming AnalysisParang Saraf
 
Slides: Safeguarding Abila: Spatio-Temporal Activity Modeling
Slides: Safeguarding Abila: Spatio-Temporal Activity ModelingSlides: Safeguarding Abila: Spatio-Temporal Activity Modeling
Slides: Safeguarding Abila: Spatio-Temporal Activity ModelingParang Saraf
 
Safeguarding Abila: Discovering Evolving Activist Networks
Safeguarding Abila: Discovering Evolving Activist NetworksSafeguarding Abila: Discovering Evolving Activist Networks
Safeguarding Abila: Discovering Evolving Activist NetworksParang Saraf
 
EMBERS AutoGSR: Automated Coding of Civil Unrest Events
EMBERS AutoGSR: Automated Coding of Civil Unrest EventsEMBERS AutoGSR: Automated Coding of Civil Unrest Events
EMBERS AutoGSR: Automated Coding of Civil Unrest EventsParang Saraf
 
EMBERS at 4 years: Experiences operating an Open Source Indicators Forecastin...
EMBERS at 4 years: Experiences operating an Open Source Indicators Forecastin...EMBERS at 4 years: Experiences operating an Open Source Indicators Forecastin...
EMBERS at 4 years: Experiences operating an Open Source Indicators Forecastin...Parang Saraf
 
Slides: Forex-Foreteller: Currency Trend Modeling using News Articles
Slides: Forex-Foreteller: Currency Trend Modeling using News ArticlesSlides: Forex-Foreteller: Currency Trend Modeling using News Articles
Slides: Forex-Foreteller: Currency Trend Modeling using News ArticlesParang Saraf
 
Slides: Epidemiological Modeling of News and Rumors on Twitter
Slides: Epidemiological Modeling of News and Rumors on TwitterSlides: Epidemiological Modeling of News and Rumors on Twitter
Slides: Epidemiological Modeling of News and Rumors on TwitterParang Saraf
 
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...Parang Saraf
 
EMBERS AutoGSR: Automated Coding of Civil Unrest Events
EMBERS AutoGSR: Automated Coding of Civil Unrest EventsEMBERS AutoGSR: Automated Coding of Civil Unrest Events
EMBERS AutoGSR: Automated Coding of Civil Unrest EventsParang Saraf
 
DMAP: Data Aggregation and Presentation Framework
DMAP: Data Aggregation and Presentation FrameworkDMAP: Data Aggregation and Presentation Framework
DMAP: Data Aggregation and Presentation FrameworkParang Saraf
 
Concurrent Inference of Topic Models and Distributed Vector Representations
Concurrent Inference of Topic Models and Distributed Vector RepresentationsConcurrent Inference of Topic Models and Distributed Vector Representations
Concurrent Inference of Topic Models and Distributed Vector RepresentationsParang Saraf
 
Bayesian Model Fusion for Forecasting Civil Unrest
Bayesian Model Fusion for Forecasting Civil UnrestBayesian Model Fusion for Forecasting Civil Unrest
Bayesian Model Fusion for Forecasting Civil UnrestParang Saraf
 
‘Beating the News’ with EMBERS: Forecasting Civil Unrest using Open Source In...
‘Beating the News’ with EMBERS: Forecasting Civil Unrest using Open Source In...‘Beating the News’ with EMBERS: Forecasting Civil Unrest using Open Source In...
‘Beating the News’ with EMBERS: Forecasting Civil Unrest using Open Source In...Parang Saraf
 
Safeguarding Abila: Real-time Streaming Analysis
Safeguarding Abila: Real-time Streaming AnalysisSafeguarding Abila: Real-time Streaming Analysis
Safeguarding Abila: Real-time Streaming AnalysisParang Saraf
 
Safeguarding Abila: Discovering Evolving Activist Networks
Safeguarding Abila: Discovering Evolving Activist NetworksSafeguarding Abila: Discovering Evolving Activist Networks
Safeguarding Abila: Discovering Evolving Activist NetworksParang Saraf
 
Forex-Foreteller: Currency Trend Modeling using News Articles
Forex-Foreteller: Currency Trend Modeling using News ArticlesForex-Foreteller: Currency Trend Modeling using News Articles
Forex-Foreteller: Currency Trend Modeling using News ArticlesParang Saraf
 
Epidemiological Modeling of News and Rumors on Twitter
Epidemiological Modeling of News and Rumors on TwitterEpidemiological Modeling of News and Rumors on Twitter
Epidemiological Modeling of News and Rumors on TwitterParang Saraf
 
Merseyside Crime Analysis
Merseyside Crime AnalysisMerseyside Crime Analysis
Merseyside Crime AnalysisParang Saraf
 

More from Parang Saraf (20)

Email and Network Analyzer
Email and Network AnalyzerEmail and Network Analyzer
Email and Network Analyzer
 
Slides: Safeguarding Abila through Multiple Data Perspectives
Slides: Safeguarding Abila through Multiple Data PerspectivesSlides: Safeguarding Abila through Multiple Data Perspectives
Slides: Safeguarding Abila through Multiple Data Perspectives
 
Slides: Safeguarding Abila: Real-time Streaming Analysis
Slides: Safeguarding Abila: Real-time Streaming AnalysisSlides: Safeguarding Abila: Real-time Streaming Analysis
Slides: Safeguarding Abila: Real-time Streaming Analysis
 
Slides: Safeguarding Abila: Spatio-Temporal Activity Modeling
Slides: Safeguarding Abila: Spatio-Temporal Activity ModelingSlides: Safeguarding Abila: Spatio-Temporal Activity Modeling
Slides: Safeguarding Abila: Spatio-Temporal Activity Modeling
 
Safeguarding Abila: Discovering Evolving Activist Networks
Safeguarding Abila: Discovering Evolving Activist NetworksSafeguarding Abila: Discovering Evolving Activist Networks
Safeguarding Abila: Discovering Evolving Activist Networks
 
EMBERS AutoGSR: Automated Coding of Civil Unrest Events
EMBERS AutoGSR: Automated Coding of Civil Unrest EventsEMBERS AutoGSR: Automated Coding of Civil Unrest Events
EMBERS AutoGSR: Automated Coding of Civil Unrest Events
 
EMBERS at 4 years: Experiences operating an Open Source Indicators Forecastin...
EMBERS at 4 years: Experiences operating an Open Source Indicators Forecastin...EMBERS at 4 years: Experiences operating an Open Source Indicators Forecastin...
EMBERS at 4 years: Experiences operating an Open Source Indicators Forecastin...
 
Slides: Forex-Foreteller: Currency Trend Modeling using News Articles
Slides: Forex-Foreteller: Currency Trend Modeling using News ArticlesSlides: Forex-Foreteller: Currency Trend Modeling using News Articles
Slides: Forex-Foreteller: Currency Trend Modeling using News Articles
 
Slides: Epidemiological Modeling of News and Rumors on Twitter
Slides: Epidemiological Modeling of News and Rumors on TwitterSlides: Epidemiological Modeling of News and Rumors on Twitter
Slides: Epidemiological Modeling of News and Rumors on Twitter
 
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
 
EMBERS AutoGSR: Automated Coding of Civil Unrest Events
EMBERS AutoGSR: Automated Coding of Civil Unrest EventsEMBERS AutoGSR: Automated Coding of Civil Unrest Events
EMBERS AutoGSR: Automated Coding of Civil Unrest Events
 
DMAP: Data Aggregation and Presentation Framework
DMAP: Data Aggregation and Presentation FrameworkDMAP: Data Aggregation and Presentation Framework
DMAP: Data Aggregation and Presentation Framework
 
Concurrent Inference of Topic Models and Distributed Vector Representations
Concurrent Inference of Topic Models and Distributed Vector RepresentationsConcurrent Inference of Topic Models and Distributed Vector Representations
Concurrent Inference of Topic Models and Distributed Vector Representations
 
Bayesian Model Fusion for Forecasting Civil Unrest
Bayesian Model Fusion for Forecasting Civil UnrestBayesian Model Fusion for Forecasting Civil Unrest
Bayesian Model Fusion for Forecasting Civil Unrest
 
‘Beating the News’ with EMBERS: Forecasting Civil Unrest using Open Source In...
‘Beating the News’ with EMBERS: Forecasting Civil Unrest using Open Source In...‘Beating the News’ with EMBERS: Forecasting Civil Unrest using Open Source In...
‘Beating the News’ with EMBERS: Forecasting Civil Unrest using Open Source In...
 
Safeguarding Abila: Real-time Streaming Analysis
Safeguarding Abila: Real-time Streaming AnalysisSafeguarding Abila: Real-time Streaming Analysis
Safeguarding Abila: Real-time Streaming Analysis
 
Safeguarding Abila: Discovering Evolving Activist Networks
Safeguarding Abila: Discovering Evolving Activist NetworksSafeguarding Abila: Discovering Evolving Activist Networks
Safeguarding Abila: Discovering Evolving Activist Networks
 
Forex-Foreteller: Currency Trend Modeling using News Articles
Forex-Foreteller: Currency Trend Modeling using News ArticlesForex-Foreteller: Currency Trend Modeling using News Articles
Forex-Foreteller: Currency Trend Modeling using News Articles
 
Epidemiological Modeling of News and Rumors on Twitter
Epidemiological Modeling of News and Rumors on TwitterEpidemiological Modeling of News and Rumors on Twitter
Epidemiological Modeling of News and Rumors on Twitter
 
Merseyside Crime Analysis
Merseyside Crime AnalysisMerseyside Crime Analysis
Merseyside Crime Analysis
 

Recently uploaded

Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGIThomas Poetter
 

Recently uploaded (20)

Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
 

Safeguarding Abila: Spatio-Temporal Activity Modeling

  • 1. Safeguarding Abila: Spatio-Temporal Activity Modeling VAST 2014 Mini Challenge 2 Award: Honorable Mention for Effective Presentation Parang Saraf∗ Patrick Butler† Naren Ramakrishnan‡ Discovery Analytics Center Department of Computer Science Virginia Tech ABSTRACT We introduce a system for visual analysis of GPS tracking and financial data. This system was developed in response to VAST Mini-Challenge 2 and comprises of different interfaces for mining spatio-temporal and financial data. Index Terms: H.5.2 [Information Interfaces and Presentation (e.g., HCI)]: User Interfaces—Interaction styles (e.g., commands, menus, forms, direct manipulation) 1 INTRODUCTION AND PROBLEM OVERVIEW The VAST 2014 Mini Challenge 2 describes a hypothetical sce- nario where some of the employees of an imaginary organization, GAStech have gone missing and it is speculated that an environ- mental activist group, Protectors of Kronos (POK) is responsible behind the disappearance. The provided dataset includes two weeks of GPS tracking data for several company cars assigned to em- ployees, credit & loyalty card information of employees and ESRI shapefiles for the fictional city of Abila & country Kronos. The challenge requires identification of suspicious activities hidden in data and prioritization of all such activities in order to determine any unscrupulous persons and locations that are worthy of report- ing to law enforcement agencies. 2 SYSTEM DESIGN We developed a web-based visual analytics system for analyzing geo-spatial, temporal and financial transaction data. The system provides several widgets that empower an analyst to unearth aber- rations buried in data. Google Maps was used to visualize spa- tial data and the Javascript-based graphical libraries d3.js [1] & nvd3.js were used for plots. 2.1 Geo-Spatial Data Analyzer The creators for Mini Challenge 2, quite strategically, left out point type data in the provided ESRI shapefiles. Absence of this crucial information makes it difficult to identify “Points of Interest (POI)”. A POI is defined as a specific point location that someone may find useful or interesting, e.g., shopping center, restaurant, etc. In or- der to identify POIs by using only the provided GPS tracking data, following methodology is implemented: 1. For each of the cars, all the geo-coordinates where the car is stationary for more than 5 minutes are identified. These points refer to places that users might have found useful. 2. Spatial clustering is performed on these geo-coordinates in order to represent points in the spatial vicinity of each other as ∗e-mail: parang@cs.vt.edu †e-mail: pabutler@vt.edu ‡e-mail: naren@cs.vt.edu one single location. These clusters have a radius of 25 meters and describe a POI which is frequented by users. 3. Establishment names such as xyz coffee shop are associated with these spatially identified POIs. The step involves charac- terization of POIs as home, work and recreational locations. The system provides three different interfaces for characteriz- ing POIs. The interface for characterizing recreational POIs (see Figure 1), utilizes credit card swipe time information along with user location to visualize a recreational establishment on map. It is assumed that if a user is swiping his card at a particular establishment, then he is present there. Based on this assumption, credit card transactions are grouped by recreational establishments. Selecting a particular establishment displays the location of all the customers that vis- ited that establishment in the past two weeks. The interface also displays credit and loyalty card transaction information in tabular format for each of the establishments. Figure 1: Characterizing Recreational POIs The second interface provides a more generic platform for an- alyzing all types of POIs. It introduces two analytical views, viz. POI distribution over time and POI frequency over time. POI dis- tribution over time (see Figure 3) is a scatter plot that displays all the POIs where a particular user was present over the 2 week du- ration. This helps in identifying home and work POI for each of the users by assuming that a user spends his nights at home and his working hours in the office. The widget also allows comparison of POI distribution for several users. POI frequency over time (see Figure 4) displays the total number of users present at a particular POI during the 24 hour window. The plot helps in characterizing POIs as home, recreational, or work locations. For example, if several users are present at a particular POI during work hours (8 am till noon and 2 pm till 5 pm), then that POI would be classified as an office building. The third interface (see Figure 5) allows for query and visual- ization of spatio-temporal GPS data. Using this interface, an ana- lyst can generate selective playback of locations sequences visited 359 IEEE Symposium on Visual Analytics Science and Technology 2014 November 9-14, Paris, France 978-1-4799-6227-3/14/$31.00 ©2014 IEEE
  • 2. Figure 2: User Spending Comparison Figure 3: POI Distribution over Time Plots Figure 4: POI Frequency over Time Plots by employees. These sequences are generated by choosing date, time, POIs and employees. Queries such as find all the employees from the security department who were present at xyz restaurant on weekdays from noon till 2 pm can be easily visualized using the interface. Further, the POI frequency bar graph and POI distribu- tion by cars scatter plot, which are updated for each playback time instance, provides a mapping between cars location and POIs. 2.2 Financial Data Analyzer A fourth interface (see Figure 2) was designed to analyze financial transaction data that included credit and loyalty card information. The interface provides three different ways to visualize employees’ spending patterns: a) employee vs employee spending comparison that plots an employee’s total spending against all other employees. The comparison can be made either at company level or department level; b) employee spending distribution that compares total spend- ing of an employee either across establishments or across days dur- Figure 5: User Location Playback interface ing the two week window; c) establishment sales distribution that compares total sales of an establishment either across employees or across days for the two week window. 3 RESULTS The POI identification methodology described in Section 2 resulted in 129 POIs that required classification as home, office or recre- ational. During the classification process, car 9 and car 28 appeared to have broken GPS as there was no trend associated with their lo- cation data. These two cars accounted for 64 POI locations vis- ited only by them and no other employee, further strengthening the claim that these POIs are a result of corrupt data and need to be ignored from any further analysis. Of the remaining 65 POIs, 22 were classified as home, 1 as work and 24 as recreational, thereby leaving 18 POIs unclassified that were marked as suspicious. Em- ployees visiting these suspicious POIs were scrutinized further for any questionable behavior or activities. This analysis in conjunction with aberrations identified in employees’ spending data provided enough compelling clues necessary for reporting specific individu- als as key to the solution to this mini challenge. ACKNOWLEDGEMENTS We would like to thank Ritika Dokania for her creative inputs and feedback on the visualization, as well as for lending her voice to the explanatory video that describes the system. This work is partially supported by US NSF Grant CCF-0937133. REFERENCES [1] M. Bostock, V. Ogievetsky, and J. Heer. D3: Data-driven documents. IEEE Trans. Visualization & Comp. Graphics (Proc. InfoVis), 2011. 360