1. Pathways to SEASR
National Center for Supercomputing Applications!
University of Illinois at Urbana-Champaign
The SEASR project and its Meandre infrastructure!
are sponsored by The Andrew W. Mellon Foundation
3. Welcome
• Dr. Robert Wilhelmson,
– Division Director – CyberApplications and Communities
– National Center for Supercomputing Applications
• Dr. Chris Mackie
– Program Officer – Research in Information Technology
– The Andrew W. Mellon Foundation
5. Agenda Day 1 Morning
• Thursday, January 15, 2009
• 8:00am Registration and Breakfast
• 8:30am Introduction, Michael Welge, SEASR, NCSA
• 9:00am SEASR Overview and Workshop Goals, Michael
Welge
• 10:00am Break
• 10:30am SEASR Application Examples and
Demonstrations
– Zotero and SEASR, Xavier Llorà, Boris Capitanu
– UIMA and SEASR, Mike Haberman
– Audio Analysis and SEASR (NEMA, NESTER), J. Stephen Downie
• Noon Lunch
6. Agenda Day 1 Afternoon
• 1:00pm SEASR Architecture, Xavier Llorà and Bernie Acs
– Workshop Plan Presentation (5 min each) by J. Stephen Downie
• 2:30pm Break
• 3:00pm SEASR Tools with Hands On Demo
– Community Hub, Loretta Auvil
– Workbench, Boris Capitanu
– Server Interface, Xavier Llorà
– ZigZag, Xavier Llorà
– Eclipse Plugin, Amit Kumar
• 4:00 SEASR Adoptability and Sustainability, John Unsworth
• 4:30pm End of Day
• 6:00pm Dinner
7. Agenda Day 2 Morning
• Friday, January 16, 2009
• 8:00am Registration and Breakfast
• 8:30am SEASR Application Examples and Demonstrations (continued)
– MONK and SEASR, Stéfan Sinclair
– FutureLens and SEASR and MONK, Andrey Puretskiy
– DISCUS and SEASR, Xavier Llorà
– Visualization and SEASR, Loretta Auvil
– Workshop Plan Presentation (5 min each) by Teams
• 10:00am Break
• 10:30am SEASR Installation, Xavier Llorà and Loretta Auvil
– Workshop Plan Presentation (5 min each) by Teams
• Noon Lunch
8. Agenda Day 2 Afternoon
• 1:00pm SEASR Wrap Up and Summary, Michael Welge
and Loretta Auvil
• Workshop Plan Presentation (5 min each) by Teams
• 2:30pm Break
• 3:00pm SEASR Lab Session, Michael Welge and Loretta
Auvil Workshop Plan Presentation (5 min each) by Teams
• 4:30pm End of Day
9. Workshop Objective
• The objective of the workshop is:
• To explain and demonstrate the utility of SEASR
for digital humanities, and to bring you to a point
where you could deploy, contribute and utilize the
SEASR environment.
•
SEASR + TOOLS + EXEMPLARS + HANDS ON
10. Workshop Goals
• The goals of the workshop are:
• LEARN: Provide a detailed understanding of the SEASR framework
• LEARN: Provide a foundation and examples for participant teams to
use SEASR in a study or inquiry
• ADOPT: Share participant generated research plans to utilize SEASR
• INSTALL: Provide detailed instructions on how to install, build
components, integrate existing applications, and maintain the SEASR
environment
• SUPPORT: Develop plans for resolution of issues raised by the user
community in utilization of SEASR
• SUSTAIN: Develop a plan for community driven future development
and dissemination of SEASR
• Learn + Adopt + Sustain
11. SEASR Synergy
Technology
Scholarly Pathfinders
Roadmaps
Roadmaps Communities
Partnerships Integrated Project Team
Requirements
Analysis & Specification
SESOA
Execution Engine
University
Security
International Development & Interaction UIs
Government System Integration
Data Mining
Visualization
Community Hub
Prototype or Production
Research
Research Scholarly
Applications Discoveries
“Infrastructure is a set of practices, standards, technology and cultures that
work together to provide a foundation for digital scholarship” Digital Commonwealth 2003
12. SEASR: Reach + Relevance + Reuse + Repeatability
SEASR emphasizes flexibility, scalability, modularity,
provides community hub and access to heterogeneous
data and computational systems
– Semantic driven environment for SOA interoperability
– Encourages sharing and participation for building communities
– Modular construction allows flows to be modified and configured
to encourage reusability within and across domains
– Enables a mashup and integration of tools
– Data-intensive flows can be executed on a simple desktop or a
large cluster(s) without modification
– Computation can be created for distributed execution on servers
where the content lives
– User accessibility to control trust and compliance with required
copyright license of content
– Relies on standardized Resource Description Framework (RDF) to
define components and flow
13. SEASR Apps SEASR Plugins SEASR Web Apps SEASR Services
Meandre Data‐Intensive Flows
SEASR Components
Developer Tools
Data Analy5cs Visualiza5on
Gateway Connec5ons Descrip5ve Sta5s5cs Graphing
Data Persistence Predic5ve Modeling Modeling Vis
Data Transforma5on Discovery Info Vis (small mul5ples)
Natural Lang Processing
Component Repository Component Discovery
Meandre Infrastructure
Shared Stores File Systems Metadata Stores SOA Gateways
Virtualiza5on Infrastructure
16. Participant Project Plan
• Participant Project Plan Guide
– Research Objective
– Data Source(s)
– Transformation(s)
– Query, Descriptive, and/or Analysis
– Evaluation – Table, Visualization
– Interaction
– Outcome
17. Workbench
• Web-based UI
• Components and flows
are retrieved from server
• Additional locations of
components and flows
can be added to server
• Create flow using a
graphical drag and drop
interface
• Change property values
• Execute the flow
19. SEASR @ Work – Zotero
• Plugin to Firefox
• Zotero manages the
collection
• Launch SEASR Analytics
– Citation Analysis uses the
JUNG network importance
algorithms to rank the authors
in the citation network that is
exported as RDF data from
Zotero to SEASR
– Zotero Export to Fedora
through SEASR
– Saves results from SEASR
Analytics to a Collection
• Launch MONK Processing
– MONK DB Ingestion Workflow
20. SEASR @ Work – Fedora
Interactive Web
Application
Web Service
21. SEASR @ Work – Entity Mash-up
• Entity Extraction
with OpenNLP
• Locations
viewed on
Google Map
• Dates viewed on
Simile Timeline
22. SEASR @ Work – Audio Analysis
• NEMA: Executes a
SEASR flow for each
run
– Loads audio data
– Extracts features for
every 10 sec moving
window of audio
– Loads and applies the
models
– Sends results back to the
WebUI
• NESTER: Annotation of
Audio via Spectral
Analysis
23. SEASR @ Work – MONK
Executes flows for
each analysis
requested
– Predictive
modeling using
Naïve Bayes
– Predictive
modeling using
Support Vector
Machines (SVM)
24. SEASR @ Work – DISCUS
On-demand usage of
•
analytics while surfing
– While navigating
request analytics to be
performed on page
– Text extraction and
cleaning
Summarization and key
•
work extraction
– List the important
terms on the page
being analyzed
– Provide relevant short
summaries
Visual maps
•
– Provide a visual
representation of the
key concepts
– Show the graph of
relations between
concepts
25. SEASR and UIMA : Emotion Tracking
Goal is to have this type of Visualization to track emotions across a
text document (Leveraging flare.prefuse.org)
28. Agenda
• Thursday, January 15, 2009
• 10:00am Break
• 10:30am SEASR Application Examples and
Demonstrations
– Zotero and SEASR, Xavier Llorà, Boris Capitanu
– UIMA and SEASR, Mike Haberman
– Audio Analysis and SEASR (NEMA, NESTER), J.
Stephen Downie
• 12:00pm Lunch
29. Pathways to SEASR
National Center for Supercomputing Applications!
University of Illinois at Urbana-Champaign
The SEASR project and its Meandre infrastructure!
are sponsored by The Andrew W. Mellon Foundation