SlideShare ist ein Scribd-Unternehmen logo
1 von 31
Studying Archives of Online Behavior
Computational Qualitative Research Seminar
James Howison
University of Texas at Austin
Link to slides on twitter @jameshowison
Readings at
https://www.dropbox.com/sh/1gx9s2zlnxvumbz/AAAV9uSAJHsiPeJ
hSsNnnM9Pa?dl=0
Readings
• The presentation and discussion will draw on:
– Howison, J., & Crowston, K. (2014). Collaboration through open superposition: A theory of the
open source way. MIS Quarterly, 38(1), 29–50.
– Howison, J., Wiggins, A., & Crowston, K. (2011). Validity Issues in the Use of Social Network
Analysis with Digital Trace Data. Journal of the Association for Information Systems, 12(12),
Article 2.
– Geiger, R. S., & Ribes, D. (2011). Trace Ethnography: Following Coordination through
Documentary Practices. In Proceedings of the 44th Hawaii International Conference on System
Sciences (HICSS 2011) (pp. 1–10). Waikoloa, HI. http://doi.org/10.1109/HICSS.2011.455
– Annabi, H., Crowston, K., & Heckman, R. (2008). Depicting What Really Matters: Using
Episodes to Study Latent Phenomenon. In Proceedings of the International Conference on
Information Systems (ICIS).
– The methodological appendix for the Howison and Crowston Superposition article.
To the Archives!
The evidence is here, somewhere.
CC Credit:
http://www.flickr.com/
photos/hamadryades/
Opportunities of online archive studies
• Quantity
• Granularity
• Accessibility
– Much is openly available
– Or the organization can provide bulk access
– (compare to ethnography and getting individual
cooperation)
• Emic'ness
Emic'ness?
Emic: in their words (from the inside)
Etic: in your words (from the outside)
Naturalistic: the archives are primary to the
users and the activity themselves:
"documentary traces are the primary mechanism in
which users themselves know their distributed
communities and act within them.”
(Ribes and Geiger, 2011)
Yet, many challenges
We are using the system (and the system that
archived and presents the traces) as a data
collection method.
But the systems were not built for research.
So we need to ask, for any research question:
How well do the archives represent the activity,
as it happened?
Individual Exercise (6 mins)
1. Pick a system that renders online archives of
something you are interested in.
– Can be your project for this course or something you
choose right now.
– Slight preference for an archive showing traces from
more than 1 person
2. Go and find a specific archive page and read it.
3. Write a sentence or two about what is
happening there.
Quick Group discussion (4 mins)
• Let’s hear from a few participants about their
choices.
Individual exercise II (6 mins)
• How might archives diverge from experience?
1. How did the system record activity at the time?
2. How did the conversion to archives occur?
3. How is your experience of reading the archives
different from the experience of the participants
in the activity that was archived?
Discussion in groups
• Group discuss questions (go question by
question, not person by person)
– How recorded? (each person speak)
– How converted? …
– How is reading experience different?
Most surprising?
• One person from each group report back
aspect that was most surprising.
Archival transformation
• Deletions
– Some data is periodically purged from databases, after all
they are running a website, not a research database.
• Overlaps
– When database dumps are pulled periodically
• Re-calculations
– Historical depictions on a site (e.g., counts of messages,
members, or other data such as downloads) might be later
creations or re-calculations
– Can you rely on participants having seen those figures at
the time?
Database schemas
are not research ontologies
• Databases (or websites) often use words that are
very exciting for research
– “Friends”, “Followers”, “Assignment”, “Member”
• But their meaning may have very, very little to do
with the sociological/theoretical concept
– At best they are a hint that something interesting is
happening, but often are interpreted literally!
• Examples from Sourceforge
– use of “assigned to” field on close.
– “member list” does not show who is active (no one
was ever removed!)
Non-archived activity
PublicPrivate
Errors
Warnings
Code
Local
Binary
Application
Logs
Stack
Dumps
Variables
Stacks
Commit
Log
Annoucement
Email
Discussion
Emails
Release
Notes
Bug
report
Discussion
of Bug
Bug
Repository
Testing
Builds
from CVS
Public
Release
CVS
Check-in
Coding
Compiling
Debugging
Public
Release
Binary
Release
Source
Release
Private Public
CVS
Check-out
Local
testing
Reasoning with missing/complete data
• Trouble both ways
• Assuming that the data are complete (rather
than a system selected sample)
• Can miss important activities or whole archives that
need to be integrated.
• Oddly enough, when data are complete issues
can also emerge
– See discussion in JASIST validity in SNA paper.
Hidden readership
• Archives almost never tell you who read what,
and when they read it.
– Might be key to interpretation (or might be
irrelevant)
– Definitely crucial to any argument about
information flow (and almost all interpretations of
SNA measures are about information flow).
• You may be able to impute readership from
responses, but it’s a weak signal.
Activity traces scattered
through archives
• Participants experience a flow of activities
across different systems
– Linked by time and order that they occur
• But they are archived by different systems
– If you just read the mailing list you miss so much
– And yet so many studies *want* their archive to
be the only one (so much easier to analyze).
Release Notes Dev Email Bug Tracker RFE TrackerUser Forum
TaskOutcome
Task
Relevant
Documents
TaskOutcome
Task
Relevant
Documents
TaskOutcome
Task
Relevant
Documents
CVS
Search and assign
Relevant Documents
Pacing of activities
• Participant observation in an open source
project highlighted the role of pacing.
– Rapid replies indicated interest and importance
but also availability
– Very long gaps (sometimes years) indicated
deferral and return.
• In other work I was reading archives and
found pacing hard to appreciate; it was very
salient in participant observation but hidden
in studies relying on trace data alone.
An episode
How to represent pacing?
Time stamps
Representing pacing
• Calculate gaps?
Reading gaps doesn’t help, easy to ignore,
make them harder to ignore?
Visualize events
What is to be done?
• Sufficient engagement with the system and community
to adequately interpret the traces.
• Use a system and see how your data is archived.
• When you think a phenomena/construct can be
operationalized computationally, at least show some
narrative examples from the dataset.
• Complement archives with interviews and/or surveys
– Archives make great prompts for interviews
– Lakhani and Wolf (2003) survey immediately after a post.
• Gaskin et al (2014) “Zooming in and out of
sociomaterial routines” MISQ.
An ontology for trace data studies
• Document
– Archived content. E.g., An e-mail message, tracker comment, release note, pull-request,
log entry.
– Provides evidence for events and actions.
– One document may provide evidence for multiple events and actions.
• Event
– An event causes documents to be archived. Sending an email, releasing a
version.
• Action
– The contextualized meaning of an event. e.g., contributing code, showing
leadership (can be at quite different conceptual levels in different studies.)
• Participant
– An actor (typically a person, but could be a machine or bot)
• Identifier
– A string associated with a participant.
– Many identifiers could refer to one participant (e.g, email and username)
– but many participants may act through one identifier (e.g., “admin account”)
Episodes
• A unit of analysis, facilitating comparison and
summary (e.g., counting)
– Compare to content analysis or nlp that counts
mentions of concepts, database queries that
count documents, surveys that measure attitudes.
– The detail provided by trace data renders episodes
more accessible, research to be more granular,
closer to the work.
• Ideally emic (meaningful to and recognizable
by participants)
Ok, but how to store this?
• Moving from documents and events to actions
and outcomes is interpretative work
– I do the qualitative first, then hope to make it
computable (e.g, through machine learning)
• It is akin to content analysis but a much more
complicated ontology
– Content analysis (classic or grounded theory) assigns
Codes to Documents
– Software like Atlas ti has trouble handling coding of
structured data (dates, linked documents like threads,
multiple identifiers for single participant.).
I use RDF
• Resource Description Format
– Triples: James hasEmail james@howison.name
– URLs working natively (making viewing original archives easy)
• Retains original data structure
– e.g., Document in thread by Identifier
– Allows ad-hoc addition of structure (schemaless)
– Allows inheritance (e.g., MailingListEvent a
CommunicationEvent)
• Allows you to overlay higher level structure
– e.g., Action(s) in (ordered) Episode by Participant
– And then apply codes to Actions (storing when, who, why)
• Querying via SPARQL, Validation via RDF rules (aka SPIN)
An episode
Showing an example

Weitere ähnliche Inhalte

Was ist angesagt?

Electronic Laboratory Notebooks
Electronic Laboratory NotebooksElectronic Laboratory Notebooks
Electronic Laboratory NotebooksKristin Briney
 
Remembrance of data past
Remembrance of data pastRemembrance of data past
Remembrance of data pastAmélie Marian
 
Blogs for Information Management
Blogs for Information ManagementBlogs for Information Management
Blogs for Information ManagementChristina Pikas
 
Data Science, Data Curation, and Human-Data Interaction
Data Science, Data Curation, and Human-Data InteractionData Science, Data Curation, and Human-Data Interaction
Data Science, Data Curation, and Human-Data InteractionUniversity of Washington
 
Data management for researchers
Data management for researchersData management for researchers
Data management for researchersDirk Roorda
 
Referencing mudcd it_id
Referencing mudcd it_idReferencing mudcd it_id
Referencing mudcd it_idlibrarymudc
 
Hacking the research process final version cil 2014
Hacking the research process final version   cil 2014Hacking the research process final version   cil 2014
Hacking the research process final version cil 2014Cheryl Peltier-Davis
 
Working with data.open.ac.uk, the Linked Data Platform of the Open University
Working with data.open.ac.uk, the Linked Data Platform of the Open UniversityWorking with data.open.ac.uk, the Linked Data Platform of the Open University
Working with data.open.ac.uk, the Linked Data Platform of the Open UniversityMathieu d'Aquin
 
Social Media in Science and Altmetrics - New Ways of Measuring Research Impact
Social Media in Science and Altmetrics - New Ways of Measuring Research Impact Social Media in Science and Altmetrics - New Ways of Measuring Research Impact
Social Media in Science and Altmetrics - New Ways of Measuring Research Impact Christoph Lutz
 
Linked Open Data for Libraries
Linked Open Data for LibrariesLinked Open Data for Libraries
Linked Open Data for LibrariesLukas Koster
 
From Search to Predictions in Tagged Information Spaces
From Search to Predictions in Tagged Information SpacesFrom Search to Predictions in Tagged Information Spaces
From Search to Predictions in Tagged Information SpacesChristoph Trattner
 
Facilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic MarkupFacilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic MarkupJames Hendler
 
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the WebBeyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the WebStefan Dietze
 
Cognitive Models in Recommender Systems
Cognitive Models in Recommender SystemsCognitive Models in Recommender Systems
Cognitive Models in Recommender SystemsChristoph Trattner
 
LUCERO - Building the Open University Web of Linked Data
LUCERO - Building the Open University Web of Linked DataLUCERO - Building the Open University Web of Linked Data
LUCERO - Building the Open University Web of Linked DataMathieu d'Aquin
 
TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22jodischneider
 
Recommending Items in Social Tagging Systems Using Tag and Time Information
Recommending Items in Social Tagging Systems Using Tag and Time InformationRecommending Items in Social Tagging Systems Using Tag and Time Information
Recommending Items in Social Tagging Systems Using Tag and Time InformationChristoph Trattner
 
Owning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your PatronsOwning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your PatronsRobert H. McDonald
 

Was ist angesagt? (20)

Electronic Laboratory Notebooks
Electronic Laboratory NotebooksElectronic Laboratory Notebooks
Electronic Laboratory Notebooks
 
Remembrance of data past
Remembrance of data pastRemembrance of data past
Remembrance of data past
 
Blogs for Information Management
Blogs for Information ManagementBlogs for Information Management
Blogs for Information Management
 
Data Science, Data Curation, and Human-Data Interaction
Data Science, Data Curation, and Human-Data InteractionData Science, Data Curation, and Human-Data Interaction
Data Science, Data Curation, and Human-Data Interaction
 
Data management for researchers
Data management for researchersData management for researchers
Data management for researchers
 
Referencing mudcd it_id
Referencing mudcd it_idReferencing mudcd it_id
Referencing mudcd it_id
 
Hacking the research process final version cil 2014
Hacking the research process final version   cil 2014Hacking the research process final version   cil 2014
Hacking the research process final version cil 2014
 
Working with data.open.ac.uk, the Linked Data Platform of the Open University
Working with data.open.ac.uk, the Linked Data Platform of the Open UniversityWorking with data.open.ac.uk, the Linked Data Platform of the Open University
Working with data.open.ac.uk, the Linked Data Platform of the Open University
 
Social Media in Science and Altmetrics - New Ways of Measuring Research Impact
Social Media in Science and Altmetrics - New Ways of Measuring Research Impact Social Media in Science and Altmetrics - New Ways of Measuring Research Impact
Social Media in Science and Altmetrics - New Ways of Measuring Research Impact
 
NISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to RealityNISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to Reality
 
Linked Open Data for Libraries
Linked Open Data for LibrariesLinked Open Data for Libraries
Linked Open Data for Libraries
 
From Search to Predictions in Tagged Information Spaces
From Search to Predictions in Tagged Information SpacesFrom Search to Predictions in Tagged Information Spaces
From Search to Predictions in Tagged Information Spaces
 
Facilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic MarkupFacilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic Markup
 
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the WebBeyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
 
Cognitive Models in Recommender Systems
Cognitive Models in Recommender SystemsCognitive Models in Recommender Systems
Cognitive Models in Recommender Systems
 
LUCERO - Building the Open University Web of Linked Data
LUCERO - Building the Open University Web of Linked DataLUCERO - Building the Open University Web of Linked Data
LUCERO - Building the Open University Web of Linked Data
 
TSEM 102 Sp 2012
TSEM 102 Sp 2012TSEM 102 Sp 2012
TSEM 102 Sp 2012
 
TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22
 
Recommending Items in Social Tagging Systems Using Tag and Time Information
Recommending Items in Social Tagging Systems Using Tag and Time InformationRecommending Items in Social Tagging Systems Using Tag and Time Information
Recommending Items in Social Tagging Systems Using Tag and Time Information
 
Owning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your PatronsOwning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your Patrons
 

Andere mochten auch

Andere mochten auch (7)

Trabajo busqueda y gestion de la web
Trabajo busqueda y gestion de la webTrabajo busqueda y gestion de la web
Trabajo busqueda y gestion de la web
 
Analisi del DNA Fetale - Prenatal Safe Test a Monza
Analisi del DNA Fetale - Prenatal Safe Test a MonzaAnalisi del DNA Fetale - Prenatal Safe Test a Monza
Analisi del DNA Fetale - Prenatal Safe Test a Monza
 
La llorona
La lloronaLa llorona
La llorona
 
11 разв (43)
11 разв (43)11 разв (43)
11 разв (43)
 
2011 Calendar
2011 Calendar2011 Calendar
2011 Calendar
 
App53 quintero justicia-ambiental_final-cf
App53 quintero justicia-ambiental_final-cfApp53 quintero justicia-ambiental_final-cf
App53 quintero justicia-ambiental_final-cf
 
Long Term Planning Presentation with Kathryn McCurdy - POP Winter Conference
Long Term Planning Presentation with Kathryn McCurdy - POP Winter ConferenceLong Term Planning Presentation with Kathryn McCurdy - POP Winter Conference
Long Term Planning Presentation with Kathryn McCurdy - POP Winter Conference
 

Ähnlich wie Studying archives of online behavior

Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)TimelessFuture
 
Web-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationWeb-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationRachel Vacek
 
Investing in a time of desruptive change
Investing in a time of desruptive changeInvesting in a time of desruptive change
Investing in a time of desruptive changeJisc
 
Managing Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsManaging Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsRebecca Grant
 
Creating an Urban Legend: A System for Electrophysiology Data Management and ...
Creating an Urban Legend: A System for Electrophysiology Data Management and ...Creating an Urban Legend: A System for Electrophysiology Data Management and ...
Creating an Urban Legend: A System for Electrophysiology Data Management and ...Anita de Waard
 
Scholarly Information Practices In The Online Environment
Scholarly Information Practices In The Online EnvironmentScholarly Information Practices In The Online Environment
Scholarly Information Practices In The Online EnvironmentOCLC Research
 
Research Data Management in the Humanities and Social Sciences
Research Data Management in the Humanities and Social SciencesResearch Data Management in the Humanities and Social Sciences
Research Data Management in the Humanities and Social SciencesCelia Emmelhainz
 
Webscale Discovery with the Enduser in Mind
Webscale Discovery with the Enduser in Mind Webscale Discovery with the Enduser in Mind
Webscale Discovery with the Enduser in Mind Debra Kolah
 
Web analytics webinar
Web analytics webinarWeb analytics webinar
Web analytics webinarJim Jansen
 
Web analytics presentation
Web analytics presentationWeb analytics presentation
Web analytics presentationJim Jansen
 
Thinking about technology .... differently
Thinking about technology .... differentlyThinking about technology .... differently
Thinking about technology .... differentlylisld
 
LIBRARY ASSESSMENT
LIBRARY ASSESSMENTLIBRARY ASSESSMENT
LIBRARY ASSESSMENTJen Rutner
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the WebRinke Hoekstra
 
Proposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender FrameworkProposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender FrameworkAravind Sesagiri Raamkumar
 
Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...
Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...
Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...Globus
 
Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...James Howison
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Paul Groth
 
Where Have We Been & Where Are We Going?
Where Have We Been & Where Are We Going?Where Have We Been & Where Are We Going?
Where Have We Been & Where Are We Going?Philip Bourne
 
Digital Library Infrastructure for a Million Books
Digital Library Infrastructure for a Million BooksDigital Library Infrastructure for a Million Books
Digital Library Infrastructure for a Million BooksSteve Toub
 

Ähnlich wie Studying archives of online behavior (20)

Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)
 
Web-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationWeb-Scale Discovery: Post Implementation
Web-Scale Discovery: Post Implementation
 
Investing in a time of desruptive change
Investing in a time of desruptive changeInvesting in a time of desruptive change
Investing in a time of desruptive change
 
Managing Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsManaging Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research Methods
 
Creating an Urban Legend: A System for Electrophysiology Data Management and ...
Creating an Urban Legend: A System for Electrophysiology Data Management and ...Creating an Urban Legend: A System for Electrophysiology Data Management and ...
Creating an Urban Legend: A System for Electrophysiology Data Management and ...
 
Scholarly Information Practices In The Online Environment
Scholarly Information Practices In The Online EnvironmentScholarly Information Practices In The Online Environment
Scholarly Information Practices In The Online Environment
 
Research Data Management in the Humanities and Social Sciences
Research Data Management in the Humanities and Social SciencesResearch Data Management in the Humanities and Social Sciences
Research Data Management in the Humanities and Social Sciences
 
Webscale Discovery with the Enduser in Mind
Webscale Discovery with the Enduser in Mind Webscale Discovery with the Enduser in Mind
Webscale Discovery with the Enduser in Mind
 
Web analytics webinar
Web analytics webinarWeb analytics webinar
Web analytics webinar
 
Web analytics presentation
Web analytics presentationWeb analytics presentation
Web analytics presentation
 
Thinking about technology .... differently
Thinking about technology .... differentlyThinking about technology .... differently
Thinking about technology .... differently
 
LIBRARY ASSESSMENT
LIBRARY ASSESSMENTLIBRARY ASSESSMENT
LIBRARY ASSESSMENT
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the Web
 
Proposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender FrameworkProposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender Framework
 
Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...
Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...
Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...
 
Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.
 
Where Have We Been & Where Are We Going?
Where Have We Been & Where Are We Going?Where Have We Been & Where Are We Going?
Where Have We Been & Where Are We Going?
 
EDS for JIBS
EDS for JIBSEDS for JIBS
EDS for JIBS
 
Digital Library Infrastructure for a Million Books
Digital Library Infrastructure for a Million BooksDigital Library Infrastructure for a Million Books
Digital Library Infrastructure for a Million Books
 

Mehr von James Howison

Sustainability in Scientific Software: Ecosystem complexity and Software Vis...
Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...
Sustainability in Scientific Software: Ecosystem complexity and Software Vis...James Howison
 
Scientific Software - what happens after the grant?
Scientific Software - what happens after the grant?Scientific Software - what happens after the grant?
Scientific Software - what happens after the grant?James Howison
 
How to Ask questions people want to answer
How to Ask questions people want to answerHow to Ask questions people want to answer
How to Ask questions people want to answerJames Howison
 
Software Citation and a Proposal (NSF workshop at Havard Medical School)
Software Citation and a Proposal (NSF workshop at Havard Medical School)Software Citation and a Proposal (NSF workshop at Havard Medical School)
Software Citation and a Proposal (NSF workshop at Havard Medical School)James Howison
 
Heartbeat: measuring installed base by analyzing downloads and Scientific S...
Heartbeat: measuring installed base by analyzing downloads and Scientific S...Heartbeat: measuring installed base by analyzing downloads and Scientific S...
Heartbeat: measuring installed base by analyzing downloads and Scientific S...James Howison
 
Open Superposition and lessons for scientific software development
Open Superposition and lessons for scientific software developmentOpen Superposition and lessons for scientific software development
Open Superposition and lessons for scientific software developmentJames Howison
 
Howison CAREER journey - NSF CISE CAREER Workshop 2016
Howison CAREER journey - NSF CISE CAREER Workshop 2016Howison CAREER journey - NSF CISE CAREER Workshop 2016
Howison CAREER journey - NSF CISE CAREER Workshop 2016James Howison
 
SciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsSciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsJames Howison
 
Howison rutgers-open superposition
Howison rutgers-open superpositionHowison rutgers-open superposition
Howison rutgers-open superpositionJames Howison
 
Howison i conf-transition
Howison i conf-transitionHowison i conf-transition
Howison i conf-transitionJames Howison
 
Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...James Howison
 
Sharing the spoils: the trouble with reputation as a motivation and reward fo...
Sharing the spoils: the trouble with reputation as a motivation and reward fo...Sharing the spoils: the trouble with reputation as a motivation and reward fo...
Sharing the spoils: the trouble with reputation as a motivation and reward fo...James Howison
 
Scientific software sustainability and ecosystem complexity
Scientific software sustainability and ecosystem complexityScientific software sustainability and ecosystem complexity
Scientific software sustainability and ecosystem complexityJames Howison
 
Five Models for Interaction Between Science Enterprises and Organization Scie...
Five Models for Interaction Between Science Enterprises and Organization Scie...Five Models for Interaction Between Science Enterprises and Organization Scie...
Five Models for Interaction Between Science Enterprises and Organization Scie...James Howison
 

Mehr von James Howison (16)

Sustainability in Scientific Software: Ecosystem complexity and Software Vis...
Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...
Sustainability in Scientific Software: Ecosystem complexity and Software Vis...
 
Scientific Software - what happens after the grant?
Scientific Software - what happens after the grant?Scientific Software - what happens after the grant?
Scientific Software - what happens after the grant?
 
How to Ask questions people want to answer
How to Ask questions people want to answerHow to Ask questions people want to answer
How to Ask questions people want to answer
 
Software Citation and a Proposal (NSF workshop at Havard Medical School)
Software Citation and a Proposal (NSF workshop at Havard Medical School)Software Citation and a Proposal (NSF workshop at Havard Medical School)
Software Citation and a Proposal (NSF workshop at Havard Medical School)
 
Heartbeat: measuring installed base by analyzing downloads and Scientific S...
Heartbeat: measuring installed base by analyzing downloads and Scientific S...Heartbeat: measuring installed base by analyzing downloads and Scientific S...
Heartbeat: measuring installed base by analyzing downloads and Scientific S...
 
Open Superposition and lessons for scientific software development
Open Superposition and lessons for scientific software developmentOpen Superposition and lessons for scientific software development
Open Superposition and lessons for scientific software development
 
Howison CAREER journey - NSF CISE CAREER Workshop 2016
Howison CAREER journey - NSF CISE CAREER Workshop 2016Howison CAREER journey - NSF CISE CAREER Workshop 2016
Howison CAREER journey - NSF CISE CAREER Workshop 2016
 
SciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsSciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystems
 
Howison si2 keynote
Howison si2 keynoteHowison si2 keynote
Howison si2 keynote
 
Howison rutgers-open superposition
Howison rutgers-open superpositionHowison rutgers-open superposition
Howison rutgers-open superposition
 
Howison i conf-transition
Howison i conf-transitionHowison i conf-transition
Howison i conf-transition
 
Howison traces
Howison tracesHowison traces
Howison traces
 
Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...
 
Sharing the spoils: the trouble with reputation as a motivation and reward fo...
Sharing the spoils: the trouble with reputation as a motivation and reward fo...Sharing the spoils: the trouble with reputation as a motivation and reward fo...
Sharing the spoils: the trouble with reputation as a motivation and reward fo...
 
Scientific software sustainability and ecosystem complexity
Scientific software sustainability and ecosystem complexityScientific software sustainability and ecosystem complexity
Scientific software sustainability and ecosystem complexity
 
Five Models for Interaction Between Science Enterprises and Organization Scie...
Five Models for Interaction Between Science Enterprises and Organization Scie...Five Models for Interaction Between Science Enterprises and Organization Scie...
Five Models for Interaction Between Science Enterprises and Organization Scie...
 

Kürzlich hochgeladen

%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburgmasabamasaba
 
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...WSO2
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrandmasabamasaba
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in sowetomasabamasaba
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park masabamasaba
 
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxArtyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxAnnaArtyushina1
 
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyviewmasabamasaba
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024VictoriaMetrics
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...masabamasaba
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Bert Jan Schrijver
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisamasabamasaba
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2
 
WSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaSWSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaSWSO2
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2
 
WSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2
 
tonesoftg
tonesoftgtonesoftg
tonesoftglanshi9
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnAmarnathKambale
 

Kürzlich hochgeladen (20)

%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
 
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxArtyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptx
 
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
WSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaSWSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaS
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
 
WSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - Keynote
 
tonesoftg
tonesoftgtonesoftg
tonesoftg
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 

Studying archives of online behavior

  • 1. Studying Archives of Online Behavior Computational Qualitative Research Seminar James Howison University of Texas at Austin Link to slides on twitter @jameshowison Readings at https://www.dropbox.com/sh/1gx9s2zlnxvumbz/AAAV9uSAJHsiPeJ hSsNnnM9Pa?dl=0
  • 2. Readings • The presentation and discussion will draw on: – Howison, J., & Crowston, K. (2014). Collaboration through open superposition: A theory of the open source way. MIS Quarterly, 38(1), 29–50. – Howison, J., Wiggins, A., & Crowston, K. (2011). Validity Issues in the Use of Social Network Analysis with Digital Trace Data. Journal of the Association for Information Systems, 12(12), Article 2. – Geiger, R. S., & Ribes, D. (2011). Trace Ethnography: Following Coordination through Documentary Practices. In Proceedings of the 44th Hawaii International Conference on System Sciences (HICSS 2011) (pp. 1–10). Waikoloa, HI. http://doi.org/10.1109/HICSS.2011.455 – Annabi, H., Crowston, K., & Heckman, R. (2008). Depicting What Really Matters: Using Episodes to Study Latent Phenomenon. In Proceedings of the International Conference on Information Systems (ICIS). – The methodological appendix for the Howison and Crowston Superposition article.
  • 3. To the Archives! The evidence is here, somewhere. CC Credit: http://www.flickr.com/ photos/hamadryades/
  • 4. Opportunities of online archive studies • Quantity • Granularity • Accessibility – Much is openly available – Or the organization can provide bulk access – (compare to ethnography and getting individual cooperation) • Emic'ness
  • 5. Emic'ness? Emic: in their words (from the inside) Etic: in your words (from the outside) Naturalistic: the archives are primary to the users and the activity themselves: "documentary traces are the primary mechanism in which users themselves know their distributed communities and act within them.” (Ribes and Geiger, 2011)
  • 6. Yet, many challenges We are using the system (and the system that archived and presents the traces) as a data collection method. But the systems were not built for research. So we need to ask, for any research question: How well do the archives represent the activity, as it happened?
  • 7. Individual Exercise (6 mins) 1. Pick a system that renders online archives of something you are interested in. – Can be your project for this course or something you choose right now. – Slight preference for an archive showing traces from more than 1 person 2. Go and find a specific archive page and read it. 3. Write a sentence or two about what is happening there.
  • 8. Quick Group discussion (4 mins) • Let’s hear from a few participants about their choices.
  • 9. Individual exercise II (6 mins) • How might archives diverge from experience? 1. How did the system record activity at the time? 2. How did the conversion to archives occur? 3. How is your experience of reading the archives different from the experience of the participants in the activity that was archived?
  • 10. Discussion in groups • Group discuss questions (go question by question, not person by person) – How recorded? (each person speak) – How converted? … – How is reading experience different?
  • 11. Most surprising? • One person from each group report back aspect that was most surprising.
  • 12. Archival transformation • Deletions – Some data is periodically purged from databases, after all they are running a website, not a research database. • Overlaps – When database dumps are pulled periodically • Re-calculations – Historical depictions on a site (e.g., counts of messages, members, or other data such as downloads) might be later creations or re-calculations – Can you rely on participants having seen those figures at the time?
  • 13. Database schemas are not research ontologies • Databases (or websites) often use words that are very exciting for research – “Friends”, “Followers”, “Assignment”, “Member” • But their meaning may have very, very little to do with the sociological/theoretical concept – At best they are a hint that something interesting is happening, but often are interpreted literally! • Examples from Sourceforge – use of “assigned to” field on close. – “member list” does not show who is active (no one was ever removed!)
  • 15. Reasoning with missing/complete data • Trouble both ways • Assuming that the data are complete (rather than a system selected sample) • Can miss important activities or whole archives that need to be integrated. • Oddly enough, when data are complete issues can also emerge – See discussion in JASIST validity in SNA paper.
  • 16. Hidden readership • Archives almost never tell you who read what, and when they read it. – Might be key to interpretation (or might be irrelevant) – Definitely crucial to any argument about information flow (and almost all interpretations of SNA measures are about information flow). • You may be able to impute readership from responses, but it’s a weak signal.
  • 17. Activity traces scattered through archives • Participants experience a flow of activities across different systems – Linked by time and order that they occur • But they are archived by different systems – If you just read the mailing list you miss so much – And yet so many studies *want* their archive to be the only one (so much easier to analyze).
  • 18. Release Notes Dev Email Bug Tracker RFE TrackerUser Forum TaskOutcome Task Relevant Documents TaskOutcome Task Relevant Documents TaskOutcome Task Relevant Documents CVS Search and assign Relevant Documents
  • 19. Pacing of activities • Participant observation in an open source project highlighted the role of pacing. – Rapid replies indicated interest and importance but also availability – Very long gaps (sometimes years) indicated deferral and return. • In other work I was reading archives and found pacing hard to appreciate; it was very salient in participant observation but hidden in studies relying on trace data alone.
  • 21. How to represent pacing? Time stamps
  • 23. Reading gaps doesn’t help, easy to ignore, make them harder to ignore?
  • 25. What is to be done? • Sufficient engagement with the system and community to adequately interpret the traces. • Use a system and see how your data is archived. • When you think a phenomena/construct can be operationalized computationally, at least show some narrative examples from the dataset. • Complement archives with interviews and/or surveys – Archives make great prompts for interviews – Lakhani and Wolf (2003) survey immediately after a post. • Gaskin et al (2014) “Zooming in and out of sociomaterial routines” MISQ.
  • 26. An ontology for trace data studies • Document – Archived content. E.g., An e-mail message, tracker comment, release note, pull-request, log entry. – Provides evidence for events and actions. – One document may provide evidence for multiple events and actions. • Event – An event causes documents to be archived. Sending an email, releasing a version. • Action – The contextualized meaning of an event. e.g., contributing code, showing leadership (can be at quite different conceptual levels in different studies.) • Participant – An actor (typically a person, but could be a machine or bot) • Identifier – A string associated with a participant. – Many identifiers could refer to one participant (e.g, email and username) – but many participants may act through one identifier (e.g., “admin account”)
  • 27. Episodes • A unit of analysis, facilitating comparison and summary (e.g., counting) – Compare to content analysis or nlp that counts mentions of concepts, database queries that count documents, surveys that measure attitudes. – The detail provided by trace data renders episodes more accessible, research to be more granular, closer to the work. • Ideally emic (meaningful to and recognizable by participants)
  • 28. Ok, but how to store this? • Moving from documents and events to actions and outcomes is interpretative work – I do the qualitative first, then hope to make it computable (e.g, through machine learning) • It is akin to content analysis but a much more complicated ontology – Content analysis (classic or grounded theory) assigns Codes to Documents – Software like Atlas ti has trouble handling coding of structured data (dates, linked documents like threads, multiple identifiers for single participant.).
  • 29. I use RDF • Resource Description Format – Triples: James hasEmail james@howison.name – URLs working natively (making viewing original archives easy) • Retains original data structure – e.g., Document in thread by Identifier – Allows ad-hoc addition of structure (schemaless) – Allows inheritance (e.g., MailingListEvent a CommunicationEvent) • Allows you to overlay higher level structure – e.g., Action(s) in (ordered) Episode by Participant – And then apply codes to Actions (storing when, who, why) • Querying via SPARQL, Validation via RDF rules (aka SPIN)