1. Discovering Historic
Social Networks
Prototype Historical Resource Demo
Brian Tingle, California Digital Library
Digital Humanities 2011
Stanford University
2011 June 22
2. Meet the target users
Personas are fictional characters created to represent the different user types within a targeted demographic, attitude and/or behavior set that might use a site, brand
or product in a similar way. http://en.wikipedia.org/wiki/Persona_(marketing)
3. Meet the target users
Personas are fictional characters created to represent the different user types within a targeted demographic, attitude and/or behavior set that might use a site, brand
or product in a similar way. http://en.wikipedia.org/wiki/Persona_(marketing)
• Randy: Graduate student working on a PhD that involves biographies and the study of diplomatic
families and networks. Sometimes he comes to the site looking for information on specific people; other
times he is looking for information on a specific subject or event. He also TAs an undergraduate history
class and sometimes has to help students find topics for papers.
4. Meet the target users
Personas are fictional characters created to represent the different user types within a targeted demographic, attitude and/or behavior set that might use a site, brand
or product in a similar way. http://en.wikipedia.org/wiki/Persona_(marketing)
• Randy: Graduate student working on a PhD that involves biographies and the study of diplomatic
families and networks. Sometimes he comes to the site looking for information on specific people; other
times he is looking for information on a specific subject or event. He also TAs an undergraduate history
class and sometimes has to help students find topics for papers.
• Connie: Works at an institution that contributed records to the project. Is going to be asking
themselves how this site would be useful to their users. Wants to understand how their records were
used and what the added value is.
5. Meet the target users
Personas are fictional characters created to represent the different user types within a targeted demographic, attitude and/or behavior set that might use a site, brand
or product in a similar way. http://en.wikipedia.org/wiki/Persona_(marketing)
• Randy: Graduate student working on a PhD that involves biographies and the study of diplomatic
families and networks. Sometimes he comes to the site looking for information on specific people; other
times he is looking for information on a specific subject or event. He also TAs an undergraduate history
class and sometimes has to help students find topics for papers.
• Connie: Works at an institution that contributed records to the project. Is going to be asking
themselves how this site would be useful to their users. Wants to understand how their records were
used and what the added value is.
• Quincy: Library School Student working to QA record matching.
6. Meet the target users
Personas are fictional characters created to represent the different user types within a targeted demographic, attitude and/or behavior set that might use a site, brand
or product in a similar way. http://en.wikipedia.org/wiki/Persona_(marketing)
• Randy: Graduate student working on a PhD that involves biographies and the study of diplomatic
families and networks. Sometimes he comes to the site looking for information on specific people; other
times he is looking for information on a specific subject or event. He also TAs an undergraduate history
class and sometimes has to help students find topics for papers.
• Connie: Works at an institution that contributed records to the project. Is going to be asking
themselves how this site would be useful to their users. Wants to understand how their records were
used and what the added value is.
• Quincy: Library School Student working to QA record matching.
• Adele: Person doing authority work during collection processing.
7. Meet the target users
Personas are fictional characters created to represent the different user types within a targeted demographic, attitude and/or behavior set that might use a site, brand
or product in a similar way. http://en.wikipedia.org/wiki/Persona_(marketing)
• Randy: Graduate student working on a PhD that involves biographies and the study of diplomatic
families and networks. Sometimes he comes to the site looking for information on specific people; other
times he is looking for information on a specific subject or event. He also TAs an undergraduate history
class and sometimes has to help students find topics for papers.
• Connie: Works at an institution that contributed records to the project. Is going to be asking
themselves how this site would be useful to their users. Wants to understand how their records were
used and what the added value is.
• Quincy: Library School Student working to QA record matching.
• Adele: Person doing authority work during collection processing.
• Lenny: Lenny likes linked data, and wants to be able to mine the links that have been established
programatically.
31. Graph Schema
vertex
_id: auto-assigned by neo4j
_type: vertex
identity: the name of the entity (string) [indexed]
urls: n seperated list of source EAD files
entityType: 'corporateBody', 'family', or 'person'
edge
_id: auto-assigned by neo4j
_type: edge
_lable: 'correspondedWith' or 'associatedWith'
_inV: incoming vertex _id (from)
_outV: outgoing vertex _id (to)
from_name: from identity (string) denormalized
to_name: to identity (string) denormalized
32. internal id
indices/name-idx is an index on
“identity”; used to look up neo4j record
id
33. “bothE” shows in and out edges
vertices/103994/bothE
redundant data to save repeated
lookups
39. Front End Stack
• golden grid
http://code.google.com/p/the-golden-grid/
• form style http://formalize.me/
• jquery and jquery ui
• hoverIntent for advanced search
• google analytics with event tracking
40. XTF XSLT Framework
• pre filter - do special tokenization to create custom
EAC facets
• https://docs.google.com/document/d/
1wP9x6sdOZTagJNQXoyJfPh0Y6UzQgqLwLI86WSlIPbk/edit?hl=en_US
• query parser - CGI params to XTF query XML
• result formatter - XTF results to HTML
• doc formatter - EAC-CPF to HTML
• http://code.google.com/p/xtf-cpf/source/browse/?
name=xtf-cpf
41. social graph visualization
• EAC to graphML
https://code.google.com/p/eac-graph-load/
• graphML file with open license should be
viewable in other tools
• current demo uses Dracula Graph Library
• Ed Summer’s “snac hacks” post