3. Contextualization in Year 1
• Baseline
• Identification of global identifiers
• Authority and type of identity
• BBAW, SBB, NLI, UB Frankfurt, MPIWG, ÖNB:
– mostly contextualization of persons and
corporate bodies
• What can we do more?
27.11.13
9. DDC
The 1914 - 1918 Collection of the American Jewish Joint
Distribution Committee is comprised of the records of the
New York headquarters for the period from the Joint's origins
providing emergency relief through World
War I.
DC
D
27.11.13
10
12. Workflow
• Ingestion through Omnom
• Contextualization in DM2E Triplestore
• Common input vocabulary – but not really
consistent
• Saved as independent triples – no change of
original data
27.11.13
13. SILK Demo
•
•
•
•
27.11.13
Workbench to create Linkage Rules with a GUI
Transformations and Normalizations
Similarity metrics to compare values
Aggregators to combine various comparisons
14
16. Structured Data
Combination of Datatype Properties
a1
year
“1991“
a2
name
name
similarity
“C. Brodley“
“Brodley, Carla“
year
“1991“
similarity
project data
GND
27.11.13
18
19. Limitations
Needs high computing power
No on-the-fly change of linkage rules
Not well-suited for structured data
Sparse metadata: get information out of
transcriptions? Named Entity Recognition?
Know your data! Results have to be checked.
27.11.13
20. DM2E Silk Workbench
• Put behind SSO
• No user management
• Keep own sources (at least GND)
• Possibly keep contextualization job to some
power users
27.11.13