Diese Präsentation wurde erfolgreich gemeldet.
Wir verwenden Ihre LinkedIn Profilangaben und Informationen zu Ihren Aktivitäten, um Anzeigen zu personalisieren und Ihnen relevantere Inhalte anzuzeigen. Sie können Ihre Anzeigeneinstellungen jederzeit ändern.

OpenRefine for Numismatic Data Cleaning: An Introduction

122 Aufrufe

Veröffentlicht am

Introductory slides for a workshop on OpenRefine reconciliation for Nomisma.org and Online Coins of the Roman Empire (http://numismatics.org/ocre). Test CSV is available at http://numishare.blogspot.com/2018/04/openrefine-workshop-materials-for.html

Veröffentlicht in: Technologie
  • Als Erste(r) kommentieren

  • Gehören Sie zu den Ersten, denen das gefällt!

OpenRefine for Numismatic Data Cleaning: An Introduction

  1. 1. OpenRefine for Numismatic Data Cleaning: An Introduction
  2. 2. Some Links Tutorials  https://programminghistorian.org/lessons/cleaning-data-with-openrefine  http://enipedia.tudelft.nl/wiki/OpenRefine_Tutorial Blog Posts about Nomisma/OCRE Reconciliation  http://numishare.blogspot.com/2017/10/nomisma-launches-openrefine.html  http://numishare.blogspot.com/2017/11/numishare-supports-openrefine.html
  3. 3. Nomisma Reconciliation Based on Keyword search: preferred, alternate labels in all languages and definitions.
  4. 4. OCRE Reconciliation Based on the "Title", which includes the number itself and the mint or authority (or portrait, depending on volume). Optional properties include: Authority, Mint, Denomination, Portrait, Issuer, etc. These properties could be keyword or exact match (recommended for accuracy, but after reconciling these to Nomisma English preferred labels first) RIC 1 – 5: Section headings based on authority/portrait RIC I (second edition) Augustus 1A RIC II Hadrian 18a RIC II Hadrian 18b RIC II Hadrian 2A (aureus) RIC II Hadrian 2A (denarius) RIC V Gallienus (joint reign) 2 RIC V Gallienus and Salonina 1 Multiple sections in RIC 5, empresses in separate sections vs. earlier volumes 2A in RIC was split to two URIs in OCRE for denomination Lower case letters were designated subtypes, but there is no Hadrian 18 parent type
  5. 5. RIC 6 – 9: Mint RIC VIII Treveri 14 RIC VIII Lugdunum 14 Multiple numbers per ruler (both Constantine II): mint required to improve accuracy RIC 10 – Section heading, but somewhat sequential RIC X Arcadius 1 RIC X Theodosius II (East) 201 Theodosius II 201 begins after Arcadius 159. Following emperor begins at next hundred mark.

×