SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Possibilities of Digital
Analysis of Charter corpora
Georg Vogeler
IMC Leeds, 9.7.2009Georg Vogeler 2
Charter Corpora on the Web
 Württembergisches Urkundenbuch (http://maja.bsz-
bw.de/wubonline/
 CDLM (http://cdlm.unipv.it)
 DEEDS (http://www.utoronto.ca/deeds/)
 Monasterium.net (http://www.monasterium.net)
 Ut per litteras apostolicas …
(http://www.brepolis.net)
 Diplomatico Firenze
(http://www.archiviodistato.firenze.it/diplomatico)
IMC Leeds, 9.7.2009Georg Vogeler 3
What’s their advantage?
 Images
 Reconstructed archives
• Virtuelles Archiv Salzburg
• Archive of the Stift Ardagger
 Fast search
 => take the charter heritage as is
not as defined by organisational reasons
IMC Leeds, 9.7.2009Georg Vogeler 4
Online Corpus abolishes borders …
 between repositories
 between forms of representation
and
IMC Leeds, 9.7.2009Georg Vogeler 5
Research on set phrases
 Vernacular dating clauses
• Latin model: (Ulm 1275 März 29)
dirre dinge iſt gezivch herre Marquart von
Bleichen herre hartman von ſahſenhvſen vn
herre tecke von annenhoven. Datum · IIIIo · kl
· aprilis · anno dni · Mo · CCo · IXXVo.
• German model almost free from it:
Diz geſchach zehahberch an deme Ciſtage in der
phingeſtwochen / do von gotteſ geb̓vrte waren
zwelfhundert Sibenzig vn f̓vnf Jar
IMC Leeds, 9.7.2009Georg Vogeler 6
Dating Clauses
 13th century:
• Germany (de Boor 1975)
- South-western model:
• dis geschach do man zalte von gotes gebúrte zwelf
hundert und niun und niunzig jar.
- South-eastern model:
• ditz ist geschehen, do es waren von christes geburt
tousent zwaihundert und darnach in dem niun unde
niunzegisten jare.
IMC Leeds, 9.7.2009Georg Vogeler 7
In monasterium.net
for $u in //tenor[not(.='')]/ancestor::text[.//lang_MOM='Deutsch']
let $dat := substring($u//tenor, (string-length($u//tenor) - 200))
where number($u//date_sort) lt 14000001 and
number($u//date_sort) gt 13000000
order by $u//date_sort
return <dat><wo>{
$u/@b_name
} {
$u//issued/placeName/text()
}</wo>
<was> {
$dat
}</was></dat>
IMC Leeds, 9.7.2009Georg Vogeler 8
In monasterium.net
[Dd][aov]
([uv][ao]n|nach)
(([Gg]ot{1,2}[ei]{0,1}[sz])|
(([Cc]h|[cCkK])rist[eisz]*?)|
([uv]n{1,2}s[ei]{0,1}r[ei]{0,1}[sz]
[hH]er{1,2}[ei]{0,1}n))
([Gge]*?[PBpb][uv][oe°]{0,1}ri{0,1}[td][hte]*?)
(w[ao]^{0,1}[rzs][ien]*?)
IMC Leeds, 9.7.2009Georg Vogeler 9
In monasterium.net
IMC Leeds, 9.7.2009Georg Vogeler 10
Results
 13th century:
• 433 texts
• “zalt”-model: 24, all but 5 from the Chartularium
Sangallense
• “waren”-model: 137, all but 15 from the south-eastern
regions
 14th century:
• 8354 texts
• “zalt”-model: 2478, 964 not from St. Gallen
• “waren”-model: 350, only 13 from St. Gallen
IMC Leeds, 9.7.2009Georg Vogeler 11
Methods of Investigation
 Already in use
• Simple word selection/word count (Tock,
Brousseau, Parisse)
• Phrase statistics (Gervers/Margolin)
• Graphetic detail analysis (Fiebig)
• Hand identification by pattern analysis
(Schomaker/Burgers)
• Named entity recognition (Stoyan/Schmidt)
IMC Leeds, 9.7.2009Georg Vogeler 12
Possible Programming
 Testing/adapting existing algorithms
• Author identification tools
• Graphical variation tools
• Named Entity Recognition methods for clauses
 to find the connections between charters that
aren’t kept in the same archive/aren’t printed in
the same edition:
• e.g.: Influence of recipient on the charters
• Spread of formula, regions of legal culture
IMC Leeds, 9.7.2009Georg Vogeler 13
Early medieval diplomatics
 Add charters to the online corpora
 Add information to the online charter corpora
 Take text analytic software into consideration
 Ask your local computer scientist what he could
help you
Thank you for your attention
g.vogeler@lmu.de

Weitere ähnliche Inhalte

Ähnlich wie Possibilities of Digital Analysis of Charter corpora

Open Archives Initiative Protocol for Metadata Harvesting
Open Archives Initiative Protocol for Metadata HarvestingOpen Archives Initiative Protocol for Metadata Harvesting
Open Archives Initiative Protocol for Metadata Harvestingchessmu
 
Representing the world: How web users become web thinkers and web makers
Representing the world: How web users become web thinkers and web makersRepresenting the world: How web users become web thinkers and web makers
Representing the world: How web users become web thinkers and web makersjudell
 
Linking library and theatre data
Linking library and theatre dataLinking library and theatre data
Linking library and theatre dataLukas Koster
 
Open (linked) bibliographic data edmund chamberlain (university of cambridge)
Open (linked) bibliographic data   edmund chamberlain (university of cambridge)Open (linked) bibliographic data   edmund chamberlain (university of cambridge)
Open (linked) bibliographic data edmund chamberlain (university of cambridge)RDTF-Discovery
 
Open (linked) bibliographic data
Open (linked) bibliographic dataOpen (linked) bibliographic data
Open (linked) bibliographic dataEdmund Chamberlain
 
Hausstein data cite-dara-dasish2014
Hausstein data cite-dara-dasish2014Hausstein data cite-dara-dasish2014
Hausstein data cite-dara-dasish2014bhausstein
 
Serving Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked DataServing Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked DataChristophe Debruyne
 
Introduction To OpenStreetMap Fosscon2010
Introduction To OpenStreetMap Fosscon2010Introduction To OpenStreetMap Fosscon2010
Introduction To OpenStreetMap Fosscon2010rweait
 
Data Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into GoldData Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into GoldSøren Schaffstein
 
GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...
GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...
GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...GeoSolutions
 
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Nikolaos Konstantinou
 
GeoServer an introduction for beginners
GeoServer an introduction for beginnersGeoServer an introduction for beginners
GeoServer an introduction for beginnersGeoSolutions
 
WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...
WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...
WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...Dean Bubley
 
Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016KellliBee
 
iKNOW2014 - SimModel and IFC: a short introduction to the ontologies
iKNOW2014 - SimModel and IFC: a short introduction to the ontologiesiKNOW2014 - SimModel and IFC: a short introduction to the ontologies
iKNOW2014 - SimModel and IFC: a short introduction to the ontologiesPieter Pauwels
 
ADLUG 2012: Linking Linked Data
ADLUG 2012: Linking Linked DataADLUG 2012: Linking Linked Data
ADLUG 2012: Linking Linked DataAndrea Gazzarini
 
Virtual Environments for Research in Archaeology (Mark Baker)
Virtual Environments for Research in Archaeology (Mark Baker)Virtual Environments for Research in Archaeology (Mark Baker)
Virtual Environments for Research in Archaeology (Mark Baker)Onroerend Erfgoed
 
Bridging the Gap Between Print and Digital Environment
Bridging the Gap Between Print and Digital EnvironmentBridging the Gap Between Print and Digital Environment
Bridging the Gap Between Print and Digital EnvironmentAnita Riley
 
Edinburgh OldMapsOnline Workshop
Edinburgh OldMapsOnline WorkshopEdinburgh OldMapsOnline Workshop
Edinburgh OldMapsOnline WorkshopPetr Pridal
 

Ähnlich wie Possibilities of Digital Analysis of Charter corpora (20)

Open Archives Initiative Protocol for Metadata Harvesting
Open Archives Initiative Protocol for Metadata HarvestingOpen Archives Initiative Protocol for Metadata Harvesting
Open Archives Initiative Protocol for Metadata Harvesting
 
Representing the world: How web users become web thinkers and web makers
Representing the world: How web users become web thinkers and web makersRepresenting the world: How web users become web thinkers and web makers
Representing the world: How web users become web thinkers and web makers
 
Linking library and theatre data
Linking library and theatre dataLinking library and theatre data
Linking library and theatre data
 
Open (linked) bibliographic data edmund chamberlain (university of cambridge)
Open (linked) bibliographic data   edmund chamberlain (university of cambridge)Open (linked) bibliographic data   edmund chamberlain (university of cambridge)
Open (linked) bibliographic data edmund chamberlain (university of cambridge)
 
Open (linked) bibliographic data
Open (linked) bibliographic dataOpen (linked) bibliographic data
Open (linked) bibliographic data
 
Hausstein data cite-dara-dasish2014
Hausstein data cite-dara-dasish2014Hausstein data cite-dara-dasish2014
Hausstein data cite-dara-dasish2014
 
Serving Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked DataServing Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked Data
 
Introduction To OpenStreetMap Fosscon2010
Introduction To OpenStreetMap Fosscon2010Introduction To OpenStreetMap Fosscon2010
Introduction To OpenStreetMap Fosscon2010
 
Rani Pinchuk
Rani PinchukRani Pinchuk
Rani Pinchuk
 
Data Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into GoldData Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into Gold
 
GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...
GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...
GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...
 
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
 
GeoServer an introduction for beginners
GeoServer an introduction for beginnersGeoServer an introduction for beginners
GeoServer an introduction for beginners
 
WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...
WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...
WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...
 
Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016
 
iKNOW2014 - SimModel and IFC: a short introduction to the ontologies
iKNOW2014 - SimModel and IFC: a short introduction to the ontologiesiKNOW2014 - SimModel and IFC: a short introduction to the ontologies
iKNOW2014 - SimModel and IFC: a short introduction to the ontologies
 
ADLUG 2012: Linking Linked Data
ADLUG 2012: Linking Linked DataADLUG 2012: Linking Linked Data
ADLUG 2012: Linking Linked Data
 
Virtual Environments for Research in Archaeology (Mark Baker)
Virtual Environments for Research in Archaeology (Mark Baker)Virtual Environments for Research in Archaeology (Mark Baker)
Virtual Environments for Research in Archaeology (Mark Baker)
 
Bridging the Gap Between Print and Digital Environment
Bridging the Gap Between Print and Digital EnvironmentBridging the Gap Between Print and Digital Environment
Bridging the Gap Between Print and Digital Environment
 
Edinburgh OldMapsOnline Workshop
Edinburgh OldMapsOnline WorkshopEdinburgh OldMapsOnline Workshop
Edinburgh OldMapsOnline Workshop
 

Mehr von Georg Vogeler

Standing-off Trees and Graphs : on the affordance of technologies for the edi...
Standing-off Trees and Graphs : on the affordance of technologies for the edi...Standing-off Trees and Graphs : on the affordance of technologies for the edi...
Standing-off Trees and Graphs : on the affordance of technologies for the edi...Georg Vogeler
 
Von IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über Personen
Von IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über PersonenVon IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über Personen
Von IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über PersonenGeorg Vogeler
 
Working digitally with Historical Documents
Working digitally with Historical DocumentsWorking digitally with Historical Documents
Working digitally with Historical DocumentsGeorg Vogeler
 
Digitising charter images : benefits and pitfalls
Digitising charter images : benefits and pitfallsDigitising charter images : benefits and pitfalls
Digitising charter images : benefits and pitfallsGeorg Vogeler
 
Transformationen: Zum Übergang aus langfristigen Editionsprojekten in die dig...
Transformationen:Zum Übergang aus langfristigen Editionsprojekten in die dig...Transformationen:Zum Übergang aus langfristigen Editionsprojekten in die dig...
Transformationen: Zum Übergang aus langfristigen Editionsprojekten in die dig...Georg Vogeler
 
Digital diplomatics - Defining a new scope of interpretation of historical do...
Digital diplomatics - Defining a new scope of interpretation of historical do...Digital diplomatics - Defining a new scope of interpretation of historical do...
Digital diplomatics - Defining a new scope of interpretation of historical do...Georg Vogeler
 
Vernetzung Zum Verhältnis von klassischen Formen der Archiverschließung und I...
VernetzungZum Verhältnis von klassischen Formen der Archiverschließung und I...VernetzungZum Verhältnis von klassischen Formen der Archiverschließung und I...
Vernetzung Zum Verhältnis von klassischen Formen der Archiverschließung und I...Georg Vogeler
 
Encoding Text About Things (Georg Vogeler)
Encoding Text About Things (Georg Vogeler)Encoding Text About Things (Georg Vogeler)
Encoding Text About Things (Georg Vogeler)Georg Vogeler
 
Results of “Digital Diplomatics” for the research with medieval documents
Results of “Digital Diplomatics” for the research with medieval documentsResults of “Digital Diplomatics” for the research with medieval documents
Results of “Digital Diplomatics” for the research with medieval documentsGeorg Vogeler
 
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...Georg Vogeler
 
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...Georg Vogeler
 
Medieval and Early Modern Accounts in the Digital Age
Medieval and Early Modern Accounts in the Digital AgeMedieval and Early Modern Accounts in the Digital Age
Medieval and Early Modern Accounts in the Digital AgeGeorg Vogeler
 
Why not edit medieval account books digitally?
Why not edit medieval account books digitally?Why not edit medieval account books digitally?
Why not edit medieval account books digitally?Georg Vogeler
 
Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...
Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...
Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...Georg Vogeler
 

Mehr von Georg Vogeler (15)

Standing-off Trees and Graphs : on the affordance of technologies for the edi...
Standing-off Trees and Graphs : on the affordance of technologies for the edi...Standing-off Trees and Graphs : on the affordance of technologies for the edi...
Standing-off Trees and Graphs : on the affordance of technologies for the edi...
 
Von IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über Personen
Von IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über PersonenVon IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über Personen
Von IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über Personen
 
Working digitally with Historical Documents
Working digitally with Historical DocumentsWorking digitally with Historical Documents
Working digitally with Historical Documents
 
Digitising charter images : benefits and pitfalls
Digitising charter images : benefits and pitfallsDigitising charter images : benefits and pitfalls
Digitising charter images : benefits and pitfalls
 
Transformationen: Zum Übergang aus langfristigen Editionsprojekten in die dig...
Transformationen:Zum Übergang aus langfristigen Editionsprojekten in die dig...Transformationen:Zum Übergang aus langfristigen Editionsprojekten in die dig...
Transformationen: Zum Übergang aus langfristigen Editionsprojekten in die dig...
 
Digital diplomatics - Defining a new scope of interpretation of historical do...
Digital diplomatics - Defining a new scope of interpretation of historical do...Digital diplomatics - Defining a new scope of interpretation of historical do...
Digital diplomatics - Defining a new scope of interpretation of historical do...
 
Vernetzung Zum Verhältnis von klassischen Formen der Archiverschließung und I...
VernetzungZum Verhältnis von klassischen Formen der Archiverschließung und I...VernetzungZum Verhältnis von klassischen Formen der Archiverschließung und I...
Vernetzung Zum Verhältnis von klassischen Formen der Archiverschließung und I...
 
Encoding Text About Things (Georg Vogeler)
Encoding Text About Things (Georg Vogeler)Encoding Text About Things (Georg Vogeler)
Encoding Text About Things (Georg Vogeler)
 
Results of “Digital Diplomatics” for the research with medieval documents
Results of “Digital Diplomatics” for the research with medieval documentsResults of “Digital Diplomatics” for the research with medieval documents
Results of “Digital Diplomatics” for the research with medieval documents
 
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
 
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
 
Medieval and Early Modern Accounts in the Digital Age
Medieval and Early Modern Accounts in the Digital AgeMedieval and Early Modern Accounts in the Digital Age
Medieval and Early Modern Accounts in the Digital Age
 
Why not edit medieval account books digitally?
Why not edit medieval account books digitally?Why not edit medieval account books digitally?
Why not edit medieval account books digitally?
 
Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...
Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...
Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...
 
Charter encoding
Charter encodingCharter encoding
Charter encoding
 

Kürzlich hochgeladen

Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxJorenAcuavera1
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxEran Akiva Sinbar
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsSérgio Sacani
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPirithiRaju
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringPrajakta Shinde
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...D. B. S. College Kanpur
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx023NiWayanAnggiSriWa
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationColumbia Weather Systems
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingNetHelix
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxmaryFF1
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 

Kürzlich hochgeladen (20)

Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptx
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptx
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive stars
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical Engineering
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
 
Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by na
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather Station
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdf
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 

Possibilities of Digital Analysis of Charter corpora

  • 1. Possibilities of Digital Analysis of Charter corpora Georg Vogeler
  • 2. IMC Leeds, 9.7.2009Georg Vogeler 2 Charter Corpora on the Web  Württembergisches Urkundenbuch (http://maja.bsz- bw.de/wubonline/  CDLM (http://cdlm.unipv.it)  DEEDS (http://www.utoronto.ca/deeds/)  Monasterium.net (http://www.monasterium.net)  Ut per litteras apostolicas … (http://www.brepolis.net)  Diplomatico Firenze (http://www.archiviodistato.firenze.it/diplomatico)
  • 3. IMC Leeds, 9.7.2009Georg Vogeler 3 What’s their advantage?  Images  Reconstructed archives • Virtuelles Archiv Salzburg • Archive of the Stift Ardagger  Fast search  => take the charter heritage as is not as defined by organisational reasons
  • 4. IMC Leeds, 9.7.2009Georg Vogeler 4 Online Corpus abolishes borders …  between repositories  between forms of representation and
  • 5. IMC Leeds, 9.7.2009Georg Vogeler 5 Research on set phrases  Vernacular dating clauses • Latin model: (Ulm 1275 März 29) dirre dinge iſt gezivch herre Marquart von Bleichen herre hartman von ſahſenhvſen vn herre tecke von annenhoven. Datum · IIIIo · kl · aprilis · anno dni · Mo · CCo · IXXVo. • German model almost free from it: Diz geſchach zehahberch an deme Ciſtage in der phingeſtwochen / do von gotteſ geb̓vrte waren zwelfhundert Sibenzig vn f̓vnf Jar
  • 6. IMC Leeds, 9.7.2009Georg Vogeler 6 Dating Clauses  13th century: • Germany (de Boor 1975) - South-western model: • dis geschach do man zalte von gotes gebúrte zwelf hundert und niun und niunzig jar. - South-eastern model: • ditz ist geschehen, do es waren von christes geburt tousent zwaihundert und darnach in dem niun unde niunzegisten jare.
  • 7. IMC Leeds, 9.7.2009Georg Vogeler 7 In monasterium.net for $u in //tenor[not(.='')]/ancestor::text[.//lang_MOM='Deutsch'] let $dat := substring($u//tenor, (string-length($u//tenor) - 200)) where number($u//date_sort) lt 14000001 and number($u//date_sort) gt 13000000 order by $u//date_sort return <dat><wo>{ $u/@b_name } { $u//issued/placeName/text() }</wo> <was> { $dat }</was></dat>
  • 8. IMC Leeds, 9.7.2009Georg Vogeler 8 In monasterium.net [Dd][aov] ([uv][ao]n|nach) (([Gg]ot{1,2}[ei]{0,1}[sz])| (([Cc]h|[cCkK])rist[eisz]*?)| ([uv]n{1,2}s[ei]{0,1}r[ei]{0,1}[sz] [hH]er{1,2}[ei]{0,1}n)) ([Gge]*?[PBpb][uv][oe°]{0,1}ri{0,1}[td][hte]*?) (w[ao]^{0,1}[rzs][ien]*?)
  • 9. IMC Leeds, 9.7.2009Georg Vogeler 9 In monasterium.net
  • 10. IMC Leeds, 9.7.2009Georg Vogeler 10 Results  13th century: • 433 texts • “zalt”-model: 24, all but 5 from the Chartularium Sangallense • “waren”-model: 137, all but 15 from the south-eastern regions  14th century: • 8354 texts • “zalt”-model: 2478, 964 not from St. Gallen • “waren”-model: 350, only 13 from St. Gallen
  • 11. IMC Leeds, 9.7.2009Georg Vogeler 11 Methods of Investigation  Already in use • Simple word selection/word count (Tock, Brousseau, Parisse) • Phrase statistics (Gervers/Margolin) • Graphetic detail analysis (Fiebig) • Hand identification by pattern analysis (Schomaker/Burgers) • Named entity recognition (Stoyan/Schmidt)
  • 12. IMC Leeds, 9.7.2009Georg Vogeler 12 Possible Programming  Testing/adapting existing algorithms • Author identification tools • Graphical variation tools • Named Entity Recognition methods for clauses  to find the connections between charters that aren’t kept in the same archive/aren’t printed in the same edition: • e.g.: Influence of recipient on the charters • Spread of formula, regions of legal culture
  • 13. IMC Leeds, 9.7.2009Georg Vogeler 13 Early medieval diplomatics  Add charters to the online corpora  Add information to the online charter corpora  Take text analytic software into consideration  Ask your local computer scientist what he could help you
  • 14. Thank you for your attention g.vogeler@lmu.de

Hinweis der Redaktion

  1. CDLM Württembergisches Urkundenbuch DEEDS Monasterium.net Ut per litteras apostolicas … Diplomatico Firenze
  2. The monasterium-project thus gives an insight into the possibilities of a Virtual European Charter Archive: The charters are just one corpus and you will find the documents in the Archives of the Archbishop of Salzburg although the Habsburgs transferred them all to Vienna, you will find all documents dealing with the dioceses of Passau that are incorporated to the capital of Austria before 1469. You will find documents concerning Bratislava, the capital of Slovakia from the times it was Capital of Hungary … Borders between forms: I explained last year with the Online Kemble for the Anglo Saxon Charters. This year I want to give an example how a corpus like monsterium.net can be used for diplomatic research – supported by the computer.
  3. Some of you might know that I had my own conference on Codicology and Palaeography in the Digital Age only a week ago. Thus I had not too much time to prepare this example: From vast variety of possible questions (Paarformeln, Bekräftigungsformenl; Angabe von Gründen für Beurkunden abhängig vom Aussteller? („Notturft“ bei Frauen), #Zustimmungsformeln und an schaden#, the relationship between vernacular formula and latin, function of the witnesses of the seal, seller taking responsibility for the correctness of ##proporty rights#; correlation between issuer, recipient and writing notary etc. etc.) I choose the dating clause: #Latin example#; There are several observations made from 13th century material: For Switzerland Peter Rück observed the introduction of the „modern“ dating style by counting days in a month from West to East, with continously #reluctance in the diocesis of Konstanz#; Helmut de Boor observed
  4. I prepared a selection of the full texts in mom-ca that are german and from the 13th century. They are from archives as indicated in this map (google maps): St. Gallen ist from the alemannic part and thus should prefer „zalt“ while the rest should prefer „waren“ And I made a search with regular expressions on it to identify the clauses in their variety
  5. I prepared a selection of the full texts in mom-ca that are german and from the 13th century. They are from archives as indicated in this map (google maps): St. Gallen ist from the alemannic part and thus should prefer „zalt“ while the rest should prefer „waren“ And I made a search with regular expressions on it to identify the clauses in their variety
  6. I prepared a selection of the full texts in mom-ca that are german and from the 13th century. They are from archives as indicated in this map (google maps): St. Gallen ist from the alemannic part and thus should prefer „zalt“ while the rest should prefer „waren“ And I made a search with regular expressions on it to identify the clauses in their variety
  7. 13th century confirmes the analysis of de Boor 14th century shows a significant change: the „zalt“-model isn‘t restricted to the alemannian region of south Germany and the waren model is much less spread than it was before. That fits into de Boors general observation that the zalt-model is more modern and is spreading already in the 13th century from west to east. If I wouldn‘t be occupied by research on the use of the documents of Frederic II at the moment, I would very much be inclined to continue this research. But I have to be careful: There are lot‘s of other techniques to be applied to digital charter copora:
  8. Let me give you some examples
  9. Author Idenfitication: Leeds 2008: problem of short formalistic texts: difficult to identy in general, thus of great interest for the computer linguists. Graphical Variation: edit-distance, developing soundex NER: Hidden-Markov-Model: training
  10. What could be the result of that for the early medieval diplomatists? You traditionally don’t deal with large corpora. But you could consider that: The CDLM provides a huge amount of data – and I haven’t read any study using the corpus. Unfortunately the ARTEM-Databases aren’t online, but I would so much interested to see research done with it. The online accessible corpora can be improved: Add charters to the online corpora By retro digitization and By digital edition Add information to the online charter corpora Online Editor of www.mom-ca.uni-koeln.de: there are at the moment 636 charters from before 1150, 171 of them without fulltexts. mom-ca provides the possibility to add text online, simply by registering yourself on the site. Why not enhancing the corpus? Take text analytic software into consideration Whereever your material comes from: take into consideration that there are already text analytic tools that could be useful for you. And if you imagine a tool but don’t find it or don’t know how to use it: Ask your local computer scientist what he could help you: and don’t be frustrated if he doesn’t understand you – there are lots of computer scientists supporting the work of historians!