SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Downloaden Sie, um offline zu lesen
Roman du Mont Saint Michel
A medieval novel in three books.
Case Study Shared Canvas by University of
Caen, the British Library and Biblissima
Roman du Mont Saint Michel
Source:
Two manuscripts
- BL Additional 10289 (date 1275-1300)
- BL Additional 26876 (date 1340)
Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
Roman du Mont Saint Michel
Editions
Print edition:
Guillaume de Saint-Pair, Le roman du Mont Saint-Michel, Les manuscrits
du Mont Saint-Michel : textes fondateurs II, C. Bougy (éd.), 2009.
Access point electronic edition:
http://www.unicaen.fr/puc/sources/gsp/index.php?page=sommaire
Online catalogue BL:
http://www.bl.uk/manuscripts/FullDisplay.aspx?ref=Add_MS_10289
http://www.bl.uk/catalogues/illuminatedmanuscripts/record.asp?
MSID=19373&CollID=27&NStart=26876
Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
Roman du Mont Saint Michel
Electronic edition in TEI P5 compiling the
two textual witnesses (“A” and “B”)
Translation into modern french in TEI P5
No use of <surface> and <zone> (in 2006),
digitisation realised end of 2013 by BL
Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
Electronic Editions in TEI-P5
- Chapter 11 of the TEI-Guidelines:
Representation of the Primary Source
- 11.1 Digital Facsimile
- 11.2.2 Embedded Transcription
=> use of <sourceDoc> in the case where such images
are complemented by a documentary transcription
Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
Electronic Editions in TEI
facsimile contains a representation of some written source in the form of a set of images rather
than as transcribed or encoded text.
sourceDoc contains a transcription or other representation of a single source document
potentially forming part of a dossier génétique or collection of sources.
surface defines a written surface as a two-dimensional coordinate space, optionally grouping
one or more graphic representations of that space, zones of interest within that space, and
transcriptions of the writing within them.
surfaceGrp defines any kind of useful grouping of written surfaces, for example the recto and
verso of a single leaf, which the encoder wishes to treat as a single unit.
zone defines any two-dimensional area within a surface element.
Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
Technical details MIRADOR / TEI
MIRADOR displays TEI-transcriptions line by
line transformed to JSON-LD
JSON-LD (= RDF Serialization)
Technical details MIRADOR / TEI
http://demos.biblissima-condorcet.fr/mirador/
1. Go to “BL Add MS 10289”
2. Click on the icon i(nfo)
3. http://sanddragon.bl.uk/IIIFMetadataService/add_ms_10289.json
JSON-LD for a Canvas
{"@id": "http://sanddragon.bl.uk/IIIFMetadataService/canvas/folio-8v.json",
"@type": "sc:Canvas",
"label": "f 8v",
"height": 8552,
"width": 6049,
"images": [
{"@id": "http://sanddragon.bl.uk/IIIFMetadataService/annotation/a-folio-8v.json",
"@type": "oa:Annotation",
"motivation": "sc:painting",
"resource": {
"@id": "http://sanddragon.bl.uk/IIIFImageService/add_ms_10289_f008v/full/full/0/native.jpg",
"@type": "dctypes:Image",
"format": "image/jpeg",
"height": 8552,
"width": 6049,
"service": {"@id": "http://sanddragon.bl.uk/IIIFImageService/add_ms_10289_f008v",
"profile": "http://library.stanford.edu/iiif/image-api/1.1/compliance.html#level2}
},
"on": "http://sanddragon.bl.uk/IIIFMetadataService/canvas/folio-8v.json"}
],
"otherContent" : [{"@id" : "http://demos.biblissima-condorcet.fr/iiif/metadata/BL_Add_10289/list/transscript_8v.json",
"@type": "sc:AnnotationList"
} ]
}
TEI 2 JSON-LD for transcriptions
For each <pb> (= for each canvas) we
create an AnnotationList.
That list contains several resources, here
transcriptions per line.
Technical details MIRADOR / TEI
{
"@context": "http://www.shared-canvas.org/ns/context.json",
"@id": "http://demos.biblissima-condorcet.fr/iiif/metadata/BL_Add_10289/list/transscript_8v.json",
"@type": "sc:AnnotationList",
"resources":[
{
"@id":"http://demos.biblissima-condorcet.fr/iiif/metadata/BL_Add_10289/Annotation/8v_line449.json",
"@type":"oa:Annotation",
"motivation":"sc:painting",
"resource":{
"@type":"cnt:ContentAsText",
"chars":"De la forest a feit areine",
"format":"text/plain",
"language":"fr-FR"
},
"on":"http://sanddragon.bl.uk/IIIFMetadataService/canvas/folio-8v.json#xywh=1950,400,3500,220"
}
,
TEI-P5 <surface> and <zone>
TEI-P5:
@ulx, @uly, @lrx, @lry
(upper left x, upper left y,
lower right x, lower right y)
(0,0)
(6049,8552)
ulx
uly
lrx
lry
Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
TEI-P5 versus Shared Canvas
Shared Canvas : x, y, w, h
(x, y, width, height)
(0,0)
x
y
w
8552
6049
h
Adding #xywh to resources (JSON-LD)
Approximate semiautomatic approach to
surfaces and zones for transcripts
+ easy to use
- not exact, works only for very regular
manuscripts
Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
Adding #xywh to resources (JSON-LD)
Use any image manipulation
program, e.g. Gimp to
determine the average location
of the text on the images.
In the image manipulation
program the cursor’s position is usually
displayed below the image.
Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
Adding #xywh to resources (JSON-LD)
Differenciate between recto and verso pages
…
<xsl:variable name="lx"><xsl:choose>
<xsl:when test="contains($pg,'r')">
<xsl:value-of select="$r_x"/></xsl:when>
<xsl:otherwise><xsl:value-of select="$v_x"/></xsl:otherwise>
</xsl:choose>
</xsl:variable>
…
Compute the average hight of a line
line_height = text_height / number_of_lines
Line n starts at
y = text_y + (n - 1) * line_height
→ <xsl:text>"on":"http://sanddragon.bl.uk/IIIFMetadataService/canvas/folio-</xsl:text><xsl:value-of select="$pg"/>
<xsl:text>.json#xywh=</xsl:text><xsl:value-of select="($lx)"/><xsl:text>,</xsl:text><xsl:value-of select="($y)"/><xsl:text>,
</xsl:text><xsl:value-of select="($text_width)"/><xsl:text>,</xsl:text><xsl:value-of select="($line_height)"/>
<xsl:text>"</xsl:text>
Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
Adding <zones> and <surfaces> in TEI
Parallel approach to transscript -
<facsimile>
referencing via @xml:id and #facs
+ high level of detail possible
+ text can be encoded with more possibilities
+ text and image data can be seperated
- complexity is difficult for transformation and display
Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
Technical details TEI
...
… <facsimile>
...
<surface ulx="0" uly="0" lrx="6049" lry="8552">
<zone xml:id=”f8v”
ulx="0" uly="0" lrx="6049" lry="8552">
<graphic url="http://sanddragon.bl.uk/IIIFImageService/add_ms_10289_f008v/full/full/
0/native.jpg"/>
</zone>
<zone xml:id=”vers449”
ulx="2023" uly="540" lrx="5363" lry="736"/>
<zone xml:id=”vers450”
ulx="2023" uly="737" lrx="5363" lry="944"/>
…
</facsimile> ...
...<div><div><lg> …
<l n="449" aid:pstyle="txt_Original_Vers" xml:id="vers449" facs=”#vers449”>
<pb ed="A" n="8v" xml:id=”f8v” facs=”#f8v”/>De la forest a feit areine</l>
<l n="450" aid:pstyle="txt_Original_Vers" xml:id="vers450" facs=”#vers450”>
Entor le mont, et bele et pleine<note type="marginal" xml:id="AFRftn207">
areigne.</note>.</l> …
</div></div></lg>...
Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
Technical details MIRADOR / TEI
XSL code “ulx, uly, lrx, lry” (TEI - facsimile)
to “x, y, w, h” (Shared Canvas)
...
<xsl:for-each-group select="/TEI/text/body//lg/l" group-starting-with="/TEI/text/body//lg/l[pb]">
…
<xsl:for-each select="current-group()">
...
<xsl:variable name="id"><xsl:value-of select="substring-after(@facs,’#’)"/></xsl:variable>
<!-- width = lower right x - upper left x →
<xsl:variable name="width"><xsl:value-of select="/TEI/facsimile/surface/zone[@xml:id=$id]/@lrx -
/TEI/facsimile/surface/zone[@xml:id=$id]/@ulx"/></xsl:variable>
<!-- height = lower right y - upper left y -->
<xsl:variable name="height"><xsl:value-of select="/TEI/facsimile/surface/zone[@xml:id=$id]/@lry -
/TEI/facsimile/surface/zone[@xml:id=$id]/@uly"/></xsl:variable>
<xsl:text>"on":"http://sanddragon.bl.uk/IIIFMetadataService/canvas/folio-</xsl:text><xsl:value-of
select="$pg"/><xsl:text>.json#xywh=</xsl:text><xsl:value-of select="@lrx"/><xsl:text>,</xsl:text><xsl:value-of select="@lry"/
><xsl:text>,</xsl:text><xsl:value-of select="($width)"/><xsl:text>,</xsl:text><xsl:value-of select="($height)"/><xsl:text>"</xsl:text>
… </xsl:for-each>
…</xsl:for-each-group> ...
Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
Technical details MIRADOR / TEI
Embedded approach to transscript -
<sourceDoc>
+ direct approach
+ clearity due to the limitation to transscripts only
+ simpler for XSLT transforms
- mixing of text and image data
Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
Technical details TEI
… <sourceDoc>
...
<surface ulx="0" uly="0" lrx="6049" lry="8552">
<zone xml:id=”f8v”ulx="0" uly="0" lrx="6049" lry="8552">
<graphic url="http://sanddragon.bl.uk/IIIFImageService/add_ms_10289_f008v/full/full/0/native.jpg"/>
</zone>
<zone ulx="2023" uly="540" lrx="5363" lry="736 »>
<line n="449" aid:pstyle="txt_Original_Vers" xml:id="vers449" >
<pb ed="A" n="8v" xml:id=”f8v” facs=”#f8v”/>De la forest a feit areine</line>
</zone>
<zone ulx="2023" uly="737" lrx="5363" lry="944 »>
<line n="450" aid:pstyle="txt_Original_Vers" xml:id="vers450">Entor le mont,
et bele et pleine<note type="marginal" xml:id="AFRftn207">areigne.</note>.</line>
</zone>
…
</sourceDoc> ...
...
Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
Technical details MIRADOR / TEI
XSL code “ulx, uly, lrx, lry” (TEI - sourceDoc)
to “x, y, w, h” (Shared Canvas)
...
<xsl:for-each select="/TEI/sourceDoc//surface" group-starting-with="TEI/sourceDoc//surface">
…
<xsl:for-each select= ".//line">
...
<xsl:variable name="id"><xsl:value-of select="@xml:id)"/></xsl:variable>
<!-- width = lower right x - upper left x -->
<xsl:variable name="width"><xsl:value-of select="../zone/@lrx - ../zone/@ulx"/></xsl:variable>
<!-- height = lower right y - upper left y -->
<xsl:variable name="height"><xsl:value-of select="../zone/@lry - ../zone/@uly"/></xsl:variable>
<xsl:text>"on":"http://sanddragon.bl.uk/IIIFMetadataService/canvas/folio-</xsl:text><xsl:value-of
select="$pg"/><xsl:text>.json#xywh=</xsl:text><xsl:value-of select="../zone/@lrx"/><xsl:text>,</xsl:text><xsl:value-of select="../zone/
@lry"/><xsl:text>,</xsl:text><xsl:value-of select="($width)"/><xsl:text>,</xsl:text><xsl:value-of select="($height)"/><xsl:text>"</xsl:text>
…
</xsl:for-each>
…
</xsl:for-each-group>...
Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)

Weitere ähnliche Inhalte

Ähnlich wie Roman du Mont Saint-Michel: Biblissima's case study with the University of Caen and the British Library

Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
MongoDB
 
Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Bradley Allen
 
A Model of the Scholarly Community
A Model of the Scholarly CommunityA Model of the Scholarly Community
A Model of the Scholarly Community
Marko Rodriguez
 
Maria Patterson - Building a community fountain around your data stream
Maria Patterson - Building a community fountain around your data streamMaria Patterson - Building a community fountain around your data stream
Maria Patterson - Building a community fountain around your data stream
PyData
 
DB-IR-ranking
DB-IR-rankingDB-IR-ranking
DB-IR-ranking
FELIX75
 

Ähnlich wie Roman du Mont Saint-Michel: Biblissima's case study with the University of Caen and the British Library (20)

Mapping Subsets of Scholarly Information
Mapping Subsets of Scholarly InformationMapping Subsets of Scholarly Information
Mapping Subsets of Scholarly Information
 
OpenEdition Lab projects in Text Mining
OpenEdition Lab projects in Text MiningOpenEdition Lab projects in Text Mining
OpenEdition Lab projects in Text Mining
 
Scalding Big (Ad)ta
Scalding Big (Ad)taScalding Big (Ad)ta
Scalding Big (Ad)ta
 
Scala and big data in ICM. Scoobie, Scalding, Spark, Stratosphere. Scalar 2014
Scala and big data in ICM. Scoobie, Scalding, Spark, Stratosphere. Scalar 2014Scala and big data in ICM. Scoobie, Scalding, Spark, Stratosphere. Scalar 2014
Scala and big data in ICM. Scoobie, Scalding, Spark, Stratosphere. Scalar 2014
 
A Generic Mapping-based Query Translation from SPARQL to Various Target Datab...
A Generic Mapping-based Query Translation from SPARQL to Various Target Datab...A Generic Mapping-based Query Translation from SPARQL to Various Target Datab...
A Generic Mapping-based Query Translation from SPARQL to Various Target Datab...
 
Collective entity linking with WSRM DocEng'19
Collective entity linking with WSRM DocEng'19Collective entity linking with WSRM DocEng'19
Collective entity linking with WSRM DocEng'19
 
Knowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything ProjectKnowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything Project
 
bridging formal semantics and social semantics on the web
bridging formal semantics and social semantics on the webbridging formal semantics and social semantics on the web
bridging formal semantics and social semantics on the web
 
Moving Library Metadata Toward Linked Data: Opportunities Provided by the eX...
Moving Library Metadata Toward Linked Data:  Opportunities Provided by the eX...Moving Library Metadata Toward Linked Data:  Opportunities Provided by the eX...
Moving Library Metadata Toward Linked Data: Opportunities Provided by the eX...
 
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
 
Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)
 
Small Data in the large with Oppidum
Small Data in the large with OppidumSmall Data in the large with Oppidum
Small Data in the large with Oppidum
 
A Model of the Scholarly Community
A Model of the Scholarly CommunityA Model of the Scholarly Community
A Model of the Scholarly Community
 
Biblissima et IIIF (MAE)
Biblissima et IIIF (MAE)Biblissima et IIIF (MAE)
Biblissima et IIIF (MAE)
 
Maria Patterson - Building a community fountain around your data stream
Maria Patterson - Building a community fountain around your data streamMaria Patterson - Building a community fountain around your data stream
Maria Patterson - Building a community fountain around your data stream
 
20130912 YTC_Reynold Xin_Spark and Shark
20130912 YTC_Reynold Xin_Spark and Shark20130912 YTC_Reynold Xin_Spark and Shark
20130912 YTC_Reynold Xin_Spark and Shark
 
Integrating Heterogeneous Data Sources in the Web of Data
Integrating Heterogeneous Data Sources in the Web of DataIntegrating Heterogeneous Data Sources in the Web of Data
Integrating Heterogeneous Data Sources in the Web of Data
 
For paper
For paperFor paper
For paper
 
ACCU 2011 Introduction to Scala: An Object Functional Programming Language
ACCU 2011 Introduction to Scala: An Object Functional Programming LanguageACCU 2011 Introduction to Scala: An Object Functional Programming Language
ACCU 2011 Introduction to Scala: An Object Functional Programming Language
 
DB-IR-ranking
DB-IR-rankingDB-IR-ranking
DB-IR-ranking
 

Mehr von Equipex Biblissima

Mehr von Equipex Biblissima (20)

Da Biblissima a Biblissima+ : per un osservatorio delle culture scritte
Da Biblissima a Biblissima+ : per un osservatorio delle culture scritteDa Biblissima a Biblissima+ : per un osservatorio delle culture scritte
Da Biblissima a Biblissima+ : per un osservatorio delle culture scritte
 
eScriptorium: An Open Source Platform for Historical Document Analysis
eScriptorium: An Open Source Platform for Historical Document AnalysiseScriptorium: An Open Source Platform for Historical Document Analysis
eScriptorium: An Open Source Platform for Historical Document Analysis
 
Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...
Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...
Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...
 
Appliquer les techniques d'apprentissage profond pour détecter les enluminure...
Appliquer les techniques d'apprentissage profond pour détecter les enluminure...Appliquer les techniques d'apprentissage profond pour détecter les enluminure...
Appliquer les techniques d'apprentissage profond pour détecter les enluminure...
 
Représentations du chant du Moyen Âge dans les images IIIF
Représentations du chant du Moyen Âge dans les images IIIFReprésentations du chant du Moyen Âge dans les images IIIF
Représentations du chant du Moyen Âge dans les images IIIF
 
Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...
Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...
Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...
 
Mise en œuvre de IIIF pour la reconnaissance automatique de documents
Mise en œuvre de IIIF pour la reconnaissance automatique de documentsMise en œuvre de IIIF pour la reconnaissance automatique de documents
Mise en œuvre de IIIF pour la reconnaissance automatique de documents
 
Nakala et IIIF
Nakala et IIIFNakala et IIIF
Nakala et IIIF
 
Actualités et perspectives de IIIF
Actualités et perspectives de IIIFActualités et perspectives de IIIF
Actualités et perspectives de IIIF
 
Mieux diffuser et valoriser ses images sur le Web grâce aux standards IIIF
Mieux diffuser et valoriser ses images sur le Web grâce aux standards IIIFMieux diffuser et valoriser ses images sur le Web grâce aux standards IIIF
Mieux diffuser et valoriser ses images sur le Web grâce aux standards IIIF
 
Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...
Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...
Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...
 
IIIF360: A Service to Support and Promote IIIF in France
IIIF360: A Service to Support and Promote IIIF in FranceIIIF360: A Service to Support and Promote IIIF in France
IIIF360: A Service to Support and Promote IIIF in France
 
The Biblissima Authority File of Geographical Names
The Biblissima Authority File of Geographical NamesThe Biblissima Authority File of Geographical Names
The Biblissima Authority File of Geographical Names
 
Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...
Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...
Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...
 
Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)
Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)
Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)
 
Biblissima: Connecting Manuscripts Collections
Biblissima: Connecting Manuscripts CollectionsBiblissima: Connecting Manuscripts Collections
Biblissima: Connecting Manuscripts Collections
 
IIIF et Biblissima
IIIF et BiblissimaIIIF et Biblissima
IIIF et Biblissima
 
A la recherche du patrimoine écrit avec le portail Biblissima
A la recherche du patrimoine écrit avec le portail BiblissimaA la recherche du patrimoine écrit avec le portail Biblissima
A la recherche du patrimoine écrit avec le portail Biblissima
 
Browse and Visualize Manuscripts Illuminations with IIIF
Browse and Visualize Manuscripts Illuminations with IIIFBrowse and Visualize Manuscripts Illuminations with IIIF
Browse and Visualize Manuscripts Illuminations with IIIF
 
Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...
Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...
Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Kürzlich hochgeladen (20)

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

Roman du Mont Saint-Michel: Biblissima's case study with the University of Caen and the British Library

  • 1. Roman du Mont Saint Michel A medieval novel in three books. Case Study Shared Canvas by University of Caen, the British Library and Biblissima
  • 2. Roman du Mont Saint Michel Source: Two manuscripts - BL Additional 10289 (date 1275-1300) - BL Additional 26876 (date 1340) Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
  • 3. Roman du Mont Saint Michel Editions Print edition: Guillaume de Saint-Pair, Le roman du Mont Saint-Michel, Les manuscrits du Mont Saint-Michel : textes fondateurs II, C. Bougy (éd.), 2009. Access point electronic edition: http://www.unicaen.fr/puc/sources/gsp/index.php?page=sommaire Online catalogue BL: http://www.bl.uk/manuscripts/FullDisplay.aspx?ref=Add_MS_10289 http://www.bl.uk/catalogues/illuminatedmanuscripts/record.asp? MSID=19373&CollID=27&NStart=26876 Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
  • 4. Roman du Mont Saint Michel Electronic edition in TEI P5 compiling the two textual witnesses (“A” and “B”) Translation into modern french in TEI P5 No use of <surface> and <zone> (in 2006), digitisation realised end of 2013 by BL Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
  • 5. Electronic Editions in TEI-P5 - Chapter 11 of the TEI-Guidelines: Representation of the Primary Source - 11.1 Digital Facsimile - 11.2.2 Embedded Transcription => use of <sourceDoc> in the case where such images are complemented by a documentary transcription Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
  • 6. Electronic Editions in TEI facsimile contains a representation of some written source in the form of a set of images rather than as transcribed or encoded text. sourceDoc contains a transcription or other representation of a single source document potentially forming part of a dossier génétique or collection of sources. surface defines a written surface as a two-dimensional coordinate space, optionally grouping one or more graphic representations of that space, zones of interest within that space, and transcriptions of the writing within them. surfaceGrp defines any kind of useful grouping of written surfaces, for example the recto and verso of a single leaf, which the encoder wishes to treat as a single unit. zone defines any two-dimensional area within a surface element. Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
  • 7. Technical details MIRADOR / TEI MIRADOR displays TEI-transcriptions line by line transformed to JSON-LD JSON-LD (= RDF Serialization)
  • 8. Technical details MIRADOR / TEI http://demos.biblissima-condorcet.fr/mirador/ 1. Go to “BL Add MS 10289” 2. Click on the icon i(nfo) 3. http://sanddragon.bl.uk/IIIFMetadataService/add_ms_10289.json
  • 9. JSON-LD for a Canvas {"@id": "http://sanddragon.bl.uk/IIIFMetadataService/canvas/folio-8v.json", "@type": "sc:Canvas", "label": "f 8v", "height": 8552, "width": 6049, "images": [ {"@id": "http://sanddragon.bl.uk/IIIFMetadataService/annotation/a-folio-8v.json", "@type": "oa:Annotation", "motivation": "sc:painting", "resource": { "@id": "http://sanddragon.bl.uk/IIIFImageService/add_ms_10289_f008v/full/full/0/native.jpg", "@type": "dctypes:Image", "format": "image/jpeg", "height": 8552, "width": 6049, "service": {"@id": "http://sanddragon.bl.uk/IIIFImageService/add_ms_10289_f008v", "profile": "http://library.stanford.edu/iiif/image-api/1.1/compliance.html#level2} }, "on": "http://sanddragon.bl.uk/IIIFMetadataService/canvas/folio-8v.json"} ], "otherContent" : [{"@id" : "http://demos.biblissima-condorcet.fr/iiif/metadata/BL_Add_10289/list/transscript_8v.json", "@type": "sc:AnnotationList" } ] }
  • 10. TEI 2 JSON-LD for transcriptions For each <pb> (= for each canvas) we create an AnnotationList. That list contains several resources, here transcriptions per line.
  • 11. Technical details MIRADOR / TEI { "@context": "http://www.shared-canvas.org/ns/context.json", "@id": "http://demos.biblissima-condorcet.fr/iiif/metadata/BL_Add_10289/list/transscript_8v.json", "@type": "sc:AnnotationList", "resources":[ { "@id":"http://demos.biblissima-condorcet.fr/iiif/metadata/BL_Add_10289/Annotation/8v_line449.json", "@type":"oa:Annotation", "motivation":"sc:painting", "resource":{ "@type":"cnt:ContentAsText", "chars":"De la forest a feit areine", "format":"text/plain", "language":"fr-FR" }, "on":"http://sanddragon.bl.uk/IIIFMetadataService/canvas/folio-8v.json#xywh=1950,400,3500,220" } ,
  • 12. TEI-P5 <surface> and <zone> TEI-P5: @ulx, @uly, @lrx, @lry (upper left x, upper left y, lower right x, lower right y) (0,0) (6049,8552) ulx uly lrx lry Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
  • 13. TEI-P5 versus Shared Canvas Shared Canvas : x, y, w, h (x, y, width, height) (0,0) x y w 8552 6049 h
  • 14. Adding #xywh to resources (JSON-LD) Approximate semiautomatic approach to surfaces and zones for transcripts + easy to use - not exact, works only for very regular manuscripts Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
  • 15. Adding #xywh to resources (JSON-LD) Use any image manipulation program, e.g. Gimp to determine the average location of the text on the images. In the image manipulation program the cursor’s position is usually displayed below the image. Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
  • 16. Adding #xywh to resources (JSON-LD) Differenciate between recto and verso pages … <xsl:variable name="lx"><xsl:choose> <xsl:when test="contains($pg,'r')"> <xsl:value-of select="$r_x"/></xsl:when> <xsl:otherwise><xsl:value-of select="$v_x"/></xsl:otherwise> </xsl:choose> </xsl:variable> … Compute the average hight of a line line_height = text_height / number_of_lines Line n starts at y = text_y + (n - 1) * line_height → <xsl:text>"on":"http://sanddragon.bl.uk/IIIFMetadataService/canvas/folio-</xsl:text><xsl:value-of select="$pg"/> <xsl:text>.json#xywh=</xsl:text><xsl:value-of select="($lx)"/><xsl:text>,</xsl:text><xsl:value-of select="($y)"/><xsl:text>, </xsl:text><xsl:value-of select="($text_width)"/><xsl:text>,</xsl:text><xsl:value-of select="($line_height)"/> <xsl:text>"</xsl:text> Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
  • 17. Adding <zones> and <surfaces> in TEI Parallel approach to transscript - <facsimile> referencing via @xml:id and #facs + high level of detail possible + text can be encoded with more possibilities + text and image data can be seperated - complexity is difficult for transformation and display Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
  • 18. Technical details TEI ... … <facsimile> ... <surface ulx="0" uly="0" lrx="6049" lry="8552"> <zone xml:id=”f8v” ulx="0" uly="0" lrx="6049" lry="8552"> <graphic url="http://sanddragon.bl.uk/IIIFImageService/add_ms_10289_f008v/full/full/ 0/native.jpg"/> </zone> <zone xml:id=”vers449” ulx="2023" uly="540" lrx="5363" lry="736"/> <zone xml:id=”vers450” ulx="2023" uly="737" lrx="5363" lry="944"/> … </facsimile> ... ...<div><div><lg> … <l n="449" aid:pstyle="txt_Original_Vers" xml:id="vers449" facs=”#vers449”> <pb ed="A" n="8v" xml:id=”f8v” facs=”#f8v”/>De la forest a feit areine</l> <l n="450" aid:pstyle="txt_Original_Vers" xml:id="vers450" facs=”#vers450”> Entor le mont, et bele et pleine<note type="marginal" xml:id="AFRftn207"> areigne.</note>.</l> … </div></div></lg>... Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
  • 19. Technical details MIRADOR / TEI XSL code “ulx, uly, lrx, lry” (TEI - facsimile) to “x, y, w, h” (Shared Canvas) ... <xsl:for-each-group select="/TEI/text/body//lg/l" group-starting-with="/TEI/text/body//lg/l[pb]"> … <xsl:for-each select="current-group()"> ... <xsl:variable name="id"><xsl:value-of select="substring-after(@facs,’#’)"/></xsl:variable> <!-- width = lower right x - upper left x → <xsl:variable name="width"><xsl:value-of select="/TEI/facsimile/surface/zone[@xml:id=$id]/@lrx - /TEI/facsimile/surface/zone[@xml:id=$id]/@ulx"/></xsl:variable> <!-- height = lower right y - upper left y --> <xsl:variable name="height"><xsl:value-of select="/TEI/facsimile/surface/zone[@xml:id=$id]/@lry - /TEI/facsimile/surface/zone[@xml:id=$id]/@uly"/></xsl:variable> <xsl:text>"on":"http://sanddragon.bl.uk/IIIFMetadataService/canvas/folio-</xsl:text><xsl:value-of select="$pg"/><xsl:text>.json#xywh=</xsl:text><xsl:value-of select="@lrx"/><xsl:text>,</xsl:text><xsl:value-of select="@lry"/ ><xsl:text>,</xsl:text><xsl:value-of select="($width)"/><xsl:text>,</xsl:text><xsl:value-of select="($height)"/><xsl:text>"</xsl:text> … </xsl:for-each> …</xsl:for-each-group> ... Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
  • 20. Technical details MIRADOR / TEI Embedded approach to transscript - <sourceDoc> + direct approach + clearity due to the limitation to transscripts only + simpler for XSLT transforms - mixing of text and image data Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
  • 21. Technical details TEI … <sourceDoc> ... <surface ulx="0" uly="0" lrx="6049" lry="8552"> <zone xml:id=”f8v”ulx="0" uly="0" lrx="6049" lry="8552"> <graphic url="http://sanddragon.bl.uk/IIIFImageService/add_ms_10289_f008v/full/full/0/native.jpg"/> </zone> <zone ulx="2023" uly="540" lrx="5363" lry="736 »> <line n="449" aid:pstyle="txt_Original_Vers" xml:id="vers449" > <pb ed="A" n="8v" xml:id=”f8v” facs=”#f8v”/>De la forest a feit areine</line> </zone> <zone ulx="2023" uly="737" lrx="5363" lry="944 »> <line n="450" aid:pstyle="txt_Original_Vers" xml:id="vers450">Entor le mont, et bele et pleine<note type="marginal" xml:id="AFRftn207">areigne.</note>.</line> </zone> … </sourceDoc> ... ... Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)
  • 22. Technical details MIRADOR / TEI XSL code “ulx, uly, lrx, lry” (TEI - sourceDoc) to “x, y, w, h” (Shared Canvas) ... <xsl:for-each select="/TEI/sourceDoc//surface" group-starting-with="TEI/sourceDoc//surface"> … <xsl:for-each select= ".//line"> ... <xsl:variable name="id"><xsl:value-of select="@xml:id)"/></xsl:variable> <!-- width = lower right x - upper left x --> <xsl:variable name="width"><xsl:value-of select="../zone/@lrx - ../zone/@ulx"/></xsl:variable> <!-- height = lower right y - upper left y --> <xsl:variable name="height"><xsl:value-of select="../zone/@lry - ../zone/@uly"/></xsl:variable> <xsl:text>"on":"http://sanddragon.bl.uk/IIIFMetadataService/canvas/folio-</xsl:text><xsl:value-of select="$pg"/><xsl:text>.json#xywh=</xsl:text><xsl:value-of select="../zone/@lrx"/><xsl:text>,</xsl:text><xsl:value-of select="../zone/ @lry"/><xsl:text>,</xsl:text><xsl:value-of select="($width)"/><xsl:text>,</xsl:text><xsl:value-of select="($height)"/><xsl:text>"</xsl:text> … </xsl:for-each> … </xsl:for-each-group>... Stefanie Gehrke - Pool Biblissima. Training School COST-IRHT (02.04.2014)