SlideShare ist ein Scribd-Unternehmen logo
1 von 31
Downloaden Sie, um offline zu lesen
eScriptorium: An Open Source
Platform for
Historical Document Analysis
Daniel Stökl Ben Ezra
Peter Stokes
Marc Bui
Ben Kiessling
Robin Tissot
eScriptorium
• Blog: http://escripta.hypotheses.org
• UI Code: https://gitlab.inria.fr/scripta/escriptorium
• AI Code: https://github.com/mittagessen/kraken
Funded by: PSL IRIS Scripta, H2020 Resilience, MENESR, DIM STCN Ile de
France, EquipEx Biblissima+, [indirectement: Mellon, MCC]
eScriptorium Universe
Scripta PSL:
eScriptorium
LectauRep
INRIA
ANF
openITI
North-Eastern
U Maryland
ERC
Vietnamica
EPHE
U-Bib
Heidelberg
?
National
Library of
Israel ?
ENC
Sorbonne
Université
DIM STCN
Observatoir
e de Paris
IRHT
H2020
Resilience
• manuscriptologIA
High Performance
Computing Cluster at
mesoPSL
Biblissima+
TGIR Huma-Num
current
• Import:
IIIF
pdf, imgfiles (jpg, png, …), alto, PageXML,
trained segmentation or transcription models
• Ergonomic UI for manual segmentation, transcription and (soon) annotation.
4 panels (facsimile, segmentation, transcription, text-annotation)
( user definable architectures)
↓ Metadata imported via iiif
current
• Import: IIIF, pdf, imgfiles (jpg, png, …), alto, PageXML,
trained segmentation or transcription models (user definable architectures)
• Ergonomic UI for manual segmentation, transcription and (soon) annotation.
4 panels (facsimile, segmentation, transcription, text-annotation)
Ergonomic transcription e.g. of vertical or oblique lines
BL ms Add. 27296
Transcription font size automatically adapted to manuscript line
current
• Import: IIIF, pdf, imgfiles (jpg, png, …), alto, PageXML,
trained segmentation or transcription models (user definable architectures)
• Ergonomic UI for manual segmentation, transcription and (soon) annotation.
4 panels (facsimile, segmentation, transcription, text-annotation)
• Automatic segmentation (lines, semantic lines and regions, also overlapping)
based on user-defined ontologies.
• Automatic transcription according to the principles set by the user.
• Export: alto 4(!), PageXML, txt, imgfiles (jpg, png ,…)
trained segmentation or transcription models
• Powerful and growing API
Segmentation and Transcription
Demonstration
↑ User definable
segmentation ontology
Locate illuminations through layout segmentation
Automatic segmentation result of ms specific model
Ergonomic correction
Jbaiter Mirador textoverlay plugin
eScriptorium (near) FUTURE
Scripta PSL:
eScriptorium
LectauRep
INRIA
ANF openITI
North-Eastern
U Maryland
ERC
Vietnamica
EPHE
U-Bib
Heidelberg
?
National
Library of
Israel ?
ENC
Sorbonne
Université
DIM STCN
Observatoir
e de Paris
IRHT
H2020
Resilience
• Search
• Trainable reading order
• Prototype for text annotation (NE,
ecdotic) with TEI-Export
• Prototype for image annotation
(e.g. Digipal / Archetype)
• manuscriptologIA
High Performance
Computing Cluster at
mesoPSL
• Customizable virtual
keyboard
• Vertical interface for Chinese
• Automatic textalignment
• Additional simplified interface
• Improved project management
• Crowdsourcing interface
Biblissima+
TGIR Huma-Num
Transcription created automatically without
specific transcription BnF syr 341
Judeo-Arabic+Hebrew, Ox. Bodl. Pococke 295,
Maimonides, Mishnah Commentary
Greek papyri (with WÜ, HD, B)
Greek papyri (with WÜ, HD, B)
eScriptorium used for Dead Sea Scroll Glyph alignment
Automatic letter level alignment
Images of Dead Sea Scrolls by
Shay Halevy Courtesy Israel
Antiquities Authority
p. 3558:
Please stay tuned for upcoming workshops
Contact: daniel.stoekl@ephe.psl.eu, peter.stokes@ephe.psl.eu
https://escripta.hypotheses.org
Many thanks to
Bibliothèque nationale de France
National Library of Israel (Ktiv!)
Bayerische Staatsbibliothek München
Biblioteca Apostolica Vaticana
Bodleian Library, Oxford
Cambridge University Library
Israel Antiquities Authority, Jerusalem
Staatsbibliothek Berlin, Preußischer Kulturbesitz
Intro tutorial: https://lectaurep.hypotheses.org/documentation/prendre-en-main-escriptorium

Weitere ähnliche Inhalte

Was ist angesagt?

The semantic web
The semantic web The semantic web
The semantic web ap
 
How Graph Databases efficiently store, manage and query connected data at s...
How Graph Databases efficiently  store, manage and query  connected data at s...How Graph Databases efficiently  store, manage and query  connected data at s...
How Graph Databases efficiently store, manage and query connected data at s...jexp
 
Neo4j in Production: A look at Neo4j in the Real World
Neo4j in Production: A look at Neo4j in the Real WorldNeo4j in Production: A look at Neo4j in the Real World
Neo4j in Production: A look at Neo4j in the Real WorldNeo4j
 
FIWARE Global Summit - Idra: A Solution for Open Data Interoperability
FIWARE Global Summit - Idra: A Solution for Open Data InteroperabilityFIWARE Global Summit - Idra: A Solution for Open Data Interoperability
FIWARE Global Summit - Idra: A Solution for Open Data InteroperabilityFIWARE
 
Python Style Guide
Python Style GuidePython Style Guide
Python Style GuideJiayun Zhou
 
Sakhr Arabic OCR User's Guide
Sakhr Arabic OCR User's GuideSakhr Arabic OCR User's Guide
Sakhr Arabic OCR User's GuideWael Sharba
 
Introduction to Vertica (Architecture & More)
Introduction to Vertica (Architecture & More)Introduction to Vertica (Architecture & More)
Introduction to Vertica (Architecture & More)LivePerson
 
Vector databases and neural search
Vector databases and neural searchVector databases and neural search
Vector databases and neural searchDmitry Kan
 
A Brief Introduction to SKOS
A Brief Introduction to SKOSA Brief Introduction to SKOS
A Brief Introduction to SKOSHeather Hedden
 
딥 러닝 자연어 처리 학습을 위한 PPT! (Deep Learning for Natural Language Processing)
딥 러닝 자연어 처리 학습을 위한 PPT! (Deep Learning for Natural Language Processing)딥 러닝 자연어 처리 학습을 위한 PPT! (Deep Learning for Natural Language Processing)
딥 러닝 자연어 처리 학습을 위한 PPT! (Deep Learning for Natural Language Processing)WON JOON YOO
 
Kyo - Functional Scala 2023.pdf
Kyo - Functional Scala 2023.pdfKyo - Functional Scala 2023.pdf
Kyo - Functional Scala 2023.pdfFlavio W. Brasil
 
Query Understanding: A Manifesto
Query Understanding: A ManifestoQuery Understanding: A Manifesto
Query Understanding: A ManifestoDaniel Tunkelang
 
Building Next-Generation Web APIs with JSON-LD and Hydra
Building Next-Generation Web APIs with JSON-LD and HydraBuilding Next-Generation Web APIs with JSON-LD and Hydra
Building Next-Generation Web APIs with JSON-LD and HydraMarkus Lanthaler
 
Semantic Web - Ontologies
Semantic Web - OntologiesSemantic Web - Ontologies
Semantic Web - OntologiesSerge Linckels
 
TensorFlow Object Detection | Realtime Object Detection with TensorFlow | Ten...
TensorFlow Object Detection | Realtime Object Detection with TensorFlow | Ten...TensorFlow Object Detection | Realtime Object Detection with TensorFlow | Ten...
TensorFlow Object Detection | Realtime Object Detection with TensorFlow | Ten...Edureka!
 
Intro to Graphs and Neo4j
Intro to Graphs and Neo4jIntro to Graphs and Neo4j
Intro to Graphs and Neo4jjexp
 
Introduction to Overleaf Workshop
Introduction to Overleaf WorkshopIntroduction to Overleaf Workshop
Introduction to Overleaf WorkshopOlga Scrivner
 

Was ist angesagt? (20)

The semantic web
The semantic web The semantic web
The semantic web
 
How Graph Databases efficiently store, manage and query connected data at s...
How Graph Databases efficiently  store, manage and query  connected data at s...How Graph Databases efficiently  store, manage and query  connected data at s...
How Graph Databases efficiently store, manage and query connected data at s...
 
Neo4j in Production: A look at Neo4j in the Real World
Neo4j in Production: A look at Neo4j in the Real WorldNeo4j in Production: A look at Neo4j in the Real World
Neo4j in Production: A look at Neo4j in the Real World
 
FIWARE Global Summit - Idra: A Solution for Open Data Interoperability
FIWARE Global Summit - Idra: A Solution for Open Data InteroperabilityFIWARE Global Summit - Idra: A Solution for Open Data Interoperability
FIWARE Global Summit - Idra: A Solution for Open Data Interoperability
 
Python Style Guide
Python Style GuidePython Style Guide
Python Style Guide
 
Sakhr Arabic OCR User's Guide
Sakhr Arabic OCR User's GuideSakhr Arabic OCR User's Guide
Sakhr Arabic OCR User's Guide
 
Introduction to Vertica (Architecture & More)
Introduction to Vertica (Architecture & More)Introduction to Vertica (Architecture & More)
Introduction to Vertica (Architecture & More)
 
Vector databases and neural search
Vector databases and neural searchVector databases and neural search
Vector databases and neural search
 
Zenoh: The Genesis
Zenoh: The GenesisZenoh: The Genesis
Zenoh: The Genesis
 
A Brief Introduction to SKOS
A Brief Introduction to SKOSA Brief Introduction to SKOS
A Brief Introduction to SKOS
 
딥 러닝 자연어 처리 학습을 위한 PPT! (Deep Learning for Natural Language Processing)
딥 러닝 자연어 처리 학습을 위한 PPT! (Deep Learning for Natural Language Processing)딥 러닝 자연어 처리 학습을 위한 PPT! (Deep Learning for Natural Language Processing)
딥 러닝 자연어 처리 학습을 위한 PPT! (Deep Learning for Natural Language Processing)
 
Php
PhpPhp
Php
 
Kyo - Functional Scala 2023.pdf
Kyo - Functional Scala 2023.pdfKyo - Functional Scala 2023.pdf
Kyo - Functional Scala 2023.pdf
 
SHACL by example
SHACL by exampleSHACL by example
SHACL by example
 
Query Understanding: A Manifesto
Query Understanding: A ManifestoQuery Understanding: A Manifesto
Query Understanding: A Manifesto
 
Building Next-Generation Web APIs with JSON-LD and Hydra
Building Next-Generation Web APIs with JSON-LD and HydraBuilding Next-Generation Web APIs with JSON-LD and Hydra
Building Next-Generation Web APIs with JSON-LD and Hydra
 
Semantic Web - Ontologies
Semantic Web - OntologiesSemantic Web - Ontologies
Semantic Web - Ontologies
 
TensorFlow Object Detection | Realtime Object Detection with TensorFlow | Ten...
TensorFlow Object Detection | Realtime Object Detection with TensorFlow | Ten...TensorFlow Object Detection | Realtime Object Detection with TensorFlow | Ten...
TensorFlow Object Detection | Realtime Object Detection with TensorFlow | Ten...
 
Intro to Graphs and Neo4j
Intro to Graphs and Neo4jIntro to Graphs and Neo4j
Intro to Graphs and Neo4j
 
Introduction to Overleaf Workshop
Introduction to Overleaf WorkshopIntroduction to Overleaf Workshop
Introduction to Overleaf Workshop
 

Ähnlich wie eScriptorium: An Open Source Platform for Historical Document Analysis

How community software supports language documentation and data analysis
How community software supports language documentation and data analysisHow community software supports language documentation and data analysis
How community software supports language documentation and data analysisPeter Bouda
 
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinalProf. Wim Van Criekinge
 
Travis Oliphant "Python for Speed, Scale, and Science"
Travis Oliphant "Python for Speed, Scale, and Science"Travis Oliphant "Python for Speed, Scale, and Science"
Travis Oliphant "Python for Speed, Scale, and Science"Fwdays
 
Curation Technologies for Multilingual Europe
Curation Technologies for Multilingual EuropeCuration Technologies for Multilingual Europe
Curation Technologies for Multilingual EuropeGeorg Rehm
 
Deep Dive into Apache MXNet on AWS
Deep Dive into Apache MXNet on AWSDeep Dive into Apache MXNet on AWS
Deep Dive into Apache MXNet on AWSKristana Kane
 
Introduction to libre « fulltext » technology
Introduction to libre « fulltext » technologyIntroduction to libre « fulltext » technology
Introduction to libre « fulltext » technologyRobert Viseur
 
Why Python
Why PythonWhy Python
Why Pythonarnav
 
3 python packages
3 python packages3 python packages
3 python packagesFEG
 
Automation in VLSI related tasks.
Automation in VLSI related tasks.Automation in VLSI related tasks.
Automation in VLSI related tasks.Shariful Islam
 
If You Have The Content, Then Apache Has The Technology!
If You Have The Content, Then Apache Has The Technology!If You Have The Content, Then Apache Has The Technology!
If You Have The Content, Then Apache Has The Technology!gagravarr
 
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...Andrii Vozniuk
 
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech TalksA Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech TalksAmazon Web Services
 
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech TalksA Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech TalksAmazon Web Services
 
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...Simplilearn
 
Machine learning from software developers point of view
Machine learning from software developers point of viewMachine learning from software developers point of view
Machine learning from software developers point of viewPierre Paci
 
ANN-Lecture2-Python Startup.pptx
ANN-Lecture2-Python Startup.pptxANN-Lecture2-Python Startup.pptx
ANN-Lecture2-Python Startup.pptxShahzadAhmadJoiya3
 
Digitization in theory and practice
Digitization in theory and practiceDigitization in theory and practice
Digitization in theory and practiceHelen Nneka Okpala
 

Ähnlich wie eScriptorium: An Open Source Platform for Historical Document Analysis (20)

How community software supports language documentation and data analysis
How community software supports language documentation and data analysisHow community software supports language documentation and data analysis
How community software supports language documentation and data analysis
 
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
 
Travis Oliphant "Python for Speed, Scale, and Science"
Travis Oliphant "Python for Speed, Scale, and Science"Travis Oliphant "Python for Speed, Scale, and Science"
Travis Oliphant "Python for Speed, Scale, and Science"
 
Curation Technologies for Multilingual Europe
Curation Technologies for Multilingual EuropeCuration Technologies for Multilingual Europe
Curation Technologies for Multilingual Europe
 
Deep Dive into Apache MXNet on AWS
Deep Dive into Apache MXNet on AWSDeep Dive into Apache MXNet on AWS
Deep Dive into Apache MXNet on AWS
 
Introduction to libre « fulltext » technology
Introduction to libre « fulltext » technologyIntroduction to libre « fulltext » technology
Introduction to libre « fulltext » technology
 
Why Python
Why PythonWhy Python
Why Python
 
3 python packages
3 python packages3 python packages
3 python packages
 
Automation in VLSI related tasks.
Automation in VLSI related tasks.Automation in VLSI related tasks.
Automation in VLSI related tasks.
 
If You Have The Content, Then Apache Has The Technology!
If You Have The Content, Then Apache Has The Technology!If You Have The Content, Then Apache Has The Technology!
If You Have The Content, Then Apache Has The Technology!
 
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...
 
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech TalksA Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
 
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech TalksA Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
 
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
 
Machine learning from software developers point of view
Machine learning from software developers point of viewMachine learning from software developers point of view
Machine learning from software developers point of view
 
Amazon Deep Learning
Amazon Deep LearningAmazon Deep Learning
Amazon Deep Learning
 
ANN-Lecture2-Python Startup.pptx
ANN-Lecture2-Python Startup.pptxANN-Lecture2-Python Startup.pptx
ANN-Lecture2-Python Startup.pptx
 
P1 2018 python
P1 2018 pythonP1 2018 python
P1 2018 python
 
P1 2017 python
P1 2017 pythonP1 2017 python
P1 2017 python
 
Digitization in theory and practice
Digitization in theory and practiceDigitization in theory and practice
Digitization in theory and practice
 

Mehr von Equipex Biblissima

Da Biblissima a Biblissima+ : per un osservatorio delle culture scritte
Da Biblissima a Biblissima+ : per un osservatorio delle culture scritteDa Biblissima a Biblissima+ : per un osservatorio delle culture scritte
Da Biblissima a Biblissima+ : per un osservatorio delle culture scritteEquipex Biblissima
 
Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...
Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...
Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...Equipex Biblissima
 
Appliquer les techniques d'apprentissage profond pour détecter les enluminure...
Appliquer les techniques d'apprentissage profond pour détecter les enluminure...Appliquer les techniques d'apprentissage profond pour détecter les enluminure...
Appliquer les techniques d'apprentissage profond pour détecter les enluminure...Equipex Biblissima
 
Représentations du chant du Moyen Âge dans les images IIIF
Représentations du chant du Moyen Âge dans les images IIIFReprésentations du chant du Moyen Âge dans les images IIIF
Représentations du chant du Moyen Âge dans les images IIIFEquipex Biblissima
 
Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...
Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...
Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...Equipex Biblissima
 
Mise en œuvre de IIIF pour la reconnaissance automatique de documents
Mise en œuvre de IIIF pour la reconnaissance automatique de documentsMise en œuvre de IIIF pour la reconnaissance automatique de documents
Mise en œuvre de IIIF pour la reconnaissance automatique de documentsEquipex Biblissima
 
Actualités et perspectives de IIIF
Actualités et perspectives de IIIFActualités et perspectives de IIIF
Actualités et perspectives de IIIFEquipex Biblissima
 
Mieux diffuser et valoriser ses images sur le Web grâce aux standards IIIF
Mieux diffuser et valoriser ses images sur le Web grâce aux standards IIIFMieux diffuser et valoriser ses images sur le Web grâce aux standards IIIF
Mieux diffuser et valoriser ses images sur le Web grâce aux standards IIIFEquipex Biblissima
 
Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...
Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...
Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...Equipex Biblissima
 
IIIF360: A Service to Support and Promote IIIF in France
IIIF360: A Service to Support and Promote IIIF in FranceIIIF360: A Service to Support and Promote IIIF in France
IIIF360: A Service to Support and Promote IIIF in FranceEquipex Biblissima
 
The Biblissima Authority File of Geographical Names
The Biblissima Authority File of Geographical NamesThe Biblissima Authority File of Geographical Names
The Biblissima Authority File of Geographical NamesEquipex Biblissima
 
Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...
Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...
Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...Equipex Biblissima
 
Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)
Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)
Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)Equipex Biblissima
 
Biblissima: Connecting Manuscripts Collections
Biblissima: Connecting Manuscripts CollectionsBiblissima: Connecting Manuscripts Collections
Biblissima: Connecting Manuscripts CollectionsEquipex Biblissima
 
A la recherche du patrimoine écrit avec le portail Biblissima
A la recherche du patrimoine écrit avec le portail BiblissimaA la recherche du patrimoine écrit avec le portail Biblissima
A la recherche du patrimoine écrit avec le portail BiblissimaEquipex Biblissima
 
Browse and Visualize Manuscripts Illuminations with IIIF
Browse and Visualize Manuscripts Illuminations with IIIFBrowse and Visualize Manuscripts Illuminations with IIIF
Browse and Visualize Manuscripts Illuminations with IIIFEquipex Biblissima
 
Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...
Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...
Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...Equipex Biblissima
 
A la recherche du patrimoine écrit avec le portail Biblissima
A la recherche du patrimoine écrit avec le portail BiblissimaA la recherche du patrimoine écrit avec le portail Biblissima
A la recherche du patrimoine écrit avec le portail BiblissimaEquipex Biblissima
 

Mehr von Equipex Biblissima (20)

Da Biblissima a Biblissima+ : per un osservatorio delle culture scritte
Da Biblissima a Biblissima+ : per un osservatorio delle culture scritteDa Biblissima a Biblissima+ : per un osservatorio delle culture scritte
Da Biblissima a Biblissima+ : per un osservatorio delle culture scritte
 
Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...
Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...
Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...
 
Appliquer les techniques d'apprentissage profond pour détecter les enluminure...
Appliquer les techniques d'apprentissage profond pour détecter les enluminure...Appliquer les techniques d'apprentissage profond pour détecter les enluminure...
Appliquer les techniques d'apprentissage profond pour détecter les enluminure...
 
Représentations du chant du Moyen Âge dans les images IIIF
Représentations du chant du Moyen Âge dans les images IIIFReprésentations du chant du Moyen Âge dans les images IIIF
Représentations du chant du Moyen Âge dans les images IIIF
 
Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...
Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...
Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...
 
Mise en œuvre de IIIF pour la reconnaissance automatique de documents
Mise en œuvre de IIIF pour la reconnaissance automatique de documentsMise en œuvre de IIIF pour la reconnaissance automatique de documents
Mise en œuvre de IIIF pour la reconnaissance automatique de documents
 
Nakala et IIIF
Nakala et IIIFNakala et IIIF
Nakala et IIIF
 
Actualités et perspectives de IIIF
Actualités et perspectives de IIIFActualités et perspectives de IIIF
Actualités et perspectives de IIIF
 
Mieux diffuser et valoriser ses images sur le Web grâce aux standards IIIF
Mieux diffuser et valoriser ses images sur le Web grâce aux standards IIIFMieux diffuser et valoriser ses images sur le Web grâce aux standards IIIF
Mieux diffuser et valoriser ses images sur le Web grâce aux standards IIIF
 
Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...
Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...
Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...
 
IIIF360: A Service to Support and Promote IIIF in France
IIIF360: A Service to Support and Promote IIIF in FranceIIIF360: A Service to Support and Promote IIIF in France
IIIF360: A Service to Support and Promote IIIF in France
 
The Biblissima Authority File of Geographical Names
The Biblissima Authority File of Geographical NamesThe Biblissima Authority File of Geographical Names
The Biblissima Authority File of Geographical Names
 
Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...
Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...
Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...
 
Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)
Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)
Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)
 
Biblissima: Connecting Manuscripts Collections
Biblissima: Connecting Manuscripts CollectionsBiblissima: Connecting Manuscripts Collections
Biblissima: Connecting Manuscripts Collections
 
IIIF et Biblissima
IIIF et BiblissimaIIIF et Biblissima
IIIF et Biblissima
 
A la recherche du patrimoine écrit avec le portail Biblissima
A la recherche du patrimoine écrit avec le portail BiblissimaA la recherche du patrimoine écrit avec le portail Biblissima
A la recherche du patrimoine écrit avec le portail Biblissima
 
Browse and Visualize Manuscripts Illuminations with IIIF
Browse and Visualize Manuscripts Illuminations with IIIFBrowse and Visualize Manuscripts Illuminations with IIIF
Browse and Visualize Manuscripts Illuminations with IIIF
 
Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...
Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...
Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...
 
A la recherche du patrimoine écrit avec le portail Biblissima
A la recherche du patrimoine écrit avec le portail BiblissimaA la recherche du patrimoine écrit avec le portail Biblissima
A la recherche du patrimoine écrit avec le portail Biblissima
 

Kürzlich hochgeladen

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 

Kürzlich hochgeladen (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 

eScriptorium: An Open Source Platform for Historical Document Analysis

  • 1. eScriptorium: An Open Source Platform for Historical Document Analysis Daniel Stökl Ben Ezra Peter Stokes Marc Bui Ben Kiessling Robin Tissot
  • 2. eScriptorium • Blog: http://escripta.hypotheses.org • UI Code: https://gitlab.inria.fr/scripta/escriptorium • AI Code: https://github.com/mittagessen/kraken Funded by: PSL IRIS Scripta, H2020 Resilience, MENESR, DIM STCN Ile de France, EquipEx Biblissima+, [indirectement: Mellon, MCC]
  • 3. eScriptorium Universe Scripta PSL: eScriptorium LectauRep INRIA ANF openITI North-Eastern U Maryland ERC Vietnamica EPHE U-Bib Heidelberg ? National Library of Israel ? ENC Sorbonne Université DIM STCN Observatoir e de Paris IRHT H2020 Resilience • manuscriptologIA High Performance Computing Cluster at mesoPSL Biblissima+ TGIR Huma-Num
  • 4.
  • 5.
  • 6.
  • 7. current • Import: IIIF pdf, imgfiles (jpg, png, …), alto, PageXML, trained segmentation or transcription models • Ergonomic UI for manual segmentation, transcription and (soon) annotation. 4 panels (facsimile, segmentation, transcription, text-annotation) ( user definable architectures)
  • 9. current • Import: IIIF, pdf, imgfiles (jpg, png, …), alto, PageXML, trained segmentation or transcription models (user definable architectures) • Ergonomic UI for manual segmentation, transcription and (soon) annotation. 4 panels (facsimile, segmentation, transcription, text-annotation)
  • 10. Ergonomic transcription e.g. of vertical or oblique lines
  • 11. BL ms Add. 27296 Transcription font size automatically adapted to manuscript line
  • 12. current • Import: IIIF, pdf, imgfiles (jpg, png, …), alto, PageXML, trained segmentation or transcription models (user definable architectures) • Ergonomic UI for manual segmentation, transcription and (soon) annotation. 4 panels (facsimile, segmentation, transcription, text-annotation) • Automatic segmentation (lines, semantic lines and regions, also overlapping) based on user-defined ontologies. • Automatic transcription according to the principles set by the user. • Export: alto 4(!), PageXML, txt, imgfiles (jpg, png ,…) trained segmentation or transcription models • Powerful and growing API
  • 13. Segmentation and Transcription Demonstration ↑ User definable segmentation ontology
  • 14.
  • 15. Locate illuminations through layout segmentation
  • 16. Automatic segmentation result of ms specific model
  • 18.
  • 20. eScriptorium (near) FUTURE Scripta PSL: eScriptorium LectauRep INRIA ANF openITI North-Eastern U Maryland ERC Vietnamica EPHE U-Bib Heidelberg ? National Library of Israel ? ENC Sorbonne Université DIM STCN Observatoir e de Paris IRHT H2020 Resilience • Search • Trainable reading order • Prototype for text annotation (NE, ecdotic) with TEI-Export • Prototype for image annotation (e.g. Digipal / Archetype) • manuscriptologIA High Performance Computing Cluster at mesoPSL • Customizable virtual keyboard • Vertical interface for Chinese • Automatic textalignment • Additional simplified interface • Improved project management • Crowdsourcing interface Biblissima+ TGIR Huma-Num
  • 21. Transcription created automatically without specific transcription BnF syr 341
  • 22.
  • 23.
  • 24. Judeo-Arabic+Hebrew, Ox. Bodl. Pococke 295, Maimonides, Mishnah Commentary
  • 25.
  • 26. Greek papyri (with WÜ, HD, B)
  • 27. Greek papyri (with WÜ, HD, B)
  • 28. eScriptorium used for Dead Sea Scroll Glyph alignment Automatic letter level alignment Images of Dead Sea Scrolls by Shay Halevy Courtesy Israel Antiquities Authority
  • 29.
  • 31. Please stay tuned for upcoming workshops Contact: daniel.stoekl@ephe.psl.eu, peter.stokes@ephe.psl.eu https://escripta.hypotheses.org Many thanks to Bibliothèque nationale de France National Library of Israel (Ktiv!) Bayerische Staatsbibliothek München Biblioteca Apostolica Vaticana Bodleian Library, Oxford Cambridge University Library Israel Antiquities Authority, Jerusalem Staatsbibliothek Berlin, Preußischer Kulturbesitz Intro tutorial: https://lectaurep.hypotheses.org/documentation/prendre-en-main-escriptorium