SlideShare ist ein Scribd-Unternehmen logo
1 von 68
INTERDISCIPLINARY CHALLENGE USING TEI FOR
BUILDING MULTILINGUAL DIGITAL CORPORA
A MAGHREB DH CASE STUDY
Mokhtar Ben Henda
Digital Humanities Institute – Beirut
American University of Beirut
2-6 March 2015
The general scope
 Creating a DH community of practice in
the Maghreb region
 Treat multilingual and interdisciplinary
resources
 Acquire new research and editing habits with
SH digital resources
 Using TEI as technical solution
 For its collegiate dynamics, open to non-
Western linguistic diversity
 For its accuracy to maintain, enrich and
document open guidelines on diversity
 For its ability to ensure the interoperability of
all produced resources
Delimiters
Multilingualism as a core question
Hybridization of
oral and written languages
in the Maghreb
Languages hybridization
 Use of classical or literary (scholar) Arabic in formal contexts :
 Administration, politics, media, education, religion…
 Dialects in daily life (trade, traditions, …)
 “Mother” tongue, “native” language ?
 Use of Arabic dialects:
 Morocco : Haḍarī (Fez, Rabat), Bedui (Marrakech), ‘Arūbī (west)
 Algeria: wahrani (West), Ksantini (East), Dziri (Center)
 Tunsia: Beldi (Tunis), Sahli (Center), Bedui (South)
 Ibn Žaldūn, al-Muqaddima, p.984 : (1377) «... Aboriginal Language
[Algerian] has taken over the Arabic language. The result is a mixed
language, in which Berber dominates and in which North African
dialects are farthest from Classical Arabic ».
Writing hybridization
A calendar page for November 1, 1895
with Ottoman-Turkish, Hebrew, Latin, Arabic scripts
Turkish and
Arabic manuscript
dictionary. 1201
AH [1786/87 AD].
Writing hybridization
Notary records in Arabic and French
(1342 Hg/1923 AD)
Strong claims of Berber as a
national language
Berber languages
 It is estimated that there are between 14 and 25 million speakers of
Berber languages
Tachelhit 3 million Morocco
Tamazight 3 million Morocco
Kabyle 2.5 to 6 million Algeria
Tarifit 1.7 million Morocco
Tachawit 1.4 million Algeria
Tamajaq 640,000 Niger
Tamasheq (Tuareg) 281,000 Mali
Tamajeq 250,000 Niger
Nafusi 167,000 Libya
Tamasheq has five varieties.
Tachelhit has a dialect called Susiua.
Tamazight has two major varieties: Central Altas and Southern Oran.
Tarifit has at least two dialects, one of which may be a separate language
Kabyle has two dialects: Greater and Lesser Kabyle.
Berber writing systems
 Berber languages are written with
Roman, Arabic and Tifinagh alphabets
The female name "Saida" written in tifinagh Berber language poetry in Arabic script
with its translation in French
Berber writing systems
 Kabylian manuscript of the XVIII Century
Berber « Rosetta » stone
Berber « Rosetta stone » from Dugga (Tunisia)
Phoenician and Tifinagh scripts (British Museum)
The Tifinagh alphabet is
thought to have been
derived from the Phoenician
alphabet.
Berber spreading in Maghreb
 60% in Morocco, 50% in Algeria, 2% in Tunisia
Berber recognition as a national language
 Algeria: constitutional amendment (2002)
Berber recognition as a national language
 Morocco « Constitutional amendment, July 2011)
IRCAM
IRCAM
 Tunisia
 New constitution 2014: recognizing minorities without nominating them
 NGOs acting for a constitutional amendment
So what ? … (hypothesis)
 A significant amount of resources in various
languages and writing systems have been
accumulated throughout years
 It does not exist today vast digital corpora of
texts in vernacular idioms and scripts (Unlike
many traditional textual resources : printed
documents and medieval manuscripts),
 But, since constitutional amendments
(Algeria and Morocco), there is a growing
awareness that these resources should be
collected, studied and digitized
A large reservoir of
unrecorded oral
heritage
A large reservoir of
popular music
A large reservoir of
multilingual city
printing
A large amount of
untreated notary
records
A large quantity of
manuscripts to be
digitized
How to subscribe to this challenge ?
 Need for a strategy
 Need for a working frame
 Need for competencies
 Need for data resources
 Need for … incentives !
Incentives
 DH general context in France
 THATCamp Paris 2010 : HD Manifest
 …
 III. Declaration
 5. We, professionals of the digital humanities, are building a
community of practice that is solidary, open, welcoming and
freely accessible.
 6. We are a community without borders. We are a multilingual
and multidisciplinary community.
 7. Our objectives are the advancement of knowledge, the
improvement of research quality in our disciplines, the
enrichment of knowledge and of collective patrimony, in the
academic sphere and beyond it.
 8. We call for the integration of digital culture in the definition
of the general culture of the twenty-first century.
http://tcp.hypotheses.org/411
Incentives
 France: 2014
Incentives
 Research projects experience
 BNFB (Bibliothèque Numérique Franco-
Berbère) : OIF/FFI Project - 2011 (100 000 €)
 Enrich and update, with automated procedures,
records repositories that describe and make
available documents in Berber languages
(Timbuktu manuscripts)
 IDEFI-CréaTIC: Unesco Chair-ITEN (2012) :
Innovative Training (5,2 million €)
 ISO standards:
 E-Learning vocabulary standards (SC36/WG1)
 E-Learning MLR (Metadata for Learning
standard)
 E-learning MLO (Metadata for learning
opportunities) (Afnor/GT1 mirror)
Incentives
 Research background
 Information and Communication Studies
 Multilingual data encoding standards
 Digital data processing
 Research affiliations
 Universities (Algeria, France, Morocco,
Tunisia)
 Research Laboratories
 Professional associations
 International Organizations
Key incentive
 Support from international TEI referees
 Laurent ROMARY : director of DARIAH & member of
the TEI council
 Lu BURNARD: Oxford Univ., godfather of TEI
Guidelines & consultant TGIR Huma-Num, France
Laurent ROMARY
Lu BURNARD
The roadmap
 Building partnerships
 Fund raising
 Building work environment
 Work organization
 Implementation, production and evaluation
Partnerships
 Individuals and institutions
Fund raising
 OIF/FFI (2010-2013)
 100 000 €
 BNFB project
 ISCC/CNRS (2012-2013)
 10 000 €
 HumanitéDigitMaghreb
 LEDEN, MSH Paris-Nord (2014-2015)
 3000 €
 TEI/MEI Euromed
 Chaire Unesco-ITEN (provisional)
 HD-MUREN
Work organization (structuring)
SH Corpus:
Document structures modeling
Poetry
Theatre Stories
Music Manuscript Speech
Fables
Others
Video
Peintings
Work environment
 Preferring
 A data description system ?
 Document centered system
 Document structuring
 to
 A Data base management
system
 Flat scanning
 Referencing metadata
Work environment
 oXygen
Work environment
 oXygen
 oXygen XML Editor supports a large number of text encodings.
 Bidirectional Text Support in Text Editing Mode (Unicode BIDI
Algorithm)
 Editing right-to left
XML documents
 Ability to create and
edit XML content in
Arabic
Work environment
http://www.tei-c.org/Roma/
Work environment
http://www.tei-c.org/oxgarage/
Work environment
 Image Markup Tool
Work environment
 Xaira (XML search engine)
 Explore a corpus language:
 Search words, forms, phrases or XML tags
 Analyze the way how they are used (frequency, stats, forms)
Work environment
 Omeka CMS platform
Work plan (provisional)
 Sharing tasks and responsibilities
 Describing roles
 Training sessions
 Resource analysis
 Structuring (segmentation)
 TEI/XML Encoding
 Collections organization
 Referencing (TeiHeader)
 DC indexing (metadata harvesting)
 Exploitation
 Content research
 Communication
Work organization (sharing)
SH Specialists
XML/TEI technicians
Info-Com. specialists
Arts & SH Corpus
Work organization (structuring)
 SHS SPECIALISTS identify logical structure of documents
according to the genres (Novel, poem, theatre, story, painting,
music, …)
Work organization (encoding)
 XML/TEI TECHNICIANS convert data models prepared by SH
specialists into TEI schemas using specialized software
XML files
Work organization (indexing)
 INFORMATION SPECIALISTS (librarians) add referencing data
(TEIHeader) and description metadata in the Data base
CMS / DB
Some examples (TEIVerse)
 Identify within a poem
 Verse metric
 Different types of verse
 Different verse clusters (verses,
quatrains)
 Patterns of stanzas
 Rhythm
 ...
The granularity may vary between large segments of
text and very tiny character strings
Some examples (TEIVerse)
<text>
<body>
<head>Heureux qui, comme Ulysse, a fait un beau voyage
</head>
<lg>
<l>Heureux qui, comme Ulysse, a fait un beau voyage </l>
<l> Ou comme cestuy-là qui conquit la toison </l>
<l> Et puis est retourné, plein d'usage et raison </l>
<l> Vivre entre ses parents le reste de son âge ! </l>
</lg>
<lg>
<l> Quand reverrai-je, hélas, de mon petit village </l>
<l> Fumer la cheminée, et en quelle saison </l>
<l> Reverrai-je le clos de ma pauvre maison </l>
<l> Qui m'est une province, et beaucoup davantage ?
</lg>
<lg>
<l> Plus me plaît le séjour qu'ont bâti mes aïeux </l>
<l> Que des palais Romains le front audacieux </l>
<l> Plus que le marbre dur me plaît l'ardoise fine : </l>
</lg>
<lg>
<l> Plus mon Loir gaulois, que le Tibre latin </l>
<l> Plus mon petit Liré, que le mont Palatin </l>
<l> Et plus que l'air marin la douceur angevine </l>
</lg>
</body>
</text>
<teiHeader>
<fileDesc>
<titleStmt>
<Title>Heureux qui comme
Ulysse</title>
<respStmt>
<resp>Auteur</resp>
<Name>Joacquim Du
Belay</name>
</respStmt>
</titleStmt>
<sourceDesc>
<Bibl>Recueil Les regrets
</bibl>
</sourceDesc>
</fileDesc>
</teiHeader>
<text>…
Some examples (TEIDrama)
 Identify within a piece of theatre:
Interpretation and production
Speech on stage
Stage direction (“didascalies”)
Types of exposure
Dénouement
Prologue
 …
 Interventions et indications scéniques
<sp who=“P”>
<speaker>Paul</speaker>
<p>Il fait chaud
aujourd’hui…</p>
</sp>
<stage>Paul allume une
cigarette</stage>
<surface> : a page, a stela, all objects
with inscription
 Surface contains zones and lines
 It has coordinates
<zone>: An area of the surface
determined arbitrarily for editorial
needs. Areas can overlap: overlaping is
defined according to spacial
coordinates
 Can contain <line>
 Disposes of coordinates
<line> : a continuum of text clearly
identified by editor
 Can contain text and <zone>
 Without coordinates
Some examples (TEIManuscript)
Some examples (TEIManuscript)
Some examples (TEIManuscript)
<sourceDoc>
<surface ulx="0" uly="0" lrx="200" lry="300">
<zone ulx="10" uly="43" lrx="185" lry="84"
rotate="0”>
<zone>
<line rend="right"> 1 April 2009</line>
</zone>
<line>Fed Birds in the park today.</line>
<line>Might write an articleabout</line>
<line>the Thick-billed Warbler.</line>
</zone>
<zone ulx="9" uly="20" lrx="70" lry="60"
rotate="90”>
<line>Samaria is a Greek</line>
<line>brand of water that</line>
<line>comes from the natural</line>
<line>springs of Stilos, in</line>
<line>Crete</line>
</zone>
</surface>
</sourceDoc>
<surface
ulx="0"
uly="0"
lrx="700"
lry="1000">
<!-- ... -->
</surface>
<zone
ulx="93"
uly="681"
lrx="967"
lry="1568">
<graphic url=« gb.jpg"/>
</zone>
Some examples (TEIManuscript)
<surface xml:id=“s1” ulx="0" uly="0"
lrx="50" lry="50">
<zone xml:id=“s1-z1” ulx="1" uly="1"
lrx="10" lry="10">
<line>Poem</line>
<!-- ... -->
<line>the head</line>
</zone>
<zone xml:id=“s1-z2” ulx="4" uly="4"
lrx="20" lry="20">
<surface xml:id=“s2” ulx="0" uly="0"
lrx="100" lry="100">
<zone xml:id=“s2-z1” ulx="10"
uly="10" lrx="90" lry="95"> Spring has just
set in here, and the weather […] a steamer
</zone>
</surface>
</zone>
</surface>
s1
s1-z1
s1-z2
s2
s2-z1
Some examples (Graph)
 Image Markup Tool
Some examples (Graph)
<teiHeader>
<fileDesc>
<titleStmt>
<title>The Image Markup Logo</title>
</titleStmt>
<publicationStmt>
<p></p>
</publicationStmt>
<sourceDesc>
<p>377 x 259</p>
</sourceDesc>
</fileDesc>
<encodingDesc>
</encodingDesc>
</teiHeader>
Some examples (Graph)
 Image Markup Tool
Some encoding examples (TEISpeech)
 Encoding speech according to a
« Timeline »
 Possibility to describe different
speech phenomena of oral
communication and sounds
during a speech
 pauses, words overlaps, tonal
changes, vocal expressions (coughs,
laughs, sneezing, …), gestures etc.
© The Music Encoding Initiative. http://music-encoding.org/
Some examples (MEI)
A tunisian song encoded in TEI <TeiHeader>
Métadonnées(TEIHeader)Espacedenom
A tunisian song encoded in TEI <text>
‫الليل‬ ‫في‬ ‫الياسمينة‬ ‫تحت‬
‫نسمة‬‫والورد‬‫محاذيني‬
‫األغصان‬‫تــــميل‬ ‫عليا‬
‫تمسحلي‬‫دمعة‬ ‫في‬‫عيني‬
‫تحت‬‫الياسمينة‬‫اتكيت‬
‫عدلت‬‫وغنيت‬ ‫الـــعود‬
‫وتناطر‬‫وبكيت‬ ‫دمعي‬
‫تفكرتك‬‫كنت‬ ‫كيف‬‫تجيني‬
‫جنينة‬‫النوار‬ ‫مزينها‬
‫فاحت‬‫األزهار‬ ‫ريحت‬ ‫من‬
‫تفكرتك‬‫النار‬ ‫شعــــلت‬
‫عملت‬‫لهليبة‬‫في‬‫قليبي‬
‫متوحش‬‫محتار‬ ‫وحدي‬
‫ال‬‫أطيار‬ ‫حس‬ ‫ال‬ ‫و‬ ‫قمرة‬
‫كان‬‫األشجار‬ ‫ع‬ ‫النسمة‬
‫توانس‬‫فيا‬‫وتواسيني‬
Stanza(Quatrain)
A tunisian song encoded in MEI <music>
A Arabic/French fable encoded with Oxygen
59
A TEI-based critical apparat of an Arabic document
Titre du document : ‫ّم‬‫ج‬‫المن‬ ‫بن‬ ‫يحيى‬ ‫رسالة‬ [Epître de Yahya ibn al-Munajjm sur la musique]
Nature : Fichier Word (converti en TEI P5 XML par Oxgarage)
Langue : Arabe
Genre littéraire : Catalogue de manuscrit de musique
Réalisation : Anas GHRAB
Schéma TEI : All
Extrait du bloc
<TEIHeader> :
<teiHeader>
<fileDesc xml:idx="001">
<titleStmt>
<title type="full">
<title type="main">‫ّم‬‫ج‬‫المن‬ ‫بن‬ ‫يحيى‬ ‫/<رسالة‬title>
<title type="sub">Titre translitéré</title>
<title type="alt">Epître de Yahya ibn al-Munajjm sur la musique</title>
</title>
<author>
<persName>
<forename>‫/<يحيى‬forename>
<genName>‫يحيى‬ ‫/<بن‬genName>
<surname>‫ّم‬‫ج‬‫المن‬ ‫/<بن‬surname>
<addName></addName>
</persName>
<date>...
<placeName>
<settlement type="city">...</settlement>
<country>...</country>
</placeName>
</date>
<date>...
<placeName>
<settlement type="city">...</settlement>
<country>...</country>
</placeName>
</date>
</author>
<respStmt>
<resp>‫/<تحقيق‬resp>
<persName>‫غراب‬ ‫/<أنس‬persName>
</respStmt>
</titleStmt>
<publicationStmt>
Extrait du bloc
<Text>
<text>
<front>
<div type="abstract">
<head>‫الكتاب‬ ‫محتوى‬ ‫/<وصف‬head>
</div>
<div type="contents">
<head>‫الكتاب‬ ‫/<فهرس‬head>
</div>
</front>
<body>
<p xml:idx="001">
ِ‫م‬‫ِي‬‫ح‬‫ه‬‫ر‬‫ال‬ ِ‫ن‬‫َا‬‫م‬ْ‫هح‬‫ر‬‫ال‬ ِ ‫ه‬‫اَّلل‬ ِ‫م‬ ْ‫س‬ِ‫ب‬
</p>
<p xml:idx="002">
‫ال‬ ‫ِي‬‫ف‬ ّ َ‫اَّلل‬ِ‫ب‬ ‫ِد‬‫ض‬َ‫ت‬‫ُع‬‫م‬‫ال‬ َ‫منين‬ْ‫ؤ‬ُ‫م‬‫ال‬ ِ‫ِير‬‫م‬َ‫أ‬ ‫ى‬َ‫َول‬‫م‬ ِ‫م‬ِّ‫ج‬َ‫ن‬ُ‫م‬‫ال‬ ‫َى‬‫ي‬ْ‫َح‬‫ي‬ ُ‫بن‬ ّ‫ي‬ِ‫ل‬َ‫ع‬ ُ‫بن‬ ‫َى‬‫ي‬ْ‫َح‬‫ي‬ ُ‫ة‬َ‫ال‬ َ‫س‬ِ‫ر‬‫َى‬‫ق‬‫ِي‬‫س‬‫ُو‬‫م‬
</p>
<p xml:idx="003">
ْ‫ي‬َ‫ل‬َ‫ع‬ َ‫ون‬ُ‫َك‬‫ي‬ ْ‫ن‬َ‫أ‬ ُ‫ب‬ِ‫ج‬َ‫ي‬ ‫َا‬‫م‬َ‫و‬ ،‫ِي‬ّ‫ن‬َ‫غ‬ُ‫م‬‫ال‬ َ‫ة‬َ‫ف‬ِ‫ص‬ ‫َا‬‫ذ‬َ‫ه‬ َ‫ل‬ْ‫ب‬َ‫ق‬ ‫ِي‬‫ذ‬‫ه‬‫ال‬ ‫َا‬‫ن‬ِ‫ب‬‫َا‬‫ت‬ِ‫ك‬ ‫ِي‬‫ف‬ ‫َا‬‫ن‬ْ‫ر‬َ‫َك‬‫ذ‬ ْ‫َد‬‫ق‬ ُ‫ل‬‫و‬ُ‫َق‬‫ن‬َ‫م‬ ‫َا‬‫ن‬ْ‫ف‬َ‫ص‬َ‫و‬َ‫و‬ ،ِ‫ه‬َ‫ر‬ْ‫م‬َ‫أ‬ َ‫اآلن‬ ُ‫ِف‬‫ص‬َ‫ن‬َ‫و‬ ،َ‫ك‬ِ‫ل‬َ‫ذ‬ ْ‫ِن‬‫م‬ ً‫ة‬َ‫ي‬‫َا‬‫ه‬ِ‫ن‬ ِ‫ه‬‫ِي‬‫ف‬ ‫ا‬
ٍ‫ع‬ِ‫ب‬ْ‫ص‬‫ا‬ َ‫ع‬ِ‫ق‬‫َا‬‫و‬َ‫م‬َ‫و‬ ، ُ‫ِف‬‫ل‬َ‫ت‬ْ‫َخ‬‫ي‬َ‫و‬ ‫َا‬‫ه‬ْ‫ن‬ِ‫م‬ ُ‫ِف‬‫ل‬َ‫ت‬ْ‫َأ‬‫ي‬ ‫َا‬‫م‬َ‫و‬ ،‫َا‬‫ه‬َ‫د‬َ‫د‬َ‫ع‬َ‫و‬ ِ‫م‬َ‫غ‬‫ه‬‫ن‬‫ال‬ٍ‫ع‬ِ‫ب‬ْ‫ص‬‫ا‬ٍ‫َر‬‫ت‬َ‫و‬ ْ‫ِن‬‫م‬ٍ‫َر‬‫ت‬َ‫و‬َ‫و‬ ،ُ‫ه‬‫ا‬‫ه‬‫م‬ َ‫س‬ ‫َا‬‫م‬ ُ‫ن‬ِّ‫ي‬َ‫ب‬ُ‫ن‬َ‫و‬ ،ٍ‫ن‬‫َا‬‫ت‬ ْ‫َس‬‫د‬ ِّ‫ل‬ُ‫ك‬ ْ‫ِن‬‫م‬ ٍ‫ة‬َ‫م‬ْ‫غ‬َ‫ن‬ ِّ‫ل‬ُ‫ك‬ َ‫ع‬ِ‫ض‬ْ‫و‬َ‫م‬
‫ِي‬‫ل‬ِ‫ص‬ْ‫و‬َ‫م‬‫ال‬ َ‫م‬‫ِي‬‫ه‬‫َا‬‫ر‬ْ‫ب‬‫ا‬ ُ‫بن‬ ُ‫َاق‬‫ح‬ ْ‫س‬ِ‫إ‬«‫َى‬‫ر‬ْ‫َج‬‫م‬‫ال‬»ْ‫ُس‬‫و‬‫ال‬ ‫َى‬‫ر‬ْ‫َج‬‫م‬ِ‫ب‬ ‫َا‬‫ه‬َ‫ض‬ْ‫ع‬َ‫ب‬ َ‫م‬ َ‫َس‬‫ر‬ ‫ِي‬‫ت‬‫ه‬‫ال‬ ِ‫ت‬‫َا‬‫و‬ْ‫ص‬َ‫األ‬ ‫ِي‬‫ف‬،ِ‫َر‬‫ص‬ْ‫ن‬ِ‫ب‬‫ال‬ ‫َى‬‫ر‬ْ‫َج‬‫م‬ِ‫ب‬ ‫َا‬‫ه‬َ‫ض‬ْ‫ع‬َ‫ب‬َ‫و‬ ،‫ى‬َ‫ط‬
ْ‫هن‬‫م‬ِ‫م‬ ‫ِه‬‫ئ‬‫َا‬‫ر‬َ‫ظ‬ُ‫ن‬َ‫و‬ َ‫َاق‬‫ح‬ ْ‫س‬ِ‫إ‬ َ‫ل‬ْ‫ث‬ِ‫م‬ ِّ‫ي‬ِ‫ب‬َ‫ر‬َ‫ع‬‫ال‬ ِ‫ء‬‫َا‬‫ن‬ِ‫غ‬‫ال‬ ِ‫ب‬‫َا‬‫ح‬ْ‫ص‬َ‫أ‬ َ‫ْن‬‫ي‬َ‫ب‬ ‫َا‬‫م‬ َ‫ف‬ َ‫َِل‬‫ت‬ْ‫َاخ‬‫و‬
<app>
<lem>َ‫ع‬َ‫م‬َ‫ج‬</lem>
<rdg wit="#Ram">‫/<جميع‬rdg>
</app>
َ‫د‬َ‫ع‬ ‫ِي‬‫ف‬ ِ‫ء‬‫َا‬‫م‬َ‫د‬ُ‫الق‬ ِ‫ة‬َ‫ف‬ِ‫س‬ َ‫ََل‬‫ف‬‫ال‬ َ‫ِن‬‫م‬ ‫َى‬‫ق‬‫ِي‬‫س‬‫ُو‬‫م‬‫ال‬ ِ‫ب‬‫َا‬‫ح‬ْ‫ص‬َ‫أ‬ َ‫ْن‬‫ي‬َ‫ب‬َ‫و‬ ،ِ‫َل‬‫م‬َ‫ع‬‫َال‬‫و‬ ِ‫ة‬َ‫ع‬‫َا‬‫ن‬ِّ‫ص‬‫ال‬ِ‫ب‬ َ‫م‬ْ‫ِل‬‫ع‬‫ال‬َ‫َان‬‫ك‬ ْ‫ذ‬ِ‫إ‬ ِ‫م‬َ‫غ‬‫ه‬‫ن‬‫ال‬ ِ‫د‬‫ه‬‫ن‬َ‫أ‬ َ‫ُون‬‫م‬َ‫ع‬ْ‫ز‬َ‫ي‬ ُ‫ه‬ُ‫َاؤ‬‫ر‬َ‫ظ‬ُ‫ن‬َ‫و‬ ُ‫َاق‬‫ح‬ ْ‫س‬ِ‫إ‬
،َ‫ة‬َ‫ر‬ َ‫َش‬‫ع‬ ‫ِي‬‫ن‬‫َا‬‫م‬َ‫ث‬ ‫َا‬‫ه‬‫ه‬‫ن‬َ‫أ‬ َ‫ُون‬‫م‬َ‫ع‬ْ‫ز‬َ‫ي‬ ‫َى‬‫ق‬‫ِي‬‫س‬‫ُو‬‫م‬‫ال‬ ُ‫َاب‬‫ح‬ْ‫ص‬َ‫أ‬ َ‫َان‬‫ك‬َ‫و‬ ٌ‫ر‬ ْ‫َش‬‫ع‬ َ‫م‬َ‫غ‬‫ه‬‫ن‬‫ال‬
<app>
<lem>.َ‫م‬‫َا‬‫غ‬ْ‫ن‬َ‫َاأل‬‫و‬ َ‫َان‬‫ح‬ْ‫ل‬َ‫األ‬ ُ‫ع‬َ‫م‬ْ‫َج‬‫ي‬ ‫َا‬‫م‬ِ‫ل‬ ٌ‫ح‬ْ‫ر‬ َ‫ش‬ ‫َا‬‫ه‬َ‫د‬ْ‫ع‬َ‫ب‬َ‫و‬</lem>
<rdg wit="#Ram"> ‫يجمع‬ ‫ما‬ ‫شرح‬ ‫وبعده‬‫االلحار‬‫واالفهام‬ </rdg>
<rdg wit="#BM"> ‫يجتمع‬ ‫بما‬ ‫شرح‬ ‫ونقدمه‬‫االلحار‬‫واالفهام‬ </rdg>
<rdg wit="#Zak">‫واألنغام‬ ‫األلحان‬ ‫يجتمع‬ ‫ما‬ ‫شرح‬ ‫/<ونقدم‬rdg>
</app>
Critical diagnosis
 Facing many problems
 DH & TEI are very new concepts for academic community
(resistance)
 Vertical clustering of disciplines in the same university
 Some cultural and linguistic sensitiveness observed between
partners
 Need for more training and interdisciplinary
research
 Nomadic training team
 Group publishing
The stylistic notes by Rétif de la Bretonne: a hypertext before digital era
Henri HUDRISIER (1), Sofia BEZINA (2), Hichem ISMAIL (3), Rachid ZGHIBI (4), Sihem
ZGHIDI (5), Loula Abd-elrazak (6), Laurent ROMARY (7), Mokhtar BEN HENDA (8),
Arnaud LABORDERIE (9), Ghislaine AZEMARD (10)
General orientation
 A strategy of support and orientation
 French translation of the full version of "TEI P5
Guidelines" (1500 pages counting appendices
including about 600 pages are translated to
date)
 An Arabic TEI P5 version is also under attempt
Rachid ZGHIBI, Tunis
Henri HUDRISIER, PAris8
General orientation
 Further research projects
 2014-2015: Project & MEI TEI-Euromed
(Encoding with MEI Maghreb traditional
music)
Sylvaine MARTIN,
LEDEN, MSH Paris Nord
General orientation
 Further research projects
 2012: The Unesco Chair-ITEN:
“Innovation, Transmission, Edition
Numérique”
 Bringing academic and entrepreneurial
partners together to discuss the
contemporary changes education
related to the effects of digital
innovation
 2012: IDEFI-CréaTIC project
 budget of 5,2 M€ on 8 years
 17 training programs
 24 workshops open to more than 500
students
HD-Muren
 Implement and test an environment where different stakeholders
in the academic communities learn together about new
computing technologies and their impact on the humanities
 DH & TEI at school?
General orientation
 More positioning within international
communities of practice in the DH
 Humanistica (2014 membership)
 ThatCamps: St Malo, 2013 / Lyon 2014 /
Beirut, 2015 / Paris 2015)
Thank you for your attention

Weitere ähnliche Inhalte

Ähnlich wie TEI for building multilingual corpora

Europeana Regia presentation at eChallenges 2011 conference
Europeana Regia presentation at eChallenges 2011 conferenceEuropeana Regia presentation at eChallenges 2011 conference
Europeana Regia presentation at eChallenges 2011 conferenceEuropeana Regia
 
03_ For a sociology of translator For a sociology of translator.ppt
03_ For a sociology of translator For a sociology of translator.ppt03_ For a sociology of translator For a sociology of translator.ppt
03_ For a sociology of translator For a sociology of translator.pptnzroghani
 
Promoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language TechnologyPromoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language Technologytechiaith
 
Building the Biblissima Observatory
Building the Biblissima ObservatoryBuilding the Biblissima Observatory
Building the Biblissima ObservatoryEquipex Biblissima
 
Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...
Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...
Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...Equipex Biblissima
 
Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...
Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...
Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...SSHOC
 
Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...
Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...
Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...OpenEdition
 
Laura Welcher - The Rosetta Project and The Language Commons
Laura Welcher - The Rosetta Project and The Language CommonsLaura Welcher - The Rosetta Project and The Language Commons
Laura Welcher - The Rosetta Project and The Language Commonslongnow
 
Multilingual challenges in Europeana
Multilingual challenges in EuropeanaMultilingual challenges in Europeana
Multilingual challenges in EuropeanaAntoine Isaac
 
Biblissima and data modelling - EuropeanaTech 2015 Paris
Biblissima and data modelling - EuropeanaTech 2015 ParisBiblissima and data modelling - EuropeanaTech 2015 Paris
Biblissima and data modelling - EuropeanaTech 2015 ParisEquipex Biblissima
 
Challenges and opportunities of aggregating multiple databases in Europeana
Challenges and opportunities of aggregating multiple databases in EuropeanaChallenges and opportunities of aggregating multiple databases in Europeana
Challenges and opportunities of aggregating multiple databases in EuropeanaDavid Haskiya
 
Europeana 2019 - Connect Communities
Europeana 2019 - Connect CommunitiesEuropeana 2019 - Connect Communities
Europeana 2019 - Connect CommunitiesEuropeana
 
Challenges for the Language Technology Industry
Challenges for the Language Technology IndustryChallenges for the Language Technology Industry
Challenges for the Language Technology IndustryAntoine Isaac
 
TraduXio project - Cosi10
TraduXio project - Cosi10TraduXio project - Cosi10
TraduXio project - Cosi10PhilippeLacour
 
LIBER. The power of Association
LIBER. The power of AssociationLIBER. The power of Association
LIBER. The power of AssociationLIBER Europe
 
(Un)writing the histories of Humanities Computing(s)
(Un)writing the histories of Humanities Computing(s)(Un)writing the histories of Humanities Computing(s)
(Un)writing the histories of Humanities Computing(s)Edward Vanhoutte
 
Learning and Text Analysis for Ontology Engineering
Learning and Text Analysis for Ontology EngineeringLearning and Text Analysis for Ontology Engineering
Learning and Text Analysis for Ontology Engineeringbutest
 
Plurilingualism and intercomprehension teaching forum 2012
Plurilingualism and intercomprehension teaching forum 2012Plurilingualism and intercomprehension teaching forum 2012
Plurilingualism and intercomprehension teaching forum 2012Tita Beaven
 
Digital text as a phenomenon of culture
Digital text as a phenomenon of cultureDigital text as a phenomenon of culture
Digital text as a phenomenon of cultureTeodora Petkova
 

Ähnlich wie TEI for building multilingual corpora (20)

Europeana Regia presentation at eChallenges 2011 conference
Europeana Regia presentation at eChallenges 2011 conferenceEuropeana Regia presentation at eChallenges 2011 conference
Europeana Regia presentation at eChallenges 2011 conference
 
03_ For a sociology of translator For a sociology of translator.ppt
03_ For a sociology of translator For a sociology of translator.ppt03_ For a sociology of translator For a sociology of translator.ppt
03_ For a sociology of translator For a sociology of translator.ppt
 
Promoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language TechnologyPromoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language Technology
 
Building the Biblissima Observatory
Building the Biblissima ObservatoryBuilding the Biblissima Observatory
Building the Biblissima Observatory
 
Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...
Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...
Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...
 
Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...
Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...
Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...
 
Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...
Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...
Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...
 
Laura Welcher - The Rosetta Project and The Language Commons
Laura Welcher - The Rosetta Project and The Language CommonsLaura Welcher - The Rosetta Project and The Language Commons
Laura Welcher - The Rosetta Project and The Language Commons
 
Multilingual challenges in Europeana
Multilingual challenges in EuropeanaMultilingual challenges in Europeana
Multilingual challenges in Europeana
 
Biblissima and data modelling - EuropeanaTech 2015 Paris
Biblissima and data modelling - EuropeanaTech 2015 ParisBiblissima and data modelling - EuropeanaTech 2015 Paris
Biblissima and data modelling - EuropeanaTech 2015 Paris
 
Challenges and opportunities of aggregating multiple databases in Europeana
Challenges and opportunities of aggregating multiple databases in EuropeanaChallenges and opportunities of aggregating multiple databases in Europeana
Challenges and opportunities of aggregating multiple databases in Europeana
 
Europeana 2019 - Connect Communities
Europeana 2019 - Connect CommunitiesEuropeana 2019 - Connect Communities
Europeana 2019 - Connect Communities
 
Challenges for the Language Technology Industry
Challenges for the Language Technology IndustryChallenges for the Language Technology Industry
Challenges for the Language Technology Industry
 
TraduXio project - Cosi10
TraduXio project - Cosi10TraduXio project - Cosi10
TraduXio project - Cosi10
 
LIBER. The power of Association
LIBER. The power of AssociationLIBER. The power of Association
LIBER. The power of Association
 
(Un)writing the histories of Humanities Computing(s)
(Un)writing the histories of Humanities Computing(s)(Un)writing the histories of Humanities Computing(s)
(Un)writing the histories of Humanities Computing(s)
 
Introducing DARIAH
Introducing DARIAHIntroducing DARIAH
Introducing DARIAH
 
Learning and Text Analysis for Ontology Engineering
Learning and Text Analysis for Ontology EngineeringLearning and Text Analysis for Ontology Engineering
Learning and Text Analysis for Ontology Engineering
 
Plurilingualism and intercomprehension teaching forum 2012
Plurilingualism and intercomprehension teaching forum 2012Plurilingualism and intercomprehension teaching forum 2012
Plurilingualism and intercomprehension teaching forum 2012
 
Digital text as a phenomenon of culture
Digital text as a phenomenon of cultureDigital text as a phenomenon of culture
Digital text as a phenomenon of culture
 

Mehr von Mokhtar Ben Henda

Webinaire de formation sur les REL
Webinaire de formation sur les RELWebinaire de formation sur les REL
Webinaire de formation sur les RELMokhtar Ben Henda
 
La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...
La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...
La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...Mokhtar Ben Henda
 
Modalités & outils d’évaluation dans l’apprentissages en ligne
Modalités & outils d’évaluation dans l’apprentissages en ligneModalités & outils d’évaluation dans l’apprentissages en ligne
Modalités & outils d’évaluation dans l’apprentissages en ligneMokhtar Ben Henda
 
Écosystème d’ensemble d’un dispositif de formation hybride
Écosystème d’ensemble d’un dispositif de formation hybride Écosystème d’ensemble d’un dispositif de formation hybride
Écosystème d’ensemble d’un dispositif de formation hybride Mokhtar Ben Henda
 
Évaluation & Valorisation des résultats de la recherche
Évaluation & Valorisation des résultats de la rechercheÉvaluation & Valorisation des résultats de la recherche
Évaluation & Valorisation des résultats de la rechercheMokhtar Ben Henda
 
Modeling a Framework for Production and Dissemination of Open Educational Res...
Modeling a Framework for Production and Dissemination of Open Educational Res...Modeling a Framework for Production and Dissemination of Open Educational Res...
Modeling a Framework for Production and Dissemination of Open Educational Res...Mokhtar Ben Henda
 
Transformation Numerique à l'UEL (Vietnam)
Transformation Numerique à l'UEL (Vietnam)Transformation Numerique à l'UEL (Vietnam)
Transformation Numerique à l'UEL (Vietnam)Mokhtar Ben Henda
 
Classe inversée au TLU, Hanoï
Classe inversée au TLU, HanoïClasse inversée au TLU, Hanoï
Classe inversée au TLU, HanoïMokhtar Ben Henda
 
Formation Hybride au TLU, Hanoï
Formation Hybride au TLU, HanoïFormation Hybride au TLU, Hanoï
Formation Hybride au TLU, HanoïMokhtar Ben Henda
 
Gender-Equality_Scenarios.pptx
Gender-Equality_Scenarios.pptxGender-Equality_Scenarios.pptx
Gender-Equality_Scenarios.pptxMokhtar Ben Henda
 
Normes technologiques internationales : potentiels d’Innovation et de créativ...
Normes technologiques internationales : potentiels d’Innovation et de créativ...Normes technologiques internationales : potentiels d’Innovation et de créativ...
Normes technologiques internationales : potentiels d’Innovation et de créativ...Mokhtar Ben Henda
 
Modeling 30% online & 70% face-to-face learning
Modeling 30% online & 70% face-to-face learningModeling 30% online & 70% face-to-face learning
Modeling 30% online & 70% face-to-face learningMokhtar Ben Henda
 
Webinaires AUF ENS-Luang Prabang Laos
Webinaires AUF ENS-Luang Prabang LaosWebinaires AUF ENS-Luang Prabang Laos
Webinaires AUF ENS-Luang Prabang LaosMokhtar Ben Henda
 
مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...
مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...
مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...Mokhtar Ben Henda
 
Open education between the specifications of Creative Commons and Copyright
Open education between the specifications of Creative Commons and CopyrightOpen education between the specifications of Creative Commons and Copyright
Open education between the specifications of Creative Commons and CopyrightMokhtar Ben Henda
 
Les fondamentaux de la classe inversée
Les fondamentaux de la classe inverséeLes fondamentaux de la classe inversée
Les fondamentaux de la classe inverséeMokhtar Ben Henda
 
L’intégrité académique en relation avec l’offre de formation d’un établisseme...
L’intégrité académique en relation avec l’offre de formation d’un établisseme...L’intégrité académique en relation avec l’offre de formation d’un établisseme...
L’intégrité académique en relation avec l’offre de formation d’un établisseme...Mokhtar Ben Henda
 
Animation des activités d'une classe virtuelle
Animation des activités d'une classe virtuelleAnimation des activités d'une classe virtuelle
Animation des activités d'une classe virtuelleMokhtar Ben Henda
 

Mehr von Mokhtar Ben Henda (20)

Webinaire de formation sur les REL
Webinaire de formation sur les RELWebinaire de formation sur les REL
Webinaire de formation sur les REL
 
La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...
La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...
La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...
 
Modalités & outils d’évaluation dans l’apprentissages en ligne
Modalités & outils d’évaluation dans l’apprentissages en ligneModalités & outils d’évaluation dans l’apprentissages en ligne
Modalités & outils d’évaluation dans l’apprentissages en ligne
 
Écosystème d’ensemble d’un dispositif de formation hybride
Écosystème d’ensemble d’un dispositif de formation hybride Écosystème d’ensemble d’un dispositif de formation hybride
Écosystème d’ensemble d’un dispositif de formation hybride
 
Évaluation & Valorisation des résultats de la recherche
Évaluation & Valorisation des résultats de la rechercheÉvaluation & Valorisation des résultats de la recherche
Évaluation & Valorisation des résultats de la recherche
 
Modeling a Framework for Production and Dissemination of Open Educational Res...
Modeling a Framework for Production and Dissemination of Open Educational Res...Modeling a Framework for Production and Dissemination of Open Educational Res...
Modeling a Framework for Production and Dissemination of Open Educational Res...
 
Transformation Numerique à l'UEL (Vietnam)
Transformation Numerique à l'UEL (Vietnam)Transformation Numerique à l'UEL (Vietnam)
Transformation Numerique à l'UEL (Vietnam)
 
Classe inversée au TLU, Hanoï
Classe inversée au TLU, HanoïClasse inversée au TLU, Hanoï
Classe inversée au TLU, Hanoï
 
Classe virtuelle au TLU
Classe virtuelle au TLUClasse virtuelle au TLU
Classe virtuelle au TLU
 
Formation Hybride au TLU, Hanoï
Formation Hybride au TLU, HanoïFormation Hybride au TLU, Hanoï
Formation Hybride au TLU, Hanoï
 
Ressouces Éducatives
Ressouces ÉducativesRessouces Éducatives
Ressouces Éducatives
 
Gender-Equality_Scenarios.pptx
Gender-Equality_Scenarios.pptxGender-Equality_Scenarios.pptx
Gender-Equality_Scenarios.pptx
 
Normes technologiques internationales : potentiels d’Innovation et de créativ...
Normes technologiques internationales : potentiels d’Innovation et de créativ...Normes technologiques internationales : potentiels d’Innovation et de créativ...
Normes technologiques internationales : potentiels d’Innovation et de créativ...
 
Modeling 30% online & 70% face-to-face learning
Modeling 30% online & 70% face-to-face learningModeling 30% online & 70% face-to-face learning
Modeling 30% online & 70% face-to-face learning
 
Webinaires AUF ENS-Luang Prabang Laos
Webinaires AUF ENS-Luang Prabang LaosWebinaires AUF ENS-Luang Prabang Laos
Webinaires AUF ENS-Luang Prabang Laos
 
مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...
مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...
مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...
 
Open education between the specifications of Creative Commons and Copyright
Open education between the specifications of Creative Commons and CopyrightOpen education between the specifications of Creative Commons and Copyright
Open education between the specifications of Creative Commons and Copyright
 
Les fondamentaux de la classe inversée
Les fondamentaux de la classe inverséeLes fondamentaux de la classe inversée
Les fondamentaux de la classe inversée
 
L’intégrité académique en relation avec l’offre de formation d’un établisseme...
L’intégrité académique en relation avec l’offre de formation d’un établisseme...L’intégrité académique en relation avec l’offre de formation d’un établisseme...
L’intégrité académique en relation avec l’offre de formation d’un établisseme...
 
Animation des activités d'une classe virtuelle
Animation des activités d'une classe virtuelleAnimation des activités d'une classe virtuelle
Animation des activités d'une classe virtuelle
 

Kürzlich hochgeladen

Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024eCommerce Institute
 
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxmohammadalnahdi22
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024eCommerce Institute
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...henrik385807
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Kayode Fayemi
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...Sheetaleventcompany
 
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )Pooja Nehwal
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubssamaasim06
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Vipesco
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfhenrik385807
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Delhi Call girls
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxNikitaBankoti2
 
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardsticksaastr
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Hasting Chen
 
Mathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMoumonDas2
 
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyCall Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyPooja Nehwal
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Chameera Dedduwage
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Pooja Nehwal
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AITatiana Gurgel
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesPooja Nehwal
 

Kürzlich hochgeladen (20)

Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
 
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
 
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubs
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
 
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
 
Mathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptx
 
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyCall Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AI
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
 

TEI for building multilingual corpora

  • 1. INTERDISCIPLINARY CHALLENGE USING TEI FOR BUILDING MULTILINGUAL DIGITAL CORPORA A MAGHREB DH CASE STUDY Mokhtar Ben Henda Digital Humanities Institute – Beirut American University of Beirut 2-6 March 2015
  • 2. The general scope  Creating a DH community of practice in the Maghreb region  Treat multilingual and interdisciplinary resources  Acquire new research and editing habits with SH digital resources  Using TEI as technical solution  For its collegiate dynamics, open to non- Western linguistic diversity  For its accuracy to maintain, enrich and document open guidelines on diversity  For its ability to ensure the interoperability of all produced resources
  • 3. Delimiters Multilingualism as a core question Hybridization of oral and written languages in the Maghreb
  • 4. Languages hybridization  Use of classical or literary (scholar) Arabic in formal contexts :  Administration, politics, media, education, religion…  Dialects in daily life (trade, traditions, …)  “Mother” tongue, “native” language ?  Use of Arabic dialects:  Morocco : Haḍarī (Fez, Rabat), Bedui (Marrakech), ‘Arūbī (west)  Algeria: wahrani (West), Ksantini (East), Dziri (Center)  Tunsia: Beldi (Tunis), Sahli (Center), Bedui (South)  Ibn Žaldūn, al-Muqaddima, p.984 : (1377) «... Aboriginal Language [Algerian] has taken over the Arabic language. The result is a mixed language, in which Berber dominates and in which North African dialects are farthest from Classical Arabic ».
  • 5. Writing hybridization A calendar page for November 1, 1895 with Ottoman-Turkish, Hebrew, Latin, Arabic scripts Turkish and Arabic manuscript dictionary. 1201 AH [1786/87 AD].
  • 6. Writing hybridization Notary records in Arabic and French (1342 Hg/1923 AD) Strong claims of Berber as a national language
  • 7. Berber languages  It is estimated that there are between 14 and 25 million speakers of Berber languages Tachelhit 3 million Morocco Tamazight 3 million Morocco Kabyle 2.5 to 6 million Algeria Tarifit 1.7 million Morocco Tachawit 1.4 million Algeria Tamajaq 640,000 Niger Tamasheq (Tuareg) 281,000 Mali Tamajeq 250,000 Niger Nafusi 167,000 Libya Tamasheq has five varieties. Tachelhit has a dialect called Susiua. Tamazight has two major varieties: Central Altas and Southern Oran. Tarifit has at least two dialects, one of which may be a separate language Kabyle has two dialects: Greater and Lesser Kabyle.
  • 8. Berber writing systems  Berber languages are written with Roman, Arabic and Tifinagh alphabets The female name "Saida" written in tifinagh Berber language poetry in Arabic script with its translation in French
  • 9. Berber writing systems  Kabylian manuscript of the XVIII Century
  • 10. Berber « Rosetta » stone Berber « Rosetta stone » from Dugga (Tunisia) Phoenician and Tifinagh scripts (British Museum) The Tifinagh alphabet is thought to have been derived from the Phoenician alphabet.
  • 11. Berber spreading in Maghreb  60% in Morocco, 50% in Algeria, 2% in Tunisia
  • 12. Berber recognition as a national language  Algeria: constitutional amendment (2002)
  • 13. Berber recognition as a national language  Morocco « Constitutional amendment, July 2011)
  • 14. IRCAM
  • 15. IRCAM
  • 16.  Tunisia  New constitution 2014: recognizing minorities without nominating them  NGOs acting for a constitutional amendment
  • 17.
  • 18. So what ? … (hypothesis)  A significant amount of resources in various languages and writing systems have been accumulated throughout years  It does not exist today vast digital corpora of texts in vernacular idioms and scripts (Unlike many traditional textual resources : printed documents and medieval manuscripts),  But, since constitutional amendments (Algeria and Morocco), there is a growing awareness that these resources should be collected, studied and digitized A large reservoir of unrecorded oral heritage A large reservoir of popular music A large reservoir of multilingual city printing A large amount of untreated notary records A large quantity of manuscripts to be digitized
  • 19. How to subscribe to this challenge ?  Need for a strategy  Need for a working frame  Need for competencies  Need for data resources  Need for … incentives !
  • 20. Incentives  DH general context in France  THATCamp Paris 2010 : HD Manifest  …  III. Declaration  5. We, professionals of the digital humanities, are building a community of practice that is solidary, open, welcoming and freely accessible.  6. We are a community without borders. We are a multilingual and multidisciplinary community.  7. Our objectives are the advancement of knowledge, the improvement of research quality in our disciplines, the enrichment of knowledge and of collective patrimony, in the academic sphere and beyond it.  8. We call for the integration of digital culture in the definition of the general culture of the twenty-first century. http://tcp.hypotheses.org/411
  • 22. Incentives  Research projects experience  BNFB (Bibliothèque Numérique Franco- Berbère) : OIF/FFI Project - 2011 (100 000 €)  Enrich and update, with automated procedures, records repositories that describe and make available documents in Berber languages (Timbuktu manuscripts)  IDEFI-CréaTIC: Unesco Chair-ITEN (2012) : Innovative Training (5,2 million €)  ISO standards:  E-Learning vocabulary standards (SC36/WG1)  E-Learning MLR (Metadata for Learning standard)  E-learning MLO (Metadata for learning opportunities) (Afnor/GT1 mirror)
  • 23. Incentives  Research background  Information and Communication Studies  Multilingual data encoding standards  Digital data processing  Research affiliations  Universities (Algeria, France, Morocco, Tunisia)  Research Laboratories  Professional associations  International Organizations
  • 24. Key incentive  Support from international TEI referees  Laurent ROMARY : director of DARIAH & member of the TEI council  Lu BURNARD: Oxford Univ., godfather of TEI Guidelines & consultant TGIR Huma-Num, France Laurent ROMARY Lu BURNARD
  • 25.
  • 26. The roadmap  Building partnerships  Fund raising  Building work environment  Work organization  Implementation, production and evaluation
  • 28. Fund raising  OIF/FFI (2010-2013)  100 000 €  BNFB project  ISCC/CNRS (2012-2013)  10 000 €  HumanitéDigitMaghreb  LEDEN, MSH Paris-Nord (2014-2015)  3000 €  TEI/MEI Euromed  Chaire Unesco-ITEN (provisional)  HD-MUREN
  • 29. Work organization (structuring) SH Corpus: Document structures modeling Poetry Theatre Stories Music Manuscript Speech Fables Others Video Peintings
  • 30. Work environment  Preferring  A data description system ?  Document centered system  Document structuring  to  A Data base management system  Flat scanning  Referencing metadata
  • 32. Work environment  oXygen  oXygen XML Editor supports a large number of text encodings.  Bidirectional Text Support in Text Editing Mode (Unicode BIDI Algorithm)  Editing right-to left XML documents  Ability to create and edit XML content in Arabic
  • 36. Work environment  Xaira (XML search engine)  Explore a corpus language:  Search words, forms, phrases or XML tags  Analyze the way how they are used (frequency, stats, forms)
  • 38. Work plan (provisional)  Sharing tasks and responsibilities  Describing roles  Training sessions  Resource analysis  Structuring (segmentation)  TEI/XML Encoding  Collections organization  Referencing (TeiHeader)  DC indexing (metadata harvesting)  Exploitation  Content research  Communication
  • 39. Work organization (sharing) SH Specialists XML/TEI technicians Info-Com. specialists Arts & SH Corpus
  • 40. Work organization (structuring)  SHS SPECIALISTS identify logical structure of documents according to the genres (Novel, poem, theatre, story, painting, music, …)
  • 41. Work organization (encoding)  XML/TEI TECHNICIANS convert data models prepared by SH specialists into TEI schemas using specialized software XML files
  • 42. Work organization (indexing)  INFORMATION SPECIALISTS (librarians) add referencing data (TEIHeader) and description metadata in the Data base CMS / DB
  • 43. Some examples (TEIVerse)  Identify within a poem  Verse metric  Different types of verse  Different verse clusters (verses, quatrains)  Patterns of stanzas  Rhythm  ... The granularity may vary between large segments of text and very tiny character strings
  • 44. Some examples (TEIVerse) <text> <body> <head>Heureux qui, comme Ulysse, a fait un beau voyage </head> <lg> <l>Heureux qui, comme Ulysse, a fait un beau voyage </l> <l> Ou comme cestuy-là qui conquit la toison </l> <l> Et puis est retourné, plein d'usage et raison </l> <l> Vivre entre ses parents le reste de son âge ! </l> </lg> <lg> <l> Quand reverrai-je, hélas, de mon petit village </l> <l> Fumer la cheminée, et en quelle saison </l> <l> Reverrai-je le clos de ma pauvre maison </l> <l> Qui m'est une province, et beaucoup davantage ? </lg> <lg> <l> Plus me plaît le séjour qu'ont bâti mes aïeux </l> <l> Que des palais Romains le front audacieux </l> <l> Plus que le marbre dur me plaît l'ardoise fine : </l> </lg> <lg> <l> Plus mon Loir gaulois, que le Tibre latin </l> <l> Plus mon petit Liré, que le mont Palatin </l> <l> Et plus que l'air marin la douceur angevine </l> </lg> </body> </text> <teiHeader> <fileDesc> <titleStmt> <Title>Heureux qui comme Ulysse</title> <respStmt> <resp>Auteur</resp> <Name>Joacquim Du Belay</name> </respStmt> </titleStmt> <sourceDesc> <Bibl>Recueil Les regrets </bibl> </sourceDesc> </fileDesc> </teiHeader> <text>…
  • 45. Some examples (TEIDrama)  Identify within a piece of theatre: Interpretation and production Speech on stage Stage direction (“didascalies”) Types of exposure Dénouement Prologue  …  Interventions et indications scéniques <sp who=“P”> <speaker>Paul</speaker> <p>Il fait chaud aujourd’hui…</p> </sp> <stage>Paul allume une cigarette</stage>
  • 46. <surface> : a page, a stela, all objects with inscription  Surface contains zones and lines  It has coordinates <zone>: An area of the surface determined arbitrarily for editorial needs. Areas can overlap: overlaping is defined according to spacial coordinates  Can contain <line>  Disposes of coordinates <line> : a continuum of text clearly identified by editor  Can contain text and <zone>  Without coordinates Some examples (TEIManuscript)
  • 48. Some examples (TEIManuscript) <sourceDoc> <surface ulx="0" uly="0" lrx="200" lry="300"> <zone ulx="10" uly="43" lrx="185" lry="84" rotate="0”> <zone> <line rend="right"> 1 April 2009</line> </zone> <line>Fed Birds in the park today.</line> <line>Might write an articleabout</line> <line>the Thick-billed Warbler.</line> </zone> <zone ulx="9" uly="20" lrx="70" lry="60" rotate="90”> <line>Samaria is a Greek</line> <line>brand of water that</line> <line>comes from the natural</line> <line>springs of Stilos, in</line> <line>Crete</line> </zone> </surface> </sourceDoc> <surface ulx="0" uly="0" lrx="700" lry="1000"> <!-- ... --> </surface> <zone ulx="93" uly="681" lrx="967" lry="1568"> <graphic url=« gb.jpg"/> </zone>
  • 49. Some examples (TEIManuscript) <surface xml:id=“s1” ulx="0" uly="0" lrx="50" lry="50"> <zone xml:id=“s1-z1” ulx="1" uly="1" lrx="10" lry="10"> <line>Poem</line> <!-- ... --> <line>the head</line> </zone> <zone xml:id=“s1-z2” ulx="4" uly="4" lrx="20" lry="20"> <surface xml:id=“s2” ulx="0" uly="0" lrx="100" lry="100"> <zone xml:id=“s2-z1” ulx="10" uly="10" lrx="90" lry="95"> Spring has just set in here, and the weather […] a steamer </zone> </surface> </zone> </surface> s1 s1-z1 s1-z2 s2 s2-z1
  • 50. Some examples (Graph)  Image Markup Tool
  • 51. Some examples (Graph) <teiHeader> <fileDesc> <titleStmt> <title>The Image Markup Logo</title> </titleStmt> <publicationStmt> <p></p> </publicationStmt> <sourceDesc> <p>377 x 259</p> </sourceDesc> </fileDesc> <encodingDesc> </encodingDesc> </teiHeader>
  • 52. Some examples (Graph)  Image Markup Tool
  • 53. Some encoding examples (TEISpeech)  Encoding speech according to a « Timeline »  Possibility to describe different speech phenomena of oral communication and sounds during a speech  pauses, words overlaps, tonal changes, vocal expressions (coughs, laughs, sneezing, …), gestures etc.
  • 54. © The Music Encoding Initiative. http://music-encoding.org/ Some examples (MEI)
  • 55. A tunisian song encoded in TEI <TeiHeader> Métadonnées(TEIHeader)Espacedenom
  • 56. A tunisian song encoded in TEI <text> ‫الليل‬ ‫في‬ ‫الياسمينة‬ ‫تحت‬ ‫نسمة‬‫والورد‬‫محاذيني‬ ‫األغصان‬‫تــــميل‬ ‫عليا‬ ‫تمسحلي‬‫دمعة‬ ‫في‬‫عيني‬ ‫تحت‬‫الياسمينة‬‫اتكيت‬ ‫عدلت‬‫وغنيت‬ ‫الـــعود‬ ‫وتناطر‬‫وبكيت‬ ‫دمعي‬ ‫تفكرتك‬‫كنت‬ ‫كيف‬‫تجيني‬ ‫جنينة‬‫النوار‬ ‫مزينها‬ ‫فاحت‬‫األزهار‬ ‫ريحت‬ ‫من‬ ‫تفكرتك‬‫النار‬ ‫شعــــلت‬ ‫عملت‬‫لهليبة‬‫في‬‫قليبي‬ ‫متوحش‬‫محتار‬ ‫وحدي‬ ‫ال‬‫أطيار‬ ‫حس‬ ‫ال‬ ‫و‬ ‫قمرة‬ ‫كان‬‫األشجار‬ ‫ع‬ ‫النسمة‬ ‫توانس‬‫فيا‬‫وتواسيني‬ Stanza(Quatrain)
  • 57. A tunisian song encoded in MEI <music>
  • 58. A Arabic/French fable encoded with Oxygen 59
  • 59. A TEI-based critical apparat of an Arabic document Titre du document : ‫ّم‬‫ج‬‫المن‬ ‫بن‬ ‫يحيى‬ ‫رسالة‬ [Epître de Yahya ibn al-Munajjm sur la musique] Nature : Fichier Word (converti en TEI P5 XML par Oxgarage) Langue : Arabe Genre littéraire : Catalogue de manuscrit de musique Réalisation : Anas GHRAB Schéma TEI : All Extrait du bloc <TEIHeader> : <teiHeader> <fileDesc xml:idx="001"> <titleStmt> <title type="full"> <title type="main">‫ّم‬‫ج‬‫المن‬ ‫بن‬ ‫يحيى‬ ‫/<رسالة‬title> <title type="sub">Titre translitéré</title> <title type="alt">Epître de Yahya ibn al-Munajjm sur la musique</title> </title> <author> <persName> <forename>‫/<يحيى‬forename> <genName>‫يحيى‬ ‫/<بن‬genName> <surname>‫ّم‬‫ج‬‫المن‬ ‫/<بن‬surname> <addName></addName> </persName> <date>... <placeName> <settlement type="city">...</settlement> <country>...</country> </placeName> </date> <date>... <placeName> <settlement type="city">...</settlement> <country>...</country> </placeName> </date> </author> <respStmt> <resp>‫/<تحقيق‬resp> <persName>‫غراب‬ ‫/<أنس‬persName> </respStmt> </titleStmt> <publicationStmt> Extrait du bloc <Text> <text> <front> <div type="abstract"> <head>‫الكتاب‬ ‫محتوى‬ ‫/<وصف‬head> </div> <div type="contents"> <head>‫الكتاب‬ ‫/<فهرس‬head> </div> </front> <body> <p xml:idx="001"> ِ‫م‬‫ِي‬‫ح‬‫ه‬‫ر‬‫ال‬ ِ‫ن‬‫َا‬‫م‬ْ‫هح‬‫ر‬‫ال‬ ِ ‫ه‬‫اَّلل‬ ِ‫م‬ ْ‫س‬ِ‫ب‬ </p> <p xml:idx="002"> ‫ال‬ ‫ِي‬‫ف‬ ّ َ‫اَّلل‬ِ‫ب‬ ‫ِد‬‫ض‬َ‫ت‬‫ُع‬‫م‬‫ال‬ َ‫منين‬ْ‫ؤ‬ُ‫م‬‫ال‬ ِ‫ِير‬‫م‬َ‫أ‬ ‫ى‬َ‫َول‬‫م‬ ِ‫م‬ِّ‫ج‬َ‫ن‬ُ‫م‬‫ال‬ ‫َى‬‫ي‬ْ‫َح‬‫ي‬ ُ‫بن‬ ّ‫ي‬ِ‫ل‬َ‫ع‬ ُ‫بن‬ ‫َى‬‫ي‬ْ‫َح‬‫ي‬ ُ‫ة‬َ‫ال‬ َ‫س‬ِ‫ر‬‫َى‬‫ق‬‫ِي‬‫س‬‫ُو‬‫م‬ </p> <p xml:idx="003"> ْ‫ي‬َ‫ل‬َ‫ع‬ َ‫ون‬ُ‫َك‬‫ي‬ ْ‫ن‬َ‫أ‬ ُ‫ب‬ِ‫ج‬َ‫ي‬ ‫َا‬‫م‬َ‫و‬ ،‫ِي‬ّ‫ن‬َ‫غ‬ُ‫م‬‫ال‬ َ‫ة‬َ‫ف‬ِ‫ص‬ ‫َا‬‫ذ‬َ‫ه‬ َ‫ل‬ْ‫ب‬َ‫ق‬ ‫ِي‬‫ذ‬‫ه‬‫ال‬ ‫َا‬‫ن‬ِ‫ب‬‫َا‬‫ت‬ِ‫ك‬ ‫ِي‬‫ف‬ ‫َا‬‫ن‬ْ‫ر‬َ‫َك‬‫ذ‬ ْ‫َد‬‫ق‬ ُ‫ل‬‫و‬ُ‫َق‬‫ن‬َ‫م‬ ‫َا‬‫ن‬ْ‫ف‬َ‫ص‬َ‫و‬َ‫و‬ ،ِ‫ه‬َ‫ر‬ْ‫م‬َ‫أ‬ َ‫اآلن‬ ُ‫ِف‬‫ص‬َ‫ن‬َ‫و‬ ،َ‫ك‬ِ‫ل‬َ‫ذ‬ ْ‫ِن‬‫م‬ ً‫ة‬َ‫ي‬‫َا‬‫ه‬ِ‫ن‬ ِ‫ه‬‫ِي‬‫ف‬ ‫ا‬ ٍ‫ع‬ِ‫ب‬ْ‫ص‬‫ا‬ َ‫ع‬ِ‫ق‬‫َا‬‫و‬َ‫م‬َ‫و‬ ، ُ‫ِف‬‫ل‬َ‫ت‬ْ‫َخ‬‫ي‬َ‫و‬ ‫َا‬‫ه‬ْ‫ن‬ِ‫م‬ ُ‫ِف‬‫ل‬َ‫ت‬ْ‫َأ‬‫ي‬ ‫َا‬‫م‬َ‫و‬ ،‫َا‬‫ه‬َ‫د‬َ‫د‬َ‫ع‬َ‫و‬ ِ‫م‬َ‫غ‬‫ه‬‫ن‬‫ال‬ٍ‫ع‬ِ‫ب‬ْ‫ص‬‫ا‬ٍ‫َر‬‫ت‬َ‫و‬ ْ‫ِن‬‫م‬ٍ‫َر‬‫ت‬َ‫و‬َ‫و‬ ،ُ‫ه‬‫ا‬‫ه‬‫م‬ َ‫س‬ ‫َا‬‫م‬ ُ‫ن‬ِّ‫ي‬َ‫ب‬ُ‫ن‬َ‫و‬ ،ٍ‫ن‬‫َا‬‫ت‬ ْ‫َس‬‫د‬ ِّ‫ل‬ُ‫ك‬ ْ‫ِن‬‫م‬ ٍ‫ة‬َ‫م‬ْ‫غ‬َ‫ن‬ ِّ‫ل‬ُ‫ك‬ َ‫ع‬ِ‫ض‬ْ‫و‬َ‫م‬ ‫ِي‬‫ل‬ِ‫ص‬ْ‫و‬َ‫م‬‫ال‬ َ‫م‬‫ِي‬‫ه‬‫َا‬‫ر‬ْ‫ب‬‫ا‬ ُ‫بن‬ ُ‫َاق‬‫ح‬ ْ‫س‬ِ‫إ‬«‫َى‬‫ر‬ْ‫َج‬‫م‬‫ال‬»ْ‫ُس‬‫و‬‫ال‬ ‫َى‬‫ر‬ْ‫َج‬‫م‬ِ‫ب‬ ‫َا‬‫ه‬َ‫ض‬ْ‫ع‬َ‫ب‬ َ‫م‬ َ‫َس‬‫ر‬ ‫ِي‬‫ت‬‫ه‬‫ال‬ ِ‫ت‬‫َا‬‫و‬ْ‫ص‬َ‫األ‬ ‫ِي‬‫ف‬،ِ‫َر‬‫ص‬ْ‫ن‬ِ‫ب‬‫ال‬ ‫َى‬‫ر‬ْ‫َج‬‫م‬ِ‫ب‬ ‫َا‬‫ه‬َ‫ض‬ْ‫ع‬َ‫ب‬َ‫و‬ ،‫ى‬َ‫ط‬ ْ‫هن‬‫م‬ِ‫م‬ ‫ِه‬‫ئ‬‫َا‬‫ر‬َ‫ظ‬ُ‫ن‬َ‫و‬ َ‫َاق‬‫ح‬ ْ‫س‬ِ‫إ‬ َ‫ل‬ْ‫ث‬ِ‫م‬ ِّ‫ي‬ِ‫ب‬َ‫ر‬َ‫ع‬‫ال‬ ِ‫ء‬‫َا‬‫ن‬ِ‫غ‬‫ال‬ ِ‫ب‬‫َا‬‫ح‬ْ‫ص‬َ‫أ‬ َ‫ْن‬‫ي‬َ‫ب‬ ‫َا‬‫م‬ َ‫ف‬ َ‫َِل‬‫ت‬ْ‫َاخ‬‫و‬ <app> <lem>َ‫ع‬َ‫م‬َ‫ج‬</lem> <rdg wit="#Ram">‫/<جميع‬rdg> </app> َ‫د‬َ‫ع‬ ‫ِي‬‫ف‬ ِ‫ء‬‫َا‬‫م‬َ‫د‬ُ‫الق‬ ِ‫ة‬َ‫ف‬ِ‫س‬ َ‫ََل‬‫ف‬‫ال‬ َ‫ِن‬‫م‬ ‫َى‬‫ق‬‫ِي‬‫س‬‫ُو‬‫م‬‫ال‬ ِ‫ب‬‫َا‬‫ح‬ْ‫ص‬َ‫أ‬ َ‫ْن‬‫ي‬َ‫ب‬َ‫و‬ ،ِ‫َل‬‫م‬َ‫ع‬‫َال‬‫و‬ ِ‫ة‬َ‫ع‬‫َا‬‫ن‬ِّ‫ص‬‫ال‬ِ‫ب‬ َ‫م‬ْ‫ِل‬‫ع‬‫ال‬َ‫َان‬‫ك‬ ْ‫ذ‬ِ‫إ‬ ِ‫م‬َ‫غ‬‫ه‬‫ن‬‫ال‬ ِ‫د‬‫ه‬‫ن‬َ‫أ‬ َ‫ُون‬‫م‬َ‫ع‬ْ‫ز‬َ‫ي‬ ُ‫ه‬ُ‫َاؤ‬‫ر‬َ‫ظ‬ُ‫ن‬َ‫و‬ ُ‫َاق‬‫ح‬ ْ‫س‬ِ‫إ‬ ،َ‫ة‬َ‫ر‬ َ‫َش‬‫ع‬ ‫ِي‬‫ن‬‫َا‬‫م‬َ‫ث‬ ‫َا‬‫ه‬‫ه‬‫ن‬َ‫أ‬ َ‫ُون‬‫م‬َ‫ع‬ْ‫ز‬َ‫ي‬ ‫َى‬‫ق‬‫ِي‬‫س‬‫ُو‬‫م‬‫ال‬ ُ‫َاب‬‫ح‬ْ‫ص‬َ‫أ‬ َ‫َان‬‫ك‬َ‫و‬ ٌ‫ر‬ ْ‫َش‬‫ع‬ َ‫م‬َ‫غ‬‫ه‬‫ن‬‫ال‬ <app> <lem>.َ‫م‬‫َا‬‫غ‬ْ‫ن‬َ‫َاأل‬‫و‬ َ‫َان‬‫ح‬ْ‫ل‬َ‫األ‬ ُ‫ع‬َ‫م‬ْ‫َج‬‫ي‬ ‫َا‬‫م‬ِ‫ل‬ ٌ‫ح‬ْ‫ر‬ َ‫ش‬ ‫َا‬‫ه‬َ‫د‬ْ‫ع‬َ‫ب‬َ‫و‬</lem> <rdg wit="#Ram"> ‫يجمع‬ ‫ما‬ ‫شرح‬ ‫وبعده‬‫االلحار‬‫واالفهام‬ </rdg> <rdg wit="#BM"> ‫يجتمع‬ ‫بما‬ ‫شرح‬ ‫ونقدمه‬‫االلحار‬‫واالفهام‬ </rdg> <rdg wit="#Zak">‫واألنغام‬ ‫األلحان‬ ‫يجتمع‬ ‫ما‬ ‫شرح‬ ‫/<ونقدم‬rdg> </app>
  • 60. Critical diagnosis  Facing many problems  DH & TEI are very new concepts for academic community (resistance)  Vertical clustering of disciplines in the same university  Some cultural and linguistic sensitiveness observed between partners  Need for more training and interdisciplinary research  Nomadic training team  Group publishing The stylistic notes by Rétif de la Bretonne: a hypertext before digital era Henri HUDRISIER (1), Sofia BEZINA (2), Hichem ISMAIL (3), Rachid ZGHIBI (4), Sihem ZGHIDI (5), Loula Abd-elrazak (6), Laurent ROMARY (7), Mokhtar BEN HENDA (8), Arnaud LABORDERIE (9), Ghislaine AZEMARD (10)
  • 61.
  • 62. General orientation  A strategy of support and orientation  French translation of the full version of "TEI P5 Guidelines" (1500 pages counting appendices including about 600 pages are translated to date)  An Arabic TEI P5 version is also under attempt Rachid ZGHIBI, Tunis Henri HUDRISIER, PAris8
  • 63. General orientation  Further research projects  2014-2015: Project & MEI TEI-Euromed (Encoding with MEI Maghreb traditional music) Sylvaine MARTIN, LEDEN, MSH Paris Nord
  • 64. General orientation  Further research projects  2012: The Unesco Chair-ITEN: “Innovation, Transmission, Edition Numérique”  Bringing academic and entrepreneurial partners together to discuss the contemporary changes education related to the effects of digital innovation  2012: IDEFI-CréaTIC project  budget of 5,2 M€ on 8 years  17 training programs  24 workshops open to more than 500 students
  • 65.
  • 66. HD-Muren  Implement and test an environment where different stakeholders in the academic communities learn together about new computing technologies and their impact on the humanities  DH & TEI at school?
  • 67. General orientation  More positioning within international communities of practice in the DH  Humanistica (2014 membership)  ThatCamps: St Malo, 2013 / Lyon 2014 / Beirut, 2015 / Paris 2015)
  • 68. Thank you for your attention