SlideShare ist ein Scribd-Unternehmen logo
1 von 5
Downloaden Sie, um offline zu lesen
IOSR Journal of Engineering (IOSRJEN) www.iosrjen.org
ISSN (e): 2250-3021, ISSN (p): 2278-8719
Vol. 04, Issue 12 (December 2014), ||V2|| PP 32-36
International organization of Scientific Research 32 | P a g e
Arabic Inquiry-Answer Dialogue Acts Annotation Schema
AbdelRahim A. Elmadany1
, Sherif M. Abdou2
, Mervat Gheith1
1
Institute of Statistical Studies and Research (ISSR), Cairo University
Emails: ar_elmadany@hotmail.com, mervat_gheith@yahoo.com
2
Faculty of Computers and Information, Cairo University
Email: sh.ma.abdou@gmail.com
Abstract: We present an annotation schema as part of an effort to create a manually annotated corpus for
Arabic dialogue language understanding including spoken dialogue and written „chat‟ dialogue for inquiry-
answer domain. The proposed schema handles mainly the request and response acts that occurs frequently in
inquiry-answer debate conversations expressing request services, suggests, and offers. We applied the proposed
schema on 83 Arabic inquiry-answer dialogues.
I. Introduction
Arabic Natural Language Processing (ANLP) including Arabic Language Understanding (ALU) within
dialogue-based research has gained an increasing interest in the last few years. Building an ALU system requires
an annotated dialogue acts (DAs) corpora that is annotated according to a specified dialogue acts schema.
Inquiry-answer is an important part of industry world because it is the core of a successful business.
When quality services are met and exceed customer expectations, the customers are pleased. In order to ensure
this happens, service managers (inquiry-answer operators) need to be aware of the parts of the service delivery
experience that are open to cultural effects[1].
In this paper, we propose a dialogue acts annotation schema for Arabic inquiry-answer dialogues. This
schema defines the dialogue acts using multi-dimensions within communication functions based on request and
response that have certain questions, responses and the relation between them.
In this paper, we proposed the first version of Arabic inquiry-answer manually annotated dialogues
corpus, which contains 83 spoken and written ‘chat’ dialogues.
In the following sections, section 2 summarizes the main annotation schemas developed for English
language and Arabic language. Section 3 describes the proposed annotation schema. Section 4 describes the an-
notation corpus and section 5 includes the conclusion.
II. Related works
Dialogue act refers to the meaning of an utterance representation at the level of illocutionary force [2].
Speech acts terminology has been addressed by Searle (1969) based on Austin (1962) work.
The idea of a dialogue act plays a key role in studies of dialogue, especially in communicative behavior
understanding of dialogue participants, in building annotated dialogue corpora and in the design of dialogue
management systems for spoken human-computer dialogue.
Dialogue act is approximately the equivalent of the speech act of Searle (1969). Dialog acts are differ-
ent in different dialog systems. The research on dialog acts has increased since 1999, after spoken dialog sys-
tems became a commercial reality[3].
The MapTask project [4] proposed labeling schema using 12 dialogue acts based on two categories (1)
initiating moves (2) response. The VERBMOBIL project (1993-2000) aimed at the development of an automatic
speech to speech translation system for the languages German, American English and Japanese [5]. The
VERBMOBIL Project had two phases, the first phase proposed labeling schema using hierarchy of 43 dialogue
acts [6]. The second phase expanded the dialogues from meeting scheduling to comprehensive travel planning.
Thus change labeling schema to a hierarchy of 18 dialogue acts [7].
The DAMSL Dialogue Act Markup using Several Layers (DAMSL) were proposed as a general- pur-
pose schema [8-10] developed for multi-dimensional dialogue acts annotation. SWITCHBOARD-DAMSL is
improved version of DAMSL proposed by [11]when they wanted to annotate a large amount of transcribed
speech data ‘Switchboard Corpus’ because of the difficulty of consistently applying the DAMSL annotation
schema [11, 12]. SWITCHBOARD-DAMSL schema includes 220 dialogues acts, but it is still difficult to be
used for manual annotation because it is a very large set. Moreover, [11]reported 0.80 of Kappa score with the
220 dialogue acts and 130 dialogue acts occurred less than 10 times in the entire corpus [13]. To obtain enough
data per class for statistical modeling purposes, [11]proposed new dialogue act schema namely SWITCH-
BOARD contains 42 mutually exclusive dialogue acts types.
Arabic Inquiry-Answer Dialogue Acts Annotation Schema
International organization of Scientific Research 33 | P a g e
The ICSI-MRDA Meeting Room corpus use DA variant of the DAMSL dialogue acts schema like the
SWITCHBOARD corpus by combining the tags into single, distinct dialogue acts to reduce aspects of the mul-
tidimensional nature of the original DAMSL annotation scheme. There are 11 general tags and 39 specific acts
that are used over ICSI-MRDA Meeting Room corpus [14]. The AMI project, a European research project cen-
tered on multi-modal meeting room technology, uses 15 dialogue acts.
Dynamic Interpretation Theory (DIT) [15] reported dialogue acts schema with a number of dialogue act
types from DAMSL [8] and other schema. The DIT++ is a comprehensive system of dialogue act types obtained
by extending the acts of DIT [16]. DIT++ schema has 11 dimensions with around 95 communicative functions,
around 42 of which, like switchboard are for general purpose functions, whereas others cover elements of feed-
back, interaction management and the control of social obligations [13].
[17]has proposed a preliminary version of ISO DIS 24617-2:2010 as an international standard for anno-
tating dialogue with semantic information; in particular concerning the communicative functions of the utter-
ances, the kind of content they address, and the dependency relations to what was said and done earlier in the
dialogue. [18]has proposed the final version of ISO Standard 24617-2.
Most of the previous proposed annotation schemes to mark-up dialogue corpora in languages such as
English, German and Spanish. There are few efforts were done to propose dialogue acts annotation schema for
Arabic. First attempt was by [19] which proposed a DAs classifier with scheme that contains about 10 DAs (as-
sertion, declaration, denial, expressive evaluation, greeting, indirect request, question, promise/denial, response
to question, and short-response).
(Dbabis et al., 2012) proposed another DAs schema within an Arabic discussion that contains about 13
DAs within 6 categories: Social Obligation Management (e.g. Opening, Closing, Greeting … etc.), Turn Man-
agement (e.g. Acknowledgement, Calm, Clarify Request … etc.), Request (e.g. Question, Order, Promise …
etc.), Argumentation (e.g. Opinion, Appreciation, Accept, Reject … etc.), Answer, and Statement [20].
These schemas were used to mark-up dialogue corpora based on a general conversion discussion like
TV talk-show programs.
There are two other works concerned with understanding the Arabic inquiry-answer dialogue conver-
sions, [21] proposed an annotation schema to semantically label words on spoken Tunisian dialect turns which
are not segmented into utterances as shown in Figure 1. This schema was applied on TUDICOI corpus that con-
tains dialogues collected from the National Company of Railway in Tunisia (SNCFT). This schema was used to
mark-up word-by-word in dialogue turn not turns or utterances dialogue acts. This schema contains about 33
semantic labels for word annotation within 5 dimensions e.g. Train_Type, Trip_Time, and Destination.
[ ‫مع‬ Out] [ ‫وقتاش‬ Hour_Req] [ ‫إي‬Out] [ ‫إي‬Out] [‫التران‬Train] [ ‫يمشي‬ Departure_Cpt]
Figure 1 Example of semantic labeling from [21]
[22] proposed a dialogue acts classifier based on hotel reservations corpus that consist of 100 queries of
different types (negation, affirmation, interrogation and acceptance). They used a small test set that contained
140 utterances with 14 different dialogue acts that is uttered spontaneously to evaluate their system. This dialo-
gue acts are incomplete to build a inquiry-answer discourse structure because this tag set can’t annotate commu-
nicative functions related to social obligations, suggestions, miss understanding correction, warnings, explana-
tion … etc.
III. Annotation Schema
In this work, we started from dialogue acts in a previous annotation schema for either Arabic or Eng-
lish. We proposed a general dialogue acts annotation schema for inquiry-answer based request and response di-
mensions. Table 1,Table 2, and Table 3 describe the proposed dialogue acts based on request dimension within 6
dialogue acts, response dimension within 14 dialogue acts, and other dimension within 3 dialogue acts.
Request Acts
Dealing with request activities and tasks
Taking-Request
Dealing with taking request e.g. hello
Service-Question
Dealing with services request e.g. asking about service
information or required a service.
Confirm-Question
Happens when needs to confirmation about some informa-
tion.
YesNo-Question
Happens when needs Yes or No answer.
Arabic Inquiry-Answer Dialogue Acts Annotation Schema
International organization of Scientific Research 34 | P a g e
Choice-Question
Happens when needs select one answer from service mul-
tiple-choices question.
Other-Question
Happens when asking about non-service question e.g. mo-
bile number, email, or address.
Turn-Assign
Happens when wants to addressee the speaker to take the
turn e.g. Adam?
Table 1: Request dimension dialogue acts
Response Acts
Dealing with response activities and tasks
Service-Answer
Happens when answer a Service-Question or Choice-
Question.
Other-Answer
Happens when answer an Other-Question.
Agree
Describe agreement/accept answer from Confirm-Question
or YesNo-Question.
Disagree
Describe disagreement/reject answer from Confirm-
Question or YesNo-Question.
Greeting
Happens when speaker wants to greeting and welcome the
other speaker. Also describe greeting accept „return-
greeting‟.
Inform
Happens when speaker wants to explain or describe some-
thing to other speaker.
Thanking
Happens when speaker wants to thank the other speaker.
Also describe thanking accept.
Apology
Happens when speaker wants to apology.
MissUnderstandingSign
Happens when non-understanding the previous utterance.
Correct
Happens when correct an information in previous utterance
or in current utterance.
Pausing
Happens when needs to request more time or stealing time
e.g. just a moment.
Suggest
Happens when provides a suggestion.
Promise
Happens when provides a promise.
Warning
Happens when provides a warning action.
Offer
Happens when provides an offer to the customer.
Table 2: Response dimension dialogue acts
Other Acts
Dealing with neither request nor response activities and
tasks
Opening
Dealing with opening obligation utterance e.g. “Good even-
ing, Banque Misr, Ahmed Samy speaking”.
Closing
Dealing with closing obligation request e.g. “Thank you for
calling and goodbye”.
Self-Introduce
Happens when wants to introduce our self or organization.
Table 3: Other dimension dialogue acts
We set ‘Agree’ dialogue act to describe the agreement/acceptance answers as accept-confirmation, Yes-
Answer, accept-thanking, and accept-apology. In contrast, we set ‘Disagree’ dialogue act to describe the disa-
Arabic Inquiry-Answer Dialogue Acts Annotation Schema
International organization of Scientific Research 35 | P a g e
greement/reject answer as disconfirm, No-Answer, and Reject-thanking, Reject-apology. Our empirical analysis
for our corpus leaded to identifying ‘Opening’ dialogue act as a group of ‘Greeting’ act and ‘self-Introduce’ act.
For example from our corpus:
This turn tagged as ‘Opening’ dialogue act and segmented to three utterances as:
 ‫الخير‬ ‫مساء‬/msA' Alxyr/Good evening is tagged as ‘Greeting’ dialogue act.
 ‫مصر‬ ‫بنك‬/ bnk mSr/Banque Misr is tagged as ‘Self-Introduce’ dialogue act.
 ‫حضرتك‬ ‫مع‬ ‫احمد‬/ AHmd mE HDrtk/Ahmed speaking is tagged as ‘Self-Introduce’ dialogue act.
IV. Annotation Corpus
For the best of our knowledge, there is no corpus for annotated Arabic spoken or written ‘chat’ dialo-
gues in Arabic dialect. For this reason, we built our own corpus as the first version of manually annotated corpus
for Arabic dialogue language understanding including spoken dialogue and written ‘chat’ dialogue for inquiry-
answer domain.
Figure 2: Utterance Annotation Example
This corpus contains two parts (1) spoken dialogue which contains 52 phone calls recorded from Egyp-
tian’s banks inquiry-answer and Egypt Air Company with an average duration of two hour of talking time after
removing ads from calls. It consists of human-human discussions about providing services e.g. create new bank
account, service request, balance check and flight reservation. These phone calls were transcribed using Tran-
scriber®, a tool that is frequently used for segmenting, labeling and transcribing speech corpora. (2) Written
‘Chat’ dialogues, which contain 30 chat dialogues, recorded from mobile agencies web sites ‘KSA Zain, KSA
Mobily, and KSA STC’.
This corpus contains 3001 turns with average 6.7 words per turn and contains 4727 utterances with average
4.3 words per utterance. This corpus is manually segmented and annotated using the proposed schema. An an-
notation tool was developed to manually annotate dialogue turns and segment it is to utterances and label the
dialogue act. A sample-annotated turn is shown in Figure 2.
V. Conclusion
In this paper, we proposed a general dialogue acts schema for Arabic inquiry-answer dialogue either
spoken or written ‘chat’. We proposed the first version of the manually annotated inquiry-answer dialogue as a
part of Arabic language understanding methodologies.
In addition, we introduced the first version of annotated corpus that includes 83 inquiry-answer dialo-
gues from banks, Airlines Company, and mobile networks agency. In future work we plan to enrich this corpus
with inquiry-answer dialogues from other domains e.g. Online Markets, and Railway Networks in both forms
spoken and written ‘chat’ dialogues.
Reference
[1]. S. Carraher and A. Doerr, "A Six Continent Study of Customer Service," Proceedings of Academy of Marketing Studies, vol. 17, 2012.
[2]. J. Austin, How to Do Things with Words. Oxford: Oxford University Press, 1962.
Arabic: ‫حضرتك‬ ‫مع‬ ‫احمد‬ ‫مصر‬ ‫بنك‬ ‫الخير‬ ‫مساء‬
Buckwalter: msA' Alxyr bnk mSr AHmd mE HDrtk
English: Good evening, Banque Misr, Ahmed speaking
Arabic Inquiry-Answer Dialogue Acts Annotation Schema
International organization of Scientific Research 36 | P a g e
[3]. A. Stolcke, K. Ries, N. Coccaro, E. Shriberg, R. Bates, D. Jurafsky, P. Taylor, R. Martin, C. V. Ess-Dykema, and M. Meteer, "Dialogue
Act Modeling for Automatic Tagging and Recognition of Conversational Speech," Computational Linguistics, vol. 26, pp. 339-373,
2000.
[4]. A. Anderson, M. Bader, E. Bard, E. Boyle, and G. Doherty, "The HCRC Map Task Corpus," Language and Speech, vol. 34, pp. 351-
366, 1991.
[5]. W. Wahlster, Verbmobil: Foundations of Speech-To-Speech Translation: Springer, 2000.
[6]. S. Jekat, A. Klein, E. Maier, Ilona Maleck, M. Mast, and J. Quantz, "Dialogue Acts in VERBMOBIL. VM report 65," Universities of
Berlin, Saarbrcken and Stuttgart.1995.
[7]. J. Alexandersson, B. Buschbeck-Wolf, T. Fujinami, M. Kipp, S. Koch, E. Maier, N. Reithinger, B. S. mitz, and M. Siegel., "Dialogue
Acts in VERBMOBIL- 2 (second edition). VM report 226 , DFKI GmbH, ," Universities of Berlin, Saarbrcken and
Stuttgart1998.
[8]. J. Allen and M. Core, "Draft of DAMSL: Dialog Act Markup in Several Layers," 1997.
[9]. M. Core and J. Allen, "Coding Dialogs with the DAMSL annotation scheme," presented at the AAAI Fall Symposium on
Communicative Action in Humans and Machines, MIT, Cambridge, MA, 1997.
[10]. M. Core, M. Ishizaki, J. Moore, C. Nakatani, N. Reithinger, D. Traum, and S. Tutiya, "Report of The Third Workshop of the
Discours e Resource Initiative," Chiba University, Chiba, Japan1998.
[11]. D. Jurafsky, R. Bates, N. Coccaro, R. Martin, M. Meteer, K. Ries, E. Shriberg, A. Stolcke, P. Taylor, and C. E. Dykema., "Automatic
Detection of Discourse Structure for Speech Recognition and Understanding," presented at the IEEE Workshop on Speech
Recognition and Understanding, Santa Barbara, 1997.
[12]. D. Jurafsky, R. Bates, N. Coccaro, R. Martin, M. Meteer, K. Ries, E. Shriberg, A. Stolcke, P. Taylor, and C. V. Ess-Dykema,
"Switch-board Discourse Language Modeling Project Final Report," in ResearchNote 30, Center for Language and Speech
Processing, Johns Hopkins University, Baltimore, 1998.
[13]. N. Webb, "Cue-Based Dialogue Act Classification," Ph.D. dissertation, Department of Computer Science, University of Sheffield,
England, 2010.
[14]. E. Shriberg, R. Dhillon, S. Bhagat, J. Ang, and H. Carvey, " The ICSI Meeting Recorder Dialog Act (MRDA) Corpus," in Special
Interest Group on Discourse and Dialogue (SIGdial), Boston, USA, 2004.
[15]. H. Bunt, "Context and Dialogue Control. , 1," THINK, vol. 3, pp. 19-31, 1994.
[16]. H. Bunt and Y. Girard, "Designing an Open, Multidimensional Dialogue Act Taxonomy," in the 9th Workshop on the Semantics and
Pragmatics in Dialogue, 2005.
[17]. H. Bunt, J. Alexandersson, J. Carletta, J.-W. Choe, A. C. Fang, K. Hasida, K. Lee, V. Petukhova, A. Popescu-Belis, L. Romary, C.
Soria, and D. Traum, "Towards an ISO standard for dialogue act annotation," in the International Conference on Language
Resources and Systems (LREC 2010), Malta, Italy, 2010, pp. 2548–2558.
[18]. H. Bunt, J. Alexandersson, J. Carletta, J.-W. Choe, A. C. Fang, K. Hasida, K. Lee, V. Petukhova, A. Popescu-Belis, L. Romary, C.
Soria, and D. Traum, "ISO 24617-2: A semantically-based standard for dialogue annotation," in Proceedings of the International
Conference on Language Resources and Evaluation (LREC), Istanbul, Turkey, 2012.
[19]. L. Shala, V. Rus, and A. Graesser, "Automatic Speech Act Classification In Arabic," in Subjetividad y Procesos Cognitivos
Conference 2010, pp. 284-292.
[20]. S. B. Dbabis, F. Mallek, H. Ghorbel, and L. Belguith, "Dialogue Acts Annotation Scheme within Arabic discussion," in SemDial
2012 The 16th workshop on semantics and pragmatics of dialogue Sorbonne, Paris, France, 2012.
[21]. M. Graja, M. Jaoua, and L. H. Belguith, "Discriminative Framework for Spoken Tunisian Dialect Understanding," presented at the
SLSP, 2013.
[22]. C. Lhioui, A. Zouaghi, and M. Zrigui, "A Combined Method Based on Stochastic and Linguistic Paradigm for the Understanding of
Arabic Spontaneous Utterances," in CICLing 2013, Computational Linguistics and Intelligent Text Processing Lecture Notes in
Computer Science, Samos, Greece, 2013, pp. 549-558.

Weitere ähnliche Inhalte

Was ist angesagt?

The First English-Persian statistical machine translation
The First English-Persian statistical machine translationThe First English-Persian statistical machine translation
The First English-Persian statistical machine translation
Mahsa Mohaghegh
 
Part of speech tagger
Part of speech taggerPart of speech tagger
Part of speech tagger
arteimi
 
THE STRUCTURED COMPACT TAG-SET FOR LUGANDA
THE STRUCTURED COMPACT TAG-SET FOR LUGANDATHE STRUCTURED COMPACT TAG-SET FOR LUGANDA
THE STRUCTURED COMPACT TAG-SET FOR LUGANDA
ijnlc
 
13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for Translation13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for Translation
RIILP
 
8. Qun Liu (DCU) Hybrid Solutions for Translation
8. Qun Liu (DCU) Hybrid Solutions for Translation8. Qun Liu (DCU) Hybrid Solutions for Translation
8. Qun Liu (DCU) Hybrid Solutions for Translation
RIILP
 
7. Trevor Cohn (usfd) Statistical Machine Translation
7. Trevor Cohn (usfd) Statistical Machine Translation7. Trevor Cohn (usfd) Statistical Machine Translation
7. Trevor Cohn (usfd) Statistical Machine Translation
RIILP
 
Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliterator
Shashank Shisodia
 

Was ist angesagt? (16)

Make it simple with paraphrases: Automated paraphrasing for authoring aids an...
Make it simple with paraphrases: Automated paraphrasing for authoring aids an...Make it simple with paraphrases: Automated paraphrasing for authoring aids an...
Make it simple with paraphrases: Automated paraphrasing for authoring aids an...
 
The First English-Persian statistical machine translation
The First English-Persian statistical machine translationThe First English-Persian statistical machine translation
The First English-Persian statistical machine translation
 
Part of speech tagger
Part of speech taggerPart of speech tagger
Part of speech tagger
 
Communication
CommunicationCommunication
Communication
 
FIRE2014_IIT-P
FIRE2014_IIT-PFIRE2014_IIT-P
FIRE2014_IIT-P
 
THE STRUCTURED COMPACT TAG-SET FOR LUGANDA
THE STRUCTURED COMPACT TAG-SET FOR LUGANDATHE STRUCTURED COMPACT TAG-SET FOR LUGANDA
THE STRUCTURED COMPACT TAG-SET FOR LUGANDA
 
Surface realization
Surface realizationSurface realization
Surface realization
 
13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for Translation13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for Translation
 
8. Qun Liu (DCU) Hybrid Solutions for Translation
8. Qun Liu (DCU) Hybrid Solutions for Translation8. Qun Liu (DCU) Hybrid Solutions for Translation
8. Qun Liu (DCU) Hybrid Solutions for Translation
 
STANDARD ARABIC VERBS INFLECTIONS USING NOOJ PLATFORM
STANDARD ARABIC VERBS INFLECTIONS USING NOOJ PLATFORMSTANDARD ARABIC VERBS INFLECTIONS USING NOOJ PLATFORM
STANDARD ARABIC VERBS INFLECTIONS USING NOOJ PLATFORM
 
7. Trevor Cohn (usfd) Statistical Machine Translation
7. Trevor Cohn (usfd) Statistical Machine Translation7. Trevor Cohn (usfd) Statistical Machine Translation
7. Trevor Cohn (usfd) Statistical Machine Translation
 
NEURAL SYMBOLIC ARABIC PARAPHRASING WITH AUTOMATIC EVALUATION
NEURAL SYMBOLIC ARABIC PARAPHRASING WITH AUTOMATIC EVALUATIONNEURAL SYMBOLIC ARABIC PARAPHRASING WITH AUTOMATIC EVALUATION
NEURAL SYMBOLIC ARABIC PARAPHRASING WITH AUTOMATIC EVALUATION
 
Modeling of Speech Synthesis of Standard Arabic Using an Expert System
Modeling of Speech Synthesis of Standard Arabic Using an Expert SystemModeling of Speech Synthesis of Standard Arabic Using an Expert System
Modeling of Speech Synthesis of Standard Arabic Using an Expert System
 
Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliterator
 
Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)
 
An Efficient Approach to Produce Source Code by Interpreting Algorithm
An Efficient Approach to Produce Source Code by Interpreting AlgorithmAn Efficient Approach to Produce Source Code by Interpreting Algorithm
An Efficient Approach to Produce Source Code by Interpreting Algorithm
 

Andere mochten auch

Dario falasca cv_mar_2013_st
Dario falasca cv_mar_2013_stDario falasca cv_mar_2013_st
Dario falasca cv_mar_2013_st
dariofalasca
 
01.2015 gks graduate program guidelines(english korean)
01.2015 gks graduate program guidelines(english korean)01.2015 gks graduate program guidelines(english korean)
01.2015 gks graduate program guidelines(english korean)
enkhee12
 
CROSS-CULTURAL SCENARIOS : USA vs ARGENTINE
CROSS-CULTURAL SCENARIOS : USA vs ARGENTINECROSS-CULTURAL SCENARIOS : USA vs ARGENTINE
CROSS-CULTURAL SCENARIOS : USA vs ARGENTINE
Nur Fathin Amirah Musa
 
Imad Saed _ CV _ EU Foramt_November_ 2015
Imad Saed _ CV _ EU Foramt_November_ 2015Imad Saed _ CV _ EU Foramt_November_ 2015
Imad Saed _ CV _ EU Foramt_November_ 2015
Imad SAED
 
Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...
Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...
Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...
Annie Johanna
 
Cybercrime, Digital Investigation and Public Private Partnership by Francesca...
Cybercrime, Digital Investigation and Public Private Partnership by Francesca...Cybercrime, Digital Investigation and Public Private Partnership by Francesca...
Cybercrime, Digital Investigation and Public Private Partnership by Francesca...
Tech and Law Center
 
L.Egberts Final Thesis igitur
L.Egberts Final Thesis igiturL.Egberts Final Thesis igitur
L.Egberts Final Thesis igitur
L Egberts
 

Andere mochten auch (18)

New World Origins: Jesuit-Guarani Reductions tour in South America
New World Origins: Jesuit-Guarani Reductions tour in South AmericaNew World Origins: Jesuit-Guarani Reductions tour in South America
New World Origins: Jesuit-Guarani Reductions tour in South America
 
Dario falasca cv_mar_2013_st
Dario falasca cv_mar_2013_stDario falasca cv_mar_2013_st
Dario falasca cv_mar_2013_st
 
Characteristics of poverty in pakistan
Characteristics of poverty in pakistanCharacteristics of poverty in pakistan
Characteristics of poverty in pakistan
 
Rural urban poverty nexus impact of housing environment
Rural urban poverty nexus impact of housing environmentRural urban poverty nexus impact of housing environment
Rural urban poverty nexus impact of housing environment
 
OpenAIRE-COAR conference 2014: Aligning Repository Networks - RED CLARA/LaRef...
OpenAIRE-COAR conference 2014: Aligning Repository Networks - RED CLARA/LaRef...OpenAIRE-COAR conference 2014: Aligning Repository Networks - RED CLARA/LaRef...
OpenAIRE-COAR conference 2014: Aligning Repository Networks - RED CLARA/LaRef...
 
01.2015 gks graduate program guidelines(english korean)
01.2015 gks graduate program guidelines(english korean)01.2015 gks graduate program guidelines(english korean)
01.2015 gks graduate program guidelines(english korean)
 
CROSS-CULTURAL SCENARIOS : USA vs ARGENTINE
CROSS-CULTURAL SCENARIOS : USA vs ARGENTINECROSS-CULTURAL SCENARIOS : USA vs ARGENTINE
CROSS-CULTURAL SCENARIOS : USA vs ARGENTINE
 
MMA_C.V.2015
MMA_C.V.2015MMA_C.V.2015
MMA_C.V.2015
 
Imad Saed _ CV _ EU Foramt_November_ 2015
Imad Saed _ CV _ EU Foramt_November_ 2015Imad Saed _ CV _ EU Foramt_November_ 2015
Imad Saed _ CV _ EU Foramt_November_ 2015
 
Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...
Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...
Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...
 
Thesis: Title Page and Acknowledgements
Thesis: Title Page and AcknowledgementsThesis: Title Page and Acknowledgements
Thesis: Title Page and Acknowledgements
 
A Draft Report of my Final Thesis
A Draft Report of my Final ThesisA Draft Report of my Final Thesis
A Draft Report of my Final Thesis
 
Swartsfager-Resume
Swartsfager-ResumeSwartsfager-Resume
Swartsfager-Resume
 
Lessons learnt from applying PyData to GetYourGuide marketing
Lessons learnt from applying PyData to GetYourGuide marketingLessons learnt from applying PyData to GetYourGuide marketing
Lessons learnt from applying PyData to GetYourGuide marketing
 
Massive scale analytics with Stratosphere using R
Massive scale analytics with Stratosphere using RMassive scale analytics with Stratosphere using R
Massive scale analytics with Stratosphere using R
 
Cybercrime, Digital Investigation and Public Private Partnership by Francesca...
Cybercrime, Digital Investigation and Public Private Partnership by Francesca...Cybercrime, Digital Investigation and Public Private Partnership by Francesca...
Cybercrime, Digital Investigation and Public Private Partnership by Francesca...
 
PB - Participatory Budget; Case Studies: Brazil and Argentina
PB - Participatory Budget; Case Studies: Brazil and Argentina PB - Participatory Budget; Case Studies: Brazil and Argentina
PB - Participatory Budget; Case Studies: Brazil and Argentina
 
L.Egberts Final Thesis igitur
L.Egberts Final Thesis igiturL.Egberts Final Thesis igitur
L.Egberts Final Thesis igitur
 

Ähnlich wie E041223236

54-6 September 2002, Edinburgh, UK, Pages 5-11.Bos, Fost.docx
54-6 September 2002, Edinburgh, UK, Pages 5-11.Bos, Fost.docx54-6 September 2002, Edinburgh, UK, Pages 5-11.Bos, Fost.docx
54-6 September 2002, Edinburgh, UK, Pages 5-11.Bos, Fost.docx
blondellchancy
 
A Dialogue System for Telugu, a Resource-Poor Language
A Dialogue System for Telugu, a Resource-Poor LanguageA Dialogue System for Telugu, a Resource-Poor Language
A Dialogue System for Telugu, a Resource-Poor Language
Sravanthi Mullapudi
 
Cooperation and Collaboration in Natural Command Language Dialogue.docx
Cooperation and Collaboration in Natural Command Language Dialogue.docxCooperation and Collaboration in Natural Command Language Dialogue.docx
Cooperation and Collaboration in Natural Command Language Dialogue.docx
donnajames55
 
Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...
Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...
Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...
IJCI JOURNAL
 
Proposal of an Ontology Applied to Technical Debt on PL/SQL Development
Proposal of an Ontology Applied to Technical Debt on PL/SQL DevelopmentProposal of an Ontology Applied to Technical Debt on PL/SQL Development
Proposal of an Ontology Applied to Technical Debt on PL/SQL Development
Jorge Barreto
 
Developemnt and evaluation of a web based question answering system for arabi...
Developemnt and evaluation of a web based question answering system for arabi...Developemnt and evaluation of a web based question answering system for arabi...
Developemnt and evaluation of a web based question answering system for arabi...
ijnlc
 
Diacritic Oriented Arabic Information Retrieval System
Diacritic Oriented Arabic Information Retrieval SystemDiacritic Oriented Arabic Information Retrieval System
Diacritic Oriented Arabic Information Retrieval System
CSCJournals
 
ON THE UTILITY OF A SYLLABLE-LIKE SEGMENTATION FOR LEARNING A TRANSLITERATION...
ON THE UTILITY OF A SYLLABLE-LIKE SEGMENTATION FOR LEARNING A TRANSLITERATION...ON THE UTILITY OF A SYLLABLE-LIKE SEGMENTATION FOR LEARNING A TRANSLITERATION...
ON THE UTILITY OF A SYLLABLE-LIKE SEGMENTATION FOR LEARNING A TRANSLITERATION...
cscpconf
 

Ähnlich wie E041223236 (20)

Recent approaches to arabic dialogue acts classifications
Recent approaches to arabic dialogue acts classificationsRecent approaches to arabic dialogue acts classifications
Recent approaches to arabic dialogue acts classifications
 
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
 
54-6 September 2002, Edinburgh, UK, Pages 5-11.Bos, Fost.docx
54-6 September 2002, Edinburgh, UK, Pages 5-11.Bos, Fost.docx54-6 September 2002, Edinburgh, UK, Pages 5-11.Bos, Fost.docx
54-6 September 2002, Edinburgh, UK, Pages 5-11.Bos, Fost.docx
 
A Dialogue System for Telugu, a Resource-Poor Language
A Dialogue System for Telugu, a Resource-Poor LanguageA Dialogue System for Telugu, a Resource-Poor Language
A Dialogue System for Telugu, a Resource-Poor Language
 
ATAR: Attention-based LSTM for Arabizi transliteration
ATAR: Attention-based LSTM for Arabizi transliterationATAR: Attention-based LSTM for Arabizi transliteration
ATAR: Attention-based LSTM for Arabizi transliteration
 
Cooperation and Collaboration in Natural Command Language Dialogue.docx
Cooperation and Collaboration in Natural Command Language Dialogue.docxCooperation and Collaboration in Natural Command Language Dialogue.docx
Cooperation and Collaboration in Natural Command Language Dialogue.docx
 
Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...
Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...
Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...
 
An expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabicAn expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabic
 
Proposal of an Ontology Applied to Technical Debt on PL/SQL Development
Proposal of an Ontology Applied to Technical Debt on PL/SQL DevelopmentProposal of an Ontology Applied to Technical Debt on PL/SQL Development
Proposal of an Ontology Applied to Technical Debt on PL/SQL Development
 
A Context-based Numeral Reading Technique for Text to Speech Systems
A Context-based Numeral Reading Technique for Text to Speech Systems A Context-based Numeral Reading Technique for Text to Speech Systems
A Context-based Numeral Reading Technique for Text to Speech Systems
 
Developemnt and evaluation of a web based question answering system for arabi...
Developemnt and evaluation of a web based question answering system for arabi...Developemnt and evaluation of a web based question answering system for arabi...
Developemnt and evaluation of a web based question answering system for arabi...
 
Car-Following Parameters by Means of Cellular Automata in the Case of Evacuation
Car-Following Parameters by Means of Cellular Automata in the Case of EvacuationCar-Following Parameters by Means of Cellular Automata in the Case of Evacuation
Car-Following Parameters by Means of Cellular Automata in the Case of Evacuation
 
Diacritic Oriented Arabic Information Retrieval System
Diacritic Oriented Arabic Information Retrieval SystemDiacritic Oriented Arabic Information Retrieval System
Diacritic Oriented Arabic Information Retrieval System
 
Using ontology based context in the
Using ontology based context in theUsing ontology based context in the
Using ontology based context in the
 
ON THE UTILITY OF A SYLLABLE-LIKE SEGMENTATION FOR LEARNING A TRANSLITERATION...
ON THE UTILITY OF A SYLLABLE-LIKE SEGMENTATION FOR LEARNING A TRANSLITERATION...ON THE UTILITY OF A SYLLABLE-LIKE SEGMENTATION FOR LEARNING A TRANSLITERATION...
ON THE UTILITY OF A SYLLABLE-LIKE SEGMENTATION FOR LEARNING A TRANSLITERATION...
 
A NATURAL LANGUAGE REQUIREMENTS ENGINEERING APPROACH FOR MDA
A NATURAL LANGUAGE REQUIREMENTS ENGINEERING APPROACH FOR MDA A NATURAL LANGUAGE REQUIREMENTS ENGINEERING APPROACH FOR MDA
A NATURAL LANGUAGE REQUIREMENTS ENGINEERING APPROACH FOR MDA
 
A NATURAL LANGUAGE REQUIREMENTS ENGINEERING APPROACH FOR MDA
A NATURAL LANGUAGE REQUIREMENTS ENGINEERING APPROACH FOR MDA A NATURAL LANGUAGE REQUIREMENTS ENGINEERING APPROACH FOR MDA
A NATURAL LANGUAGE REQUIREMENTS ENGINEERING APPROACH FOR MDA
 
A NATURAL LANGUAGE REQUIREMENTS ENGINEERING APPROACH FOR MDA
A NATURAL LANGUAGE REQUIREMENTS ENGINEERING APPROACH FOR MDA A NATURAL LANGUAGE REQUIREMENTS ENGINEERING APPROACH FOR MDA
A NATURAL LANGUAGE REQUIREMENTS ENGINEERING APPROACH FOR MDA
 
A Natural Language Requirements Engineering Approach for MDA
A Natural Language Requirements Engineering Approach for MDAA Natural Language Requirements Engineering Approach for MDA
A Natural Language Requirements Engineering Approach for MDA
 
A natural language requirements engineering approach for mda
A natural language requirements engineering approach for mdaA natural language requirements engineering approach for mda
A natural language requirements engineering approach for mda
 

Mehr von IOSR-JEN

Mehr von IOSR-JEN (20)

C05921721
C05921721C05921721
C05921721
 
B05921016
B05921016B05921016
B05921016
 
A05920109
A05920109A05920109
A05920109
 
J05915457
J05915457J05915457
J05915457
 
I05914153
I05914153I05914153
I05914153
 
H05913540
H05913540H05913540
H05913540
 
G05913234
G05913234G05913234
G05913234
 
F05912731
F05912731F05912731
F05912731
 
E05912226
E05912226E05912226
E05912226
 
D05911621
D05911621D05911621
D05911621
 
C05911315
C05911315C05911315
C05911315
 
B05910712
B05910712B05910712
B05910712
 
A05910106
A05910106A05910106
A05910106
 
B05840510
B05840510B05840510
B05840510
 
I05844759
I05844759I05844759
I05844759
 
H05844346
H05844346H05844346
H05844346
 
G05843942
G05843942G05843942
G05843942
 
F05843238
F05843238F05843238
F05843238
 
E05842831
E05842831E05842831
E05842831
 
D05842227
D05842227D05842227
D05842227
 

Kürzlich hochgeladen

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Kürzlich hochgeladen (20)

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 

E041223236

  • 1. IOSR Journal of Engineering (IOSRJEN) www.iosrjen.org ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 04, Issue 12 (December 2014), ||V2|| PP 32-36 International organization of Scientific Research 32 | P a g e Arabic Inquiry-Answer Dialogue Acts Annotation Schema AbdelRahim A. Elmadany1 , Sherif M. Abdou2 , Mervat Gheith1 1 Institute of Statistical Studies and Research (ISSR), Cairo University Emails: ar_elmadany@hotmail.com, mervat_gheith@yahoo.com 2 Faculty of Computers and Information, Cairo University Email: sh.ma.abdou@gmail.com Abstract: We present an annotation schema as part of an effort to create a manually annotated corpus for Arabic dialogue language understanding including spoken dialogue and written „chat‟ dialogue for inquiry- answer domain. The proposed schema handles mainly the request and response acts that occurs frequently in inquiry-answer debate conversations expressing request services, suggests, and offers. We applied the proposed schema on 83 Arabic inquiry-answer dialogues. I. Introduction Arabic Natural Language Processing (ANLP) including Arabic Language Understanding (ALU) within dialogue-based research has gained an increasing interest in the last few years. Building an ALU system requires an annotated dialogue acts (DAs) corpora that is annotated according to a specified dialogue acts schema. Inquiry-answer is an important part of industry world because it is the core of a successful business. When quality services are met and exceed customer expectations, the customers are pleased. In order to ensure this happens, service managers (inquiry-answer operators) need to be aware of the parts of the service delivery experience that are open to cultural effects[1]. In this paper, we propose a dialogue acts annotation schema for Arabic inquiry-answer dialogues. This schema defines the dialogue acts using multi-dimensions within communication functions based on request and response that have certain questions, responses and the relation between them. In this paper, we proposed the first version of Arabic inquiry-answer manually annotated dialogues corpus, which contains 83 spoken and written ‘chat’ dialogues. In the following sections, section 2 summarizes the main annotation schemas developed for English language and Arabic language. Section 3 describes the proposed annotation schema. Section 4 describes the an- notation corpus and section 5 includes the conclusion. II. Related works Dialogue act refers to the meaning of an utterance representation at the level of illocutionary force [2]. Speech acts terminology has been addressed by Searle (1969) based on Austin (1962) work. The idea of a dialogue act plays a key role in studies of dialogue, especially in communicative behavior understanding of dialogue participants, in building annotated dialogue corpora and in the design of dialogue management systems for spoken human-computer dialogue. Dialogue act is approximately the equivalent of the speech act of Searle (1969). Dialog acts are differ- ent in different dialog systems. The research on dialog acts has increased since 1999, after spoken dialog sys- tems became a commercial reality[3]. The MapTask project [4] proposed labeling schema using 12 dialogue acts based on two categories (1) initiating moves (2) response. The VERBMOBIL project (1993-2000) aimed at the development of an automatic speech to speech translation system for the languages German, American English and Japanese [5]. The VERBMOBIL Project had two phases, the first phase proposed labeling schema using hierarchy of 43 dialogue acts [6]. The second phase expanded the dialogues from meeting scheduling to comprehensive travel planning. Thus change labeling schema to a hierarchy of 18 dialogue acts [7]. The DAMSL Dialogue Act Markup using Several Layers (DAMSL) were proposed as a general- pur- pose schema [8-10] developed for multi-dimensional dialogue acts annotation. SWITCHBOARD-DAMSL is improved version of DAMSL proposed by [11]when they wanted to annotate a large amount of transcribed speech data ‘Switchboard Corpus’ because of the difficulty of consistently applying the DAMSL annotation schema [11, 12]. SWITCHBOARD-DAMSL schema includes 220 dialogues acts, but it is still difficult to be used for manual annotation because it is a very large set. Moreover, [11]reported 0.80 of Kappa score with the 220 dialogue acts and 130 dialogue acts occurred less than 10 times in the entire corpus [13]. To obtain enough data per class for statistical modeling purposes, [11]proposed new dialogue act schema namely SWITCH- BOARD contains 42 mutually exclusive dialogue acts types.
  • 2. Arabic Inquiry-Answer Dialogue Acts Annotation Schema International organization of Scientific Research 33 | P a g e The ICSI-MRDA Meeting Room corpus use DA variant of the DAMSL dialogue acts schema like the SWITCHBOARD corpus by combining the tags into single, distinct dialogue acts to reduce aspects of the mul- tidimensional nature of the original DAMSL annotation scheme. There are 11 general tags and 39 specific acts that are used over ICSI-MRDA Meeting Room corpus [14]. The AMI project, a European research project cen- tered on multi-modal meeting room technology, uses 15 dialogue acts. Dynamic Interpretation Theory (DIT) [15] reported dialogue acts schema with a number of dialogue act types from DAMSL [8] and other schema. The DIT++ is a comprehensive system of dialogue act types obtained by extending the acts of DIT [16]. DIT++ schema has 11 dimensions with around 95 communicative functions, around 42 of which, like switchboard are for general purpose functions, whereas others cover elements of feed- back, interaction management and the control of social obligations [13]. [17]has proposed a preliminary version of ISO DIS 24617-2:2010 as an international standard for anno- tating dialogue with semantic information; in particular concerning the communicative functions of the utter- ances, the kind of content they address, and the dependency relations to what was said and done earlier in the dialogue. [18]has proposed the final version of ISO Standard 24617-2. Most of the previous proposed annotation schemes to mark-up dialogue corpora in languages such as English, German and Spanish. There are few efforts were done to propose dialogue acts annotation schema for Arabic. First attempt was by [19] which proposed a DAs classifier with scheme that contains about 10 DAs (as- sertion, declaration, denial, expressive evaluation, greeting, indirect request, question, promise/denial, response to question, and short-response). (Dbabis et al., 2012) proposed another DAs schema within an Arabic discussion that contains about 13 DAs within 6 categories: Social Obligation Management (e.g. Opening, Closing, Greeting … etc.), Turn Man- agement (e.g. Acknowledgement, Calm, Clarify Request … etc.), Request (e.g. Question, Order, Promise … etc.), Argumentation (e.g. Opinion, Appreciation, Accept, Reject … etc.), Answer, and Statement [20]. These schemas were used to mark-up dialogue corpora based on a general conversion discussion like TV talk-show programs. There are two other works concerned with understanding the Arabic inquiry-answer dialogue conver- sions, [21] proposed an annotation schema to semantically label words on spoken Tunisian dialect turns which are not segmented into utterances as shown in Figure 1. This schema was applied on TUDICOI corpus that con- tains dialogues collected from the National Company of Railway in Tunisia (SNCFT). This schema was used to mark-up word-by-word in dialogue turn not turns or utterances dialogue acts. This schema contains about 33 semantic labels for word annotation within 5 dimensions e.g. Train_Type, Trip_Time, and Destination. [ ‫مع‬ Out] [ ‫وقتاش‬ Hour_Req] [ ‫إي‬Out] [ ‫إي‬Out] [‫التران‬Train] [ ‫يمشي‬ Departure_Cpt] Figure 1 Example of semantic labeling from [21] [22] proposed a dialogue acts classifier based on hotel reservations corpus that consist of 100 queries of different types (negation, affirmation, interrogation and acceptance). They used a small test set that contained 140 utterances with 14 different dialogue acts that is uttered spontaneously to evaluate their system. This dialo- gue acts are incomplete to build a inquiry-answer discourse structure because this tag set can’t annotate commu- nicative functions related to social obligations, suggestions, miss understanding correction, warnings, explana- tion … etc. III. Annotation Schema In this work, we started from dialogue acts in a previous annotation schema for either Arabic or Eng- lish. We proposed a general dialogue acts annotation schema for inquiry-answer based request and response di- mensions. Table 1,Table 2, and Table 3 describe the proposed dialogue acts based on request dimension within 6 dialogue acts, response dimension within 14 dialogue acts, and other dimension within 3 dialogue acts. Request Acts Dealing with request activities and tasks Taking-Request Dealing with taking request e.g. hello Service-Question Dealing with services request e.g. asking about service information or required a service. Confirm-Question Happens when needs to confirmation about some informa- tion. YesNo-Question Happens when needs Yes or No answer.
  • 3. Arabic Inquiry-Answer Dialogue Acts Annotation Schema International organization of Scientific Research 34 | P a g e Choice-Question Happens when needs select one answer from service mul- tiple-choices question. Other-Question Happens when asking about non-service question e.g. mo- bile number, email, or address. Turn-Assign Happens when wants to addressee the speaker to take the turn e.g. Adam? Table 1: Request dimension dialogue acts Response Acts Dealing with response activities and tasks Service-Answer Happens when answer a Service-Question or Choice- Question. Other-Answer Happens when answer an Other-Question. Agree Describe agreement/accept answer from Confirm-Question or YesNo-Question. Disagree Describe disagreement/reject answer from Confirm- Question or YesNo-Question. Greeting Happens when speaker wants to greeting and welcome the other speaker. Also describe greeting accept „return- greeting‟. Inform Happens when speaker wants to explain or describe some- thing to other speaker. Thanking Happens when speaker wants to thank the other speaker. Also describe thanking accept. Apology Happens when speaker wants to apology. MissUnderstandingSign Happens when non-understanding the previous utterance. Correct Happens when correct an information in previous utterance or in current utterance. Pausing Happens when needs to request more time or stealing time e.g. just a moment. Suggest Happens when provides a suggestion. Promise Happens when provides a promise. Warning Happens when provides a warning action. Offer Happens when provides an offer to the customer. Table 2: Response dimension dialogue acts Other Acts Dealing with neither request nor response activities and tasks Opening Dealing with opening obligation utterance e.g. “Good even- ing, Banque Misr, Ahmed Samy speaking”. Closing Dealing with closing obligation request e.g. “Thank you for calling and goodbye”. Self-Introduce Happens when wants to introduce our self or organization. Table 3: Other dimension dialogue acts We set ‘Agree’ dialogue act to describe the agreement/acceptance answers as accept-confirmation, Yes- Answer, accept-thanking, and accept-apology. In contrast, we set ‘Disagree’ dialogue act to describe the disa-
  • 4. Arabic Inquiry-Answer Dialogue Acts Annotation Schema International organization of Scientific Research 35 | P a g e greement/reject answer as disconfirm, No-Answer, and Reject-thanking, Reject-apology. Our empirical analysis for our corpus leaded to identifying ‘Opening’ dialogue act as a group of ‘Greeting’ act and ‘self-Introduce’ act. For example from our corpus: This turn tagged as ‘Opening’ dialogue act and segmented to three utterances as:  ‫الخير‬ ‫مساء‬/msA' Alxyr/Good evening is tagged as ‘Greeting’ dialogue act.  ‫مصر‬ ‫بنك‬/ bnk mSr/Banque Misr is tagged as ‘Self-Introduce’ dialogue act.  ‫حضرتك‬ ‫مع‬ ‫احمد‬/ AHmd mE HDrtk/Ahmed speaking is tagged as ‘Self-Introduce’ dialogue act. IV. Annotation Corpus For the best of our knowledge, there is no corpus for annotated Arabic spoken or written ‘chat’ dialo- gues in Arabic dialect. For this reason, we built our own corpus as the first version of manually annotated corpus for Arabic dialogue language understanding including spoken dialogue and written ‘chat’ dialogue for inquiry- answer domain. Figure 2: Utterance Annotation Example This corpus contains two parts (1) spoken dialogue which contains 52 phone calls recorded from Egyp- tian’s banks inquiry-answer and Egypt Air Company with an average duration of two hour of talking time after removing ads from calls. It consists of human-human discussions about providing services e.g. create new bank account, service request, balance check and flight reservation. These phone calls were transcribed using Tran- scriber®, a tool that is frequently used for segmenting, labeling and transcribing speech corpora. (2) Written ‘Chat’ dialogues, which contain 30 chat dialogues, recorded from mobile agencies web sites ‘KSA Zain, KSA Mobily, and KSA STC’. This corpus contains 3001 turns with average 6.7 words per turn and contains 4727 utterances with average 4.3 words per utterance. This corpus is manually segmented and annotated using the proposed schema. An an- notation tool was developed to manually annotate dialogue turns and segment it is to utterances and label the dialogue act. A sample-annotated turn is shown in Figure 2. V. Conclusion In this paper, we proposed a general dialogue acts schema for Arabic inquiry-answer dialogue either spoken or written ‘chat’. We proposed the first version of the manually annotated inquiry-answer dialogue as a part of Arabic language understanding methodologies. In addition, we introduced the first version of annotated corpus that includes 83 inquiry-answer dialo- gues from banks, Airlines Company, and mobile networks agency. In future work we plan to enrich this corpus with inquiry-answer dialogues from other domains e.g. Online Markets, and Railway Networks in both forms spoken and written ‘chat’ dialogues. Reference [1]. S. Carraher and A. Doerr, "A Six Continent Study of Customer Service," Proceedings of Academy of Marketing Studies, vol. 17, 2012. [2]. J. Austin, How to Do Things with Words. Oxford: Oxford University Press, 1962. Arabic: ‫حضرتك‬ ‫مع‬ ‫احمد‬ ‫مصر‬ ‫بنك‬ ‫الخير‬ ‫مساء‬ Buckwalter: msA' Alxyr bnk mSr AHmd mE HDrtk English: Good evening, Banque Misr, Ahmed speaking
  • 5. Arabic Inquiry-Answer Dialogue Acts Annotation Schema International organization of Scientific Research 36 | P a g e [3]. A. Stolcke, K. Ries, N. Coccaro, E. Shriberg, R. Bates, D. Jurafsky, P. Taylor, R. Martin, C. V. Ess-Dykema, and M. Meteer, "Dialogue Act Modeling for Automatic Tagging and Recognition of Conversational Speech," Computational Linguistics, vol. 26, pp. 339-373, 2000. [4]. A. Anderson, M. Bader, E. Bard, E. Boyle, and G. Doherty, "The HCRC Map Task Corpus," Language and Speech, vol. 34, pp. 351- 366, 1991. [5]. W. Wahlster, Verbmobil: Foundations of Speech-To-Speech Translation: Springer, 2000. [6]. S. Jekat, A. Klein, E. Maier, Ilona Maleck, M. Mast, and J. Quantz, "Dialogue Acts in VERBMOBIL. VM report 65," Universities of Berlin, Saarbrcken and Stuttgart.1995. [7]. J. Alexandersson, B. Buschbeck-Wolf, T. Fujinami, M. Kipp, S. Koch, E. Maier, N. Reithinger, B. S. mitz, and M. Siegel., "Dialogue Acts in VERBMOBIL- 2 (second edition). VM report 226 , DFKI GmbH, ," Universities of Berlin, Saarbrcken and Stuttgart1998. [8]. J. Allen and M. Core, "Draft of DAMSL: Dialog Act Markup in Several Layers," 1997. [9]. M. Core and J. Allen, "Coding Dialogs with the DAMSL annotation scheme," presented at the AAAI Fall Symposium on Communicative Action in Humans and Machines, MIT, Cambridge, MA, 1997. [10]. M. Core, M. Ishizaki, J. Moore, C. Nakatani, N. Reithinger, D. Traum, and S. Tutiya, "Report of The Third Workshop of the Discours e Resource Initiative," Chiba University, Chiba, Japan1998. [11]. D. Jurafsky, R. Bates, N. Coccaro, R. Martin, M. Meteer, K. Ries, E. Shriberg, A. Stolcke, P. Taylor, and C. E. Dykema., "Automatic Detection of Discourse Structure for Speech Recognition and Understanding," presented at the IEEE Workshop on Speech Recognition and Understanding, Santa Barbara, 1997. [12]. D. Jurafsky, R. Bates, N. Coccaro, R. Martin, M. Meteer, K. Ries, E. Shriberg, A. Stolcke, P. Taylor, and C. V. Ess-Dykema, "Switch-board Discourse Language Modeling Project Final Report," in ResearchNote 30, Center for Language and Speech Processing, Johns Hopkins University, Baltimore, 1998. [13]. N. Webb, "Cue-Based Dialogue Act Classification," Ph.D. dissertation, Department of Computer Science, University of Sheffield, England, 2010. [14]. E. Shriberg, R. Dhillon, S. Bhagat, J. Ang, and H. Carvey, " The ICSI Meeting Recorder Dialog Act (MRDA) Corpus," in Special Interest Group on Discourse and Dialogue (SIGdial), Boston, USA, 2004. [15]. H. Bunt, "Context and Dialogue Control. , 1," THINK, vol. 3, pp. 19-31, 1994. [16]. H. Bunt and Y. Girard, "Designing an Open, Multidimensional Dialogue Act Taxonomy," in the 9th Workshop on the Semantics and Pragmatics in Dialogue, 2005. [17]. H. Bunt, J. Alexandersson, J. Carletta, J.-W. Choe, A. C. Fang, K. Hasida, K. Lee, V. Petukhova, A. Popescu-Belis, L. Romary, C. Soria, and D. Traum, "Towards an ISO standard for dialogue act annotation," in the International Conference on Language Resources and Systems (LREC 2010), Malta, Italy, 2010, pp. 2548–2558. [18]. H. Bunt, J. Alexandersson, J. Carletta, J.-W. Choe, A. C. Fang, K. Hasida, K. Lee, V. Petukhova, A. Popescu-Belis, L. Romary, C. Soria, and D. Traum, "ISO 24617-2: A semantically-based standard for dialogue annotation," in Proceedings of the International Conference on Language Resources and Evaluation (LREC), Istanbul, Turkey, 2012. [19]. L. Shala, V. Rus, and A. Graesser, "Automatic Speech Act Classification In Arabic," in Subjetividad y Procesos Cognitivos Conference 2010, pp. 284-292. [20]. S. B. Dbabis, F. Mallek, H. Ghorbel, and L. Belguith, "Dialogue Acts Annotation Scheme within Arabic discussion," in SemDial 2012 The 16th workshop on semantics and pragmatics of dialogue Sorbonne, Paris, France, 2012. [21]. M. Graja, M. Jaoua, and L. H. Belguith, "Discriminative Framework for Spoken Tunisian Dialect Understanding," presented at the SLSP, 2013. [22]. C. Lhioui, A. Zouaghi, and M. Zrigui, "A Combined Method Based on Stochastic and Linguistic Paradigm for the Understanding of Arabic Spontaneous Utterances," in CICLing 2013, Computational Linguistics and Intelligent Text Processing Lecture Notes in Computer Science, Samos, Greece, 2013, pp. 549-558.