SlideShare ist ein Scribd-Unternehmen logo
1 von 17
Downloaden Sie, um offline zu lesen
Question Classification
               Using Machine Learning Methods


                

                             Jennifer Lee
                     CSI 5386 Project Presentation
                               Fall 2008




                                   
Motivation
• An important step in QA
  – To classify the question to the anticipated type 
    of the answer (semantically).
  – More challenging than common search tasks.
• Q: What Canadian city has the largest 
  population?
  Answer type: city
The Ambiguity Problem
  • What is bipolar disorder?
  • What do bats eat?
  • What is the PH scale?
                
  • Hard to categorize those questions into one 
     single class
   – Need multiple class labels for a single 
     question.
Why Machine Learning?
• Manually constructed sets of rules to map a 
    question to its type is not efficient.
     – Requires the analysis of a large number of 
        questions.
                
     – Mapping questions into fine classes requires 
        the use of lexical items (specific words).
• A learned classifier enables one to define only a 
  small number of “type” features.
• Can be trained on a new taxonomy.
Li and Roth (2002):
       Learning Question Classifier
• Uses the SnoW learning architecture.
     – Hierarchical classifiers
     – 6 coarse classes: ABBREVATION, ENTITY, 
                
        DESCRIPTION, HUMAN, LOCATION, 
        NUMERIC VALUE.
     – 50 fine classes.



                         
Li and Roth (cont)
• UIUC question classification dataset
  – 5500 training (from TREC 8,9, including 500 
    rare questions).
  – 500 test datasets from TREC 10.
                
• Six primitive feature types:
  – Words, pos tags, chunks, named entities, head 
    chunks and semantically related words
• Semantically related word list for each question
  – “away” belongs to the sensor Rel(distance).

                            
Zhang and Lee (2003): 
       Question Classifcation using SVM
• Two kind of features:
   – Bag of words and bag of n­grams.
• SVM with kernel tree 
     – Use LIBVSM (Chang and Lin, 2001).
                
     – Take advantage of the syntactic structures of 
        questions.
     – Compare with Nearest Neighbors, Naïve 
        Bayes, Decision Tree, SnoW.

                            
Zhang and Lee (cont)
• Using the same dataset as Li and Roth
• Same two­layered question taxonomy
• Same assumption:
                
     – One question resides in only one category.
• Uses automated constructed features 
  – No semantically related word list




                           
Huang et al. (2008):
    QC using Head Words and their Hypernyms
    • In contrast to Li's, a compact feature set 
      was proposed:
       – Head word
                
       – Use WordNet to augment the semantic 
          features.
       – Adopt Lesk's word sense disambiguation 
          algorithm


                           
Huang et al. (cont)
• Again, use the same dataset.
• Other features:
    – Question wh­word, word  grams, word shape
• Classifiers:
                
    – Maximum Entropy Model
    – Support Vector Model – also adopt LIBVSM.
    – Obtained higher accuracy (89% and 89.2%). 



                          
Datasets for the project
• Same dataset as Li's:
    – http://l2r.cs.uiuc.edu/~cogcomp/Data/QA/QC/
• Additional datasets:
     – TREC QA: http://trec.nist.gov/data/qa.html
                




                            
Plan for the project
• Experiment with different feature types:
    – Head chunks, semantic features for head 
      chunk, named­entities, word grams and word 
      shape feature
                
• Use WordNet to automate the generation of 
  semantic features
    – Find hypernyms.
    – Apply Lesk's WSD to the head chunk.


                          
Head word sense disambiguation




                   
Resources
• Java interface to WordNet:
    – http://wordnet.princeton.edu/links#SQL
• A syntactic parser for extracting the head­
    chunk feature:
                
    – Berkeley parser (Petrov and Klein, 2007).
• Use the N­gram Statistics Package



                           
Resources (cont)
• Named entity recognizer, a relational 
  feature extraction language (FEX):
    – http://l2r.cs.uiuc.edu/~cogcomp/software.php
• Mallet Machine Learning Library:
    – http://mallet.cs.umass.edu/




                            
References
• Li, X. and D. Roth. 2002. Learning Question 
  Classifiers.The 19th international conference on 
  Computational linguistics, vol. 1, pp. 1–7.
• Zhang D. and W. S. Lee. 2003. Question 
  Classification using Support Vector Machines. 
  The ACM SIGIR conference in information 
  retrieval, pp. 26–32.
• Zhiheng Huang; Marcus Thint; Zengchang Qin. 
  Question Classification using Head Words and 
  their Hypernyms.
                          
References (cont)
• D. Roth, G. Kao, X. Li, R. Nagarajan, V. 
  Punyakanok, N. Rizzolo, W. Yih, C. O. Alm, and 
  L. G. Moran. 2002. Learning components for a 
  question answering system. In TREC­2001.
• Jonathan Brown – IR Lab.  Entity­Tagged 
  Language Models for Question Classification in a 
  QA System.
• Donald Metzler, W. Bruce Croft Analysis of 
  statistical question classification for fact­based 
  questions (2003).

                            

Weitere ähnliche Inhalte

Ähnlich wie Question Classifier

Strata San Jose 2016: Scalable Ensemble Learning with H2O
Strata San Jose 2016: Scalable Ensemble Learning with H2OStrata San Jose 2016: Scalable Ensemble Learning with H2O
Strata San Jose 2016: Scalable Ensemble Learning with H2OSri Ambati
 
Learnometrics: Metrics for Learning Objects
Learnometrics: Metrics for Learning ObjectsLearnometrics: Metrics for Learning Objects
Learnometrics: Metrics for Learning ObjectsXavier Ochoa
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017StampedeCon
 
Discovering Hot Topics in the Blogosphere
Discovering Hot Topics in the BlogosphereDiscovering Hot Topics in the Blogosphere
Discovering Hot Topics in the BlogosphereManolis Platakis
 
My thesis progress presentation
My thesis progress presentationMy thesis progress presentation
My thesis progress presentationJames Thomas
 
Stacked Ensembles in H2O
Stacked Ensembles in H2OStacked Ensembles in H2O
Stacked Ensembles in H2OSri Ambati
 
An Overview of Naïve Bayes Classifier
An Overview of Naïve Bayes Classifier An Overview of Naïve Bayes Classifier
An Overview of Naïve Bayes Classifier ananth
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningRahul Jain
 
Nlp and Neural Networks workshop
Nlp and Neural Networks workshopNlp and Neural Networks workshop
Nlp and Neural Networks workshopQuantUniversity
 
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...Isabelle Augenstein
 
Introduction to Mahout and Machine Learning
Introduction to Mahout and Machine LearningIntroduction to Mahout and Machine Learning
Introduction to Mahout and Machine LearningVarad Meru
 
Applications for Social Networking Strategies in an Agency Context: Exploitin...
Applications for Social Networking Strategies in an Agency Context: Exploitin...Applications for Social Networking Strategies in an Agency Context: Exploitin...
Applications for Social Networking Strategies in an Agency Context: Exploitin...BoaB Team
 
Charting the Digital Library Evaluation Domain with a Semantically Enhanced M...
Charting the Digital Library Evaluation Domain with a Semantically Enhanced M...Charting the Digital Library Evaluation Domain with a Semantically Enhanced M...
Charting the Digital Library Evaluation Domain with a Semantically Enhanced M...Giannis Tsakonas
 
Open IE tutorial 2018
Open IE tutorial 2018Open IE tutorial 2018
Open IE tutorial 2018Andre Freitas
 
Scalable Ensemble Machine Learning @ Harvard Health Policy Data Science Lab
Scalable Ensemble Machine Learning @ Harvard Health Policy Data Science LabScalable Ensemble Machine Learning @ Harvard Health Policy Data Science Lab
Scalable Ensemble Machine Learning @ Harvard Health Policy Data Science LabSri Ambati
 
Ph d sem_1@iitm
Ph d sem_1@iitmPh d sem_1@iitm
Ph d sem_1@iitmVinu Ev
 
Query Recommendation - Barcelona 2017
Query Recommendation - Barcelona 2017Query Recommendation - Barcelona 2017
Query Recommendation - Barcelona 2017Puya - Hossein Vahabi
 

Ähnlich wie Question Classifier (20)

Six Month
Six MonthSix Month
Six Month
 
Strata San Jose 2016: Scalable Ensemble Learning with H2O
Strata San Jose 2016: Scalable Ensemble Learning with H2OStrata San Jose 2016: Scalable Ensemble Learning with H2O
Strata San Jose 2016: Scalable Ensemble Learning with H2O
 
Learnometrics: Metrics for Learning Objects
Learnometrics: Metrics for Learning ObjectsLearnometrics: Metrics for Learning Objects
Learnometrics: Metrics for Learning Objects
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
 
Discovering Hot Topics in the Blogosphere
Discovering Hot Topics in the BlogosphereDiscovering Hot Topics in the Blogosphere
Discovering Hot Topics in the Blogosphere
 
My thesis progress presentation
My thesis progress presentationMy thesis progress presentation
My thesis progress presentation
 
Stacked Ensembles in H2O
Stacked Ensembles in H2OStacked Ensembles in H2O
Stacked Ensembles in H2O
 
An Overview of Naïve Bayes Classifier
An Overview of Naïve Bayes Classifier An Overview of Naïve Bayes Classifier
An Overview of Naïve Bayes Classifier
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Nlp and Neural Networks workshop
Nlp and Neural Networks workshopNlp and Neural Networks workshop
Nlp and Neural Networks workshop
 
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
 
Introduction to Mahout and Machine Learning
Introduction to Mahout and Machine LearningIntroduction to Mahout and Machine Learning
Introduction to Mahout and Machine Learning
 
Applications for Social Networking Strategies in an Agency Context: Exploitin...
Applications for Social Networking Strategies in an Agency Context: Exploitin...Applications for Social Networking Strategies in an Agency Context: Exploitin...
Applications for Social Networking Strategies in an Agency Context: Exploitin...
 
Introduction To R
Introduction To RIntroduction To R
Introduction To R
 
Charting the Digital Library Evaluation Domain with a Semantically Enhanced M...
Charting the Digital Library Evaluation Domain with a Semantically Enhanced M...Charting the Digital Library Evaluation Domain with a Semantically Enhanced M...
Charting the Digital Library Evaluation Domain with a Semantically Enhanced M...
 
Open IE tutorial 2018
Open IE tutorial 2018Open IE tutorial 2018
Open IE tutorial 2018
 
ACL-IJCNLP 2015
ACL-IJCNLP 2015ACL-IJCNLP 2015
ACL-IJCNLP 2015
 
Scalable Ensemble Machine Learning @ Harvard Health Policy Data Science Lab
Scalable Ensemble Machine Learning @ Harvard Health Policy Data Science LabScalable Ensemble Machine Learning @ Harvard Health Policy Data Science Lab
Scalable Ensemble Machine Learning @ Harvard Health Policy Data Science Lab
 
Ph d sem_1@iitm
Ph d sem_1@iitmPh d sem_1@iitm
Ph d sem_1@iitm
 
Query Recommendation - Barcelona 2017
Query Recommendation - Barcelona 2017Query Recommendation - Barcelona 2017
Query Recommendation - Barcelona 2017
 

Kürzlich hochgeladen

9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 

Kürzlich hochgeladen (20)

9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 

Question Classifier

  • 1. Question Classification Using Machine Learning Methods                  Jennifer Lee CSI 5386 Project Presentation Fall 2008    
  • 2. Motivation • An important step in QA – To classify the question to the anticipated type  of the answer (semantically). – More challenging than common search tasks. • Q: What Canadian city has the largest  population? Answer type: city
  • 3. The Ambiguity Problem • What is bipolar disorder? • What do bats eat? • What is the PH scale?                  • Hard to categorize those questions into one  single class – Need multiple class labels for a single  question.
  • 4. Why Machine Learning? • Manually constructed sets of rules to map a  question to its type is not efficient. – Requires the analysis of a large number of  questions.                  – Mapping questions into fine classes requires  the use of lexical items (specific words). • A learned classifier enables one to define only a  small number of “type” features. • Can be trained on a new taxonomy.
  • 5. Li and Roth (2002): Learning Question Classifier • Uses the SnoW learning architecture. – Hierarchical classifiers – 6 coarse classes: ABBREVATION, ENTITY,                   DESCRIPTION, HUMAN, LOCATION,  NUMERIC VALUE. – 50 fine classes.    
  • 6. Li and Roth (cont) • UIUC question classification dataset – 5500 training (from TREC 8,9, including 500  rare questions). – 500 test datasets from TREC 10.                  • Six primitive feature types: – Words, pos tags, chunks, named entities, head  chunks and semantically related words • Semantically related word list for each question – “away” belongs to the sensor Rel(distance).    
  • 7. Zhang and Lee (2003):  Question Classifcation using SVM • Two kind of features: – Bag of words and bag of n­grams. • SVM with kernel tree  – Use LIBVSM (Chang and Lin, 2001).                  – Take advantage of the syntactic structures of  questions. – Compare with Nearest Neighbors, Naïve  Bayes, Decision Tree, SnoW.    
  • 8. Zhang and Lee (cont) • Using the same dataset as Li and Roth • Same two­layered question taxonomy • Same assumption:                  – One question resides in only one category. • Uses automated constructed features  – No semantically related word list    
  • 9. Huang et al. (2008): QC using Head Words and their Hypernyms • In contrast to Li's, a compact feature set  was proposed: – Head word                  – Use WordNet to augment the semantic  features. – Adopt Lesk's word sense disambiguation  algorithm    
  • 10. Huang et al. (cont) • Again, use the same dataset. • Other features: – Question wh­word, word  grams, word shape • Classifiers:                  – Maximum Entropy Model – Support Vector Model – also adopt LIBVSM. – Obtained higher accuracy (89% and 89.2%).     
  • 11. Datasets for the project • Same dataset as Li's: – http://l2r.cs.uiuc.edu/~cogcomp/Data/QA/QC/ • Additional datasets: – TREC QA: http://trec.nist.gov/data/qa.html                     
  • 12. Plan for the project • Experiment with different feature types: – Head chunks, semantic features for head  chunk, named­entities, word grams and word  shape feature                  • Use WordNet to automate the generation of  semantic features – Find hypernyms. – Apply Lesk's WSD to the head chunk.    
  • 14. Resources • Java interface to WordNet: – http://wordnet.princeton.edu/links#SQL • A syntactic parser for extracting the head­ chunk feature:                  – Berkeley parser (Petrov and Klein, 2007). • Use the N­gram Statistics Package    
  • 15. Resources (cont) • Named entity recognizer, a relational  feature extraction language (FEX): – http://l2r.cs.uiuc.edu/~cogcomp/software.php • Mallet Machine Learning Library: – http://mallet.cs.umass.edu/    
  • 16. References • Li, X. and D. Roth. 2002. Learning Question  Classifiers.The 19th international conference on  Computational linguistics, vol. 1, pp. 1–7. • Zhang D. and W. S. Lee. 2003. Question  Classification using Support Vector Machines.  The ACM SIGIR conference in information  retrieval, pp. 26–32. • Zhiheng Huang; Marcus Thint; Zengchang Qin.  Question Classification using Head Words and  their Hypernyms.    
  • 17. References (cont) • D. Roth, G. Kao, X. Li, R. Nagarajan, V.  Punyakanok, N. Rizzolo, W. Yih, C. O. Alm, and  L. G. Moran. 2002. Learning components for a  question answering system. In TREC­2001. • Jonathan Brown – IR Lab.  Entity­Tagged  Language Models for Question Classification in a  QA System. • Donald Metzler, W. Bruce Croft Analysis of  statistical question classification for fact­based  questions (2003).