SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
Natural Language Processing

                  Daniel Dahlmeier

NUS Graduate School for Integrative Sciences and Engineering
              danielhe@comp.nus.edu.sg


            CSTalks 2 November 2011
Acknowledgments




  Examples and figures from Michael Collins’ lecture notes:
  http://www.cs.columbia.edu/∼mcollins.


  Some other figures are from Wikipedia: http://www.wikipedia.org.


  The rest I randomly found on the web.
Examples
                    What is NLP?
                     Background
                       NLP tasks
                   Why is it hard?
                    Related Stuff
                      Conclusion



Google translate




                                     3/25
Examples
                     What is NLP?
                      Background
                        NLP tasks
                    Why is it hard?
                     Related Stuff
                       Conclusion



IBM’s Watson computer wins at Jeopardy!




                                          4/25
Examples
        What is NLP?
         Background
           NLP tasks
       Why is it hard?
        Related Stuff
          Conclusion



Siri




                         5/25
Examples
                             What is NLP?
                              Background
                                NLP tasks
                            Why is it hard?
                             Related Stuff
                               Conclusion



What is Natural Language Processing?


   Natural Language Processing (NLP) or Computational Linguistics
   Language processing that goes beyond a “bag of words” representation.

   Example
       Translate from one language into the other.
       Answer natural language questions.
       Parse the syntactic/semantic structure of a sentence.

   The other NLP
   NLP = neuro-linguistic programming.


                                                                           6/25
Examples
                                What is NLP?
                                 Background
                                   NLP tasks
                               Why is it hard?
                                Related Stuff
                                  Conclusion



Background(s): Artificial Intelligence




   Talk to your computer
       Dave: Hello, HAL. Do you read me, HAL?
       HAL: Affirmative, Dave. I read you.
       Dave: Open the pod bay doors, HAL.
       HAL: I’m sorry, Dave. I’m afraid I can’t do that.

   The computer needs to ...
       Understand the user : Natural Language Understanding.
       Generate a well-formed reply : Natural Language Generation.
                                                                     7/25
Examples
                              What is NLP?
                               Background
                                 NLP tasks
                             Why is it hard?
                              Related Stuff
                                Conclusion



Background(s): Artificial Intelligence (cont.)




   Turing Test
       Experimenter talks to two parties A and B via a terminal.
       If C cannot distinguish which party is a computer and which is a
       human, we should consider the computer to be intelligent.
       Natural language is deeply intertwined with intelligence.
                                                                          8/25
Examples
                              What is NLP?
                               Background
                                 NLP tasks
                             Why is it hard?
                              Related Stuff
                                Conclusion



Background(s): Linguistics




   Generative Linguistics
       Humans can produce and understand an infinite number of
       sentences by means of a finite set of rules.
       Language is produced through a generative, recursive process in the
       human brain.
       The principles that underlie this process are universal to all
       languages (universal grammar).                                        9/25
Examples
                               What is NLP?
                                Background
                                  NLP tasks
                              Why is it hard?
                               Related Stuff
                                 Conclusion



Background(s): the Web



       “We are drowning in information but starved for knowledge.”
       by Edward Osborne Wilson

   Too much text to read...
       Wikipedia: over 3.7 million articles (English).
       PubMed: over 20 million citations.
       WWW: billions of pages, trillions of words.




                                                                     10/25
Examples
                              What is NLP?
                               Background
                                 NLP tasks
                             Why is it hard?
                              Related Stuff
                                Conclusion



Part-of-speech Tagging



   Part-of-speech tagging
       Input: a sentence.
       Output: a part-of-speech tag sequence, e.g., noun, verb, adjective,...

   Example
   Profits/N soared/V at/P Boeing/N Co./N ,/, easily/ADV topping/V
   forecasts/N on/P Wall/N Street/N ./.




                                                                                11/25
Examples
                               What is NLP?
                                Background
                                  NLP tasks
                              Why is it hard?
                               Related Stuff
                                 Conclusion



Named-entity recognition


   Named-entity recognition
       Input: a sentence.
       Output: a BIO-named entity tag sequence, e.g., PERSON,
       ORGANIZATION, OTHER.

   Example
   Profits/O soared/O at/O Boeing/B-ORG Co./I-ORG ,/O easily/O
   topping/O forecasts/O on/O Wall/O Street/O ./O




                                                                12/25
Examples
                             What is NLP?
                              Background
                                NLP tasks
                            Why is it hard?
                             Related Stuff
                               Conclusion



Word Sense Disambiguation



   Word sense disambiguation
       Input: a sentence.
       Output: the sense of each word in the sentence.

   Example
   I/sense1 can/sense1 can/sense2 a/sense1 can sense3 .




                                                          13/25
Examples
                               What is NLP?
                                Background
                                  NLP tasks
                              Why is it hard?
                               Related Stuff
                                 Conclusion



Parsing
   Parsing
       Input: a sentence.
       Output: the syntactic tree structure of the sentence.

   Example
   Boeing is located in Seattle.




                                                               14/25
Examples
                              What is NLP?
                               Background
                                 NLP tasks
                             Why is it hard?
                              Related Stuff
                                Conclusion



Machine translation


   Machine Translation
      Input: a sentence in language F .
       Output: the translated sentence in language E .

   Example
   Input: Syriens Pr¨sident Baschar al-Assad hat den Westen davor
                     a
   gewarnt, sich in die Angelegenheiten seines Landes einzumischen.

   Output: Syrian President Bashar al-Assad has warned the West against
   interfering in the affairs of his country.



                                                                          15/25
Examples
                           What is NLP?
                            Background
                              NLP tasks
                          Why is it hard?
                           Related Stuff
                             Conclusion



Why is it hard? ( example from L.Lee)




       “At last, a computer that understands you like your mother”




                                                                     16/25
Examples
                               What is NLP?
                                Background
                                  NLP tasks
                              Why is it hard?
                               Related Stuff
                                 Conclusion



Ambiguity of Natural Language



          “At last, a computer that understands you like your mother”

   This could mean...
     1   It understands you as well as your mother understands you.
     2   It understands (that) you like your mother.
     3   It understands you as well as it understands your mother.
   1 and 3: Does this mean well, or poorly?




                                                                        17/25
Examples
                               What is NLP?
                                Background
                                  NLP tasks
                              Why is it hard?
                               Related Stuff
                                 Conclusion



Ambiguity at the Acoustic Level




          “At last, a computer that understands you like your mother”

   This sounds like...
     1   “... a computer that understands you like your mother.”
     2   “... a computer that understands you lie cured mother.”




                                                                        18/25
Examples
                           What is NLP?
                            Background
                              NLP tasks
                          Why is it hard?
                           Related Stuff
                             Conclusion



Ambiguity at the Syntactic (structure) Level



       “At last, a computer that understands you like your mother”




                                                                     19/25
Examples
                        What is NLP?
                         Background
                           NLP tasks
                       Why is it hard?
                        Related Stuff
                          Conclusion



Ambiguity at the Syntactic (structure) Level
                   “List all flights on Tuesday.”




                                                   20/25
Examples
                                What is NLP?
                                 Background
                                   NLP tasks
                               Why is it hard?
                                Related Stuff
                                  Conclusion



Ambiguity at the Semantic (meaning) Level


   Definition of “mother”
     1   a woman who has given birth to a child
     2   a stringy slimy substance consisting of yeast cells and bacteria; is
         added to cider or wine to produce vinegar.

   More ambiguity
         They put money in the bank (= buried in mud?).
         I saw her duck with a telescope (= a duck carrying a telescope?).




                                                                                21/25
Examples
                              What is NLP?
                               Background
                                 NLP tasks
                             Why is it hard?
                              Related Stuff
                                Conclusion



Ambiguity at the Discourse (multi-clause) Level



   Anaphora resolution
   Alice says they’ve built a computer that understands you like your
   mother.
   But she ...
       ... doesn’t know any details (Alice)
       ... doesn’t understand me at all (my mother)




                                                                        22/25
Examples
                               What is NLP?
                                Background
                                  NLP tasks
                              Why is it hard?
                               Related Stuff
                                 Conclusion



Related Stuff

   Machine Learning
        This really made large-scale, open domain NLP applications possible.

   Information Retrieval
        Both need to “understand” language.

   Linguistics
        Interested in the nature of language.

   Psychology / Cognitive Science
        Both interested in human cognitive capabilities.


                                                                               23/25
Examples
                              What is NLP?
                               Background
                                 NLP tasks
                             Why is it hard?
                              Related Stuff
                                Conclusion



Conclusion


   What I have told you...
       What NLP is about.
       Some NLP tasks that people work on.
       Why it’s not that easy.

   What I haven’t told you
       How do you solve all these problems?
       How well does it work?
       What is left to be done?



                                               24/25
Examples
                            What is NLP?
                             Background
                               NLP tasks
                           Why is it hard?
                            Related Stuff
                              Conclusion



Would you like to know more?

   NLP courses at NUS
      CS4248: natural language processing
       CS6207: advanced natural language processing

   Books




   Jurafsky and Martin, Speech and Language Processing (2nd Edition)


                                                                       25/25

Weitere ähnliche Inhalte

Was ist angesagt?

Pal gov.tutorial4.session1 2.whatisontology
Pal gov.tutorial4.session1 2.whatisontologyPal gov.tutorial4.session1 2.whatisontology
Pal gov.tutorial4.session1 2.whatisontology
Mustafa Jarrar
 
Pal gov.tutorial4.session8 2.stepwisemethodologies
Pal gov.tutorial4.session8 2.stepwisemethodologiesPal gov.tutorial4.session8 2.stepwisemethodologies
Pal gov.tutorial4.session8 2.stepwisemethodologies
Mustafa Jarrar
 
Pal gov.tutorial4.session5.lab ontologytools
Pal gov.tutorial4.session5.lab ontologytoolsPal gov.tutorial4.session5.lab ontologytools
Pal gov.tutorial4.session5.lab ontologytools
Mustafa Jarrar
 
Pal gov.tutorial4.session8 1.ontologymodelingchallenges
Pal gov.tutorial4.session8 1.ontologymodelingchallengesPal gov.tutorial4.session8 1.ontologymodelingchallenges
Pal gov.tutorial4.session8 1.ontologymodelingchallenges
Mustafa Jarrar
 
Pal gov.tutorial4.session3.lab bankcustomerontology
Pal gov.tutorial4.session3.lab bankcustomerontologyPal gov.tutorial4.session3.lab bankcustomerontology
Pal gov.tutorial4.session3.lab bankcustomerontology
Mustafa Jarrar
 
Pal gov.tutorial4.session2.lab populationontology
Pal gov.tutorial4.session2.lab populationontologyPal gov.tutorial4.session2.lab populationontology
Pal gov.tutorial4.session2.lab populationontology
Mustafa Jarrar
 
Pal gov.tutorial4.outline
Pal gov.tutorial4.outlinePal gov.tutorial4.outline
Pal gov.tutorial4.outline
Mustafa Jarrar
 
Pal gov.tutorial4.session6 1.ontologyengineeringchallenges
Pal gov.tutorial4.session6 1.ontologyengineeringchallengesPal gov.tutorial4.session6 1.ontologyengineeringchallenges
Pal gov.tutorial4.session6 1.ontologyengineeringchallenges
Mustafa Jarrar
 
Pal gov.tutorial4.session14 rootinglegalpersonontology
Pal gov.tutorial4.session14 rootinglegalpersonontologyPal gov.tutorial4.session14 rootinglegalpersonontology
Pal gov.tutorial4.session14 rootinglegalpersonontology
Mustafa Jarrar
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Christoph Lange
 
Pal gov.tutorial4.session1 1.needforsharedsemantics
Pal gov.tutorial4.session1 1.needforsharedsemanticsPal gov.tutorial4.session1 1.needforsharedsemantics
Pal gov.tutorial4.session1 1.needforsharedsemantics
Mustafa Jarrar
 

Was ist angesagt? (20)

Pal gov.tutorial4.session1 2.whatisontology
Pal gov.tutorial4.session1 2.whatisontologyPal gov.tutorial4.session1 2.whatisontology
Pal gov.tutorial4.session1 2.whatisontology
 
Pal gov.tutorial4.session8 2.stepwisemethodologies
Pal gov.tutorial4.session8 2.stepwisemethodologiesPal gov.tutorial4.session8 2.stepwisemethodologies
Pal gov.tutorial4.session8 2.stepwisemethodologies
 
Pal gov.tutorial4.session5.lab ontologytools
Pal gov.tutorial4.session5.lab ontologytoolsPal gov.tutorial4.session5.lab ontologytools
Pal gov.tutorial4.session5.lab ontologytools
 
Pal gov.tutorial4.session8 1.ontologymodelingchallenges
Pal gov.tutorial4.session8 1.ontologymodelingchallengesPal gov.tutorial4.session8 1.ontologymodelingchallenges
Pal gov.tutorial4.session8 1.ontologymodelingchallenges
 
Pal gov.tutorial4.session3.lab bankcustomerontology
Pal gov.tutorial4.session3.lab bankcustomerontologyPal gov.tutorial4.session3.lab bankcustomerontology
Pal gov.tutorial4.session3.lab bankcustomerontology
 
Pal gov.tutorial4.session2.lab populationontology
Pal gov.tutorial4.session2.lab populationontologyPal gov.tutorial4.session2.lab populationontology
Pal gov.tutorial4.session2.lab populationontology
 
Pal gov.tutorial4.outline
Pal gov.tutorial4.outlinePal gov.tutorial4.outline
Pal gov.tutorial4.outline
 
Respeaking as a part of translation and interpreting curriculum
Respeaking as a part of translation and interpreting curriculumRespeaking as a part of translation and interpreting curriculum
Respeaking as a part of translation and interpreting curriculum
 
Pal gov.tutorial4.session6 1.ontologyengineeringchallenges
Pal gov.tutorial4.session6 1.ontologyengineeringchallengesPal gov.tutorial4.session6 1.ontologyengineeringchallenges
Pal gov.tutorial4.session6 1.ontologyengineeringchallenges
 
Deep learning for natural language embeddings
Deep learning for natural language embeddingsDeep learning for natural language embeddings
Deep learning for natural language embeddings
 
Practical Deep Learning for NLP
Practical Deep Learning for NLP Practical Deep Learning for NLP
Practical Deep Learning for NLP
 
Pal gov.tutorial4.session14 rootinglegalpersonontology
Pal gov.tutorial4.session14 rootinglegalpersonontologyPal gov.tutorial4.session14 rootinglegalpersonontology
Pal gov.tutorial4.session14 rootinglegalpersonontology
 
Eurecom and Aalto University at Mediaeval 2021
Eurecom and Aalto University at Mediaeval 2021Eurecom and Aalto University at Mediaeval 2021
Eurecom and Aalto University at Mediaeval 2021
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
 
Pal gov.tutorial4.session1 1.needforsharedsemantics
Pal gov.tutorial4.session1 1.needforsharedsemanticsPal gov.tutorial4.session1 1.needforsharedsemantics
Pal gov.tutorial4.session1 1.needforsharedsemantics
 
BERT
BERTBERT
BERT
 
KiwiPyCon 2014 talk - Understanding human language with Python
KiwiPyCon 2014 talk - Understanding human language with PythonKiwiPyCon 2014 talk - Understanding human language with Python
KiwiPyCon 2014 talk - Understanding human language with Python
 
DPC2007 Objects Of Desire (Kevlin Henney)
DPC2007 Objects Of Desire (Kevlin Henney)DPC2007 Objects Of Desire (Kevlin Henney)
DPC2007 Objects Of Desire (Kevlin Henney)
 
Deep Learning for Information Retrieval
Deep Learning for Information RetrievalDeep Learning for Information Retrieval
Deep Learning for Information Retrieval
 

Andere mochten auch (7)

CSTalks - Named Data Networks - 9 Feb
CSTalks - Named Data Networks - 9 FebCSTalks - Named Data Networks - 9 Feb
CSTalks - Named Data Networks - 9 Feb
 
CSTalks-Natural Language Processing-2 Nov
CSTalks-Natural Language Processing-2 NovCSTalks-Natural Language Processing-2 Nov
CSTalks-Natural Language Processing-2 Nov
 
CSTalks - Peer-to-peer - 16 Feb
CSTalks - Peer-to-peer - 16 FebCSTalks - Peer-to-peer - 16 Feb
CSTalks - Peer-to-peer - 16 Feb
 
CSTalks - Model Checking - 26 Jan
CSTalks - Model Checking - 26 JanCSTalks - Model Checking - 26 Jan
CSTalks - Model Checking - 26 Jan
 
(Paper Presentation) ZIGZAG: An Efficient Peer-to-Peer Scheme for Media Strea...
(Paper Presentation)ZIGZAG: An Efficient Peer-to-Peer Scheme forMedia Strea...(Paper Presentation)ZIGZAG: An Efficient Peer-to-Peer Scheme forMedia Strea...
(Paper Presentation) ZIGZAG: An Efficient Peer-to-Peer Scheme for Media Strea...
 
BMC 2012
BMC 2012BMC 2012
BMC 2012
 
CSTalks - GPGPU - 19 Jan
CSTalks  -  GPGPU - 19 JanCSTalks  -  GPGPU - 19 Jan
CSTalks - GPGPU - 19 Jan
 

Ähnlich wie CSTalks-Natural Language Processing-17Aug

Transfer_Learning_for_Natural_Language_P_v3_MEAP.pdf
Transfer_Learning_for_Natural_Language_P_v3_MEAP.pdfTransfer_Learning_for_Natural_Language_P_v3_MEAP.pdf
Transfer_Learning_for_Natural_Language_P_v3_MEAP.pdf
oranisalcani
 
13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for Translation13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for Translation
RIILP
 

Ähnlich wie CSTalks-Natural Language Processing-17Aug (20)

AI_08_NLP.pptx
AI_08_NLP.pptxAI_08_NLP.pptx
AI_08_NLP.pptx
 
Portuguese Linguistic Tools: What, Why and How
Portuguese Linguistic Tools: What, Why and HowPortuguese Linguistic Tools: What, Why and How
Portuguese Linguistic Tools: What, Why and How
 
CS269-01 (1).pptx
CS269-01 (1).pptxCS269-01 (1).pptx
CS269-01 (1).pptx
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Testing speaking
Testing speakingTesting speaking
Testing speaking
 
Deep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDeep Learning for Natural Language Processing
Deep Learning for Natural Language Processing
 
srinu.pptx
srinu.pptxsrinu.pptx
srinu.pptx
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...
 
Transfer_Learning_for_Natural_Language_P_v3_MEAP.pdf
Transfer_Learning_for_Natural_Language_P_v3_MEAP.pdfTransfer_Learning_for_Natural_Language_P_v3_MEAP.pdf
Transfer_Learning_for_Natural_Language_P_v3_MEAP.pdf
 
Pycon India 2018 Natural Language Processing Workshop
Pycon India 2018   Natural Language Processing WorkshopPycon India 2018   Natural Language Processing Workshop
Pycon India 2018 Natural Language Processing Workshop
 
thesis_palogiannidi
thesis_palogiannidithesis_palogiannidi
thesis_palogiannidi
 
13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for Translation13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for Translation
 
Nlp final
Nlp finalNlp final
Nlp final
 
ICDM 2019 Tutorial: Speech and Language Processing: New Tools and Applications
ICDM 2019 Tutorial: Speech and Language Processing: New Tools and ApplicationsICDM 2019 Tutorial: Speech and Language Processing: New Tools and Applications
ICDM 2019 Tutorial: Speech and Language Processing: New Tools and Applications
 
Yves Peirsman - Deep Learning for NLP
Yves Peirsman - Deep Learning for NLPYves Peirsman - Deep Learning for NLP
Yves Peirsman - Deep Learning for NLP
 
The Translation Game
The Translation GameThe Translation Game
The Translation Game
 
Eta2006 Nona
Eta2006 NonaEta2006 Nona
Eta2006 Nona
 
Wired_JSpring_08062022.pptx
Wired_JSpring_08062022.pptxWired_JSpring_08062022.pptx
Wired_JSpring_08062022.pptx
 
Nlp tutorial using python nltk (simple examples)
Nlp tutorial using python nltk (simple examples)Nlp tutorial using python nltk (simple examples)
Nlp tutorial using python nltk (simple examples)
 

Mehr von cstalks

CSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17AugCSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
cstalks
 
CSTalks-Visualizing Software Behavior-14Sep
CSTalks-Visualizing Software Behavior-14SepCSTalks-Visualizing Software Behavior-14Sep
CSTalks-Visualizing Software Behavior-14Sep
cstalks
 
CSTalks-Polymorphic heterogeneous multicore systems-17Aug
CSTalks-Polymorphic heterogeneous multicore systems-17AugCSTalks-Polymorphic heterogeneous multicore systems-17Aug
CSTalks-Polymorphic heterogeneous multicore systems-17Aug
cstalks
 
CSTalks-Quaternary Semantics Recomandation System-24 Aug
CSTalks-Quaternary Semantics Recomandation System-24 AugCSTalks-Quaternary Semantics Recomandation System-24 Aug
CSTalks-Quaternary Semantics Recomandation System-24 Aug
cstalks
 
CSTalks - The Multicore Midlife Crisis - 30 Mar
CSTalks - The Multicore Midlife Crisis - 30 MarCSTalks - The Multicore Midlife Crisis - 30 Mar
CSTalks - The Multicore Midlife Crisis - 30 Mar
cstalks
 
CSTalks - On machine learning - 2 Mar
CSTalks - On machine learning - 2 MarCSTalks - On machine learning - 2 Mar
CSTalks - On machine learning - 2 Mar
cstalks
 
CSTalks - Real movie recommendation - 9 Mar
CSTalks - Real movie recommendation - 9 MarCSTalks - Real movie recommendation - 9 Mar
CSTalks - Real movie recommendation - 9 Mar
cstalks
 
CSTalks-LifeBeyondPhD-16Mar
CSTalks-LifeBeyondPhD-16MarCSTalks-LifeBeyondPhD-16Mar
CSTalks-LifeBeyondPhD-16Mar
cstalks
 
CSTalks - Music Information Retrieval - 23 Feb
CSTalks - Music Information Retrieval - 23 FebCSTalks - Music Information Retrieval - 23 Feb
CSTalks - Music Information Retrieval - 23 Feb
cstalks
 

Mehr von cstalks (10)

CSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17AugCSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
 
CSTalks-Visualizing Software Behavior-14Sep
CSTalks-Visualizing Software Behavior-14SepCSTalks-Visualizing Software Behavior-14Sep
CSTalks-Visualizing Software Behavior-14Sep
 
CSTalks-Polymorphic heterogeneous multicore systems-17Aug
CSTalks-Polymorphic heterogeneous multicore systems-17AugCSTalks-Polymorphic heterogeneous multicore systems-17Aug
CSTalks-Polymorphic heterogeneous multicore systems-17Aug
 
CSTalks-Quaternary Semantics Recomandation System-24 Aug
CSTalks-Quaternary Semantics Recomandation System-24 AugCSTalks-Quaternary Semantics Recomandation System-24 Aug
CSTalks-Quaternary Semantics Recomandation System-24 Aug
 
CSTalks - Object detection and tracking - 25th May
CSTalks - Object detection and tracking - 25th MayCSTalks - Object detection and tracking - 25th May
CSTalks - Object detection and tracking - 25th May
 
CSTalks - The Multicore Midlife Crisis - 30 Mar
CSTalks - The Multicore Midlife Crisis - 30 MarCSTalks - The Multicore Midlife Crisis - 30 Mar
CSTalks - The Multicore Midlife Crisis - 30 Mar
 
CSTalks - On machine learning - 2 Mar
CSTalks - On machine learning - 2 MarCSTalks - On machine learning - 2 Mar
CSTalks - On machine learning - 2 Mar
 
CSTalks - Real movie recommendation - 9 Mar
CSTalks - Real movie recommendation - 9 MarCSTalks - Real movie recommendation - 9 Mar
CSTalks - Real movie recommendation - 9 Mar
 
CSTalks-LifeBeyondPhD-16Mar
CSTalks-LifeBeyondPhD-16MarCSTalks-LifeBeyondPhD-16Mar
CSTalks-LifeBeyondPhD-16Mar
 
CSTalks - Music Information Retrieval - 23 Feb
CSTalks - Music Information Retrieval - 23 FebCSTalks - Music Information Retrieval - 23 Feb
CSTalks - Music Information Retrieval - 23 Feb
 

Kürzlich hochgeladen

The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Krashi Coaching
 

Kürzlich hochgeladen (20)

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 

CSTalks-Natural Language Processing-17Aug

  • 1. Natural Language Processing Daniel Dahlmeier NUS Graduate School for Integrative Sciences and Engineering danielhe@comp.nus.edu.sg CSTalks 2 November 2011
  • 2. Acknowledgments Examples and figures from Michael Collins’ lecture notes: http://www.cs.columbia.edu/∼mcollins. Some other figures are from Wikipedia: http://www.wikipedia.org. The rest I randomly found on the web.
  • 3. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Google translate 3/25
  • 4. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion IBM’s Watson computer wins at Jeopardy! 4/25
  • 5. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Siri 5/25
  • 6. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion What is Natural Language Processing? Natural Language Processing (NLP) or Computational Linguistics Language processing that goes beyond a “bag of words” representation. Example Translate from one language into the other. Answer natural language questions. Parse the syntactic/semantic structure of a sentence. The other NLP NLP = neuro-linguistic programming. 6/25
  • 7. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Background(s): Artificial Intelligence Talk to your computer Dave: Hello, HAL. Do you read me, HAL? HAL: Affirmative, Dave. I read you. Dave: Open the pod bay doors, HAL. HAL: I’m sorry, Dave. I’m afraid I can’t do that. The computer needs to ... Understand the user : Natural Language Understanding. Generate a well-formed reply : Natural Language Generation. 7/25
  • 8. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Background(s): Artificial Intelligence (cont.) Turing Test Experimenter talks to two parties A and B via a terminal. If C cannot distinguish which party is a computer and which is a human, we should consider the computer to be intelligent. Natural language is deeply intertwined with intelligence. 8/25
  • 9. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Background(s): Linguistics Generative Linguistics Humans can produce and understand an infinite number of sentences by means of a finite set of rules. Language is produced through a generative, recursive process in the human brain. The principles that underlie this process are universal to all languages (universal grammar). 9/25
  • 10. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Background(s): the Web “We are drowning in information but starved for knowledge.” by Edward Osborne Wilson Too much text to read... Wikipedia: over 3.7 million articles (English). PubMed: over 20 million citations. WWW: billions of pages, trillions of words. 10/25
  • 11. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Part-of-speech Tagging Part-of-speech tagging Input: a sentence. Output: a part-of-speech tag sequence, e.g., noun, verb, adjective,... Example Profits/N soared/V at/P Boeing/N Co./N ,/, easily/ADV topping/V forecasts/N on/P Wall/N Street/N ./. 11/25
  • 12. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Named-entity recognition Named-entity recognition Input: a sentence. Output: a BIO-named entity tag sequence, e.g., PERSON, ORGANIZATION, OTHER. Example Profits/O soared/O at/O Boeing/B-ORG Co./I-ORG ,/O easily/O topping/O forecasts/O on/O Wall/O Street/O ./O 12/25
  • 13. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Word Sense Disambiguation Word sense disambiguation Input: a sentence. Output: the sense of each word in the sentence. Example I/sense1 can/sense1 can/sense2 a/sense1 can sense3 . 13/25
  • 14. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Parsing Parsing Input: a sentence. Output: the syntactic tree structure of the sentence. Example Boeing is located in Seattle. 14/25
  • 15. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Machine translation Machine Translation Input: a sentence in language F . Output: the translated sentence in language E . Example Input: Syriens Pr¨sident Baschar al-Assad hat den Westen davor a gewarnt, sich in die Angelegenheiten seines Landes einzumischen. Output: Syrian President Bashar al-Assad has warned the West against interfering in the affairs of his country. 15/25
  • 16. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Why is it hard? ( example from L.Lee) “At last, a computer that understands you like your mother” 16/25
  • 17. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Ambiguity of Natural Language “At last, a computer that understands you like your mother” This could mean... 1 It understands you as well as your mother understands you. 2 It understands (that) you like your mother. 3 It understands you as well as it understands your mother. 1 and 3: Does this mean well, or poorly? 17/25
  • 18. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Ambiguity at the Acoustic Level “At last, a computer that understands you like your mother” This sounds like... 1 “... a computer that understands you like your mother.” 2 “... a computer that understands you lie cured mother.” 18/25
  • 19. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Ambiguity at the Syntactic (structure) Level “At last, a computer that understands you like your mother” 19/25
  • 20. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Ambiguity at the Syntactic (structure) Level “List all flights on Tuesday.” 20/25
  • 21. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Ambiguity at the Semantic (meaning) Level Definition of “mother” 1 a woman who has given birth to a child 2 a stringy slimy substance consisting of yeast cells and bacteria; is added to cider or wine to produce vinegar. More ambiguity They put money in the bank (= buried in mud?). I saw her duck with a telescope (= a duck carrying a telescope?). 21/25
  • 22. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Ambiguity at the Discourse (multi-clause) Level Anaphora resolution Alice says they’ve built a computer that understands you like your mother. But she ... ... doesn’t know any details (Alice) ... doesn’t understand me at all (my mother) 22/25
  • 23. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Related Stuff Machine Learning This really made large-scale, open domain NLP applications possible. Information Retrieval Both need to “understand” language. Linguistics Interested in the nature of language. Psychology / Cognitive Science Both interested in human cognitive capabilities. 23/25
  • 24. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Conclusion What I have told you... What NLP is about. Some NLP tasks that people work on. Why it’s not that easy. What I haven’t told you How do you solve all these problems? How well does it work? What is left to be done? 24/25
  • 25. Examples What is NLP? Background NLP tasks Why is it hard? Related Stuff Conclusion Would you like to know more? NLP courses at NUS CS4248: natural language processing CS6207: advanced natural language processing Books Jurafsky and Martin, Speech and Language Processing (2nd Edition) 25/25