SlideShare ist ein Scribd-Unternehmen logo
1 von 18
Downloaden Sie, um offline zu lesen
Language
                           Modeling in
                         Turner&Charniak
                              (2007)

                            Kilian Evang


                         Language Models
                         N-gram LMs

 Language Modeling in    Charniak’s LM

                         Determiner

Turner&Charniak (2007)   Selection
                         Method
                         Results
                         Reasons for Success

                         References
      Kilian Evang


       2009-11-30
Language
Recap: Language Models                                           Modeling in
                                                               Turner&Charniak
                                                                    (2007)

                                                                  Kilian Evang


                                                               Language Models
                                                               N-gram LMs
                                                               Charniak’s LM
  ◮   LMs assign probabilities to sentences                    Determiner
                                                               Selection
  ◮   a sentence is a complex event                            Method
                                                               Results
  ◮   LMs break it up into a sequence of “atomic” events       Reasons for Success

                                                               References
  ◮   each “atomic” event conditioned on certain previous
      events
  ◮   conditional probabilities approximated by counting and
      smoothing
Language
N-gram LMs                                             Modeling in
                                                     Turner&Charniak
                                                          (2007)

                                                        Kilian Evang


                                                     Language Models
                       n-gram LMs    Charniak’s LM   N-gram LMs
                                                     Charniak’s LM
 sequence represents   sentence                      Determiner
                                                     Selection
 p(sent) =             p(seq)                        Method

 events are            words,                        Results
                                                     Reasons for Success

                       end symbols                   References




 conditioned on        the n − 1
                       previous
                       events
Language
A Sentence – a Sequence of Events                    Modeling in
                                                   Turner&Charniak
                                                        (2007)

                                                      Kilian Evang


                                                   Language Models
                                                   N-gram LMs

Sentence                                           Charniak’s LM

                                                   Determiner
                                                   Selection
                                                   Method
                                                   Results
                                                   Reasons for Success

   put       the      ball        in   the   box   References




Event sequence
put, the, ball, in, the, box, ∆
Language
A Sentence – a Sequence of Events                    Modeling in
                                                   Turner&Charniak
                                                        (2007)

                                                      Kilian Evang


                                                   Language Models
                                                   N-gram LMs

Sentence                                           Charniak’s LM

                                                   Determiner
                                                   Selection
                                                   Method
                                                   Results
                                                   Reasons for Success

   put      the       ball      in     the   box   References




Conditional probability
p(wi = the|wi −2 = ball, wi −1 = in)
Language
N-gram LMs vs. Charniak’s Parsing LM                       Modeling in
                                                         Turner&Charniak
                                                              (2007)

                                                            Kilian Evang


                       n-gram LMs    Charniak’s LM       Language Models
                                                         N-gram LMs
 sequence represents   sentence      parse tree          Charniak’s LM

 p(sent) =             p(seq)           p(seq)           Determiner
                                                         Selection
                                     seq                 Method
 events are            words,        pre-terminals,      Results
                                                         Reasons for Success
                       end symbols   terminals,          References
                                     constituents,
                                     end symbols
 conditioned on        the n − 1     certain previous
                       previous      events, depending
                       events        on type
Language
A Parse Tree – a Sequence of Events                        Modeling in
                                                         Turner&Charniak
                                                              (2007)

                                                            Kilian Evang
Parse tree
         vp                                              Language Models
                                                         N-gram LMs
                                                         Charniak’s LM

                                                         Determiner
                    np                 pp                Selection
                                                         Method
                                                         Results
                                                         Reasons for Success
                                             np          References



  verb        det        noun   prep   det        noun

   put        the        ball    in    the        box
Language
A Parse Tree – a Sequence of Events                                  Modeling in
                                                                   Turner&Charniak
                                                                        (2007)

                                                                      Kilian Evang
Parse tree
           vp                                                      Language Models
                                                                   N-gram LMs
                                                                   Charniak’s LM

                                                                   Determiner
                      np                      pp                   Selection
                                                                   Method
                                                                   Results
                                                                   Reasons for Success
                                                    np             References



  verb          det        noun   prep        det        noun

   put          the        ball     in        the        box


Event sequence
verb, put, M, ∆, M, np, pp, ∆, noun, ball, M, det, ∆, M, ∆, the,
prep, in, M, ∆, M, np, ∆, noun, box, M, det, ∆, M, ∆, the
Language
Digression: Non-head Constituents                           Modeling in
                                                          Turner&Charniak
                                                               (2007)

                                                             Kilian Evang


                                                          Language Models
Tree fragment                                             N-gram LMs
                                                          Charniak’s LM

                                  l                       Determiner
                                                          Selection
                                                          Method
                                                          Results
            Lm     ...    L1      t      R1    ...   Rn   Reasons for Success

                                                          References


                                  h



Event sequence fragment
M, L1 , . . ., Lm , ∆, M, R1 , . . ., Rn , ∆
Language
A Parse Tree – a Sequence of Events                        Modeling in
                                                         Turner&Charniak
                                                              (2007)

                                                            Kilian Evang
Parse tree
                                                         Language Models
         vp                                              N-gram LMs
                                                         Charniak’s LM

                                                         Determiner
                    np                 pp                Selection
                                                         Method
                                                         Results
                                                         Reasons for Success
                                             np          References



  verb        det        noun   prep   det        noun

   put        the        ball    in    the        box


Conditional probability for a head pre-terminal
p(t = noun|l = np, m = vp, u = verb, i = put)
Language
A Parse Tree – a Sequence of Events                         Modeling in
                                                          Turner&Charniak
                                                               (2007)

                                                             Kilian Evang
Parse tree
                                                          Language Models
         vp                                               N-gram LMs
                                                          Charniak’s LM

                                                          Determiner
                    np                  pp                Selection
                                                          Method
                                                          Results
                                                          Reasons for Success
                                              np          References



  verb        det        noun   prep    det        noun

   put        the        ball    in     the        box


Conditional probability for a head terminal
p(h = ball|t = noun, l = np, m = vp, u = verb, i = put)
Language
A Parse Tree – a Sequence of Events                        Modeling in
                                                         Turner&Charniak
                                                              (2007)


Parse tree                                                  Kilian Evang


         vp                                              Language Models
                                                         N-gram LMs
                                                         Charniak’s LM

                                                         Determiner
                    np                  pp               Selection
                                                         Method
                                                         Results
                                                         Reasons for Success
                                             np
                                                         References



  verb        det        noun   prep   det        noun

   put        the        ball    in    the        box


Conditional probability for a non-head constituent
p(Li = det|Li −1 = M, h = ball, t = noun, l = np, m =
vp, u = verb)
Language
Overview: Conditioning                                                        Modeling in
                                                                            Turner&Charniak
                                                                                 (2007)

                                                                               Kilian Evang
 event type                    conditioned on
                                                                            Language Models
 head pre-terminal t           constituent label l,
                                                                            N-gram LMs
                               mother constituent label m,                  Charniak’s LM

                               mother constituent head pre-terminal u       Determiner
                                                                            Selection
                               mother consitutent head terminal i
                                                                            Method
 head terminal h               head pre-terminal t,                         Results
                                                                            Reasons for Success
                               constituent label l,
                                                                            References
                               mother constituent label m,
                               mother constituent head pre-terminal u
                               mother consitutent head terminal i
 non-head                      (part of) L1...i −1 (L1...m , R1...i −1 ),
 constituent label Li (Ri ),   head terminal h,
 end symbol ∆                  head pre-terminal t,
                               constituent label l,
                               mother constituent label m,
                               mother constituent head pre-terminal u
Language
Determiner Selection                                               Modeling in
                                                                 Turner&Charniak
                                                                      (2007)

                                                                    Kilian Evang


                                                                 Language Models
                                                                 N-gram LMs
                                                                 Charniak’s LM

                                                                 Determiner
  ◮   for each NP,                                               Selection
                                                                 Method
        ◮   for each possible determiner (the, a/an, null),      Results
                                                                 Reasons for Success
              ◮   determine probability of NP
                                                                 References
        ◮   choose determiner resulting in highest probability
  ◮   note: sufficient to determine probabilities for events
      that differ
Language
Determiner Selection – Example                                         Modeling in
                                                                     Turner&Charniak
                                                                          (2007)

                                                                        Kilian Evang
  ◮   “put [NP the ball] in the box”
                                                                     Language Models
        ◮   p(L1 = det|m = vp, u = verb, l = np, t = noun, h =       N-gram LMs
                                                                     Charniak’s LM
            ball) × p(L2 = ∆|m = vp, u = verb, l = np, t =
                                                                     Determiner
            noun, h = ball, L1 = det) × p(det → the|m = vp, u =      Selection
            verb, l = np, t = noun, h = ball, L1 = det)              Method
                                                                     Results
                                                                     Reasons for Success
  ◮   “put [NP a/an ball] in the box”
                                                                     References
        ◮   p(L1 = det|m = vp, u = verb, l = np, t = noun, h =
            ball) × p(L2 = ∆|m = vp, u = verb, l = np, t =
            noun, h = ball, L1 = det) × (p(det → a|m = vp, u =
            verb, l = np, t = noun, h = ball, L1 = det) + p(det →
            an|m = vp, u = verb, l = np, t = noun, h = ball, L1 =
            det))
  ◮   “put [NP ball] in the box”
        ◮   p(L1 = ∆|m = vp, u = verb, l = np, t = noun, h = ball)
Language
Results     Modeling in
          Turner&Charniak
               (2007)

             Kilian Evang


          Language Models
          N-gram LMs
          Charniak’s LM

          Determiner
          Selection
          Method
          Results
          Reasons for Success

          References
Language
Reasons for Success                                                      Modeling in
                                                                       Turner&Charniak
                                                                            (2007)

                                                                          Kilian Evang


                                                                       Language Models
                                                                       N-gram LMs
                                                                       Charniak’s LM

                                                                       Determiner
  ◮   syntactic structure allows for long-distance conditioning,       Selection

      e.g.                                                             Method
                                                                       Results
                                                                       Reasons for Success
            ◮   he [VP gave [NP the sultan of Brunei] [NP a cactus]]
                                                                       References
  ◮   constituent head enforces selectional preferences,
      reflected in head-first strategy
  ◮   ...
Language
References                                                       Modeling in
                                                               Turner&Charniak
                                                                    (2007)

                                                                  Kilian Evang


                                                               Language Models
    Eugene Charniak (2000)                                     N-gram LMs

    A Maximum-Entropy-Inspired Parser                          Charniak’s LM

                                                               Determiner
    Proceedings of the First Meeting of the North American     Selection

    Chapter of the Association for Computational Linguistics   Method
                                                               Results
                                                               Reasons for Success

    Eugene Charniak (2001)                                     References

    Immediate-Head Parsing for Language Models
    Proceedings of the 39th Annual Meeting of the
    Association for Computational Linguistics
    Jenine Turner & Eugene Charniak (2007)
    Language Modeling for Determiner Selection
    Proceedings of NAACL HLT 2007, Companion Volume

Weitere ähnliche Inhalte

Andere mochten auch

Smoky Mountain Secrets
Smoky Mountain SecretsSmoky Mountain Secrets
Smoky Mountain SecretsJeanMarlowe
 
2011 05 09_배달통
2011 05 09_배달통2011 05 09_배달통
2011 05 09_배달통Sanghoon Kim
 
Joshua Brewer - Lesson on Formulating Hypotheses - Final Draft
Joshua Brewer - Lesson on Formulating Hypotheses - Final DraftJoshua Brewer - Lesson on Formulating Hypotheses - Final Draft
Joshua Brewer - Lesson on Formulating Hypotheses - Final DraftMorningMaestro
 
I Npd Mfei 5 10
I Npd Mfei 5 10I Npd Mfei 5 10
I Npd Mfei 5 10kbmcgourty
 
Pdf negro 19-mayo-2014-01
Pdf negro 19-mayo-2014-01Pdf negro 19-mayo-2014-01
Pdf negro 19-mayo-2014-01imm
 
Vaya con dios1_(pp_tminimizer)
Vaya con dios1_(pp_tminimizer)Vaya con dios1_(pp_tminimizer)
Vaya con dios1_(pp_tminimizer)Vesdo 1
 
앱리뷰 말하는웨더알람
앱리뷰 말하는웨더알람앱리뷰 말하는웨더알람
앱리뷰 말하는웨더알람Sanghoon Kim
 
Documento compraventa terreno campeche
Documento compraventa terreno campecheDocumento compraventa terreno campeche
Documento compraventa terreno campecheErmilo De la Cruz
 
Cross Media for non-profits
Cross Media for non-profitsCross Media for non-profits
Cross Media for non-profitsRCAcrossmedia
 
Doha 2006 sukan asia
Doha 2006 sukan asiaDoha 2006 sukan asia
Doha 2006 sukan asiaOlimpikini
 
Increasing food security through strategies and investments to increase incom...
Increasing food security through strategies and investments to increase incom...Increasing food security through strategies and investments to increase incom...
Increasing food security through strategies and investments to increase incom...Bangladesh Food Security Investment Forum
 
elmuellocalmon
elmuellocalmonelmuellocalmon
elmuellocalmoncristian
 
Final project
Final project Final project
Final project imm
 
Funcionaris versus emprenedors
Funcionaris versus emprenedorsFuncionaris versus emprenedors
Funcionaris versus emprenedorsaleixclosmari
 
La nit del terror. Euclides
La nit del terror. Euclides La nit del terror. Euclides
La nit del terror. Euclides joan
 
Peru presentation
Peru presentationPeru presentation
Peru presentationanchefu
 
Presentatie Ssis Part I
Presentatie Ssis Part IPresentatie Ssis Part I
Presentatie Ssis Part Isecuserve1
 

Andere mochten auch (20)

Smoky Mountain Secrets
Smoky Mountain SecretsSmoky Mountain Secrets
Smoky Mountain Secrets
 
2011 05 09_배달통
2011 05 09_배달통2011 05 09_배달통
2011 05 09_배달통
 
Joshua Brewer - Lesson on Formulating Hypotheses - Final Draft
Joshua Brewer - Lesson on Formulating Hypotheses - Final DraftJoshua Brewer - Lesson on Formulating Hypotheses - Final Draft
Joshua Brewer - Lesson on Formulating Hypotheses - Final Draft
 
I Npd Mfei 5 10
I Npd Mfei 5 10I Npd Mfei 5 10
I Npd Mfei 5 10
 
Pdf negro 19-mayo-2014-01
Pdf negro 19-mayo-2014-01Pdf negro 19-mayo-2014-01
Pdf negro 19-mayo-2014-01
 
Dip.IFRS ACCA(UK)
Dip.IFRS ACCA(UK)Dip.IFRS ACCA(UK)
Dip.IFRS ACCA(UK)
 
Vaya con dios1_(pp_tminimizer)
Vaya con dios1_(pp_tminimizer)Vaya con dios1_(pp_tminimizer)
Vaya con dios1_(pp_tminimizer)
 
앱리뷰 말하는웨더알람
앱리뷰 말하는웨더알람앱리뷰 말하는웨더알람
앱리뷰 말하는웨더알람
 
Documento compraventa terreno campeche
Documento compraventa terreno campecheDocumento compraventa terreno campeche
Documento compraventa terreno campeche
 
Cross Media for non-profits
Cross Media for non-profitsCross Media for non-profits
Cross Media for non-profits
 
Doha 2006 sukan asia
Doha 2006 sukan asiaDoha 2006 sukan asia
Doha 2006 sukan asia
 
앱리뷰 Dayta
앱리뷰 Dayta앱리뷰 Dayta
앱리뷰 Dayta
 
Increasing food security through strategies and investments to increase incom...
Increasing food security through strategies and investments to increase incom...Increasing food security through strategies and investments to increase incom...
Increasing food security through strategies and investments to increase incom...
 
elmuellocalmon
elmuellocalmonelmuellocalmon
elmuellocalmon
 
Final project
Final project Final project
Final project
 
2011 05 30_curing
2011 05 30_curing2011 05 30_curing
2011 05 30_curing
 
Funcionaris versus emprenedors
Funcionaris versus emprenedorsFuncionaris versus emprenedors
Funcionaris versus emprenedors
 
La nit del terror. Euclides
La nit del terror. Euclides La nit del terror. Euclides
La nit del terror. Euclides
 
Peru presentation
Peru presentationPeru presentation
Peru presentation
 
Presentatie Ssis Part I
Presentatie Ssis Part IPresentatie Ssis Part I
Presentatie Ssis Part I
 

Kürzlich hochgeladen

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 

Kürzlich hochgeladen (20)

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

Language Modeling in Turner&Charniak (2007)

  • 1. Language Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Language Modeling in Charniak’s LM Determiner Turner&Charniak (2007) Selection Method Results Reasons for Success References Kilian Evang 2009-11-30
  • 2. Language Recap: Language Models Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Charniak’s LM ◮ LMs assign probabilities to sentences Determiner Selection ◮ a sentence is a complex event Method Results ◮ LMs break it up into a sequence of “atomic” events Reasons for Success References ◮ each “atomic” event conditioned on certain previous events ◮ conditional probabilities approximated by counting and smoothing
  • 3. Language N-gram LMs Modeling in Turner&Charniak (2007) Kilian Evang Language Models n-gram LMs Charniak’s LM N-gram LMs Charniak’s LM sequence represents sentence Determiner Selection p(sent) = p(seq) Method events are words, Results Reasons for Success end symbols References conditioned on the n − 1 previous events
  • 4. Language A Sentence – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Sentence Charniak’s LM Determiner Selection Method Results Reasons for Success put the ball in the box References Event sequence put, the, ball, in, the, box, ∆
  • 5. Language A Sentence – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Sentence Charniak’s LM Determiner Selection Method Results Reasons for Success put the ball in the box References Conditional probability p(wi = the|wi −2 = ball, wi −1 = in)
  • 6. Language N-gram LMs vs. Charniak’s Parsing LM Modeling in Turner&Charniak (2007) Kilian Evang n-gram LMs Charniak’s LM Language Models N-gram LMs sequence represents sentence parse tree Charniak’s LM p(sent) = p(seq) p(seq) Determiner Selection seq Method events are words, pre-terminals, Results Reasons for Success end symbols terminals, References constituents, end symbols conditioned on the n − 1 certain previous previous events, depending events on type
  • 7. Language A Parse Tree – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Parse tree vp Language Models N-gram LMs Charniak’s LM Determiner np pp Selection Method Results Reasons for Success np References verb det noun prep det noun put the ball in the box
  • 8. Language A Parse Tree – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Parse tree vp Language Models N-gram LMs Charniak’s LM Determiner np pp Selection Method Results Reasons for Success np References verb det noun prep det noun put the ball in the box Event sequence verb, put, M, ∆, M, np, pp, ∆, noun, ball, M, det, ∆, M, ∆, the, prep, in, M, ∆, M, np, ∆, noun, box, M, det, ∆, M, ∆, the
  • 9. Language Digression: Non-head Constituents Modeling in Turner&Charniak (2007) Kilian Evang Language Models Tree fragment N-gram LMs Charniak’s LM l Determiner Selection Method Results Lm ... L1 t R1 ... Rn Reasons for Success References h Event sequence fragment M, L1 , . . ., Lm , ∆, M, R1 , . . ., Rn , ∆
  • 10. Language A Parse Tree – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Parse tree Language Models vp N-gram LMs Charniak’s LM Determiner np pp Selection Method Results Reasons for Success np References verb det noun prep det noun put the ball in the box Conditional probability for a head pre-terminal p(t = noun|l = np, m = vp, u = verb, i = put)
  • 11. Language A Parse Tree – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Parse tree Language Models vp N-gram LMs Charniak’s LM Determiner np pp Selection Method Results Reasons for Success np References verb det noun prep det noun put the ball in the box Conditional probability for a head terminal p(h = ball|t = noun, l = np, m = vp, u = verb, i = put)
  • 12. Language A Parse Tree – a Sequence of Events Modeling in Turner&Charniak (2007) Parse tree Kilian Evang vp Language Models N-gram LMs Charniak’s LM Determiner np pp Selection Method Results Reasons for Success np References verb det noun prep det noun put the ball in the box Conditional probability for a non-head constituent p(Li = det|Li −1 = M, h = ball, t = noun, l = np, m = vp, u = verb)
  • 13. Language Overview: Conditioning Modeling in Turner&Charniak (2007) Kilian Evang event type conditioned on Language Models head pre-terminal t constituent label l, N-gram LMs mother constituent label m, Charniak’s LM mother constituent head pre-terminal u Determiner Selection mother consitutent head terminal i Method head terminal h head pre-terminal t, Results Reasons for Success constituent label l, References mother constituent label m, mother constituent head pre-terminal u mother consitutent head terminal i non-head (part of) L1...i −1 (L1...m , R1...i −1 ), constituent label Li (Ri ), head terminal h, end symbol ∆ head pre-terminal t, constituent label l, mother constituent label m, mother constituent head pre-terminal u
  • 14. Language Determiner Selection Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Charniak’s LM Determiner ◮ for each NP, Selection Method ◮ for each possible determiner (the, a/an, null), Results Reasons for Success ◮ determine probability of NP References ◮ choose determiner resulting in highest probability ◮ note: sufficient to determine probabilities for events that differ
  • 15. Language Determiner Selection – Example Modeling in Turner&Charniak (2007) Kilian Evang ◮ “put [NP the ball] in the box” Language Models ◮ p(L1 = det|m = vp, u = verb, l = np, t = noun, h = N-gram LMs Charniak’s LM ball) × p(L2 = ∆|m = vp, u = verb, l = np, t = Determiner noun, h = ball, L1 = det) × p(det → the|m = vp, u = Selection verb, l = np, t = noun, h = ball, L1 = det) Method Results Reasons for Success ◮ “put [NP a/an ball] in the box” References ◮ p(L1 = det|m = vp, u = verb, l = np, t = noun, h = ball) × p(L2 = ∆|m = vp, u = verb, l = np, t = noun, h = ball, L1 = det) × (p(det → a|m = vp, u = verb, l = np, t = noun, h = ball, L1 = det) + p(det → an|m = vp, u = verb, l = np, t = noun, h = ball, L1 = det)) ◮ “put [NP ball] in the box” ◮ p(L1 = ∆|m = vp, u = verb, l = np, t = noun, h = ball)
  • 16. Language Results Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Charniak’s LM Determiner Selection Method Results Reasons for Success References
  • 17. Language Reasons for Success Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Charniak’s LM Determiner ◮ syntactic structure allows for long-distance conditioning, Selection e.g. Method Results Reasons for Success ◮ he [VP gave [NP the sultan of Brunei] [NP a cactus]] References ◮ constituent head enforces selectional preferences, reflected in head-first strategy ◮ ...
  • 18. Language References Modeling in Turner&Charniak (2007) Kilian Evang Language Models Eugene Charniak (2000) N-gram LMs A Maximum-Entropy-Inspired Parser Charniak’s LM Determiner Proceedings of the First Meeting of the North American Selection Chapter of the Association for Computational Linguistics Method Results Reasons for Success Eugene Charniak (2001) References Immediate-Head Parsing for Language Models Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics Jenine Turner & Eugene Charniak (2007) Language Modeling for Determiner Selection Proceedings of NAACL HLT 2007, Companion Volume