SlideShare ist ein Scribd-Unternehmen logo
1 von 9
Downloaden Sie, um offline zu lesen
A fully-automatic approach
 to answer geographic queries:
       GIRSA-WP at GikiP

   Johannes Leveling           Sven Hartrumpf

Intelligent Information and Communication Systems (IICS)
       University of Hagen (FernUniversität in Hagen)
                   58084 Hagen, Germany
       firstname.lastname@fernuni-hagen.de
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                                                      Main idea
Main idea

GIRSA-WP
                            InSicht (Hartrumpf, 2005)
Semantic filter                    • open-domain QA system
Experiments                       • based on matching semantic network representations
and Results                         of question and documents
Conclusions                       • supports question decomposition
References                          e.g. temporal or geographical constraints
                       + GIRSA (Leveling and Hartrumpf, 2008)
                                  • textual GIR system
                                  • supports methods to boost recall
                                    e.g. normalizing location indicators
                                  • supports methods to boost precision
                                    e.g. metonymy recognition
                       = GIRSA-WP (GIRSA for Wikipedia)
                                  • automatic combination of InSicht and GIRSA



      J. Leveling, S. Hartrumpf                            GIRSA-WP                      2/9
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                                                GIRSA-WP
Main idea

GIRSA-WP

Semantic filter

Experiments
and Results
                        • applies semantic filter on answer candidates
Conclusions             • merges results from InSicht and GIRSA by using the
References                  maximum score of documents
                        • returns list of Wikipedia article names
                        • simple multilingual approach:
                            follow German Wikipedia links to articles in English and
                            Portuguese




      J. Leveling, S. Hartrumpf                      GIRSA-WP                      3/9
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                                      Semantic filter (1/2)
Main idea

GIRSA-WP
                        • in QA: check expected answer type of answer
Semantic filter
                          candidates
Experiments
and Results             • for GIRSA-WP: check semantic answer types
Conclusions               (semantic sort and features, see Helbig (2006))
References                        • extract word representing the answer type from topic
                                    title and description (the first noun not a proper noun)
                                  • parse these words with WOCADI, a syntactico-semantic
                                    parser (includes a disambiguation of words) and find
                                    semantic features corresponding to the extracted words
                                  • parse the answer candidates (titles of Wikipedia
                                    articles) and determine their semantic features
                                  • test if unification of semantic features succeeds;
                                    discard answer candidate, otherwise



      J. Leveling, S. Hartrumpf                            GIRSA-WP                           4/9
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                                      Semantic filter (2/2)
Main idea

GIRSA-WP
                        • Which Swiss cantons border Germany?
Semantic filter

Experiments
                            → extracted word: cantons
and Results             • parse result: corresponding concept is canton
Conclusions
                            • artificial geographical entity or regional institution
References
                            • legal-person:+, movable:–, etc.
                        • answer candidate Cross-Border-Leasing:
                                  • prototypical-theoretical-concept
                                  • legal-person:–, movable:–
                              → semantic features not unifiable
                        • answer candidate Aargau:
                              → unifiable semantic features




      J. Leveling, S. Hartrumpf                            GIRSA-WP                   5/9
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                          Experiments and results
Main idea

GIRSA-WP

Semantic filter

Experiments             • six runs submitted:
and Results
                            three with threshold score of 0.01 and
Conclusions

References
                            varied settings for stemming, location name
                            normalization, and noun decompounding;
                            additional three experiments with threshold
                            score of 0.03
                        • 798 (372) answers found
                        • 79 correct answers in best run




      J. Leveling, S. Hartrumpf                     GIRSA-WP              6/9
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                                 Conclusions (1/2)
Main idea

GIRSA-WP

Semantic filter
                    GikiP topics
Experiments             • are at least as difficult as QA or GeoCLEF topics
and Results

Conclusions
                        • aim at a wider range of expected answer types
References              • include complex geographic relations
                            (GP2: outside, GP4: on the border ),
                            restrictions on measurable properties
                            (GP3: more than, GP13: longer than), and
                            temporal constraints
                            (GP9: Renaissance, GP15: between 1980 and 1990)
                      ⇒ new challenge for QA and GIR community




      J. Leveling, S. Hartrumpf                   GIRSA-WP                    7/9
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                                    Conclusions (2/2)
Main idea

GIRSA-WP
                        • GIRSA:
Semantic filter
                            • indexing single sentences was meant to ensure a high
Experiments
and Results                    precision (but did not work);
Conclusions                 • geographic entities have not been annotated at all in the
References                     Wikipedia documents
                        • InSicht:
                            • important information is given in tables (like inhabitant
                               numbers), but WOCADI ignores these
                            • the semantic matching approach is still too strict for the
                               IR oriented parts of GikiP queries (similarly for
                               GeoCLEF)
                      ⇒ tasks for future work



      J. Leveling, S. Hartrumpf                       GIRSA-WP                         8/9
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                                 Selected References
Main idea

GIRSA-WP            Hartrumpf, S. (2005). Question answering using sentence parsing and
Semantic filter         semantic network matching. In Multilingual Information Access for
Experiments            Text, Speech and Images: 5th Workshop of the Cross-Language
and Results            Evaluation Forum, CLEF 2004 (edited by Peters, C.; Clough, P.;
Conclusions            Gonzalo, J.; Jones, G. J. F.; Kluck, M.; and Magnini, B.), volume 3491
References             of LNCS, pp. 512–521. Berlin: Springer.
                    Helbig, H. (2006). Knowledge Representation and the Semantics of
                       Natural Language. Berlin: Springer.
                    Leveling, J. and Hartrumpf, S. (2008). Inferring location names for
                       geographic information retrieval. In Advances in Multilingual and
                       Multimodal Information Retrieval: 8th Workshop of the
                       Cross-Language Evaluation Forum, CLEF 2007 (edited by Peters, C.;
                       Jijkoun, V.; Mandl, T.; Müller, H.; Oard, D. W.; Peñas, A.; Petras, V.;
                       and Santos, D.), volume 5152 of LNCS, pp. 773–780. Berlin:
                       Springer.



      J. Leveling, S. Hartrumpf                         GIRSA-WP                             9/9

Weitere ähnliche Inhalte

Kürzlich hochgeladen

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Kürzlich hochgeladen (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

Empfohlen

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Empfohlen (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

A Fully-automatic Approach to Answer Geographic Queries: GIRSA-WP at GikiP

  • 1. A fully-automatic approach to answer geographic queries: GIRSA-WP at GikiP Johannes Leveling Sven Hartrumpf Intelligent Information and Communication Systems (IICS) University of Hagen (FernUniversität in Hagen) 58084 Hagen, Germany firstname.lastname@fernuni-hagen.de
  • 2. GIRSA-WP J. Leveling, S. Hartrumpf Main idea Main idea GIRSA-WP InSicht (Hartrumpf, 2005) Semantic filter • open-domain QA system Experiments • based on matching semantic network representations and Results of question and documents Conclusions • supports question decomposition References e.g. temporal or geographical constraints + GIRSA (Leveling and Hartrumpf, 2008) • textual GIR system • supports methods to boost recall e.g. normalizing location indicators • supports methods to boost precision e.g. metonymy recognition = GIRSA-WP (GIRSA for Wikipedia) • automatic combination of InSicht and GIRSA J. Leveling, S. Hartrumpf GIRSA-WP 2/9
  • 3. GIRSA-WP J. Leveling, S. Hartrumpf GIRSA-WP Main idea GIRSA-WP Semantic filter Experiments and Results • applies semantic filter on answer candidates Conclusions • merges results from InSicht and GIRSA by using the References maximum score of documents • returns list of Wikipedia article names • simple multilingual approach: follow German Wikipedia links to articles in English and Portuguese J. Leveling, S. Hartrumpf GIRSA-WP 3/9
  • 4. GIRSA-WP J. Leveling, S. Hartrumpf Semantic filter (1/2) Main idea GIRSA-WP • in QA: check expected answer type of answer Semantic filter candidates Experiments and Results • for GIRSA-WP: check semantic answer types Conclusions (semantic sort and features, see Helbig (2006)) References • extract word representing the answer type from topic title and description (the first noun not a proper noun) • parse these words with WOCADI, a syntactico-semantic parser (includes a disambiguation of words) and find semantic features corresponding to the extracted words • parse the answer candidates (titles of Wikipedia articles) and determine their semantic features • test if unification of semantic features succeeds; discard answer candidate, otherwise J. Leveling, S. Hartrumpf GIRSA-WP 4/9
  • 5. GIRSA-WP J. Leveling, S. Hartrumpf Semantic filter (2/2) Main idea GIRSA-WP • Which Swiss cantons border Germany? Semantic filter Experiments → extracted word: cantons and Results • parse result: corresponding concept is canton Conclusions • artificial geographical entity or regional institution References • legal-person:+, movable:–, etc. • answer candidate Cross-Border-Leasing: • prototypical-theoretical-concept • legal-person:–, movable:– → semantic features not unifiable • answer candidate Aargau: → unifiable semantic features J. Leveling, S. Hartrumpf GIRSA-WP 5/9
  • 6. GIRSA-WP J. Leveling, S. Hartrumpf Experiments and results Main idea GIRSA-WP Semantic filter Experiments • six runs submitted: and Results three with threshold score of 0.01 and Conclusions References varied settings for stemming, location name normalization, and noun decompounding; additional three experiments with threshold score of 0.03 • 798 (372) answers found • 79 correct answers in best run J. Leveling, S. Hartrumpf GIRSA-WP 6/9
  • 7. GIRSA-WP J. Leveling, S. Hartrumpf Conclusions (1/2) Main idea GIRSA-WP Semantic filter GikiP topics Experiments • are at least as difficult as QA or GeoCLEF topics and Results Conclusions • aim at a wider range of expected answer types References • include complex geographic relations (GP2: outside, GP4: on the border ), restrictions on measurable properties (GP3: more than, GP13: longer than), and temporal constraints (GP9: Renaissance, GP15: between 1980 and 1990) ⇒ new challenge for QA and GIR community J. Leveling, S. Hartrumpf GIRSA-WP 7/9
  • 8. GIRSA-WP J. Leveling, S. Hartrumpf Conclusions (2/2) Main idea GIRSA-WP • GIRSA: Semantic filter • indexing single sentences was meant to ensure a high Experiments and Results precision (but did not work); Conclusions • geographic entities have not been annotated at all in the References Wikipedia documents • InSicht: • important information is given in tables (like inhabitant numbers), but WOCADI ignores these • the semantic matching approach is still too strict for the IR oriented parts of GikiP queries (similarly for GeoCLEF) ⇒ tasks for future work J. Leveling, S. Hartrumpf GIRSA-WP 8/9
  • 9. GIRSA-WP J. Leveling, S. Hartrumpf Selected References Main idea GIRSA-WP Hartrumpf, S. (2005). Question answering using sentence parsing and Semantic filter semantic network matching. In Multilingual Information Access for Experiments Text, Speech and Images: 5th Workshop of the Cross-Language and Results Evaluation Forum, CLEF 2004 (edited by Peters, C.; Clough, P.; Conclusions Gonzalo, J.; Jones, G. J. F.; Kluck, M.; and Magnini, B.), volume 3491 References of LNCS, pp. 512–521. Berlin: Springer. Helbig, H. (2006). Knowledge Representation and the Semantics of Natural Language. Berlin: Springer. Leveling, J. and Hartrumpf, S. (2008). Inferring location names for geographic information retrieval. In Advances in Multilingual and Multimodal Information Retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007 (edited by Peters, C.; Jijkoun, V.; Mandl, T.; Müller, H.; Oard, D. W.; Peñas, A.; Petras, V.; and Santos, D.), volume 5152 of LNCS, pp. 773–780. Berlin: Springer. J. Leveling, S. Hartrumpf GIRSA-WP 9/9