SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Evaluating Entity Summarization
      Using a Game-Based Ground Truth
           Andreas Thalhammer¹, Magnus Knuth²,
                     and Harald Sack²




                             ¹ University of Innsbruck, Austria
13 Nov. 2012
ISWC 2012 Boston             ² Hasso Plattner Institute Potsdam, Germany
Google: “Get the best summary” [1]
  • Inglourious Basterds (Movie)
  • Freebase: 1279 triples
  • DBpedia: 217 triples


   • Google Knowledge Graph
     summary: 14 triples

13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   2
Entity Summarization
• First attempt towards a definition:
“... not just represent the main themes of the
original data, but rather, can best identify the
underlying entity” [2]

                       Is this a good definition?



13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   3
Entity Summarization (cont.)
   “A summary can be loosely defined as a text that is
   produced from one or more texts, that conveys
   important information in the original text(s), and
   that is no longer than half of the original text(s) and
   usually significantly less than that.” [3]

               A summary is
                 • short
                 • and conveys important information.

13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   4
Entity Summarization (cont.)
• Our (loose) definition:
    “Entity summarization is the task of producing a
    summary that conveys important facts about the
    entity while reducing the number of shown facts
    significantly.”




13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   5
The Problem: Evaluation
  • How do we make different summarization
    systems comparable?

           Sub-question:
         • How do we grasp the idea of “important
            facts”?



13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   6
Related Work
• RELIN: Relatedness and Informativeness-based
  Centrality for Entity Summarization [2]

      – Intrinsic: 24 users compiled summaries of 149
        entities (forming a gold standard)
        (Intersection-based similarity)

      – Extrinsic: 47 pairs of FB and DBpedia entities were
        selected (24 correct ones, 23 incorrect ones).
                      Users judge whether pairs are correct or not.

13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   7
Related Work (cont.)
• Towards exploratory video search using linked
  data [4]

      – Quantitative evaluation of heuristics
         Ground truth, containing 115 entities
        summarized by 72 users.
      – Precision/Recall similarity measure



13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   8
Related Work (cont.)
• It is hard to find participants.
• Generating summaries is a cumbersome
  process.
• Only a subset of property-value pairs are
  ranked by the users.
• Up to this point, none of the two evaluation
  datasets is publically available.


13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   9
Our Idea
• Important facts are commonly known
• Unimportant facts are rarely known

• How to find out?




                      Linked Data quiz game!
13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   10
Hypothesis
“A game-based ground truth is suitable for
evaluating the performance of summarization
approaches in the movie domain”

Assumption: implemented approaches correlate
with the game-based ground truth while random
summaries do not.


13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   11
Dataset
• 60 arbitrary selected movies from IMDb Top250
• RDF descriptions from Freebase
• Usage of a property white list
• Triple store: Ontotext’s OWLIM with OWL2-RL
  reasoning enabled.
• Property chains:
               <http://some-name.space/hasActor>
               <http://www.w3.org/2002/07/owl#propertyChainAxiom> (
               <http://rdf.freebase.com/ns/film.film.starring>
               <http://rdf.freebase.com/ns/film.performance.actor> ).
                 All data is available at: http://yovisto.com/labs/iswc2012
13 Nov. 2012    Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   12
WhoKnows?Movies!
            S                        P                        O
:The_Princess_Bride         prop:actor            :Billy_Crystal, ...
:Braveheart                 prop:actor            :Mel_Gibson, ...
:Pulp_Fiction               prop:actor            :John_Travolta .


  •      Question types:
                 - One-to-One
                 - One-to-N
  •      Questions are composed
         upside down:
                  ‘Object is the property of subject1,
                  subject2, subject3’
                     Play the game at: http://bit.ly/WhoKnowsMovies
  13 Nov. 2012      Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   13
Frequency == Importance ???
                                                                      word       upper          lower

• Information retrieval:                                         frequency       cut-off        cut-off



      – Luhn (1958):
        “resolving power of words” [5]
                                                                                                          ranking by
                                                                                                          word frequency

• Game supports half-knowledge in general
      – e.g. which movie was released 1994?
        Monsters, Inc. – Pulp Fiction – Casablanca
      – ... but the human brain performs better with
            pictures (actors), sounds (film music), ...
13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston             14
Evaluated Systems
• UBES (Usage-based Entity Summarization) [5]
      – Combine Freebase with HetRec2011 MovieLens2k [6]
      – Use item-based collaborative filtering to form
        neighborhoods for each movie
      – Find out which property-value pairs a movie shares
        with its neighbors
      – Use a TF-IDF related weighting scheme
                                    Bob          Alice         Marc          Elena         John     Mary
                Pulp Fiction            1            0             1              0            1      1
                Heat                    0            0             1              1            0      0
                Kill Bill               1            0             1              0            1      0
13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston          15
Evaluated Systems (cont.)
  • GKG (Google Knowledge Graph) [1]
        – Enables semi-automatic transformation to Freebase
/search?hl=en&q=quentin+tarantino&
stick=H4sIAAAAAAAAAONgVuLQz9U3MLM0zgEA_
sQyxwwAAAA&
sa=X&ei=FnjTT7rXN8jftAaAhPWIDw&
ved=0CKwBEJsTKAA

        – base64 + gzip
         /m/0693l
                        http://www.freebase.com/view/m/0693l
                        redirects to:
                        http://www.freebase.com/view/en/quentin_tarantino
  13 Nov. 2012    Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   16
Results
• 690 sessions, 8308 questions
• 217 players (135 players played only once)
• 2314 of 2829 triples were played more than 3
  times




13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   17
Result: Kendall’s τ
• Property ranking:



• Feature (property-value) ranking:




13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   18
Conclusion
• The results indicate that a game-based ground
  truth is suitable for evaluating entity
  summarization.

• The current dataset is too sparse to make valid
  assumptions about the importance of single
  facts.


13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   19
Future Work
•    Increase the number of players
•    Score the exclusion principle
•    Increase the number of movies
•    Application to additional domains
•    Publish new versions of the evaluation dataset
     on a regular basis



13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   20
Questions?

                              Help collecting data:
                  http://bit.ly/WhoKnowsMovies


                                              Andreas Thalhammer (andreas.thalhammer@sti2.at)
                                              Magnus Knuth (magnus.knuth@hpi.uni-potsdam.de)
                                                   Harald Sack (harald.sack@hpi.uni-potsdam.de)

13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   21
References
     [1] Singhal, A.: Introducing the knowledge graph: things, not strings (2012),
          http://googleblog.blogspot.com/2012/05/introducing-knowledge-graph-things-not.html
     [2] Cheng, G., Tran, T., Qu, Y.: RELIN: Relatedness and Informativeness-Based Centrality for Entity
          Summarization. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N.,
          Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 114–129. Springer, Heidelberg
          (2011)
     [3] Dragomir R. Radev, Eduard Hovy, and Kathleen McKeown. 2002. Introduction to the special
          issue on summarization. Comput. Linguist. 28, 4 (December 2002), 399-408.
          DOI=10.1162/089120102762671927 http://dx.doi.org/10.1162/089120102762671927
     [4] Waitelonis, J., Sack, H.: Towards exploratory video search using linked data. Multimedia Tools
          and Applications 59, 645–672 (2012), 10.1007/s11042-011-0733-1
     [5] Thalhammer, A., Toma, I., Roa-Valverde, A.J., Fensel, D.: Leveraging usage data for linked data
          movie entity summarization. In: Proc. of the 2nd Int. Ws. on Usage Analysis and the Web of
          Data (USEWOD 2012) co-located with WWW 2012, Lyon, France, vol. abs/1204.2718 (2012)
     [6] Cantador, I., Brusilovsky, P., Kuflik, T.: 2nd ws. on information heterogeneity and fusion in
          recommender systems (hetrec 2011). In: Proc. of 5th ACM Conf. on Recommender systems,
          RecSys 2011. ACM, New York (2011)



13 Nov. 2012      Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   22

Weitere ähnliche Inhalte

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Kürzlich hochgeladen (20)

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 

Empfohlen

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Empfohlen (20)

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 

Evaluating Entity Summarization Using a Game-Based Ground Truth

  • 1. Evaluating Entity Summarization Using a Game-Based Ground Truth Andreas Thalhammer¹, Magnus Knuth², and Harald Sack² ¹ University of Innsbruck, Austria 13 Nov. 2012 ISWC 2012 Boston ² Hasso Plattner Institute Potsdam, Germany
  • 2. Google: “Get the best summary” [1] • Inglourious Basterds (Movie) • Freebase: 1279 triples • DBpedia: 217 triples • Google Knowledge Graph summary: 14 triples 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 2
  • 3. Entity Summarization • First attempt towards a definition: “... not just represent the main themes of the original data, but rather, can best identify the underlying entity” [2] Is this a good definition? 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 3
  • 4. Entity Summarization (cont.) “A summary can be loosely defined as a text that is produced from one or more texts, that conveys important information in the original text(s), and that is no longer than half of the original text(s) and usually significantly less than that.” [3] A summary is • short • and conveys important information. 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 4
  • 5. Entity Summarization (cont.) • Our (loose) definition: “Entity summarization is the task of producing a summary that conveys important facts about the entity while reducing the number of shown facts significantly.” 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 5
  • 6. The Problem: Evaluation • How do we make different summarization systems comparable? Sub-question: • How do we grasp the idea of “important facts”? 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 6
  • 7. Related Work • RELIN: Relatedness and Informativeness-based Centrality for Entity Summarization [2] – Intrinsic: 24 users compiled summaries of 149 entities (forming a gold standard) (Intersection-based similarity) – Extrinsic: 47 pairs of FB and DBpedia entities were selected (24 correct ones, 23 incorrect ones).  Users judge whether pairs are correct or not. 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 7
  • 8. Related Work (cont.) • Towards exploratory video search using linked data [4] – Quantitative evaluation of heuristics  Ground truth, containing 115 entities summarized by 72 users. – Precision/Recall similarity measure 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 8
  • 9. Related Work (cont.) • It is hard to find participants. • Generating summaries is a cumbersome process. • Only a subset of property-value pairs are ranked by the users. • Up to this point, none of the two evaluation datasets is publically available. 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 9
  • 10. Our Idea • Important facts are commonly known • Unimportant facts are rarely known • How to find out?  Linked Data quiz game! 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 10
  • 11. Hypothesis “A game-based ground truth is suitable for evaluating the performance of summarization approaches in the movie domain” Assumption: implemented approaches correlate with the game-based ground truth while random summaries do not. 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 11
  • 12. Dataset • 60 arbitrary selected movies from IMDb Top250 • RDF descriptions from Freebase • Usage of a property white list • Triple store: Ontotext’s OWLIM with OWL2-RL reasoning enabled. • Property chains: <http://some-name.space/hasActor> <http://www.w3.org/2002/07/owl#propertyChainAxiom> ( <http://rdf.freebase.com/ns/film.film.starring> <http://rdf.freebase.com/ns/film.performance.actor> ). All data is available at: http://yovisto.com/labs/iswc2012 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 12
  • 13. WhoKnows?Movies! S P O :The_Princess_Bride prop:actor :Billy_Crystal, ... :Braveheart prop:actor :Mel_Gibson, ... :Pulp_Fiction prop:actor :John_Travolta . • Question types: - One-to-One - One-to-N • Questions are composed upside down: ‘Object is the property of subject1, subject2, subject3’ Play the game at: http://bit.ly/WhoKnowsMovies 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 13
  • 14. Frequency == Importance ??? word upper lower • Information retrieval: frequency cut-off cut-off – Luhn (1958): “resolving power of words” [5] ranking by word frequency • Game supports half-knowledge in general – e.g. which movie was released 1994? Monsters, Inc. – Pulp Fiction – Casablanca – ... but the human brain performs better with pictures (actors), sounds (film music), ... 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 14
  • 15. Evaluated Systems • UBES (Usage-based Entity Summarization) [5] – Combine Freebase with HetRec2011 MovieLens2k [6] – Use item-based collaborative filtering to form neighborhoods for each movie – Find out which property-value pairs a movie shares with its neighbors – Use a TF-IDF related weighting scheme Bob Alice Marc Elena John Mary Pulp Fiction 1 0 1 0 1 1 Heat 0 0 1 1 0 0 Kill Bill 1 0 1 0 1 0 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 15
  • 16. Evaluated Systems (cont.) • GKG (Google Knowledge Graph) [1] – Enables semi-automatic transformation to Freebase /search?hl=en&q=quentin+tarantino& stick=H4sIAAAAAAAAAONgVuLQz9U3MLM0zgEA_ sQyxwwAAAA& sa=X&ei=FnjTT7rXN8jftAaAhPWIDw& ved=0CKwBEJsTKAA – base64 + gzip  /m/0693l http://www.freebase.com/view/m/0693l redirects to: http://www.freebase.com/view/en/quentin_tarantino 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 16
  • 17. Results • 690 sessions, 8308 questions • 217 players (135 players played only once) • 2314 of 2829 triples were played more than 3 times 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 17
  • 18. Result: Kendall’s τ • Property ranking: • Feature (property-value) ranking: 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 18
  • 19. Conclusion • The results indicate that a game-based ground truth is suitable for evaluating entity summarization. • The current dataset is too sparse to make valid assumptions about the importance of single facts. 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 19
  • 20. Future Work • Increase the number of players • Score the exclusion principle • Increase the number of movies • Application to additional domains • Publish new versions of the evaluation dataset on a regular basis 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 20
  • 21. Questions? Help collecting data: http://bit.ly/WhoKnowsMovies Andreas Thalhammer (andreas.thalhammer@sti2.at) Magnus Knuth (magnus.knuth@hpi.uni-potsdam.de) Harald Sack (harald.sack@hpi.uni-potsdam.de) 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 21
  • 22. References [1] Singhal, A.: Introducing the knowledge graph: things, not strings (2012), http://googleblog.blogspot.com/2012/05/introducing-knowledge-graph-things-not.html [2] Cheng, G., Tran, T., Qu, Y.: RELIN: Relatedness and Informativeness-Based Centrality for Entity Summarization. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 114–129. Springer, Heidelberg (2011) [3] Dragomir R. Radev, Eduard Hovy, and Kathleen McKeown. 2002. Introduction to the special issue on summarization. Comput. Linguist. 28, 4 (December 2002), 399-408. DOI=10.1162/089120102762671927 http://dx.doi.org/10.1162/089120102762671927 [4] Waitelonis, J., Sack, H.: Towards exploratory video search using linked data. Multimedia Tools and Applications 59, 645–672 (2012), 10.1007/s11042-011-0733-1 [5] Thalhammer, A., Toma, I., Roa-Valverde, A.J., Fensel, D.: Leveraging usage data for linked data movie entity summarization. In: Proc. of the 2nd Int. Ws. on Usage Analysis and the Web of Data (USEWOD 2012) co-located with WWW 2012, Lyon, France, vol. abs/1204.2718 (2012) [6] Cantador, I., Brusilovsky, P., Kuflik, T.: 2nd ws. on information heterogeneity and fusion in recommender systems (hetrec 2011). In: Proc. of 5th ACM Conf. on Recommender systems, RecSys 2011. ACM, New York (2011) 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 22