SlideShare ist ein Scribd-Unternehmen logo
1 von 12
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
gathering gold standard annotations for relation extraction	

Crowd Truth
Harnessing Disagreement in
Crowdsourcing
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Gold Standard
Assumption
• typically in cognitive systems
• for each annotated instance there is a single right answer
• gold standard quality can be measured in inter-annotator
agreement
Let them disagree?
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Hypothesis
Annotator disagreement is not noise, but signal.
Not a problem to overcome but a source of information for machines
Artificially restricting humans does not help machines to learn.
They will learn better from diversity
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Position
disagreement is a sign of
intrinsic vagueness & ambiguity in human understanding
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Approach Principles
1.  Tolerate, capture & exploit disagreement
2.  Understand it by a space of possibilities (frequencies & similarities)
3.  Score the machine output based on where it falls in this space
4.  Adapt to new annotation tasks
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Relation Extraction
crowdsourcing gold standard data
Relations overlap in meaning
Sentences are vague and ambiguous
Experts have different interpretations
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Feeling the way the CHEST expands (PALPATION), can identify areas of
the lung that are full of fluid.
?PALPATIONIs CHEST related to
diagnose location associated
with
is_a otherpart_of
0 0 02 3 0 0 0 1 0 0 44 1
?CONJUNCTIVITISHYPERAEMIA related toIs
0 0 0 1 0 0 0 013 0 0 0 0 0
symptomcause
Redness (HYPERAEMIA), irritation (chemosis) and watering (epiphora)
of the eyes are symptoms common to all forms of CONJUNCTIVITIS.
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Harnessing Disagreement
• Sentence-relation score: core crowd truth metric for relation extraction, measured for each relation on
each sentence as the cosine of the unit vector for relation with sentence vector
• Sentence clarity: for each sentence - max relation score for that sentence. If all the workers selected the
same relation for a sentence, the max score is 1, indicating a clear sentence
• Relation similarity: pairwise conditional probability that if relation Ri is annotated in a sentence, Rj is as
well. Indicates how confusable the linguistic expression of two relations are
• Relation ambiguity: max relation similarity for a relation. If a relation is clear it has low score
• Relation clarity: max sentence-relation score for a relation over all sentences. If a relation has a high
clarity score, it means that it is at least possible to express the relation clearly
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
The Dark Side of Crowdsourcing
Disagreement
• spammers generate disagreement for the wrong reasons
• most spam detection requires gold standard
• Worker-sentence disagreement: the average of all the cosines between each
worker’s sentence vector and the full sentence vector (minus that worker).
Indicates how much a worker disagrees with the crowd on a sentence basis
• Worker-worker disagreement: a pairwise confusion matrix between workers
and the average agreement across the matrix for each worker. Indicates
whether there are consistently like-minded workers
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Questions?

Weitere ähnliche Inhalte

Andere mochten auch

Keynote at SMAP2012: Personalized Access to TV Content
Keynote at SMAP2012: Personalized Access to TV ContentKeynote at SMAP2012: Personalized Access to TV Content
Keynote at SMAP2012: Personalized Access to TV Content
Lora Aroyo
 
AGORA Project: Final Review 2012
AGORA Project: Final Review 2012AGORA Project: Final Review 2012
AGORA Project: Final Review 2012
Lora Aroyo
 

Andere mochten auch (8)

Keynote at SMAP2012: Personalized Access to TV Content
Keynote at SMAP2012: Personalized Access to TV ContentKeynote at SMAP2012: Personalized Access to TV Content
Keynote at SMAP2012: Personalized Access to TV Content
 
Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Agora User Committee Meeting 2013
Agora User Committee Meeting 2013
 
SealincMedia Accurator Demos
SealincMedia Accurator DemosSealincMedia Accurator Demos
SealincMedia Accurator Demos
 
AGORA Project: Final Review 2012
AGORA Project: Final Review 2012AGORA Project: Final Review 2012
AGORA Project: Final Review 2012
 
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
 
Europeana Tech 2011
Europeana Tech 2011Europeana Tech 2011
Europeana Tech 2011
 
Stitch by Stitch: Annotating Fashion at the Rijksmuseum
Stitch by Stitch: Annotating Fashion at the RijksmuseumStitch by Stitch: Annotating Fashion at the Rijksmuseum
Stitch by Stitch: Annotating Fashion at the Rijksmuseum
 
DIVE+: Explorative Search for Digital Humanities
DIVE+: Explorative Search for Digital HumanitiesDIVE+: Explorative Search for Digital Humanities
DIVE+: Explorative Search for Digital Humanities
 

Ähnlich wie WebSci2013 Harnessing Disagreement in Crowdsourcing

RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
Zachary Schendel
 
Recsys Presentation
Recsys PresentationRecsys Presentation
Recsys Presentation
Neal Lathia
 

Ähnlich wie WebSci2013 Harnessing Disagreement in Crowdsourcing (20)

Dartmouth 2018 writing assessment presentation Les Perelman
  Dartmouth 2018 writing assessment presentation Les Perelman  Dartmouth 2018 writing assessment presentation Les Perelman
Dartmouth 2018 writing assessment presentation Les Perelman
 
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffnL6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
 
School Essay Essays Format. Online assignment writing service.
School Essay Essays Format. Online assignment writing service.School Essay Essays Format. Online assignment writing service.
School Essay Essays Format. Online assignment writing service.
 
Communities of Trust - from regulation to cooperation
Communities of Trust - from regulation to cooperationCommunities of Trust - from regulation to cooperation
Communities of Trust - from regulation to cooperation
 
the relevance theory- pragmatics
the relevance theory- pragmaticsthe relevance theory- pragmatics
the relevance theory- pragmatics
 
kiranppt-170704170919 (1).pdf
kiranppt-170704170919 (1).pdfkiranppt-170704170919 (1).pdf
kiranppt-170704170919 (1).pdf
 
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
 
Sample Self Evaluation Essay.pdf
Sample Self Evaluation Essay.pdfSample Self Evaluation Essay.pdf
Sample Self Evaluation Essay.pdf
 
Cbse Class 7 English Essay
Cbse Class 7 English EssayCbse Class 7 English Essay
Cbse Class 7 English Essay
 
Essay On Exam Stress. Online assignment writing service.
Essay On Exam Stress. Online assignment writing service.Essay On Exam Stress. Online assignment writing service.
Essay On Exam Stress. Online assignment writing service.
 
Example Of Event Report Essay
Example Of Event Report EssayExample Of Event Report Essay
Example Of Event Report Essay
 
Recsys Presentation
Recsys PresentationRecsys Presentation
Recsys Presentation
 
Xmas Writing Paper
Xmas Writing PaperXmas Writing Paper
Xmas Writing Paper
 
CrowdTruth Tutorial: Using the Crowd to Understand Ambiguity
CrowdTruth Tutorial: Using the Crowd to Understand AmbiguityCrowdTruth Tutorial: Using the Crowd to Understand Ambiguity
CrowdTruth Tutorial: Using the Crowd to Understand Ambiguity
 
Size Of Writing Paper. Writing Paper Sizes Chart. 2019-01-16
Size Of Writing Paper. Writing Paper Sizes Chart. 2019-01-16Size Of Writing Paper. Writing Paper Sizes Chart. 2019-01-16
Size Of Writing Paper. Writing Paper Sizes Chart. 2019-01-16
 
Semantic Patterns for Sentiment Analysis of Twitter
Semantic Patterns for Sentiment Analysis of TwitterSemantic Patterns for Sentiment Analysis of Twitter
Semantic Patterns for Sentiment Analysis of Twitter
 
Dialogue based Meaning Negotiation
Dialogue based Meaning NegotiationDialogue based Meaning Negotiation
Dialogue based Meaning Negotiation
 
Puppy Writing Stationary Writing, Puppies, Words
Puppy Writing Stationary Writing, Puppies, WordsPuppy Writing Stationary Writing, Puppies, Words
Puppy Writing Stationary Writing, Puppies, Words
 
IndiaS Natural Beauty Essay In Hindi. Online assignment writing service.
IndiaS Natural Beauty Essay In Hindi. Online assignment writing service.IndiaS Natural Beauty Essay In Hindi. Online assignment writing service.
IndiaS Natural Beauty Essay In Hindi. Online assignment writing service.
 
2000 Word Essay How Long Introduction. Online assignment writing service.
2000 Word Essay How Long Introduction. Online assignment writing service.2000 Word Essay How Long Introduction. Online assignment writing service.
2000 Word Essay How Long Introduction. Online assignment writing service.
 

Mehr von Lora Aroyo

Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
Lora Aroyo
 
My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort ZoneMy ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
Lora Aroyo
 

Mehr von Lora Aroyo (20)

NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdfNeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
 
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine LearningCATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
 
Harnessing Human Semantics at Scale (updated)
Harnessing Human Semantics at Scale (updated)Harnessing Human Semantics at Scale (updated)
Harnessing Human Semantics at Scale (updated)
 
Data excellence: Better data for better AI
Data excellence: Better data for better AIData excellence: Better data for better AI
Data excellence: Better data for better AI
 
CHIP Demonstrator presentation @ CATCH Symposium
CHIP Demonstrator presentation @ CATCH SymposiumCHIP Demonstrator presentation @ CATCH Symposium
CHIP Demonstrator presentation @ CATCH Symposium
 
Semantic Web Challenge: CHIP Demonstrator
Semantic Web Challenge: CHIP DemonstratorSemantic Web Challenge: CHIP Demonstrator
Semantic Web Challenge: CHIP Demonstrator
 
The Rijksmuseum Collection as Linked Data
The Rijksmuseum Collection as Linked DataThe Rijksmuseum Collection as Linked Data
The Rijksmuseum Collection as Linked Data
 
Keynote at International Conference of Art Libraries 2018 @Rijksmuseum
Keynote at International Conference of Art Libraries 2018 @RijksmuseumKeynote at International Conference of Art Libraries 2018 @Rijksmuseum
Keynote at International Conference of Art Libraries 2018 @Rijksmuseum
 
FAIRview: Responsible Video Summarization @NYCML'18
FAIRview: Responsible Video Summarization @NYCML'18FAIRview: Responsible Video Summarization @NYCML'18
FAIRview: Responsible Video Summarization @NYCML'18
 
Understanding bias in video news & news filtering algorithms
Understanding bias in video news & news filtering algorithmsUnderstanding bias in video news & news filtering algorithms
Understanding bias in video news & news filtering algorithms
 
StorySourcing: Telling Stories with Humans & Machines
StorySourcing: Telling Stories with Humans & MachinesStorySourcing: Telling Stories with Humans & Machines
StorySourcing: Telling Stories with Humans & Machines
 
Data Science with Humans in the Loop
Data Science with Humans in the LoopData Science with Humans in the Loop
Data Science with Humans in the Loop
 
Digital Humanities Benelux 2017: Keynote Lora Aroyo
Digital Humanities Benelux 2017: Keynote Lora AroyoDigital Humanities Benelux 2017: Keynote Lora Aroyo
Digital Humanities Benelux 2017: Keynote Lora Aroyo
 
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
 
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
 
My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort ZoneMy ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
 
Data Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityData Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden University
 
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New SearchSXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
 
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
Europeana GA 2016: Harnessing Crowds, Niches & Professionals  in the Digital AgeEuropeana GA 2016: Harnessing Crowds, Niches & Professionals  in the Digital Age
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
 
"Video Killed the Radio Star": From MTV to Snapchat
"Video Killed the Radio Star": From MTV to Snapchat"Video Killed the Radio Star": From MTV to Snapchat
"Video Killed the Radio Star": From MTV to Snapchat
 

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 

WebSci2013 Harnessing Disagreement in Crowdsourcing

  • 1. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo gathering gold standard annotations for relation extraction Crowd Truth Harnessing Disagreement in Crowdsourcing
  • 2. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Gold Standard Assumption • typically in cognitive systems • for each annotated instance there is a single right answer • gold standard quality can be measured in inter-annotator agreement Let them disagree?
  • 3. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Hypothesis Annotator disagreement is not noise, but signal. Not a problem to overcome but a source of information for machines Artificially restricting humans does not help machines to learn. They will learn better from diversity
  • 4. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Position disagreement is a sign of intrinsic vagueness & ambiguity in human understanding
  • 5. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Approach Principles 1.  Tolerate, capture & exploit disagreement 2.  Understand it by a space of possibilities (frequencies & similarities) 3.  Score the machine output based on where it falls in this space 4.  Adapt to new annotation tasks
  • 6. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Relation Extraction crowdsourcing gold standard data Relations overlap in meaning Sentences are vague and ambiguous Experts have different interpretations
  • 7. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
  • 8. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Feeling the way the CHEST expands (PALPATION), can identify areas of the lung that are full of fluid. ?PALPATIONIs CHEST related to diagnose location associated with is_a otherpart_of 0 0 02 3 0 0 0 1 0 0 44 1 ?CONJUNCTIVITISHYPERAEMIA related toIs 0 0 0 1 0 0 0 013 0 0 0 0 0 symptomcause Redness (HYPERAEMIA), irritation (chemosis) and watering (epiphora) of the eyes are symptoms common to all forms of CONJUNCTIVITIS.
  • 9. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
  • 10. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Harnessing Disagreement • Sentence-relation score: core crowd truth metric for relation extraction, measured for each relation on each sentence as the cosine of the unit vector for relation with sentence vector • Sentence clarity: for each sentence - max relation score for that sentence. If all the workers selected the same relation for a sentence, the max score is 1, indicating a clear sentence • Relation similarity: pairwise conditional probability that if relation Ri is annotated in a sentence, Rj is as well. Indicates how confusable the linguistic expression of two relations are • Relation ambiguity: max relation similarity for a relation. If a relation is clear it has low score • Relation clarity: max sentence-relation score for a relation over all sentences. If a relation has a high clarity score, it means that it is at least possible to express the relation clearly
  • 11. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo The Dark Side of Crowdsourcing Disagreement • spammers generate disagreement for the wrong reasons • most spam detection requires gold standard • Worker-sentence disagreement: the average of all the cosines between each worker’s sentence vector and the full sentence vector (minus that worker). Indicates how much a worker disagrees with the crowd on a sentence basis • Worker-worker disagreement: a pairwise confusion matrix between workers and the average agreement across the matrix for each worker. Indicates whether there are consistently like-minded workers
  • 12. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Questions?