USFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders

•

1 gefällt mir•1,043 views

This paper describes the University of Sheffield's submission to the SemEval 2016 Twitter Stance Detection weakly supervised task (SemEval 2016 Task 6, Subtask B). In stance detection, the goal is to classify the stance of a tweet towards a target as "favor", "against", or "none". In Subtask B, the targets in the test data are different from the targets in the training data, thus rendering the task more challenging but also more realistic. To address the lack of target-specific training data, we use a large set of unlabelled tweets containing all targets and train a bag-of-words autoencoder to learn how to produce feature representations of tweets. These feature representations are then used to train a logistic regression classifier on labelled tweets, with additional features such as an indicator of whether the target is contained in the tweet. Our submitted run on the test data achieved an F1 of 0.3270. Paper: http://isabelleaugenstein.github.io/papers/SemEval2016-Stance.pdf

Technologie

$Isabelle Augenstein, Andreas Vlachos, Kalina Bontcheva i.augenstein@ucl.ac.uk, {a.vlachos | k.bontcheva}@sheffield.ac.uk USFD at SemEval-2016 Task 6: Any-Target Stance Detection on Twitter with Autoencoders Stance Detection Subtask B Classify attitude of tweet towards target as “favor”, “against”, “none” Tweet: “No more Hillary Clinton” Target: Donald Trump Stance: FAVOR Subtask A training targets: Climate Change is a Real Concern, Feminist Movement, Atheism, Legalization of Abortion, Hillary Clinton Subtask B testing target: Donald Trump Challenges •  Labelled data not available for the test target •  Manual labelling of training data not allowed •  Target does not always appear in tweet Feature Extraction •  Aut-twe: Tweet auto-encoded tweet,100d feature vector •  targetInTweet: is (shortened) target contained in tweet •  Good indicator for non-neutral stance •  Other features tested (not used for final run): WordNet- Affect gazetteers, emoticon detection •  Baselines: bag of word, word2vec (trained on same data as autoencoder) Results Model Comparison (Hillary Clinton, dev) Model Comparison (Donald Trump, test) 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 Macro F1 BoW BoW+inTwe Word2Vec Aut-twe Aut-twe+inTwe Conclusions •  It is important to detect if the target is mentioned in the tweet •  Hillary Clinton: 0.4538 F1 (inTwe) vs 0.3243 F1 (not inTwe) •  Donald Trump: 0.3745 F1 (inTwe) vs 0.2377 F1 (not inTwe) •  Autoencoder can help to detect stance towards unseen targets •  Developing method for new targets without labelled training data is challenging - discrepancies between what works for dev vs. test set •  Future work: better incorporate the target for stance detection Acknowledgements This work was partially supported by the European Union, grant agreement No. 611233 PHEME (http://www.pheme.eu) Data •  5 628 labelled train tweets about Subtask A targets •  1 278 about Hillary Clinton, used for dev •  278 013 unlabelled Donald Trump tweets •  395 212 collected unlabelled tweets about all targets •  Keywords: hillary, clinton, trump, climate, femini, aborti •  707 Donald Trump test tweets Preprocessing •  Phrase detection: Train phrase detection model on unlabelled +labelled tweets, e.g. “donald”, “trump” → “donald trump” Autoencoder •  Bag-of-word autoencoder, using 50 000 most frequent words •  trained on unlabelled+labelled tweets •  Input vector: dimensionality 50 000. For each word in vocabulary, does tweet contain the word or not •  One hidden layer (size 100), output size 100 •  Trained encoder is applied to labelled train and test data to obtain 100d features, decoder not used Model Macro F1 Majority class (oﬃcial) 0.2972 SVM n-grams (oﬃcial) 0.2843 BoW 0.3453 Aut-twe (submi6ed) 0.3307 References •  Code: https://github.com/sheffieldnlp/stance-semeval2016 •  Phrases: Mikolov et al. (2013). Distributed Representations of Words and Phrases and Their Compositionality. NIPS. Tweets “No more Hillary Clinton”, “Donald Trump”, “FAVOR” Preprocessing: [“No”, “more”, “Hillary_Clinton”] Autoencoder Training [america: 0, …, Hillary_Clinton: 1] 50 000d input [0, 0, …, 1] 100d hidden layer [0, 1, …, 1] 100d output layer Feature Extraction Autoencoder inTwe [0, 1, …, 1] 0 Logistic Regression Model Predictions “#voteTrump (…)”, “Donald Trump”, “FAVOR” “youre fired (…)” “Donald Trump”, “AGAINST”$

Weitere ähnliche Inhalte

Andere mochten auch

Question Answering over Linked Data (Reasoning Web Summer School)Andre Freitas

Information Extraction with Linked DataIsabelle Augenstein

Natural Language Processing for the Semantic WebIsabelle Augenstein

Lecture: Question AnsweringMarina Santini

Semantic Search Over The Webalierkan

Management information system question and answerspradeep acharya

Deep Learning Models for Question AnsweringSujit Pal

Intro to Deep Learning for Question AnsweringTraian Rebedea

Talent Sourcing and Matching - Artificial Intelligence and Black Box Semantic...Glen Cathey

Web 3.0 The Semantic WebHatem Mahmoud

Andere mochten auch (10)

Question Answering over Linked Data (Reasoning Web Summer School)

Information Extraction with Linked Data

Natural Language Processing for the Semantic Web

Lecture: Question Answering

Semantic Search Over The Web

Management information system question and answers

Deep Learning Models for Question Answering

Intro to Deep Learning for Question Answering

Talent Sourcing and Matching - Artificial Intelligence and Black Box Semantic...

Web 3.0 The Semantic Web

Mehr von Isabelle Augenstein

Beyond Fact Checking — Modelling Information Change in Scientific CommunicationIsabelle Augenstein

Automatically Detecting Scientific MisinformationIsabelle Augenstein

Accountable and Robust Automatic Fact CheckingIsabelle Augenstein

Determining the Credibility of Science CommunicationIsabelle Augenstein

Towards Explainable Fact Checking (DIKU Business Club presentation)Isabelle Augenstein

Explainability for NLPIsabelle Augenstein

Towards Explainable Fact CheckingIsabelle Augenstein

Tracking False Information OnlineIsabelle Augenstein

What can typological knowledge bases and language representations tell us abo...Isabelle Augenstein

Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...Isabelle Augenstein

Learning with limited labelled data in NLP: multi-task learning and beyondIsabelle Augenstein

Learning to read for automated fact checkingIsabelle Augenstein

SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...Isabelle Augenstein

1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...Isabelle Augenstein

Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...Isabelle Augenstein

Extracting Relations between Non-Standard Entities using Distant Supervision ...Isabelle Augenstein

Mehr von Isabelle Augenstein (17)

Beyond Fact Checking — Modelling Information Change in Scientific Communication

Automatically Detecting Scientific Misinformation

Accountable and Robust Automatic Fact Checking

Determining the Credibility of Science Communication

Towards Explainable Fact Checking (DIKU Business Club presentation)

Explainability for NLP

Towards Explainable Fact Checking

Tracking False Information Online

What can typological knowledge bases and language representations tell us abo...

Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...

Learning with limited labelled data in NLP: multi-task learning and beyond

Learning to read for automated fact checking

SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...

1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...

Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...

Extracting Relations between Non-Standard Entities using Distant Supervision ...

Kürzlich hochgeladen

Artificial intelligence in cctv survelliance.pptxhariprasad279825

Story boards and shot lists for my a level piececharlottematthew16

How to write a Business Continuity PlanDatabarracks

DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy

DevEX - reference for building teams, processes, and platformsSergiu Bodiu

CloudStudio User manual (basic edition):comworks

Powerpoint exploring the locations used in television show Time Clashcharlottematthew16

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz

Commit 2024 - Secret Management made easyAlfredo García Lavilla

Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited

How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely

Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3

Take control of your SAP testing with UiPath Test SuiteDianaGray10

Gen AI in Business - Global Trends Report 2024.pdfAddepto

From Family Reminiscence to Scholarly Archive .Alan Dix

Kürzlich hochgeladen (20)

Artificial intelligence in cctv survelliance.pptx

Story boards and shot lists for my a level piece

How to write a Business Continuity Plan

DevoxxFR 2024 Reproducible Builds with Apache Maven

DevEX - reference for building teams, processes, and platforms

CloudStudio User manual (basic edition):

Powerpoint exploring the locations used in television show Time Clash

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost

Commit 2024 - Secret Management made easy

Ensuring Technical Readiness For Copilot in Microsoft 365

How AI, OpenAI, and ChatGPT impact business and software.

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack

Connect Wave/ connectwave Pitch Deck Presentation

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf

Scanning the Internet for External Cloud Exposures via SSL Certs

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx

Take control of your SAP testing with UiPath Test Suite

Gen AI in Business - Global Trends Report 2024.pdf

From Family Reminiscence to Scholarly Archive .

USFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders

1. Isabelle Augenstein, Andreas Vlachos, Kalina Bontcheva i.augenstein@ucl.ac.uk, {a.vlachos | k.bontcheva}@sheffield.ac.uk USFD at SemEval-2016 Task 6: Any-Target Stance Detection on Twitter with Autoencoders Stance Detection Subtask B Classify attitude of tweet towards target as “favor”, “against”, “none” Tweet: “No more Hillary Clinton” Target: Donald Trump Stance: FAVOR Subtask A training targets: Climate Change is a Real Concern, Feminist Movement, Atheism, Legalization of Abortion, Hillary Clinton Subtask B testing target: Donald Trump Challenges •  Labelled data not available for the test target •  Manual labelling of training data not allowed •  Target does not always appear in tweet Feature Extraction •  Aut-twe: Tweet auto-encoded tweet,100d feature vector •  targetInTweet: is (shortened) target contained in tweet •  Good indicator for non-neutral stance •  Other features tested (not used for final run): WordNet- Affect gazetteers, emoticon detection •  Baselines: bag of word, word2vec (trained on same data as autoencoder) Results Model Comparison (Hillary Clinton, dev) Model Comparison (Donald Trump, test) 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 Macro F1 BoW BoW+inTwe Word2Vec Aut-twe Aut-twe+inTwe Conclusions •  It is important to detect if the target is mentioned in the tweet •  Hillary Clinton: 0.4538 F1 (inTwe) vs 0.3243 F1 (not inTwe) •  Donald Trump: 0.3745 F1 (inTwe) vs 0.2377 F1 (not inTwe) •  Autoencoder can help to detect stance towards unseen targets •  Developing method for new targets without labelled training data is challenging - discrepancies between what works for dev vs. test set •  Future work: better incorporate the target for stance detection Acknowledgements This work was partially supported by the European Union, grant agreement No. 611233 PHEME (http://www.pheme.eu) Data •  5 628 labelled train tweets about Subtask A targets •  1 278 about Hillary Clinton, used for dev •  278 013 unlabelled Donald Trump tweets •  395 212 collected unlabelled tweets about all targets •  Keywords: hillary, clinton, trump, climate, femini, aborti •  707 Donald Trump test tweets Preprocessing •  Phrase detection: Train phrase detection model on unlabelled +labelled tweets, e.g. “donald”, “trump” → “donald trump” Autoencoder •  Bag-of-word autoencoder, using 50 000 most frequent words •  trained on unlabelled+labelled tweets •  Input vector: dimensionality 50 000. For each word in vocabulary, does tweet contain the word or not •  One hidden layer (size 100), output size 100 •  Trained encoder is applied to labelled train and test data to obtain 100d features, decoder not used Model Macro F1 Majority class (oﬃcial) 0.2972 SVM n-grams (oﬃcial) 0.2843 BoW 0.3453 Aut-twe (submi6ed) 0.3307 References •  Code: https://github.com/sheffieldnlp/stance-semeval2016 •  Phrases: Mikolov et al. (2013). Distributed Representations of Words and Phrases and Their Compositionality. NIPS. Tweets “No more Hillary Clinton”, “Donald Trump”, “FAVOR” Preprocessing: [“No”, “more”, “Hillary_Clinton”] Autoencoder Training [america: 0, …, Hillary_Clinton: 1] 50 000d input [0, 0, …, 1] 100d hidden layer [0, 1, …, 1] 100d output layer Feature Extraction Autoencoder inTwe [0, 1, …, 1] 0 Logistic Regression Model Predictions “#voteTrump (…)”, “Donald Trump”, “FAVOR” “youre fired (…)” “Donald Trump”, “AGAINST”

USFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Andere mochten auch

Andere mochten auch (10)

Mehr von Isabelle Augenstein

Mehr von Isabelle Augenstein (17)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

USFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders