SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
GRÉGOIRE BUREL, PAUL MULLHOLAND, HARITH ALANI 
Knowledge Media Institute, The Open University, Milton Keynes, UK.


WWW’16, Montréal, Canada.
11th April 2016
Structural Normalisation Methods for
Improving Best Answer Identification in
Question Answering Communities
Outline
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 
Publications
2	
Identifying Best Answers in Q&A Communities
-  Question Answering Communities
-  Automatic Best Answer Identification
-  Reference Model
Structural Normalisation
-  No Normalisation Vs. Structural Normalisation
-  Normalisation Methods
-  Normalisation Method Selection
Best Answer Identification with Structural Normalisation
-  Models Comparison
-  Features Comparison
Conclusions and Future Work
Q&A Communities
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 
Publications
Question
Answer #1
Answer #2
...
Answer #n
QuestionThread
3	
Q&A communities are communities
composed of askers and answerers looking
for solutions to particular issues.

When looking for answers, users need to
identify if similar questions have already been
answered correctly and see if a best answer
exists.

Unfortunately not all questions have labelled
best answers (43.2% of questions do not have
labelled best answers).

Existing works that have focused on best
answer identification have also generally
ignored the structure of Q&A websites while
designing feature normalisation methods.


?
!
-
Q&A Communities
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 
Publications
Question
Answer #1
Answer #2
...
Answer #n
QuestionThread
4	
Q&A communities are communities
composed of askers and answerers looking
for solutions to particular issues.

When looking for answers, users need to
identify if similar questions have already been
answered correctly and see if a best answer
exists.

Unfortunately not all questions have labelled
best answers (43.2% of questions do not have
labelled best answers).

Existing works that have focused on best
answer identification have also generally
ignored the structure of Q&A websites while
designing feature normalisation methods.


?
!
-
Q&A communities are communities
composed of askers and answerers looking
for solutions to particular issues.

When looking for answers, users need to
identify if similar questions have already been
answered correctly and see if a best answer
exists.

Unfortunately not all questions have labelled
best answers (43.2% of questions do not have
labelled best answers).

Existing works that have focused on best
answer identification have also generally
ignored the structure of Q&A websites for
designing feature normalisation methods.


Q&A Communities
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 
Publications
Question
Answer #1
Answer #2
...
Answer #n
QuestionThread
?
!
-
5
Feature-based Best Answers Identification and Reference Model 
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 
6	
In order to identify best answers features
are extracted and associated with each
answers and a binary classifier is trained
(Burel et al. 2012 , 2013, 2014):
-  Such features are divided into User, Content
and Thread features.
-  30 features are used for the reference model.
-  The Alternating Decision Tree algorithm is
used.
Reference Model Results:
-  The reference model achieve an F1 of 0.817
on average. 
-  Thread features in particular score ratios are
highly related to best answers.
What methods could be used for
improving such model??
Feature-based Best Answers Identification and Reference Model 
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 
7	
In order to identify best answers features are
extracted and associated with each answers
and a binary classifier is trained (Burel et al.
2012 , 2013, 2014):
-  Such features are divided into User, Content and
Thread features.
-  30 features are used for the reference model.
-  The Alternating Decision Tree algorithm is used.
Reference Model Results:
-  The reference model achieve an F1 of 0.817 on
average. 
-  Thread features in particular score ratios are
highly related to best answers.
What methods could be used for improving
such model??
Contributions
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 
8	
Does the thread-like structure of Q&A communities can help the automatic
identification of best answers ?

-  Introduce the concept of structural normalisation, a method for
normalising features based on the structure of Q&A communities.
-  Present 3 different features normalisation methods: 1) Min/Max
normalisation; 2) Order Normalisation, and; 3) Normalised Order
Normalisation.
-  Identify the best normalisation method for automatically identifying best
answers.
-  Compare different models and features sets and show that structural
normalisation improves best answer identification.
?
Structural Normalisation / No Normalisation

Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 
9	
Question
Answer #1
Answer #2
...
Answer #n
QuestionThread
Question
Answer #1
Answer #2
...
Answer #nQuestionThread
Question
Answer #1
Answer #2
...
Answer #nQuestionThread
Question
Answer #1
Answer #2
...
Answer #nQuestionThread
Question
Answer #1
Answer #2
...
Answer #nQuestionThread
Standard classifiers such as the reference model compare all the answers even if
they are not part of the same thread.

Thread-wise normalisation normalise features based on their relation within a
thread.
!
Structural Normalisation / No Normalisation

Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 
10	
Question
Answer #1
Answer #2
...
Answer #n
QuestionThread
Question
Answer #1
Answer #2
...
Answer #nQuestionThread
Question
Answer #1
Answer #2
...
Answer #nQuestionThread
Question
Answer #1
Answer #2
...
Answer #nQuestionThread
Question
Answer #1
Answer #2
...
Answer #nQuestionThread
Standard classifiers such as the reference model compare all the answers even if
they are not part of the same thread.

Thread-wise normalisation normalise features based on their relation within a
thread.
!
Normalisation Methods

Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities
Min/Max Normalisation
(Thread-wise min/max)
Normalise features within a thread using
the minimum and maximum value of a
particular feature.
1
 2
Normalised Order
Normalisation
Same as Order Normalisation but
the values are normalised between 0
and 1.
3
11	
Order Normalisation
Generalises Gkotsis et al. (2014)
approach. Features within a thread
are ranked according to their value.
The lowest value is transformed to
the length of the thread while the
highest value is transformed to 1.
Normalisation Methods

Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities
Min/Max Normalisation
(Thread-wise min/max)
Normalise features within a thread using
the minimum and maximum value of a
particular feature.
1
Order Normalisation
Generalises Gkotsis et al. (2014)
approach. Features within a thread
are ranked according to their value.
The lowest value is transformed to
the length of the thread while the
highest value is transformed to 1.
2
Normalised Order
Normalisation
Same as Order Normalisation but
the values are normalised between 0
and 1.
3
12	
Adaptive Feature Normalisation
Not all features need to be normalised. For instance, number of question views does not change within a question thread
but does across questions. If for a given feature the variance is zero and remains constant between all the threads, the
feature is not normalised. Otherwise, the feature is normalised with one of the previous functions.

!
Normalisation Method Selection
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities
13	
?
 What normalisation method is the most useful ?
Approach
1.  Calculate the average Information Gain (IG) of each feature associated with each
normalisation method.
2.  Select the normalisation method that has the highest average IG across all the datasets.
Normalisation Method Selection
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities
14	
?
 What normalisation method is the most useful ?
Approach
1.  Calculate the average Information Gain (IG) of each feature associated with each
normalisation method.
2.  Select the normalisation method that has the highest average IG across all the datasets.
Best Answers Identification Models Comparisons
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities
15
Best Answers Identification Models Comparisons
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities
16	
General Results
-  Thread normalisation approaches improves best answer identification (p =
2.817e − 05). 
-  On average, an increase in F1 of +5.3% is observed compared to the non
normalised models.
Baseline Models Results
-  In general, thread normalisation approaches improves best answer
identification: On average F1 = 0.718 (+26.8%) .
-  The longuest answers within a thread is more likely to be associated with a
best answers.
Core Features Models Results
-  Unsurprisingly, there is no increase in F1 for thread features but there is a
significant increase for other features sets (+20% F1).
-  Content features are more important within a thread whereas user features
are better for discriminating best answers globally.

Extended Features Models Results
-  When score are present, normalisation does not improve as score ratio
already perform very well.
8
0
1
Best Answers Identification Models Comparisons
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities
17	
General Results
-  Thread normalisation approaches improves best answer identification (p =
2.817e − 05). 
-  On average, an increase in F1 of +5.3% is observed compared to the non
normalised models.
Baseline Models Results
-  In general, thread normalisation approaches improves best answer
identification: On average F1 = 0.718 (+26.8%) .
-  The longuest answers within a thread is more likely to be associated with a
best answers.
Core Features Models Results
-  Unsurprisingly, there is no increase in F1 for thread features but there is a
significant increase for other features sets (+20% F1).
-  Content features are more important within a thread whereas user features
are better for discriminating best answers globally.

Extended Features Models Results
-  When score are present, normalisation does not improve as score ratio
already perform very well.
8
0
1
Best Answers Identification Models Comparisons
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities
18	
General Results
-  Thread normalisation approaches improves best answer identification (p =
2.817e − 05). 
-  On average, an increase in F1 of +5.3% is observed compared to the non
normalised models.
Baseline Models Results
-  In general, thread normalisation approaches improves best answer
identification: On average F1 = 0.718 (+26.8%) .
-  The longuest answers within a thread is more likely to be associated with a
best answers.
Core Features Models Results
-  Unsurprisingly, there is no increase in F1 for thread features but there is a
significant increase for other features sets (+20% F1).
-  Content features are more important within a thread whereas user features
are better for discriminating best answers globally.

Extended Features Models Results
-  When score are present, normalisation does not improve as score ratio
already perform very well.
8
0
1
Best Answers Identification Models Comparisons
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities
19	
General Results
-  Thread normalisation approaches improves best answer identification (p =
2.817e − 05). 
-  On average, an increase in F1 of +5.3% is observed compared to the non
normalised models.
Baseline Models Results
-  In general, thread normalisation approaches improves best answer
identification: On average F1 = 0.718 (+26.8%) .
-  The longuest answers within a thread is more likely to be associated with a
best answers.
Core Features Models Results
-  Unsurprisingly, there is no increase in F1 for thread features but there is a
significant increase for other features sets (+20% F1).
-  Content features are more important within a thread whereas user features
are better for discriminating best answers globally.

Extended Features Models Results
-  When score are present, normalisation does not improve as score ratio
already performs very well.
8
0
1
Features Importance
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities
20	
?
Approach
1.  Calculate the Information Gain (IG) of each feature for each dataset with and without
normalisation.
2.  Order the features by IG across all the datasets. The higher the IG the more important the
feature.
What features are the most useful ? How does order normalisation impact feature
importance?
Features Importance- Results
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities
21
Features Importance - Results
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities
22	
General Results
-  As previously observed, scores based features are the most important best answer
predictors.
-  Similarly, user features are more important than content features when features are not
normalised. When features are normalised the difference diminishes.

Features Comparison
-  Number of comments is associated with best answers when normalised. Therefore, the
relative amount of comments may be a good indicator of best answers. 
-  The relative number of answer for a user is an important indicator for identifying best
answers (i.e. good answers comes from users that post a lot of answers)
-  Answers with relatively original content are more likely to be linked with best answers.
-  Relative topic reputation is not really important meaning that this feature only helps when
distinguishing best answers in a global context.
Conclusions and Future Work
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities
Conclusions
-  Compared to the reference models,
structural approaches improve automatic
best answer identification by around +5%
F1.
-  Thread-normalisation change the
importance of features (e.g. content length)
-  Score features are overwhelming
important.
-  Structural methods may be used
successfully for other classification tasks
where analysed communities are highly
structured.

23	
1
Conclusions and Future Work
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities
Conclusions
-  Compared to the baseline models,
structural approaches improve automatic
best answer identification by around +5%
F1.
-  Thread-normalisation change the
importance of features (e.g. content length)
-  Score features are overwhelming
important.
-  Structural methods may be used
successfully for other classification tasks
where analysed communities are highly
structured.

24	
1
 2
Future/Current Work
-  Evaluate other types of normalisation (e.g.
signed feature ratios)
-  Consider other structural optimisation
methods (e.g. Learning-to-Rank models)
-  Consider applicability to other type of
communities
Questions and Discussion
		
@
Email: g.burel@open.ac.uk
Twitter: @evhart
Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 
25

Weitere ähnliche Inhalte

Was ist angesagt?

FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSISFEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSISmlaij
 
IRJET- User Behavior Analysis on Social Media Data using Sentiment Analysis o...
IRJET- User Behavior Analysis on Social Media Data using Sentiment Analysis o...IRJET- User Behavior Analysis on Social Media Data using Sentiment Analysis o...
IRJET- User Behavior Analysis on Social Media Data using Sentiment Analysis o...IRJET Journal
 
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...ijiert bestjournal
 
Query aware determinization of uncertain objects
Query aware determinization of uncertain objectsQuery aware determinization of uncertain objects
Query aware determinization of uncertain objectsCloudTechnologies
 
Developing a Secured Recommender System in Social Semantic Network
Developing a Secured Recommender System in Social Semantic NetworkDeveloping a Secured Recommender System in Social Semantic Network
Developing a Secured Recommender System in Social Semantic NetworkTamer Rezk
 
Exploring social influence via posterior effect of word of-mouth
Exploring social influence via posterior effect of word of-mouthExploring social influence via posterior effect of word of-mouth
Exploring social influence via posterior effect of word of-mouthmoresmile
 
DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...
DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...
DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...Università degli Studi di Milano-Bicocca
 
DeepSearch_Project_Report
DeepSearch_Project_ReportDeepSearch_Project_Report
DeepSearch_Project_ReportUrjit Patel
 
IRJET- Classifying Twitter Data in Multiple Classes based on Sentiment Class ...
IRJET- Classifying Twitter Data in Multiple Classes based on Sentiment Class ...IRJET- Classifying Twitter Data in Multiple Classes based on Sentiment Class ...
IRJET- Classifying Twitter Data in Multiple Classes based on Sentiment Class ...IRJET Journal
 
Proceedings Template - WORD
Proceedings Template - WORDProceedings Template - WORD
Proceedings Template - WORDbutest
 
Efficiency Improvement of Neutrality-Enhanced Recommendation
Efficiency Improvement of Neutrality-Enhanced RecommendationEfficiency Improvement of Neutrality-Enhanced Recommendation
Efficiency Improvement of Neutrality-Enhanced RecommendationToshihiro Kamishima
 
Sentiment analysis of tweets using Neural Networks
Sentiment analysis of tweets using Neural NetworksSentiment analysis of tweets using Neural Networks
Sentiment analysis of tweets using Neural NetworksAdrián Palacios Corella
 
IMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATION
IMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATIONIMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATION
IMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATIONadeij1
 
Internship project presentation_final_upload
Internship project presentation_final_uploadInternship project presentation_final_upload
Internship project presentation_final_uploadSuraj Rathore
 
Model-based Approaches for Independence-Enhanced Recommendation
Model-based Approaches for Independence-Enhanced RecommendationModel-based Approaches for Independence-Enhanced Recommendation
Model-based Approaches for Independence-Enhanced RecommendationToshihiro Kamishima
 
Sentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data MiningSentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data MiningIRJET Journal
 
Inferring Kinship Cues from Facial Image Pairs
Inferring Kinship Cues from Facial Image PairsInferring Kinship Cues from Facial Image Pairs
Inferring Kinship Cues from Facial Image Pairssherinmm
 

Was ist angesagt? (19)

FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSISFEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
 
Temporal based Recommendation System
Temporal based Recommendation SystemTemporal based Recommendation System
Temporal based Recommendation System
 
My experiment
My experimentMy experiment
My experiment
 
IRJET- User Behavior Analysis on Social Media Data using Sentiment Analysis o...
IRJET- User Behavior Analysis on Social Media Data using Sentiment Analysis o...IRJET- User Behavior Analysis on Social Media Data using Sentiment Analysis o...
IRJET- User Behavior Analysis on Social Media Data using Sentiment Analysis o...
 
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
 
Query aware determinization of uncertain objects
Query aware determinization of uncertain objectsQuery aware determinization of uncertain objects
Query aware determinization of uncertain objects
 
Developing a Secured Recommender System in Social Semantic Network
Developing a Secured Recommender System in Social Semantic NetworkDeveloping a Secured Recommender System in Social Semantic Network
Developing a Secured Recommender System in Social Semantic Network
 
Exploring social influence via posterior effect of word of-mouth
Exploring social influence via posterior effect of word of-mouthExploring social influence via posterior effect of word of-mouth
Exploring social influence via posterior effect of word of-mouth
 
DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...
DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...
DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...
 
DeepSearch_Project_Report
DeepSearch_Project_ReportDeepSearch_Project_Report
DeepSearch_Project_Report
 
IRJET- Classifying Twitter Data in Multiple Classes based on Sentiment Class ...
IRJET- Classifying Twitter Data in Multiple Classes based on Sentiment Class ...IRJET- Classifying Twitter Data in Multiple Classes based on Sentiment Class ...
IRJET- Classifying Twitter Data in Multiple Classes based on Sentiment Class ...
 
Proceedings Template - WORD
Proceedings Template - WORDProceedings Template - WORD
Proceedings Template - WORD
 
Efficiency Improvement of Neutrality-Enhanced Recommendation
Efficiency Improvement of Neutrality-Enhanced RecommendationEfficiency Improvement of Neutrality-Enhanced Recommendation
Efficiency Improvement of Neutrality-Enhanced Recommendation
 
Sentiment analysis of tweets using Neural Networks
Sentiment analysis of tweets using Neural NetworksSentiment analysis of tweets using Neural Networks
Sentiment analysis of tweets using Neural Networks
 
IMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATION
IMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATIONIMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATION
IMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATION
 
Internship project presentation_final_upload
Internship project presentation_final_uploadInternship project presentation_final_upload
Internship project presentation_final_upload
 
Model-based Approaches for Independence-Enhanced Recommendation
Model-based Approaches for Independence-Enhanced RecommendationModel-based Approaches for Independence-Enhanced Recommendation
Model-based Approaches for Independence-Enhanced Recommendation
 
Sentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data MiningSentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data Mining
 
Inferring Kinship Cues from Facial Image Pairs
Inferring Kinship Cues from Facial Image PairsInferring Kinship Cues from Facial Image Pairs
Inferring Kinship Cues from Facial Image Pairs
 

Andere mochten auch

Veracity ESWC2010 Poster
Veracity ESWC2010 PosterVeracity ESWC2010 Poster
Veracity ESWC2010 PosterGregoire Burel
 
Attention-Streams Recommendations
Attention-Streams RecommendationsAttention-Streams Recommendations
Attention-Streams RecommendationsGregoire Burel
 
A Question of Complexity - Measuring the Maturity of Online Enquiry Communities
A Question of Complexity - Measuring the Maturity of Online Enquiry CommunitiesA Question of Complexity - Measuring the Maturity of Online Enquiry Communities
A Question of Complexity - Measuring the Maturity of Online Enquiry CommunitiesGregoire Burel
 
Automatic Identification of Best Answers in Online Enquiry Communities
Automatic Identification of Best Answers in Online Enquiry CommunitiesAutomatic Identification of Best Answers in Online Enquiry Communities
Automatic Identification of Best Answers in Online Enquiry CommunitiesGregoire Burel
 
Monitoring, Discussing and Publishing Energy Consumption Data using EnergyUse
Monitoring, Discussing and Publishing Energy Consumption Data using EnergyUseMonitoring, Discussing and Publishing Energy Consumption Data using EnergyUse
Monitoring, Discussing and Publishing Energy Consumption Data using EnergyUseGregoire Burel
 
Quantising Contribution Effort in Online Communities
Quantising Contribution Effort in Online CommunitiesQuantising Contribution Effort in Online Communities
Quantising Contribution Effort in Online CommunitiesGregoire Burel
 
EnergyUse - A Collective Semantic Platform for Monitoring and Discussing Ener...
EnergyUse - A Collective Semantic Platform for Monitoring and Discussing Ener...EnergyUse - A Collective Semantic Platform for Monitoring and Discussing Ener...
EnergyUse - A Collective Semantic Platform for Monitoring and Discussing Ener...Gregoire Burel
 
Asterid: Linked Data Asterisms
Asterid: Linked Data AsterismsAsterid: Linked Data Asterisms
Asterid: Linked Data AsterismsGregoire Burel
 

Andere mochten auch (8)

Veracity ESWC2010 Poster
Veracity ESWC2010 PosterVeracity ESWC2010 Poster
Veracity ESWC2010 Poster
 
Attention-Streams Recommendations
Attention-Streams RecommendationsAttention-Streams Recommendations
Attention-Streams Recommendations
 
A Question of Complexity - Measuring the Maturity of Online Enquiry Communities
A Question of Complexity - Measuring the Maturity of Online Enquiry CommunitiesA Question of Complexity - Measuring the Maturity of Online Enquiry Communities
A Question of Complexity - Measuring the Maturity of Online Enquiry Communities
 
Automatic Identification of Best Answers in Online Enquiry Communities
Automatic Identification of Best Answers in Online Enquiry CommunitiesAutomatic Identification of Best Answers in Online Enquiry Communities
Automatic Identification of Best Answers in Online Enquiry Communities
 
Monitoring, Discussing and Publishing Energy Consumption Data using EnergyUse
Monitoring, Discussing and Publishing Energy Consumption Data using EnergyUseMonitoring, Discussing and Publishing Energy Consumption Data using EnergyUse
Monitoring, Discussing and Publishing Energy Consumption Data using EnergyUse
 
Quantising Contribution Effort in Online Communities
Quantising Contribution Effort in Online CommunitiesQuantising Contribution Effort in Online Communities
Quantising Contribution Effort in Online Communities
 
EnergyUse - A Collective Semantic Platform for Monitoring and Discussing Ener...
EnergyUse - A Collective Semantic Platform for Monitoring and Discussing Ener...EnergyUse - A Collective Semantic Platform for Monitoring and Discussing Ener...
EnergyUse - A Collective Semantic Platform for Monitoring and Discussing Ener...
 
Asterid: Linked Data Asterisms
Asterid: Linked Data AsterismsAsterid: Linked Data Asterisms
Asterid: Linked Data Asterisms
 

Ähnlich wie Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities

Predicting Answering Behaviour in Online Question Answering Communities
Predicting Answering Behaviour in Online Question Answering CommunitiesPredicting Answering Behaviour in Online Question Answering Communities
Predicting Answering Behaviour in Online Question Answering CommunitiesGregoire Burel
 
IRJET- Analysis of Question and Answering Recommendation System
IRJET-  	  Analysis of Question and Answering Recommendation SystemIRJET-  	  Analysis of Question and Answering Recommendation System
IRJET- Analysis of Question and Answering Recommendation SystemIRJET Journal
 
Paper id 28201441
Paper id 28201441Paper id 28201441
Paper id 28201441IJRAT
 
The Role of Families and the Community Proposal Template (N.docx
The Role of Families and the Community Proposal Template  (N.docxThe Role of Families and the Community Proposal Template  (N.docx
The Role of Families and the Community Proposal Template (N.docxssusera34210
 
Reputation Model Based on Rating Data and Application in Recommender Systems
Reputation Model Based on Rating Data and Application in Recommender SystemsReputation Model Based on Rating Data and Application in Recommender Systems
Reputation Model Based on Rating Data and Application in Recommender SystemsAhmad Jawdat
 
Lecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionLecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionPerumalPitchandi
 
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...IRJET Journal
 
Towards Automatic Analysis of Online Discussions among Hong Kong Students
Towards Automatic Analysis of Online Discussions among Hong Kong StudentsTowards Automatic Analysis of Online Discussions among Hong Kong Students
Towards Automatic Analysis of Online Discussions among Hong Kong StudentsCITE
 
Tutorial on Coreference Resolution
Tutorial on Coreference Resolution Tutorial on Coreference Resolution
Tutorial on Coreference Resolution Anirudh Jayakumar
 
Efficient instant fuzzy search with proximity ranking
Efficient instant fuzzy search with proximity rankingEfficient instant fuzzy search with proximity ranking
Efficient instant fuzzy search with proximity rankingShakas Technologies
 
Query recommendation papers
Query recommendation papersQuery recommendation papers
Query recommendation papersAshish Kulkarni
 
Online Learning to Rank
Online Learning to RankOnline Learning to Rank
Online Learning to Rankewhuang3
 
How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...Jeromy Anglim
 
SUM sem 3 Mb0050 – research methodology
SUM sem 3  Mb0050 – research methodologySUM sem 3  Mb0050 – research methodology
SUM sem 3 Mb0050 – research methodologyak007420
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISrathnaarul
 

Ähnlich wie Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities (20)

Predicting Answering Behaviour in Online Question Answering Communities
Predicting Answering Behaviour in Online Question Answering CommunitiesPredicting Answering Behaviour in Online Question Answering Communities
Predicting Answering Behaviour in Online Question Answering Communities
 
IRJET- Analysis of Question and Answering Recommendation System
IRJET-  	  Analysis of Question and Answering Recommendation SystemIRJET-  	  Analysis of Question and Answering Recommendation System
IRJET- Analysis of Question and Answering Recommendation System
 
Paper id 28201441
Paper id 28201441Paper id 28201441
Paper id 28201441
 
A Review on Novel Scoring System for Identify Accurate Answers for Factoid Qu...
A Review on Novel Scoring System for Identify Accurate Answers for Factoid Qu...A Review on Novel Scoring System for Identify Accurate Answers for Factoid Qu...
A Review on Novel Scoring System for Identify Accurate Answers for Factoid Qu...
 
The Role of Families and the Community Proposal Template (N.docx
The Role of Families and the Community Proposal Template  (N.docxThe Role of Families and the Community Proposal Template  (N.docx
The Role of Families and the Community Proposal Template (N.docx
 
Reputation Model Based on Rating Data and Application in Recommender Systems
Reputation Model Based on Rating Data and Application in Recommender SystemsReputation Model Based on Rating Data and Application in Recommender Systems
Reputation Model Based on Rating Data and Application in Recommender Systems
 
Lecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionLecture Notes on Recommender System Introduction
Lecture Notes on Recommender System Introduction
 
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
 
Towards Automatic Analysis of Online Discussions among Hong Kong Students
Towards Automatic Analysis of Online Discussions among Hong Kong StudentsTowards Automatic Analysis of Online Discussions among Hong Kong Students
Towards Automatic Analysis of Online Discussions among Hong Kong Students
 
Tutorial on Coreference Resolution
Tutorial on Coreference Resolution Tutorial on Coreference Resolution
Tutorial on Coreference Resolution
 
Efficient instant fuzzy search with proximity ranking
Efficient instant fuzzy search with proximity rankingEfficient instant fuzzy search with proximity ranking
Efficient instant fuzzy search with proximity ranking
 
Query recommendation papers
Query recommendation papersQuery recommendation papers
Query recommendation papers
 
4 de47584
4 de475844 de47584
4 de47584
 
Online Learning to Rank
Online Learning to RankOnline Learning to Rank
Online Learning to Rank
 
How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...
 
Query expansion
Query expansionQuery expansion
Query expansion
 
Report
ReportReport
Report
 
SUM sem 3 Mb0050 – research methodology
SUM sem 3  Mb0050 – research methodologySUM sem 3  Mb0050 – research methodology
SUM sem 3 Mb0050 – research methodology
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSIS
 
Mb0050 research methodology
Mb0050 research methodologyMb0050 research methodology
Mb0050 research methodology
 

Mehr von Gregoire Burel

Monitoring, Understanding, and Influencing the Co-Spread of COVID-19 Misinfor...
Monitoring, Understanding, and Influencing the Co-Spread of COVID-19 Misinfor...Monitoring, Understanding, and Influencing the Co-Spread of COVID-19 Misinfor...
Monitoring, Understanding, and Influencing the Co-Spread of COVID-19 Misinfor...Gregoire Burel
 
Monitoring, Understanding and Influencing the Co-Spread of COVID-19 Misinform...
Monitoring, Understanding and Influencing the Co-Spread of COVID-19 Misinform...Monitoring, Understanding and Influencing the Co-Spread of COVID-19 Misinform...
Monitoring, Understanding and Influencing the Co-Spread of COVID-19 Misinform...Gregoire Burel
 
Monitoring and Understanding the Co-Spread of COVID-19 Misinformation and Fac...
Monitoring and Understanding the Co-Spread of COVID-19 Misinformation and Fac...Monitoring and Understanding the Co-Spread of COVID-19 Misinformation and Fac...
Monitoring and Understanding the Co-Spread of COVID-19 Misinformation and Fac...Gregoire Burel
 
Co-Spread of Misinformation and Fact-Checking Content during the Covid-19 Pan...
Co-Spread of Misinformation and Fact-Checking Content during the Covid-19 Pan...Co-Spread of Misinformation and Fact-Checking Content during the Covid-19 Pan...
Co-Spread of Misinformation and Fact-Checking Content during the Covid-19 Pan...Gregoire Burel
 
Crisis Event Extraction Service (CREES) – Automatic Detection and Classificat...
Crisis Event Extraction Service (CREES) – Automatic Detection and Classificat...Crisis Event Extraction Service (CREES) – Automatic Detection and Classificat...
Crisis Event Extraction Service (CREES) – Automatic Detection and Classificat...Gregoire Burel
 
Semantic Wide and Deep Learning for Detecting Crisis-Information Categories o...
Semantic Wide and Deep Learning for Detecting Crisis-Information Categories o...Semantic Wide and Deep Learning for Detecting Crisis-Information Categories o...
Semantic Wide and Deep Learning for Detecting Crisis-Information Categories o...Gregoire Burel
 
DoRES — A Three-tier Ontology for Modelling Crises in the Digital Age
DoRES — A Three-tier Ontology for Modelling Crises in the Digital AgeDoRES — A Three-tier Ontology for Modelling Crises in the Digital Age
DoRES — A Three-tier Ontology for Modelling Crises in the Digital AgeGregoire Burel
 
Sparks O3 Browser: Augmenting the Web with Semantic Overlays
Sparks O3 Browser: Augmenting the Web with Semantic OverlaysSparks O3 Browser: Augmenting the Web with Semantic Overlays
Sparks O3 Browser: Augmenting the Web with Semantic OverlaysGregoire Burel
 

Mehr von Gregoire Burel (8)

Monitoring, Understanding, and Influencing the Co-Spread of COVID-19 Misinfor...
Monitoring, Understanding, and Influencing the Co-Spread of COVID-19 Misinfor...Monitoring, Understanding, and Influencing the Co-Spread of COVID-19 Misinfor...
Monitoring, Understanding, and Influencing the Co-Spread of COVID-19 Misinfor...
 
Monitoring, Understanding and Influencing the Co-Spread of COVID-19 Misinform...
Monitoring, Understanding and Influencing the Co-Spread of COVID-19 Misinform...Monitoring, Understanding and Influencing the Co-Spread of COVID-19 Misinform...
Monitoring, Understanding and Influencing the Co-Spread of COVID-19 Misinform...
 
Monitoring and Understanding the Co-Spread of COVID-19 Misinformation and Fac...
Monitoring and Understanding the Co-Spread of COVID-19 Misinformation and Fac...Monitoring and Understanding the Co-Spread of COVID-19 Misinformation and Fac...
Monitoring and Understanding the Co-Spread of COVID-19 Misinformation and Fac...
 
Co-Spread of Misinformation and Fact-Checking Content during the Covid-19 Pan...
Co-Spread of Misinformation and Fact-Checking Content during the Covid-19 Pan...Co-Spread of Misinformation and Fact-Checking Content during the Covid-19 Pan...
Co-Spread of Misinformation and Fact-Checking Content during the Covid-19 Pan...
 
Crisis Event Extraction Service (CREES) – Automatic Detection and Classificat...
Crisis Event Extraction Service (CREES) – Automatic Detection and Classificat...Crisis Event Extraction Service (CREES) – Automatic Detection and Classificat...
Crisis Event Extraction Service (CREES) – Automatic Detection and Classificat...
 
Semantic Wide and Deep Learning for Detecting Crisis-Information Categories o...
Semantic Wide and Deep Learning for Detecting Crisis-Information Categories o...Semantic Wide and Deep Learning for Detecting Crisis-Information Categories o...
Semantic Wide and Deep Learning for Detecting Crisis-Information Categories o...
 
DoRES — A Three-tier Ontology for Modelling Crises in the Digital Age
DoRES — A Three-tier Ontology for Modelling Crises in the Digital AgeDoRES — A Three-tier Ontology for Modelling Crises in the Digital Age
DoRES — A Three-tier Ontology for Modelling Crises in the Digital Age
 
Sparks O3 Browser: Augmenting the Web with Semantic Overlays
Sparks O3 Browser: Augmenting the Web with Semantic OverlaysSparks O3 Browser: Augmenting the Web with Semantic Overlays
Sparks O3 Browser: Augmenting the Web with Semantic Overlays
 

Kürzlich hochgeladen

ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGIThomas Poetter
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxellehsormae
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 

Kürzlich hochgeladen (20)

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptx
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 

Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities

  • 1. GRÉGOIRE BUREL, PAUL MULLHOLAND, HARITH ALANI Knowledge Media Institute, The Open University, Milton Keynes, UK. WWW’16, Montréal, Canada. 11th April 2016 Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities
  • 2. Outline Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities Publications 2 Identifying Best Answers in Q&A Communities -  Question Answering Communities -  Automatic Best Answer Identification -  Reference Model Structural Normalisation -  No Normalisation Vs. Structural Normalisation -  Normalisation Methods -  Normalisation Method Selection Best Answer Identification with Structural Normalisation -  Models Comparison -  Features Comparison Conclusions and Future Work
  • 3. Q&A Communities Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities Publications Question Answer #1 Answer #2 ... Answer #n QuestionThread 3 Q&A communities are communities composed of askers and answerers looking for solutions to particular issues. When looking for answers, users need to identify if similar questions have already been answered correctly and see if a best answer exists. Unfortunately not all questions have labelled best answers (43.2% of questions do not have labelled best answers). Existing works that have focused on best answer identification have also generally ignored the structure of Q&A websites while designing feature normalisation methods. ? ! -
  • 4. Q&A Communities Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities Publications Question Answer #1 Answer #2 ... Answer #n QuestionThread 4 Q&A communities are communities composed of askers and answerers looking for solutions to particular issues. When looking for answers, users need to identify if similar questions have already been answered correctly and see if a best answer exists. Unfortunately not all questions have labelled best answers (43.2% of questions do not have labelled best answers). Existing works that have focused on best answer identification have also generally ignored the structure of Q&A websites while designing feature normalisation methods. ? ! -
  • 5. Q&A communities are communities composed of askers and answerers looking for solutions to particular issues. When looking for answers, users need to identify if similar questions have already been answered correctly and see if a best answer exists. Unfortunately not all questions have labelled best answers (43.2% of questions do not have labelled best answers). Existing works that have focused on best answer identification have also generally ignored the structure of Q&A websites for designing feature normalisation methods. Q&A Communities Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities Publications Question Answer #1 Answer #2 ... Answer #n QuestionThread ? ! - 5
  • 6. Feature-based Best Answers Identification and Reference Model Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 6 In order to identify best answers features are extracted and associated with each answers and a binary classifier is trained (Burel et al. 2012 , 2013, 2014): -  Such features are divided into User, Content and Thread features. -  30 features are used for the reference model. -  The Alternating Decision Tree algorithm is used. Reference Model Results: -  The reference model achieve an F1 of 0.817 on average. -  Thread features in particular score ratios are highly related to best answers. What methods could be used for improving such model??
  • 7. Feature-based Best Answers Identification and Reference Model Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 7 In order to identify best answers features are extracted and associated with each answers and a binary classifier is trained (Burel et al. 2012 , 2013, 2014): -  Such features are divided into User, Content and Thread features. -  30 features are used for the reference model. -  The Alternating Decision Tree algorithm is used. Reference Model Results: -  The reference model achieve an F1 of 0.817 on average. -  Thread features in particular score ratios are highly related to best answers. What methods could be used for improving such model??
  • 8. Contributions Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 8 Does the thread-like structure of Q&A communities can help the automatic identification of best answers ? -  Introduce the concept of structural normalisation, a method for normalising features based on the structure of Q&A communities. -  Present 3 different features normalisation methods: 1) Min/Max normalisation; 2) Order Normalisation, and; 3) Normalised Order Normalisation. -  Identify the best normalisation method for automatically identifying best answers. -  Compare different models and features sets and show that structural normalisation improves best answer identification. ?
  • 9. Structural Normalisation / No Normalisation Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 9 Question Answer #1 Answer #2 ... Answer #n QuestionThread Question Answer #1 Answer #2 ... Answer #nQuestionThread Question Answer #1 Answer #2 ... Answer #nQuestionThread Question Answer #1 Answer #2 ... Answer #nQuestionThread Question Answer #1 Answer #2 ... Answer #nQuestionThread Standard classifiers such as the reference model compare all the answers even if they are not part of the same thread. Thread-wise normalisation normalise features based on their relation within a thread. !
  • 10. Structural Normalisation / No Normalisation Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 10 Question Answer #1 Answer #2 ... Answer #n QuestionThread Question Answer #1 Answer #2 ... Answer #nQuestionThread Question Answer #1 Answer #2 ... Answer #nQuestionThread Question Answer #1 Answer #2 ... Answer #nQuestionThread Question Answer #1 Answer #2 ... Answer #nQuestionThread Standard classifiers such as the reference model compare all the answers even if they are not part of the same thread. Thread-wise normalisation normalise features based on their relation within a thread. !
  • 11. Normalisation Methods Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities Min/Max Normalisation (Thread-wise min/max) Normalise features within a thread using the minimum and maximum value of a particular feature. 1 2 Normalised Order Normalisation Same as Order Normalisation but the values are normalised between 0 and 1. 3 11 Order Normalisation Generalises Gkotsis et al. (2014) approach. Features within a thread are ranked according to their value. The lowest value is transformed to the length of the thread while the highest value is transformed to 1.
  • 12. Normalisation Methods Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities Min/Max Normalisation (Thread-wise min/max) Normalise features within a thread using the minimum and maximum value of a particular feature. 1 Order Normalisation Generalises Gkotsis et al. (2014) approach. Features within a thread are ranked according to their value. The lowest value is transformed to the length of the thread while the highest value is transformed to 1. 2 Normalised Order Normalisation Same as Order Normalisation but the values are normalised between 0 and 1. 3 12 Adaptive Feature Normalisation Not all features need to be normalised. For instance, number of question views does not change within a question thread but does across questions. If for a given feature the variance is zero and remains constant between all the threads, the feature is not normalised. Otherwise, the feature is normalised with one of the previous functions. !
  • 13. Normalisation Method Selection Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 13 ? What normalisation method is the most useful ? Approach 1.  Calculate the average Information Gain (IG) of each feature associated with each normalisation method. 2.  Select the normalisation method that has the highest average IG across all the datasets.
  • 14. Normalisation Method Selection Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 14 ? What normalisation method is the most useful ? Approach 1.  Calculate the average Information Gain (IG) of each feature associated with each normalisation method. 2.  Select the normalisation method that has the highest average IG across all the datasets.
  • 15. Best Answers Identification Models Comparisons Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 15
  • 16. Best Answers Identification Models Comparisons Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 16 General Results -  Thread normalisation approaches improves best answer identification (p = 2.817e − 05). -  On average, an increase in F1 of +5.3% is observed compared to the non normalised models. Baseline Models Results -  In general, thread normalisation approaches improves best answer identification: On average F1 = 0.718 (+26.8%) . -  The longuest answers within a thread is more likely to be associated with a best answers. Core Features Models Results -  Unsurprisingly, there is no increase in F1 for thread features but there is a significant increase for other features sets (+20% F1). -  Content features are more important within a thread whereas user features are better for discriminating best answers globally. Extended Features Models Results -  When score are present, normalisation does not improve as score ratio already perform very well. 8 0 1
  • 17. Best Answers Identification Models Comparisons Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 17 General Results -  Thread normalisation approaches improves best answer identification (p = 2.817e − 05). -  On average, an increase in F1 of +5.3% is observed compared to the non normalised models. Baseline Models Results -  In general, thread normalisation approaches improves best answer identification: On average F1 = 0.718 (+26.8%) . -  The longuest answers within a thread is more likely to be associated with a best answers. Core Features Models Results -  Unsurprisingly, there is no increase in F1 for thread features but there is a significant increase for other features sets (+20% F1). -  Content features are more important within a thread whereas user features are better for discriminating best answers globally. Extended Features Models Results -  When score are present, normalisation does not improve as score ratio already perform very well. 8 0 1
  • 18. Best Answers Identification Models Comparisons Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 18 General Results -  Thread normalisation approaches improves best answer identification (p = 2.817e − 05). -  On average, an increase in F1 of +5.3% is observed compared to the non normalised models. Baseline Models Results -  In general, thread normalisation approaches improves best answer identification: On average F1 = 0.718 (+26.8%) . -  The longuest answers within a thread is more likely to be associated with a best answers. Core Features Models Results -  Unsurprisingly, there is no increase in F1 for thread features but there is a significant increase for other features sets (+20% F1). -  Content features are more important within a thread whereas user features are better for discriminating best answers globally. Extended Features Models Results -  When score are present, normalisation does not improve as score ratio already perform very well. 8 0 1
  • 19. Best Answers Identification Models Comparisons Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 19 General Results -  Thread normalisation approaches improves best answer identification (p = 2.817e − 05). -  On average, an increase in F1 of +5.3% is observed compared to the non normalised models. Baseline Models Results -  In general, thread normalisation approaches improves best answer identification: On average F1 = 0.718 (+26.8%) . -  The longuest answers within a thread is more likely to be associated with a best answers. Core Features Models Results -  Unsurprisingly, there is no increase in F1 for thread features but there is a significant increase for other features sets (+20% F1). -  Content features are more important within a thread whereas user features are better for discriminating best answers globally. Extended Features Models Results -  When score are present, normalisation does not improve as score ratio already performs very well. 8 0 1
  • 20. Features Importance Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 20 ? Approach 1.  Calculate the Information Gain (IG) of each feature for each dataset with and without normalisation. 2.  Order the features by IG across all the datasets. The higher the IG the more important the feature. What features are the most useful ? How does order normalisation impact feature importance?
  • 21. Features Importance- Results Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 21
  • 22. Features Importance - Results Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 22 General Results -  As previously observed, scores based features are the most important best answer predictors. -  Similarly, user features are more important than content features when features are not normalised. When features are normalised the difference diminishes. Features Comparison -  Number of comments is associated with best answers when normalised. Therefore, the relative amount of comments may be a good indicator of best answers. -  The relative number of answer for a user is an important indicator for identifying best answers (i.e. good answers comes from users that post a lot of answers) -  Answers with relatively original content are more likely to be linked with best answers. -  Relative topic reputation is not really important meaning that this feature only helps when distinguishing best answers in a global context.
  • 23. Conclusions and Future Work Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities Conclusions -  Compared to the reference models, structural approaches improve automatic best answer identification by around +5% F1. -  Thread-normalisation change the importance of features (e.g. content length) -  Score features are overwhelming important. -  Structural methods may be used successfully for other classification tasks where analysed communities are highly structured. 23 1
  • 24. Conclusions and Future Work Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities Conclusions -  Compared to the baseline models, structural approaches improve automatic best answer identification by around +5% F1. -  Thread-normalisation change the importance of features (e.g. content length) -  Score features are overwhelming important. -  Structural methods may be used successfully for other classification tasks where analysed communities are highly structured. 24 1 2 Future/Current Work -  Evaluate other types of normalisation (e.g. signed feature ratios) -  Consider other structural optimisation methods (e.g. Learning-to-Rank models) -  Consider applicability to other type of communities
  • 25. Questions and Discussion @ Email: g.burel@open.ac.uk Twitter: @evhart Structural Normalisation Methods for Improving Best Answer Identification in Question Answering Communities 25