SlideShare ist ein Scribd-Unternehmen logo
1 von 41
Downloaden Sie, um offline zu lesen
ECIR 2016, PADUA, ITALY
EFFICIENT PSEUDO-RELEVANCE FEEDBACK
METHODS FOR COLLABORATIVE FILTERING
RECOMMENDATION
Daniel Valcarce, Javier Parapar, Álvaro Barreiro
@dvalcarce @jparapar @AlvaroBarreiroG
Information Retrieval Lab
@IRLab_UDC
University of A Coruña
Spain
Outline
1. Pseudo-Relevance Feedback (PRF)
2. Collaborative Filtering (CF)
3. PRF Methods for CF
4. Experiments
5. Conclusions and Future Work
1/28
PSEUDO-RELEVANCE FEEDBACK (PRF)
Pseudo-Relevance Feedback (I)
Pseudo-Relevance Feedback provides an automatic method for
query expansion:
Assumes that the top retrieved documents with the
original query are relevant (pseudo-relevant set).
The query is expanded with the most representative terms
from this set.
The expanded query is expected to yield better results than
the original one.
3/28
Pseudo-Relevance Feedback (II)
Information need
4/28
Pseudo-Relevance Feedback (II)
Information need
query
4/28
Pseudo-Relevance Feedback (II)
Information need
query Retrieval
System
4/28
Pseudo-Relevance Feedback (II)
Information need
query Retrieval
System
4/28
Pseudo-Relevance Feedback (II)
Information need
query Retrieval
System
4/28
Pseudo-Relevance Feedback (II)
Information need
query Retrieval
System
4/28
Pseudo-Relevance Feedback (II)
Information need
query Retrieval
System
Query
Expansion
expanded
query
4/28
Pseudo-Relevance Feedback (II)
Information need
query Retrieval
System
Query
Expansion
expanded
query
4/28
Pseudo-Relevance Feedback (III)
Some popular PRF approaches:
Based on Rocchio’s model
(Rocchio, 1971 & Carpineto et al., ACM TOIS 2001)
Relevance-Based Language Models
(Lavrenko & Croft, SIGIR 2001)
Divergence Minimization Model
(Zhai & LaïŹ€erty, SIGIR 2006)
Mixture Models
(Tao & Zhai, SIGIR 2006)
5/28
COLLABORATIVE FILTERING (CF)
Recommender Systems
Notation:
The set of users U
The set of items I
The rating that the user u gave to the item i is ru,i
The set of items rated by user u is denoted by Iu
The set of users that rated item i is denoted by Ui
The neighbourhood of user u is denoted by Vu
Top-N recommendation: create a ranked list containing
relevant and unknown items for each user u ∈ U.
7/28
Collaborative Filtering (I)
Collaborative Filtering (CF) employs the past interaction
between users and items to generate recommendations.
Idea: If this user who is similar to you likes this item, maybe you will
also like it.
DiïŹ€erent input data:
Explicit feedback: ratings, reviews...
Implicit feedback: clicks, purchases...
Perhaps the most popular approach to recommendation given
the increasing amount of information about users.
8/28
Collaborative Filtering (II)
Collaborative Filtering (CF) techniques can be classiïŹed in:
Model-based methods: learn a predictive model from the
user-item ratings.
◩ Matrix factorisation (e.g., SVD)
Neighbourhood-based (or memory-based) methods:
compute recommendations using directly part of the
ratings.
◩ k-NN approaches
9/28
PRF METHODS FOR CF
PRF for CF
PRF CF
User’s query User’s proïŹle
mostˆ1,populatedˆ2,stateˆ2 Titanicˆ2,Avatarˆ3,Matrixˆ5
Documents
Neighbours
Terms
Items
11/28
Previous Work on Adapting PRF Methods to CF
Relevance-Based Language Models
Originally devised for PRF (Lavrenko & Croft, SIGIR 2001).
Adapted to CF (Parapar et al., Inf. Process. Manage. 2013).
Two models: RM1 and RM2.
High precision ïŹgures in recommendation.
12/28
Previous Work on Adapting PRF Methods to CF
Relevance-Based Language Models
Originally devised for PRF (Lavrenko & Croft, SIGIR 2001).
Adapted to CF (Parapar et al., Inf. Process. Manage. 2013).
Two models: RM1 and RM2.
High precision ïŹgures in recommendation.
... but high computational cost!
RM1 : p(i|Ru) ∝
v∈Vu
p(v) p(i|v)
j∈Iu
p(j|v)
RM2 : p(i|Ru) ∝ p(i)
j∈Iu v∈Vu
p(i|v) p(v)
p(i)
p(j|v)
12/28
Our Proposals based on Rocchio’s Framework
Rocchio’s Weights
pRocchio(i|u)
v∈Vu
rv,i
|Vu|
Robertson Selection Value g
pRSV(i|u)
v∈Vu
rv,i
|Vu|
p(i|Vu)
CHI-2 g
pCHI−2(i|u)
p(i|Vu) − p(i|C)
2
p(i|C)
Kullback–Leibler Divergence
pKLD(i|u) p(i|Vu) log
p(i|Vu)
p(i|C)
13/28
Our Proposals based on Rocchio’s Framework
Rocchio’s Weights
pRocchio(i|u)
v∈Vu
rv,i
|Vu|
Robertson Selection Value g
pRSV(i|u)
v∈Vu
rv,i
|Vu|
p(i|Vu)
CHI-2 g
pCHI−2(i|u)
p(i|Vu) − p(i|C)
2
p(i|C)
Kullback–Leibler Divergence
pKLD(i|u) p(i|Vu) log
p(i|Vu)
p(i|C)
13/28
Our Proposals based on Rocchio’s Framework
Rocchio’s Weights
pRocchio(i|u)
v∈Vu
rv,i
|Vu|
Robertson Selection Value g
pRSV(i|u)
v∈Vu
rv,i
|Vu|
p(i|Vu)
CHI-2 g
pCHI−2(i|u)
p(i|Vu) − p(i|C)
2
p(i|C)
Kullback–Leibler Divergence
pKLD(i|u) p(i|Vu) log
p(i|Vu)
p(i|C)
13/28
Probability Estimation
Maximum Likelihood Estimate under a Multinomial
Distribution over the ratings:
pmle(i|Vu)
v∈Vu
rv,i
v∈Vu , j∈I rv,j
pmle(i|C)
u∈U ru,i
u∈U, j∈I ru,j
14/28
Neighbourhood Length Normalisation (I)
Neighbourhoods are computed using clustering algorithms:
Hard clustering: every user is in only one cluster. Clusters
may have diïŹ€erent sizes. Example: k-means.
Soft clustering: each user has its own neighbours. When
we set k to a high value, we may ïŹnd diïŹ€erent amounts of
neighbours. Example: k-NN.
15/28
Neighbourhood Length Normalisation (I)
Neighbourhoods are computed using clustering algorithms:
Hard clustering: every user is in only one cluster. Clusters
may have diïŹ€erent sizes. Example: k-means.
Soft clustering: each user has its own neighbours. When
we set k to a high value, we may ïŹnd diïŹ€erent amounts of
neighbours. Example: k-NN.
Idea: consider the variability of the neighbourhood lengths:
Big neighbourhoods is equivalent to a query with a lot of
results: the collection model is closed to the target user.
Small neighbourhoods implies that neighbours are highly
speciïŹc: the collection is very diïŹ€erent from the target user.
15/28
Neighbourhood Length Normalisation (II)
We bias the MLE to perform neighbourhood length
normalisation:
pnmle(i|Vu)
rank 1
|Vu|
v∈Vu
rv,i
v∈Vu , j∈I rv,j
pnmle(i|C)
rank 1
|U|
u∈U ru,i
u∈U, j∈I ru,j
16/28
EXPERIMENTS
Experimental settings
Baselines:
UB: traditional user-based neighbourhood approach.
SVD: matrix factorisation.
UIR-Item: probabilistic approach.
RM1 and RM2: Relevance-Based Language Models.
Our algorithms:
Rocchio’s Weights (RW)
Robertson Selection Value (RSV)
CHI-2
Kullback-Leibler Divergence (KLD)
18/28
Efficiency
0.01
0.1
1
10
ML 100k ML 1M ML 10M
recommendationtimeperuser(s)
dataset
UIR
RM1
RM2
SVD++
RSV
UB
RW
CHI-2
KLD
19/28
Accuracy (nDCG@10)
Algorithm ML 100k ML 1M R3-Yahoo! LibraryThing
UB 0.0468 0.0313 0.0108 0.0055b
SVD 0.0936a 0.0608a 0.0101 0.0015
UIR-Item 0.2188ab 0.1795abd 0.0174abd 0.0673abd
RM1 0.2473abc 0.1402ab 0.0146ab 0.0444ab
RM2 0.3323abcd 0.1992abd 0.0207abcd 0.0957abcd
Rocchio’s Weights 0.2604abcd 0.1557abd 0.0194abcd 0.0892abcd
RSV 0.2604abcd 0.1557abd 0.0194abcd 0.0892abcd
KLD
MLE 0.2693abcd 0.1264ab 0.0197abcd 0.1576abcde
NMLE 0.3120abcd 0.1546ab 0.0201abcd 0.1101abcde
CHI-2
MLE 0.0777a 0.0709ab 0.0149ab 0.0939abcd
NMLE 0.3220abcd 0.1419ab 0.0204abcd 0.1459abcde
Table: Values of nDCG@10. Pink = best algorithm. Blue = not
signiïŹcantly diïŹ€erent to the best (Wilcoxon two-sided p < 0.01). 20/28
Diversity (Gini@10)
Algorithm ML 100k ML 1M R3-Yahoo! LibraryThing
UIR-Item 0.0124 0.0050 0.0137 0.0005
RM2 0.0256 0.0069 0.0207 0.0019
CHI-2 NMLE 0.0450 0.0106 0.0506 0.0539
Table: Values of the complement of Gini index at 10. Pink = best
algorithm.
21/28
Novelty (MSI@10)
Algorithm ML 100k ML 1M R3-Yahoo! LibraryThing
UIR-Item 5.2337e 8.3713e 3.7186e 17.1229e
RM2 6.8273c 8.9481c 4.9618c 19.27343c
CHI-2 NMLE 8.1711ec 10.0043ec 7.5555ec 8.8563
Table: Values of Mean Self-Information at 10. Pink = best algorithm.
22/28
Trade-off Accuracy-Diversity
0.06
0.07
0.08
0.09
0.10
0.11
0.12
0.13
200 300 400 500 600 700 800 900
G–(Gini,nDCG)
k
RM2
CHI-2 NMLE
Figure: G-measure of nDCG@10 and Gini@10 on MovieLens 100k
varying the number of neighbours k using Pearson’s correlation
similarity.
23/28
Trade-off Accuracy-Novelty
0.9
1.0
1.1
1.2
1.3
1.4
1.5
1.6
1.7
1.8
1.9
2.0
200 300 400 500 600 700 800 900
G–(MSI,nDCG)
k
RM2
CHI-2 NMLE
Figure: G-measure of nDCG@10 and MSI@10 on MovieLens 100k
varying the number of neighbours k using Pearson’s correlation
similarity.
24/28
CONCLUSIONS AND FUTURE WORK
Conclusions
We proposed to use fast PRF methods (Rocchio’s Weigths, RSV,
KLD and CHI-2):
They are orders of magnitude faster than the Relevance
Models (up to 200x).
They generate quite accurate recommendations.
Good novelty and diversity ïŹgures with a better trade-oïŹ€
than RM2.
They lack of parameters (only clustering parameters).
26/28
Future Work
Other approaches for computing neighbourhoods:
Posterior Probability Clustering (a non-negative matrix
factorisation).
Normalised Cut (spectral clustering).
27/28
Future Work
Other approaches for computing neighbourhoods:
Posterior Probability Clustering (a non-negative matrix
factorisation).
Normalised Cut (spectral clustering).
Explore other PRF methods:
Divergence Minimization Models.
Mixture Models.
27/28
THANK YOU!
@DVALCARCE
http://www.dc.fi.udc.es/~dvalcarce

Weitere Àhnliche Inhalte

Was ist angesagt?

PPT
PPTPPT
PPTbutest
 
Ju3517011704
Ju3517011704Ju3517011704
Ju3517011704IJERA Editor
 
Traffic Classification using a Statistical Approach
Traffic Classification using a Statistical ApproachTraffic Classification using a Statistical Approach
Traffic Classification using a Statistical ApproachDenis Zuev
 
The effect of gamma value on support vector machine performance with differen...
The effect of gamma value on support vector machine performance with differen...The effect of gamma value on support vector machine performance with differen...
The effect of gamma value on support vector machine performance with differen...IJECEIAES
 
Collaborative filtering20081111
Collaborative filtering20081111Collaborative filtering20081111
Collaborative filtering20081111greatwnag
 
IEEE 2015 Java Projects
IEEE 2015 Java ProjectsIEEE 2015 Java Projects
IEEE 2015 Java ProjectsVijay Karan
 
Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Ex...
Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Ex...Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Ex...
Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Ex...CSCJournals
 
FYP presentation
FYP presentationFYP presentation
FYP presentationphilipyeung4
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
IEEE 2015 Java Projects
IEEE 2015 Java ProjectsIEEE 2015 Java Projects
IEEE 2015 Java ProjectsVijay Karan
 
An Examination of the Bloom Filter and its Application in Preventing Weak Pas...
An Examination of the Bloom Filter and its Application in Preventing Weak Pas...An Examination of the Bloom Filter and its Application in Preventing Weak Pas...
An Examination of the Bloom Filter and its Application in Preventing Weak Pas...Editor IJCATR
 
Packet Loss and Overlay Size Aware Broadcast in the Kademlia P2P System
Packet Loss and Overlay Size Aware Broadcast in the Kademlia P2P SystemPacket Loss and Overlay Size Aware Broadcast in the Kademlia P2P System
Packet Loss and Overlay Size Aware Broadcast in the Kademlia P2P SystemIDES Editor
 
Maximizing Efficiency Of multiple–Path Source Routing in Presence of Jammer
Maximizing Efficiency Of multiple–Path Source Routing in Presence of JammerMaximizing Efficiency Of multiple–Path Source Routing in Presence of Jammer
Maximizing Efficiency Of multiple–Path Source Routing in Presence of JammerIOSR Journals
 
Performance Analysis of Group-Blind Multiuser Detectors for Synchronous CDMA
Performance Analysis of Group-Blind Multiuser Detectors for Synchronous CDMAPerformance Analysis of Group-Blind Multiuser Detectors for Synchronous CDMA
Performance Analysis of Group-Blind Multiuser Detectors for Synchronous CDMAidescitation
 
Performance Analysis of Bfsk Multi-Hop Communication Systems Over K-Ό Fading ...
Performance Analysis of Bfsk Multi-Hop Communication Systems Over K-Ό Fading ...Performance Analysis of Bfsk Multi-Hop Communication Systems Over K-Ό Fading ...
Performance Analysis of Bfsk Multi-Hop Communication Systems Over K-Ό Fading ...ijwmn
 
Challenge of Image Retrieval, Brighton, 2000 1 ANVIL: a System for the Retrie...
Challenge of Image Retrieval, Brighton, 2000 1 ANVIL: a System for the Retrie...Challenge of Image Retrieval, Brighton, 2000 1 ANVIL: a System for the Retrie...
Challenge of Image Retrieval, Brighton, 2000 1 ANVIL: a System for the Retrie...Petros Tsonis
 

Was ist angesagt? (19)

PPT
PPTPPT
PPT
 
Ju3517011704
Ju3517011704Ju3517011704
Ju3517011704
 
Traffic Classification using a Statistical Approach
Traffic Classification using a Statistical ApproachTraffic Classification using a Statistical Approach
Traffic Classification using a Statistical Approach
 
The effect of gamma value on support vector machine performance with differen...
The effect of gamma value on support vector machine performance with differen...The effect of gamma value on support vector machine performance with differen...
The effect of gamma value on support vector machine performance with differen...
 
Collaborative filtering20081111
Collaborative filtering20081111Collaborative filtering20081111
Collaborative filtering20081111
 
IEEE 2015 Java Projects
IEEE 2015 Java ProjectsIEEE 2015 Java Projects
IEEE 2015 Java Projects
 
Dq24746750
Dq24746750Dq24746750
Dq24746750
 
Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Ex...
Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Ex...Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Ex...
Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Ex...
 
FYP presentation
FYP presentationFYP presentation
FYP presentation
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
IEEE 2015 Java Projects
IEEE 2015 Java ProjectsIEEE 2015 Java Projects
IEEE 2015 Java Projects
 
An Examination of the Bloom Filter and its Application in Preventing Weak Pas...
An Examination of the Bloom Filter and its Application in Preventing Weak Pas...An Examination of the Bloom Filter and its Application in Preventing Weak Pas...
An Examination of the Bloom Filter and its Application in Preventing Weak Pas...
 
Packet Loss and Overlay Size Aware Broadcast in the Kademlia P2P System
Packet Loss and Overlay Size Aware Broadcast in the Kademlia P2P SystemPacket Loss and Overlay Size Aware Broadcast in the Kademlia P2P System
Packet Loss and Overlay Size Aware Broadcast in the Kademlia P2P System
 
Maximizing Efficiency Of multiple–Path Source Routing in Presence of Jammer
Maximizing Efficiency Of multiple–Path Source Routing in Presence of JammerMaximizing Efficiency Of multiple–Path Source Routing in Presence of Jammer
Maximizing Efficiency Of multiple–Path Source Routing in Presence of Jammer
 
2009 spie hmm
2009 spie hmm2009 spie hmm
2009 spie hmm
 
Performance Analysis of Group-Blind Multiuser Detectors for Synchronous CDMA
Performance Analysis of Group-Blind Multiuser Detectors for Synchronous CDMAPerformance Analysis of Group-Blind Multiuser Detectors for Synchronous CDMA
Performance Analysis of Group-Blind Multiuser Detectors for Synchronous CDMA
 
Performance Analysis of Bfsk Multi-Hop Communication Systems Over K-Ό Fading ...
Performance Analysis of Bfsk Multi-Hop Communication Systems Over K-Ό Fading ...Performance Analysis of Bfsk Multi-Hop Communication Systems Over K-Ό Fading ...
Performance Analysis of Bfsk Multi-Hop Communication Systems Over K-Ό Fading ...
 
Challenge of Image Retrieval, Brighton, 2000 1 ANVIL: a System for the Retrie...
Challenge of Image Retrieval, Brighton, 2000 1 ANVIL: a System for the Retrie...Challenge of Image Retrieval, Brighton, 2000 1 ANVIL: a System for the Retrie...
Challenge of Image Retrieval, Brighton, 2000 1 ANVIL: a System for the Retrie...
 
9517cnc03
9517cnc039517cnc03
9517cnc03
 

Andere mochten auch

Insight Presentation
Insight PresentationInsight Presentation
Insight Presentationcompchem1
 
Adding morphological information to a connectionist Part-Of-Speech tagger
Adding morphological information  to a connectionist Part-Of-Speech taggerAdding morphological information  to a connectionist Part-Of-Speech tagger
Adding morphological information to a connectionist Part-Of-Speech taggerFrancisco Zamora-Martinez
 
When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...
When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...
When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...Daniel Valcarce
 
Boys vs girls
Boys vs girlsBoys vs girls
Boys vs girlscompchem1
 
Boys are more intelligent than girls
Boys are more intelligent than girlsBoys are more intelligent than girls
Boys are more intelligent than girlsJawad Nasar Shah
 

Andere mochten auch (6)

Insight Presentation
Insight PresentationInsight Presentation
Insight Presentation
 
Adding morphological information to a connectionist Part-Of-Speech tagger
Adding morphological information  to a connectionist Part-Of-Speech taggerAdding morphological information  to a connectionist Part-Of-Speech tagger
Adding morphological information to a connectionist Part-Of-Speech tagger
 
When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...
When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...
When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...
 
Relationships between boys and girls
Relationships between boys and girlsRelationships between boys and girls
Relationships between boys and girls
 
Boys vs girls
Boys vs girlsBoys vs girls
Boys vs girls
 
Boys are more intelligent than girls
Boys are more intelligent than girlsBoys are more intelligent than girls
Boys are more intelligent than girls
 

Ähnlich wie Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recommendation [ECIR '16 Slides]

Collaborative Filtering Survey
Collaborative Filtering SurveyCollaborative Filtering Survey
Collaborative Filtering Surveymobilizer1000
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)ijceronline
 
Learning Content and Usage Factors Simultaneously
Learning Content and Usage Factors SimultaneouslyLearning Content and Usage Factors Simultaneously
Learning Content and Usage Factors SimultaneouslyArnab Bhadury
 
A scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clusteringA scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clusteringAllenWu
 
PhD Consortium ADBIS presetation.
PhD Consortium ADBIS presetation.PhD Consortium ADBIS presetation.
PhD Consortium ADBIS presetation.Giuseppe Ricci
 
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...Daniel Valcarce
 
A Threshold Fuzzy Entropy Based Feature Selection: Comparative Study
A Threshold Fuzzy Entropy Based Feature Selection:  Comparative StudyA Threshold Fuzzy Entropy Based Feature Selection:  Comparative Study
A Threshold Fuzzy Entropy Based Feature Selection: Comparative StudyIJMER
 
Low rank models for recommender systems with limited preference information
Low rank models for recommender systems with limited preference informationLow rank models for recommender systems with limited preference information
Low rank models for recommender systems with limited preference informationEvgeny Frolov
 
A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...
A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...
A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...Daniel Valcarce
 
Analytics of analytics pipelines: from optimising re-execution to general Dat...
Analytics of analytics pipelines:from optimising re-execution to general Dat...Analytics of analytics pipelines:from optimising re-execution to general Dat...
Analytics of analytics pipelines: from optimising re-execution to general Dat...Paolo Missier
 
An Efficient Frame Embedding Using Haar Wavelet Coefficients And Orthogonal C...
An Efficient Frame Embedding Using Haar Wavelet Coefficients And Orthogonal C...An Efficient Frame Embedding Using Haar Wavelet Coefficients And Orthogonal C...
An Efficient Frame Embedding Using Haar Wavelet Coefficients And Orthogonal C...IJERA Editor
 
Advances In Collaborative Filtering
Advances In Collaborative FilteringAdvances In Collaborative Filtering
Advances In Collaborative FilteringScott Donald
 
2. visualization in data mining
2. visualization in data mining2. visualization in data mining
2. visualization in data miningAzad public school
 
HOP-Rec_RecSys18
HOP-Rec_RecSys18HOP-Rec_RecSys18
HOP-Rec_RecSys18Matt Yang
 
IRJET- Boosting Response Aware Model-Based Collaborative Filtering
IRJET- Boosting Response Aware Model-Based Collaborative FilteringIRJET- Boosting Response Aware Model-Based Collaborative Filtering
IRJET- Boosting Response Aware Model-Based Collaborative FilteringIRJET Journal
 
[CIKM 2014] Deviation-Based Contextual SLIM Recommenders
[CIKM 2014] Deviation-Based Contextual SLIM Recommenders[CIKM 2014] Deviation-Based Contextual SLIM Recommenders
[CIKM 2014] Deviation-Based Contextual SLIM RecommendersYONG ZHENG
 
Performance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsPerformance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsDinusha Dilanka
 
Data mining projects topics for java and dot net
Data mining projects topics for java and dot netData mining projects topics for java and dot net
Data mining projects topics for java and dot netredpel dot com
 
SVD and the Netflix Dataset
SVD and the Netflix DatasetSVD and the Netflix Dataset
SVD and the Netflix DatasetBen Mabey
 

Ähnlich wie Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recommendation [ECIR '16 Slides] (20)

Collaborative Filtering Survey
Collaborative Filtering SurveyCollaborative Filtering Survey
Collaborative Filtering Survey
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Learning Content and Usage Factors Simultaneously
Learning Content and Usage Factors SimultaneouslyLearning Content and Usage Factors Simultaneously
Learning Content and Usage Factors Simultaneously
 
A scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clusteringA scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clustering
 
PhD Consortium ADBIS presetation.
PhD Consortium ADBIS presetation.PhD Consortium ADBIS presetation.
PhD Consortium ADBIS presetation.
 
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...
 
A Threshold Fuzzy Entropy Based Feature Selection: Comparative Study
A Threshold Fuzzy Entropy Based Feature Selection:  Comparative StudyA Threshold Fuzzy Entropy Based Feature Selection:  Comparative Study
A Threshold Fuzzy Entropy Based Feature Selection: Comparative Study
 
Low rank models for recommender systems with limited preference information
Low rank models for recommender systems with limited preference informationLow rank models for recommender systems with limited preference information
Low rank models for recommender systems with limited preference information
 
A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...
A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...
A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...
 
Analytics of analytics pipelines: from optimising re-execution to general Dat...
Analytics of analytics pipelines:from optimising re-execution to general Dat...Analytics of analytics pipelines:from optimising re-execution to general Dat...
Analytics of analytics pipelines: from optimising re-execution to general Dat...
 
An Efficient Frame Embedding Using Haar Wavelet Coefficients And Orthogonal C...
An Efficient Frame Embedding Using Haar Wavelet Coefficients And Orthogonal C...An Efficient Frame Embedding Using Haar Wavelet Coefficients And Orthogonal C...
An Efficient Frame Embedding Using Haar Wavelet Coefficients And Orthogonal C...
 
50120140505004
5012014050500450120140505004
50120140505004
 
Advances In Collaborative Filtering
Advances In Collaborative FilteringAdvances In Collaborative Filtering
Advances In Collaborative Filtering
 
2. visualization in data mining
2. visualization in data mining2. visualization in data mining
2. visualization in data mining
 
HOP-Rec_RecSys18
HOP-Rec_RecSys18HOP-Rec_RecSys18
HOP-Rec_RecSys18
 
IRJET- Boosting Response Aware Model-Based Collaborative Filtering
IRJET- Boosting Response Aware Model-Based Collaborative FilteringIRJET- Boosting Response Aware Model-Based Collaborative Filtering
IRJET- Boosting Response Aware Model-Based Collaborative Filtering
 
[CIKM 2014] Deviation-Based Contextual SLIM Recommenders
[CIKM 2014] Deviation-Based Contextual SLIM Recommenders[CIKM 2014] Deviation-Based Contextual SLIM Recommenders
[CIKM 2014] Deviation-Based Contextual SLIM Recommenders
 
Performance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsPerformance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning Algorithms
 
Data mining projects topics for java and dot net
Data mining projects topics for java and dot netData mining projects topics for java and dot net
Data mining projects topics for java and dot net
 
SVD and the Netflix Dataset
SVD and the Netflix DatasetSVD and the Netflix Dataset
SVD and the Netflix Dataset
 

Mehr von Daniel Valcarce

Information Retrieval Models for Recommender Systems - PhD slides
Information Retrieval Models for Recommender Systems - PhD slidesInformation Retrieval Models for Recommender Systems - PhD slides
Information Retrieval Models for Recommender Systems - PhD slidesDaniel Valcarce
 
On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...
On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...
On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...Daniel Valcarce
 
LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]
LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]
LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]Daniel Valcarce
 
Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...
Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...
Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...Daniel Valcarce
 
A Study of Smoothing Methods for Relevance-Based Language Modelling of Recomm...
A Study of Smoothing Methods for Relevance-Based Language Modelling of Recomm...A Study of Smoothing Methods for Relevance-Based Language Modelling of Recomm...
A Study of Smoothing Methods for Relevance-Based Language Modelling of Recomm...Daniel Valcarce
 
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...Daniel Valcarce
 
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...Daniel Valcarce
 
Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]
Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]
Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]Daniel Valcarce
 

Mehr von Daniel Valcarce (8)

Information Retrieval Models for Recommender Systems - PhD slides
Information Retrieval Models for Recommender Systems - PhD slidesInformation Retrieval Models for Recommender Systems - PhD slides
Information Retrieval Models for Recommender Systems - PhD slides
 
On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...
On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...
On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...
 
LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]
LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]
LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]
 
Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...
Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...
Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...
 
A Study of Smoothing Methods for Relevance-Based Language Modelling of Recomm...
A Study of Smoothing Methods for Relevance-Based Language Modelling of Recomm...A Study of Smoothing Methods for Relevance-Based Language Modelling of Recomm...
A Study of Smoothing Methods for Relevance-Based Language Modelling of Recomm...
 
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
 
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
 
Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]
Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]
Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]
 

KĂŒrzlich hochgeladen

BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 

KĂŒrzlich hochgeladen (20)

BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 

Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recommendation [ECIR '16 Slides]

  • 1. ECIR 2016, PADUA, ITALY EFFICIENT PSEUDO-RELEVANCE FEEDBACK METHODS FOR COLLABORATIVE FILTERING RECOMMENDATION Daniel Valcarce, Javier Parapar, Álvaro Barreiro @dvalcarce @jparapar @AlvaroBarreiroG Information Retrieval Lab @IRLab_UDC University of A Coruña Spain
  • 2. Outline 1. Pseudo-Relevance Feedback (PRF) 2. Collaborative Filtering (CF) 3. PRF Methods for CF 4. Experiments 5. Conclusions and Future Work 1/28
  • 4. Pseudo-Relevance Feedback (I) Pseudo-Relevance Feedback provides an automatic method for query expansion: Assumes that the top retrieved documents with the original query are relevant (pseudo-relevant set). The query is expanded with the most representative terms from this set. The expanded query is expected to yield better results than the original one. 3/28
  • 7. Pseudo-Relevance Feedback (II) Information need query Retrieval System 4/28
  • 8. Pseudo-Relevance Feedback (II) Information need query Retrieval System 4/28
  • 9. Pseudo-Relevance Feedback (II) Information need query Retrieval System 4/28
  • 10. Pseudo-Relevance Feedback (II) Information need query Retrieval System 4/28
  • 11. Pseudo-Relevance Feedback (II) Information need query Retrieval System Query Expansion expanded query 4/28
  • 12. Pseudo-Relevance Feedback (II) Information need query Retrieval System Query Expansion expanded query 4/28
  • 13. Pseudo-Relevance Feedback (III) Some popular PRF approaches: Based on Rocchio’s model (Rocchio, 1971 & Carpineto et al., ACM TOIS 2001) Relevance-Based Language Models (Lavrenko & Croft, SIGIR 2001) Divergence Minimization Model (Zhai & LaïŹ€erty, SIGIR 2006) Mixture Models (Tao & Zhai, SIGIR 2006) 5/28
  • 15. Recommender Systems Notation: The set of users U The set of items I The rating that the user u gave to the item i is ru,i The set of items rated by user u is denoted by Iu The set of users that rated item i is denoted by Ui The neighbourhood of user u is denoted by Vu Top-N recommendation: create a ranked list containing relevant and unknown items for each user u ∈ U. 7/28
  • 16. Collaborative Filtering (I) Collaborative Filtering (CF) employs the past interaction between users and items to generate recommendations. Idea: If this user who is similar to you likes this item, maybe you will also like it. DiïŹ€erent input data: Explicit feedback: ratings, reviews... Implicit feedback: clicks, purchases... Perhaps the most popular approach to recommendation given the increasing amount of information about users. 8/28
  • 17. Collaborative Filtering (II) Collaborative Filtering (CF) techniques can be classiïŹed in: Model-based methods: learn a predictive model from the user-item ratings. ◩ Matrix factorisation (e.g., SVD) Neighbourhood-based (or memory-based) methods: compute recommendations using directly part of the ratings. ◩ k-NN approaches 9/28
  • 19. PRF for CF PRF CF User’s query User’s proïŹle mostˆ1,populatedˆ2,stateˆ2 Titanicˆ2,Avatarˆ3,Matrixˆ5 Documents Neighbours Terms Items 11/28
  • 20. Previous Work on Adapting PRF Methods to CF Relevance-Based Language Models Originally devised for PRF (Lavrenko & Croft, SIGIR 2001). Adapted to CF (Parapar et al., Inf. Process. Manage. 2013). Two models: RM1 and RM2. High precision ïŹgures in recommendation. 12/28
  • 21. Previous Work on Adapting PRF Methods to CF Relevance-Based Language Models Originally devised for PRF (Lavrenko & Croft, SIGIR 2001). Adapted to CF (Parapar et al., Inf. Process. Manage. 2013). Two models: RM1 and RM2. High precision ïŹgures in recommendation. ... but high computational cost! RM1 : p(i|Ru) ∝ v∈Vu p(v) p(i|v) j∈Iu p(j|v) RM2 : p(i|Ru) ∝ p(i) j∈Iu v∈Vu p(i|v) p(v) p(i) p(j|v) 12/28
  • 22. Our Proposals based on Rocchio’s Framework Rocchio’s Weights pRocchio(i|u) v∈Vu rv,i |Vu| Robertson Selection Value g pRSV(i|u) v∈Vu rv,i |Vu| p(i|Vu) CHI-2 g pCHI−2(i|u) p(i|Vu) − p(i|C) 2 p(i|C) Kullback–Leibler Divergence pKLD(i|u) p(i|Vu) log p(i|Vu) p(i|C) 13/28
  • 23. Our Proposals based on Rocchio’s Framework Rocchio’s Weights pRocchio(i|u) v∈Vu rv,i |Vu| Robertson Selection Value g pRSV(i|u) v∈Vu rv,i |Vu| p(i|Vu) CHI-2 g pCHI−2(i|u) p(i|Vu) − p(i|C) 2 p(i|C) Kullback–Leibler Divergence pKLD(i|u) p(i|Vu) log p(i|Vu) p(i|C) 13/28
  • 24. Our Proposals based on Rocchio’s Framework Rocchio’s Weights pRocchio(i|u) v∈Vu rv,i |Vu| Robertson Selection Value g pRSV(i|u) v∈Vu rv,i |Vu| p(i|Vu) CHI-2 g pCHI−2(i|u) p(i|Vu) − p(i|C) 2 p(i|C) Kullback–Leibler Divergence pKLD(i|u) p(i|Vu) log p(i|Vu) p(i|C) 13/28
  • 25. Probability Estimation Maximum Likelihood Estimate under a Multinomial Distribution over the ratings: pmle(i|Vu) v∈Vu rv,i v∈Vu , j∈I rv,j pmle(i|C) u∈U ru,i u∈U, j∈I ru,j 14/28
  • 26. Neighbourhood Length Normalisation (I) Neighbourhoods are computed using clustering algorithms: Hard clustering: every user is in only one cluster. Clusters may have diïŹ€erent sizes. Example: k-means. Soft clustering: each user has its own neighbours. When we set k to a high value, we may ïŹnd diïŹ€erent amounts of neighbours. Example: k-NN. 15/28
  • 27. Neighbourhood Length Normalisation (I) Neighbourhoods are computed using clustering algorithms: Hard clustering: every user is in only one cluster. Clusters may have diïŹ€erent sizes. Example: k-means. Soft clustering: each user has its own neighbours. When we set k to a high value, we may ïŹnd diïŹ€erent amounts of neighbours. Example: k-NN. Idea: consider the variability of the neighbourhood lengths: Big neighbourhoods is equivalent to a query with a lot of results: the collection model is closed to the target user. Small neighbourhoods implies that neighbours are highly speciïŹc: the collection is very diïŹ€erent from the target user. 15/28
  • 28. Neighbourhood Length Normalisation (II) We bias the MLE to perform neighbourhood length normalisation: pnmle(i|Vu) rank 1 |Vu| v∈Vu rv,i v∈Vu , j∈I rv,j pnmle(i|C) rank 1 |U| u∈U ru,i u∈U, j∈I ru,j 16/28
  • 30. Experimental settings Baselines: UB: traditional user-based neighbourhood approach. SVD: matrix factorisation. UIR-Item: probabilistic approach. RM1 and RM2: Relevance-Based Language Models. Our algorithms: Rocchio’s Weights (RW) Robertson Selection Value (RSV) CHI-2 Kullback-Leibler Divergence (KLD) 18/28
  • 31. Efficiency 0.01 0.1 1 10 ML 100k ML 1M ML 10M recommendationtimeperuser(s) dataset UIR RM1 RM2 SVD++ RSV UB RW CHI-2 KLD 19/28
  • 32. Accuracy (nDCG@10) Algorithm ML 100k ML 1M R3-Yahoo! LibraryThing UB 0.0468 0.0313 0.0108 0.0055b SVD 0.0936a 0.0608a 0.0101 0.0015 UIR-Item 0.2188ab 0.1795abd 0.0174abd 0.0673abd RM1 0.2473abc 0.1402ab 0.0146ab 0.0444ab RM2 0.3323abcd 0.1992abd 0.0207abcd 0.0957abcd Rocchio’s Weights 0.2604abcd 0.1557abd 0.0194abcd 0.0892abcd RSV 0.2604abcd 0.1557abd 0.0194abcd 0.0892abcd KLD MLE 0.2693abcd 0.1264ab 0.0197abcd 0.1576abcde NMLE 0.3120abcd 0.1546ab 0.0201abcd 0.1101abcde CHI-2 MLE 0.0777a 0.0709ab 0.0149ab 0.0939abcd NMLE 0.3220abcd 0.1419ab 0.0204abcd 0.1459abcde Table: Values of nDCG@10. Pink = best algorithm. Blue = not signiïŹcantly diïŹ€erent to the best (Wilcoxon two-sided p < 0.01). 20/28
  • 33. Diversity (Gini@10) Algorithm ML 100k ML 1M R3-Yahoo! LibraryThing UIR-Item 0.0124 0.0050 0.0137 0.0005 RM2 0.0256 0.0069 0.0207 0.0019 CHI-2 NMLE 0.0450 0.0106 0.0506 0.0539 Table: Values of the complement of Gini index at 10. Pink = best algorithm. 21/28
  • 34. Novelty (MSI@10) Algorithm ML 100k ML 1M R3-Yahoo! LibraryThing UIR-Item 5.2337e 8.3713e 3.7186e 17.1229e RM2 6.8273c 8.9481c 4.9618c 19.27343c CHI-2 NMLE 8.1711ec 10.0043ec 7.5555ec 8.8563 Table: Values of Mean Self-Information at 10. Pink = best algorithm. 22/28
  • 35. Trade-off Accuracy-Diversity 0.06 0.07 0.08 0.09 0.10 0.11 0.12 0.13 200 300 400 500 600 700 800 900 G–(Gini,nDCG) k RM2 CHI-2 NMLE Figure: G-measure of nDCG@10 and Gini@10 on MovieLens 100k varying the number of neighbours k using Pearson’s correlation similarity. 23/28
  • 36. Trade-off Accuracy-Novelty 0.9 1.0 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2.0 200 300 400 500 600 700 800 900 G–(MSI,nDCG) k RM2 CHI-2 NMLE Figure: G-measure of nDCG@10 and MSI@10 on MovieLens 100k varying the number of neighbours k using Pearson’s correlation similarity. 24/28
  • 38. Conclusions We proposed to use fast PRF methods (Rocchio’s Weigths, RSV, KLD and CHI-2): They are orders of magnitude faster than the Relevance Models (up to 200x). They generate quite accurate recommendations. Good novelty and diversity ïŹgures with a better trade-oïŹ€ than RM2. They lack of parameters (only clustering parameters). 26/28
  • 39. Future Work Other approaches for computing neighbourhoods: Posterior Probability Clustering (a non-negative matrix factorisation). Normalised Cut (spectral clustering). 27/28
  • 40. Future Work Other approaches for computing neighbourhoods: Posterior Probability Clustering (a non-negative matrix factorisation). Normalised Cut (spectral clustering). Explore other PRF methods: Divergence Minimization Models. Mixture Models. 27/28