Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS Poster]

•

1 gefällt mir•399 views

Poster for the Doctoral Symposium paper in ACM RecSys 2015: Daniel Valcarce: Exploring Statistical Language Models for Recommender Systems. RecSys 2015: 375-378 http://doi.acm.org/10.1145/2792838.2796547

Daten & Analysen

Exploring Statistical Language Models
for Recommender Systems
Daniel Valcarce
daniel.valcarce@udc.es – http://www.irlab.org
Information Retrieval Lab, Computer Science Department, University of A Coruña
Information Retrieval (IR)
Goal Retrieve relevant documents according to the information
need of a user.
Examples Search engines.
Methods They can be based on:
Vector Vector Space Model.
Matrix factorisation Latent Semantic Indexing.
Probabilistic modelling Language Models.
Information Fitering (IF)
Goal Select relevant items from an information stream for a
given user.
Examples spam ﬁlters, recommender systems.
Methods Some Collaborative Filtering methods are:
Vector Pairwise similarities (cosine, Pearson, etc.).
Matrix factorisation SVD, NMF.
Probabilistic modelling LDA.
Overview
• Information Filtering (IF) and Information Retrieval (IR) are two sibling ﬁelds.
• Statistical Language Models are a successful technique in IR → Explore how to apply them to recommendation.
• We start by improving the current adaptation of Relevance-Based Language Models to Collaborative Filtering [1].
Relevance-Based Language Models
IR RecSys
Query Target user
Document Neighbour
Term Item
RM2 : p(i|Ru) ∝ p(i)
j∈Iu v∈Vu
p(i|v) p(v)
p(i)
p(j|v)
• Iu is the set of items rated by the user u.
• Vu is the set of neighbours of the user u.
• p(i|u) is computed smoothing the maximum likelihood es-
timate.
• p(i) and p(v) are the item and user priors.
Smoothing methods
Smoothing deals with data sparsity and plays a similar role to
the IDF using a background model: p(i|C) = v∈U rv,i
j∈I, v∈U rv,j
[3].
Jelinek-Mercer
(JM)
pλ(i|u) = (1 − λ)
ru,i
j∈Iu
ru,j
+ λ p(i|C)
Dirichlet Priors
(DP)
pµ(i|u) =
ru,i + µ p(i|C)
µ + j∈Iu
ru,j
Absolute
Discounting
(AD)
pδ(i|u) =
max(ru,i − δ, 0) + δ |Iu| p(i|C)
j∈Iu
ru,j
Priors
Priors provide a principled way of introducing knowledge into
the recommender [2].
Uniform (U) Linear (L)
User
prior
pU (u) =
1
|U|
pL(u) = i∈Iu
ru,i
v∈U j∈Iv
rv,j
Item
prior
pU (i) =
1
|I|
pL(i) = u∈Ui
ru,i
j∈I v∈Uj
rv,j
Experiments on MovieLens 100k
Algorithm nDCG@10 Gini@10 MSI@10
SVD 0.0946 0.0109 14.6129
SVD++ 0.1113 0.0126 14.9574
NNCosNgbr 0.1771 0.0344 16.8222
UIR-Item 0.2188 0.0124 5.2337
PureSVD 0.3595 0.1364 11.8841
RM2-JM 0.3175 0.0232 9.1087
RM2-DP 0.3274 0.0251 9.2181
RM2-AD 0.3296 0.0256 9.2409
RM2-AD-L-U 0.3423 0.0264 9.2004
Research directions
• Some techniques developed for solving IR problems
can be eﬀectively applied to recommendation.
• Probabilistic models from IR are competitive recom-
mendation algorithms although there is still room for
improvements.
• Language Models provide an interpretable and prin-
cipled way of generate recommendations.
• Using diﬀerent priors [2] or clustering algorithms for
the neighbourhoods [1] can improve RM2.
• We envision as future work the development of
context-aware and hybrid recommendations under
the Language Modelling.
Bibliography
[1] J. Parapar, A. Bellogín, P. Castells, and A. Bar-
reiro. Relevance-Based Language Modelling for Recom-
mender Systems. Information Processing & Management,
49(4):966–980, 2013.
[2] D. Valcarce, J. Parapar, and A. Barreiro. A Study of Priors
for Relevance-Based Language Modelling of Recommender
Systems. In RecSys ’15. ACM, 2015.
[3] D. Valcarce, J. Parapar, and A. Barreiro. A Study of
Smoothing Methods for Relevance-Based Language Mod-
elling of Recommender Systems. In ECIR ’15, volume 9022,
pages 346–351. Springer, 2015.
RecSys 2015, 9th ACM Conference on Recommender Systems. 16 - 20 September, 2015, Vienna, Austria.

Empfohlen

A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...Daniel Valcarce

Dual Learning for Machine Translation (NIPS 2016)Toru Fujino

SVD and the Netflix DatasetBen Mabey

Interaction Networks for Learning about Objects, Relations and PhysicsKen Kuroki

Handling Missing Attributes using Matrix Factorization CS, NcState

ILP-based Opinion Sentence Extraction from User Reviews for Question DB Const...Takashi Inui

Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Lea...Dongmin Choi

Matching networks for one shot learningKazuki Fujikawa

Empfohlen

A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...Daniel Valcarce

Dual Learning for Machine Translation (NIPS 2016)Toru Fujino

SVD and the Netflix DatasetBen Mabey

Interaction Networks for Learning about Objects, Relations and PhysicsKen Kuroki

Handling Missing Attributes using Matrix Factorization CS, NcState

ILP-based Opinion Sentence Extraction from User Reviews for Question DB Const...Takashi Inui

Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Lea...Dongmin Choi

Matching networks for one shot learningKazuki Fujikawa

cvpr2009: class specific hough forest for object detectionzukun

Introduction to Machine Learning with Python and scikit-learnMatt Hagy

InfoGAN and Generative Adversarial NetworksZak Jost

Introduction into machine learningmohamed Naas

Self taught clusteringSOYEON KIM

Cso gaddis java_chapter6RhettB

An Automatic Medical Image Segmentation using Teaching Learning Based Optimiz...idescitation

Joint Word and Entity Embeddings for Entity Retrieval from Knowledge GraphFedorNikolaev

elmXiaoyu Sun

Data-Driven Recommender Systemsrecsysfr

Parallel Optimization in Machine LearningFabian Pedregosa

Context-Aware Recommender System Based on Boolean Matrix FactorisationDmitrii Ignatov

Progressive identification of true labels for partial label learningtaeseon ryu

Calculus ppt formatvaani pathak

Recsys 2016: Modeling Contextual Information in Session-Aware Recommender Sys...Bartlomiej Twardowski

Interactive Information Retrieval inspired by Quantum TheoryIngo Frommholz

A Study of Smoothing Methods for Relevance-Based Language Modelling of Recomm...Daniel Valcarce

Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...Daniel Valcarce

SHORTEST PATH FINDING VISUALIZERIRJET Journal

Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...Daniel Valcarce

Modelling User Interaction utilising Information Foraging Theory (and a bit o...Ingo Frommholz

Service System EngineeringInternational Society of Service Innovation Professionals

Weitere ähnliche Inhalte

Was ist angesagt?

cvpr2009: class specific hough forest for object detectionzukun

Introduction to Machine Learning with Python and scikit-learnMatt Hagy

InfoGAN and Generative Adversarial NetworksZak Jost

Introduction into machine learningmohamed Naas

Self taught clusteringSOYEON KIM

Cso gaddis java_chapter6RhettB

An Automatic Medical Image Segmentation using Teaching Learning Based Optimiz...idescitation

Joint Word and Entity Embeddings for Entity Retrieval from Knowledge GraphFedorNikolaev

elmXiaoyu Sun

Data-Driven Recommender Systemsrecsysfr

Parallel Optimization in Machine LearningFabian Pedregosa

Context-Aware Recommender System Based on Boolean Matrix FactorisationDmitrii Ignatov

Progressive identification of true labels for partial label learningtaeseon ryu

Calculus ppt formatvaani pathak

Recsys 2016: Modeling Contextual Information in Session-Aware Recommender Sys...Bartlomiej Twardowski

Was ist angesagt? (15)

cvpr2009: class specific hough forest for object detection

Introduction to Machine Learning with Python and scikit-learn

InfoGAN and Generative Adversarial Networks

Introduction into machine learning

Self taught clustering

Cso gaddis java_chapter6

An Automatic Medical Image Segmentation using Teaching Learning Based Optimiz...

Joint Word and Entity Embeddings for Entity Retrieval from Knowledge Graph

elm

Data-Driven Recommender Systems

Parallel Optimization in Machine Learning

Context-Aware Recommender System Based on Boolean Matrix Factorisation

Progressive identification of true labels for partial label learning

Calculus ppt format

Recsys 2016: Modeling Contextual Information in Session-Aware Recommender Sys...

Ähnlich wie Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS Poster]

Interactive Information Retrieval inspired by Quantum TheoryIngo Frommholz

A Study of Smoothing Methods for Relevance-Based Language Modelling of Recomm...Daniel Valcarce

Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...Daniel Valcarce

SHORTEST PATH FINDING VISUALIZERIRJET Journal

Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...Daniel Valcarce

Modelling User Interaction utilising Information Foraging Theory (and a bit o...Ingo Frommholz

Service System EngineeringInternational Society of Service Innovation Professionals

Learning Content and Usage Factors SimultaneouslyArnab Bhadury

PLANNING BASED ON CLASSIFICATION BY INDUCTION GRAPHcsandit

IRJET- K-SVD: Dictionary Developing Algorithms for Sparse Representation ...IRJET Journal

Documentaries use for the design of learning activitiesIOSR Journals

Domain Modeling for Personalized LearningPeter Brusilovsky

Language independent documentijcsit

Matrix Factorization Technique for Recommender SystemsAladejubelo Oluwashina

Scientific Publication Retrieval in Linked DataAIMS (Agricultural Information Management Standards)

HOP-Rec_RecSys18Matt Yang

Automatic Classification of Springer Nature Proceedings with Smart Topic MinerFrancesco Osborne

Information retrieval systems irt ppt doPonnuthuraiSelvaraj1

Early Analysis and Debuggin of Linked Open Data CubesEnrico Daga

Software Sustainability: Better Software Better ScienceCarole Goble

Ähnlich wie Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS Poster] (20)

Interactive Information Retrieval inspired by Quantum Theory

A Study of Smoothing Methods for Relevance-Based Language Modelling of Recomm...

Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...

SHORTEST PATH FINDING VISUALIZER

Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...

Modelling User Interaction utilising Information Foraging Theory (and a bit o...

Service System Engineering

Learning Content and Usage Factors Simultaneously

PLANNING BASED ON CLASSIFICATION BY INDUCTION GRAPH

IRJET- K-SVD: Dictionary Developing Algorithms for Sparse Representation ...

Documentaries use for the design of learning activities

Domain Modeling for Personalized Learning

Language independent document

Matrix Factorization Technique for Recommender Systems

Scientific Publication Retrieval in Linked Data

HOP-Rec_RecSys18

Automatic Classification of Springer Nature Proceedings with Smart Topic Miner

Information retrieval systems irt ppt do

Early Analysis and Debuggin of Linked Open Data Cubes

Software Sustainability: Better Software Better Science

Mehr von Daniel Valcarce

Information Retrieval Models for Recommender Systems - PhD slidesDaniel Valcarce

On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...Daniel Valcarce

LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]Daniel Valcarce

Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...Daniel Valcarce

When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...Daniel Valcarce

Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...Daniel Valcarce

Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]Daniel Valcarce

Mehr von Daniel Valcarce (7)

Information Retrieval Models for Recommender Systems - PhD slides

On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...

LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]

Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...

When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...

Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...

Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]

Kürzlich hochgeladen

6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)

English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml

why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole

Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen

Rithik Kumar Singh codealpha pythohn.pdfrahulyadav957181

Non Text Magic Studio Magic Design for Presentations L&P.pdfPratikPatil591646

Principles and Practices of Data VisualizationKianJazayeri1

Introduction to Mongo DB-open-‐source, high-‐performance, document-‐orient...boychatmate1

Data Analysis Project: Stroke PredictionBoston Institute of Analytics

Statistics For Management by Richard I. Levin 8ed.pdfnikeshsingh56

World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfsimulationsindia

Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics

Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics

Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics

FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone

What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17

modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx

Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics

2023 Survey Shows Dip in High School E-Cigarette UseBisnar Chase Personal Injury Attorneys

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics

Kürzlich hochgeladen (20)

6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...

English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf

why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...

Data Factory in Microsoft Fabric (MsBIP #82)

Rithik Kumar Singh codealpha pythohn.pdf

Non Text Magic Studio Magic Design for Presentations L&P.pdf

Principles and Practices of Data Visualization

Introduction to Mongo DB-open-‐source, high-‐performance, document-‐orient...

Data Analysis Project: Stroke Prediction

Statistics For Management by Richard I. Levin 8ed.pdf

World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf

Bank Loan Approval Analysis: A Comprehensive Data Analysis Project

Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model

Decoding Patterns: Customer Churn Prediction Data Analysis Project

FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024

What To Do For World Nature Conservation Day by Slidesgo.pptx

modul pembelajaran robotic Workshop _ by Slidesgo.pptx

Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...

2023 Survey Shows Dip in High School E-Cigarette Use

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...

Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS Poster]

1. Exploring Statistical Language Models for Recommender Systems Daniel Valcarce daniel.valcarce@udc.es – http://www.irlab.org Information Retrieval Lab, Computer Science Department, University of A Coruña Information Retrieval (IR) Goal Retrieve relevant documents according to the information need of a user. Examples Search engines. Methods They can be based on: Vector Vector Space Model. Matrix factorisation Latent Semantic Indexing. Probabilistic modelling Language Models. Information Fitering (IF) Goal Select relevant items from an information stream for a given user. Examples spam filters, recommender systems. Methods Some Collaborative Filtering methods are: Vector Pairwise similarities (cosine, Pearson, etc.). Matrix factorisation SVD, NMF. Probabilistic modelling LDA. Overview • Information Filtering (IF) and Information Retrieval (IR) are two sibling fields. • Statistical Language Models are a successful technique in IR → Explore how to apply them to recommendation. • We start by improving the current adaptation of Relevance-Based Language Models to Collaborative Filtering [1]. Relevance-Based Language Models IR RecSys Query Target user Document Neighbour Term Item RM2 : p(i|Ru) ∝ p(i) j∈Iu v∈Vu p(i|v) p(v) p(i) p(j|v) • Iu is the set of items rated by the user u. • Vu is the set of neighbours of the user u. • p(i|u) is computed smoothing the maximum likelihood es- timate. • p(i) and p(v) are the item and user priors. Smoothing methods Smoothing deals with data sparsity and plays a similar role to the IDF using a background model: p(i|C) = v∈U rv,i j∈I, v∈U rv,j [3]. Jelinek-Mercer (JM) pλ(i|u) = (1 − λ) ru,i j∈Iu ru,j + λ p(i|C) Dirichlet Priors (DP) pµ(i|u) = ru,i + µ p(i|C) µ + j∈Iu ru,j Absolute Discounting (AD) pδ(i|u) = max(ru,i − δ, 0) + δ |Iu| p(i|C) j∈Iu ru,j Priors Priors provide a principled way of introducing knowledge into the recommender [2]. Uniform (U) Linear (L) User prior pU (u) = 1 |U| pL(u) = i∈Iu ru,i v∈U j∈Iv rv,j Item prior pU (i) = 1 |I| pL(i) = u∈Ui ru,i j∈I v∈Uj rv,j Experiments on MovieLens 100k Algorithm nDCG@10 Gini@10 MSI@10 SVD 0.0946 0.0109 14.6129 SVD++ 0.1113 0.0126 14.9574 NNCosNgbr 0.1771 0.0344 16.8222 UIR-Item 0.2188 0.0124 5.2337 PureSVD 0.3595 0.1364 11.8841 RM2-JM 0.3175 0.0232 9.1087 RM2-DP 0.3274 0.0251 9.2181 RM2-AD 0.3296 0.0256 9.2409 RM2-AD-L-U 0.3423 0.0264 9.2004 Research directions • Some techniques developed for solving IR problems can be effectively applied to recommendation. • Probabilistic models from IR are competitive recommendation algorithms although there is still room for improvements. • Language Models provide an interpretable and principled way of generate recommendations. • Using different priors [2] or clustering algorithms for the neighbourhoods [1] can improve RM2. • We envision as future work the development of context-aware and hybrid recommendations under the Language Modelling. Bibliography [1] J. Parapar, A. Bellogín, P. Castells, and A. Bar- reiro. Relevance-Based Language Modelling for Recom- mender Systems. Information Processing & Management, 49(4):966–980, 2013. [2] D. Valcarce, J. Parapar, and A. Barreiro. A Study of Priors for Relevance-Based Language Modelling of Recommender Systems. In RecSys ’15. ACM, 2015. [3] D. Valcarce, J. Parapar, and A. Barreiro. A Study of Smoothing Methods for Relevance-Based Language Mod- elling of Recommender Systems. In ECIR ’15, volume 9022, pages 346–351. Springer, 2015. RecSys 2015, 9th ACM Conference on Recommender Systems. 16 - 20 September, 2015, Vienna, Austria.