Transfer learning in heterogeneous collaborative filtering domains

•Als PPTX, PDF herunterladen•

2 gefällt mir•3,369 views

Allen Wu

Bildung

2013/3/27
Transfer learning in
heterogeneous collaborative
filtering domains
Authors/ Weike Pan and Qiang Yang
Affiliation/ Dept. of CSE, Hong Kong University of Science and Technology
Source/ Journal of Artificial Intelligence (2013)
Presenter/ Allen Wu
1

Outline
• Introduction
• Heterogeneous collaborative filtering problems

2013/3/27
• Transfer by collective factorization
• Experimental results
• Conclusion

2

Introduction
• Data sparsity is a major challenge in collaborative filtering (CF).
• Overfitting can easily happen for prediction.

2013/3/27
• Some auxiliary data of the form “like” or “dislike” may be more
easily obtained.
• It’s more convenient for users to express preference.

• How do we take advantage of auxiliary knowledge to alleviate the
sparsity problem?

• Most existing transfer learning methods in CF consider auxiliary data from
several perspectives.
• User-side transfer, item-side transfer, knowledge-transfer. 3

Probabilistic Matrix Factorization
(NIPS’08)
•

2013/3/27
4

Social Recommendation (CIKM’08)
•

2013/3/27
5

Collective Matrix Factorization (KDD’08)
•

2013/3/27
6

CodeBook Transfer (IJCAI’09)
•

2013/3/27
7

Rating-matrix generative model (ICML’09)
• RMGM is derived and extended from FMM generative model,
which can be formulated as

2013/3/27
• The difference:
• It learns (U, V) and (U3, V3) alternatively.
• A soft indicator matrix is used. E.g., U [0, 1]n d.

8

Heterogeneous collaborative filtering
problems
• •

2013/3/27
9

Model formulation
• Assume a user u’s rating on an item i in the target data, rui, is
generated from

2013/3/27
• user-specific latent feature vector Uu 1 d, where u=1,…,n.

• item-specific latent feature vector Vi 1 d, where i=1,…,m.

• some data-dependent effect denoted as B d d.

12

Model formulation (Cont.)
• Likelihood:
• Prior:

2013/3/27
• Posterior Likelihood Prior (Bayesian inference)
• Log(Posterior)= Log(Likelihood Prior)

13

Learning U and V in CMTF
• Theorem 1. Given B and V, we can obtain the user-specific
latent matrix U in a closed form.

2013/3/27
16

Learning U and V in CSVD
•

2013/3/27
17

Learning U and V in CSVD
(Cont.)

2013/3/27
18

Evaluation metrics
• Summary of Data sets

2013/3/27
• Evaluation metrics

22

Baselines and parameter settings
•

2013/3/27
23

Performance of Moviepilot data

2013/3/27
24

Performance of Netfliex data

2013/3/27
25

Performance on Netflix at different
sparsity levels
• SCVD performs
better than CMTF in

2013/3/27
all cases.

26

Conclusion
• This paper investigate how to address the sparsity problem in
CF via a transfer learning solution.

2013/3/27
• The TCP framework is proposed to transfer knowledge from
auxiliary data to target data to alleviates the data sparsity.

• Experimental results show that TCP performs significantly
better than several state-of-the-art baseline algorithms.

• In the future, the “pure” cold-start problem for users without
any rating is needed to be addressed via transfer learning.
27

2013/3/27
Thank you for
listening.
Q&A

28

Empfohlen

Using support vector machine with a hybrid feature selection method to the st...lolokikipipi

Incremental collaborative filtering via evolutionary co clusteringAllen Wu

A scalable collaborative filtering framework based on co clusteringAllenWu

Co-clustering of multi-view datasets: a parallelizable approachAllen Wu

Organizing the classroom small group 1Michelle Martens-Dragalin, M.Ed.

Project-Based Learning Guided Lesson Study Improve the Achievement of Learnin...iosrjce

Maed 5040-5070-study of studies presentationcollin777

Avlm 2009 Guided Indep Learning Wimavlm2009avnet

Empfohlen

Using support vector machine with a hybrid feature selection method to the st...lolokikipipi

Incremental collaborative filtering via evolutionary co clusteringAllen Wu

A scalable collaborative filtering framework based on co clusteringAllenWu

Co-clustering of multi-view datasets: a parallelizable approachAllen Wu

Organizing the classroom small group 1Michelle Martens-Dragalin, M.Ed.

Project-Based Learning Guided Lesson Study Improve the Achievement of Learnin...iosrjce

Maed 5040-5070-study of studies presentationcollin777

Avlm 2009 Guided Indep Learning Wimavlm2009avnet

Packard Foundation Peer Learning GroupBeth Kanter

Peer To Peer Learning 10 7 09f1goodbuys

The effect of ability grouping on students’york1896

Teaching (and Learning) with Peer InstructionPeter Newbury

OER Peer Learning Web-Based ApplicationOpen Education Consortium

Peer-to-Peer learning technologies, Visualisation and the education around th...Grial - University of Salamanca

Curriculum developmentchristy Ador

How useful is self-supervised pretraining for Visual tasks?Seunghyun Hwang

Triangular Learner ModelLoc Nguyen

Pattern Recognition in Multiple Bike sharing Systems for comparabilityAthiq Ahamed

Declarative data analysisSouth West Data Meetup

Cikm 2013 - Beyond Data From User Information to Business ValueXavier Amatriain

Introduction to ΔQ and Network Performance Science (extracts)Martin Geddes

Lect1sujitkumar Sujit.Karande

Model-Based Testing: Concepts, Tools, and TechniquesTechWell

Principles of Data VisualizationEamonn Maguire

GRAPH-BASED RECOMMENDATION SYSTEMSyed Ebraiz Ali Chishti

A Graph Summarization: A Survey | Summarizing and understanding large graphsaftab alam

TELECOM_CHURN_PREDICTIAAAAAAAAAAAAAAAAAON[1].pptxGaganaGowda31

GDG Cloud Community Day 2022 - Managing data quality in Machine LearningSARADINDU SENGUPTA

Cold-Start Management with Cross-Domain Collaborative Filtering and TagsMatthias Braunhofer

Introduction to Data Analytics with RWei Zhong Toh

Weitere ähnliche Inhalte

Andere mochten auch

Packard Foundation Peer Learning GroupBeth Kanter

Peer To Peer Learning 10 7 09f1goodbuys

The effect of ability grouping on students’york1896

Teaching (and Learning) with Peer InstructionPeter Newbury

OER Peer Learning Web-Based ApplicationOpen Education Consortium

Peer-to-Peer learning technologies, Visualisation and the education around th...Grial - University of Salamanca

Curriculum developmentchristy Ador

Andere mochten auch (7)

Packard Foundation Peer Learning Group

Peer To Peer Learning 10 7 09f1

The effect of ability grouping on students’

Teaching (and Learning) with Peer Instruction

OER Peer Learning Web-Based Application

Peer-to-Peer learning technologies, Visualisation and the education around th...

Curriculum development

Ähnlich wie Transfer learning in heterogeneous collaborative filtering domains

How useful is self-supervised pretraining for Visual tasks?Seunghyun Hwang

Triangular Learner ModelLoc Nguyen

Pattern Recognition in Multiple Bike sharing Systems for comparabilityAthiq Ahamed

Declarative data analysisSouth West Data Meetup

Cikm 2013 - Beyond Data From User Information to Business ValueXavier Amatriain

Introduction to ΔQ and Network Performance Science (extracts)Martin Geddes

Lect1sujitkumar Sujit.Karande

Model-Based Testing: Concepts, Tools, and TechniquesTechWell

Principles of Data VisualizationEamonn Maguire

GRAPH-BASED RECOMMENDATION SYSTEMSyed Ebraiz Ali Chishti

A Graph Summarization: A Survey | Summarizing and understanding large graphsaftab alam

TELECOM_CHURN_PREDICTIAAAAAAAAAAAAAAAAAON[1].pptxGaganaGowda31

GDG Cloud Community Day 2022 - Managing data quality in Machine LearningSARADINDU SENGUPTA

Cold-Start Management with Cross-Domain Collaborative Filtering and TagsMatthias Braunhofer

Introduction to Data Analytics with RWei Zhong Toh

WorldCist 2013 - Behavior Assessment Framework Bernhard Klein

Kaggle Days Paris - Alberto Danese - ML InterpretabilityAlberto Danese

Analytic Dependency Loops in Architectural Models of Cyber-Physical SystemsIvan Ruchkin

TERM DEPOSIT SUBSCRIPTION PREDICTIONIRJET Journal

Building a business case and institutional policy on a 10Y research data mana...jiscdatapool

Ähnlich wie Transfer learning in heterogeneous collaborative filtering domains (20)

How useful is self-supervised pretraining for Visual tasks?

Triangular Learner Model

Pattern Recognition in Multiple Bike sharing Systems for comparability

Declarative data analysis

Cikm 2013 - Beyond Data From User Information to Business Value

Introduction to ΔQ and Network Performance Science (extracts)

Lect1

Model-Based Testing: Concepts, Tools, and Techniques

Principles of Data Visualization

GRAPH-BASED RECOMMENDATION SYSTEM

A Graph Summarization: A Survey | Summarizing and understanding large graphs

TELECOM_CHURN_PREDICTIAAAAAAAAAAAAAAAAAON[1].pptx

GDG Cloud Community Day 2022 - Managing data quality in Machine Learning

Cold-Start Management with Cross-Domain Collaborative Filtering and Tags

Introduction to Data Analytics with R

WorldCist 2013 - Behavior Assessment Framework

Kaggle Days Paris - Alberto Danese - ML Interpretability

Analytic Dependency Loops in Architectural Models of Cyber-Physical Systems

TERM DEPOSIT SUBSCRIPTION PREDICTION

Building a business case and institutional policy on a 10Y research data mana...

Kürzlich hochgeladen

Introduction to Nonprofit Accounting: The BasicsTechSoup

Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfDr Vijay Vishwakarma

Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh

Spatium Project Simulation student briefAssociation for Project Management

ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22

Food safety_Challenges food safety laboratories_.pdfSherif Taha

Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith

HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxmarlenawright1

Interdisciplinary_Insights_Data_Collection_Methods.pptxPooja Bhuva

HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxEsquimalt MFRC

SOC 101 Demonstration of Learning Presentationcamerronhm

Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136

Single or Multiple melodic lines structuredhanjurrannsibayan2

Understanding Accommodations and ModificationsMJDuyan

Towards a code of practice for AI in AT.pptxJisc

How to Create and Manage Wizard in Odoo 17Celine George

Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Pooja Bhuva

SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur

Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417

Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University of Engineering & Technology, Jamshoro

Kürzlich hochgeladen (20)

Introduction to Nonprofit Accounting: The Basics

Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf

Micro-Scholarship, What it is, How can it help me.pdf

Spatium Project Simulation student brief

ICT Role in 21st Century Education & its Challenges.pptx

Food safety_Challenges food safety laboratories_.pdf

Fostering Friendships - Enhancing Social Bonds in the Classroom

HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx

Interdisciplinary_Insights_Data_Collection_Methods.pptx

HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx

SOC 101 Demonstration of Learning Presentation

Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...

Single or Multiple melodic lines structure

Understanding Accommodations and Modifications

Towards a code of practice for AI in AT.pptx

How to Create and Manage Wizard in Odoo 17

Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...

SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx

Unit-V; Pricing (Pharma Marketing Management).pptx

Mehran University Newsletter Vol-X, Issue-I, 2024

Transfer learning in heterogeneous collaborative filtering domains

1. 2013/3/27 Transfer learning in heterogeneous collaborative filtering domains Authors/ Weike Pan and Qiang Yang Affiliation/ Dept. of CSE, Hong Kong University of Science and Technology Source/ Journal of Artificial Intelligence (2013) Presenter/ Allen Wu 1

2. Outline • Introduction • Heterogeneous collaborative filtering problems 2013/3/27 • Transfer by collective factorization • Experimental results • Conclusion 2

3. Introduction • Data sparsity is a major challenge in collaborative filtering (CF). • Overfitting can easily happen for prediction. 2013/3/27 • Some auxiliary data of the form “like” or “dislike” may be more easily obtained. • It’s more convenient for users to express preference. • How do we take advantage of auxiliary knowledge to alleviate the sparsity problem? • Most existing transfer learning methods in CF consider auxiliary data from several perspectives. • User-side transfer, item-side transfer, knowledge-transfer. 3

4. Probabilistic Matrix Factorization (NIPS’08) • 2013/3/27 4

5. Social Recommendation (CIKM’08) • 2013/3/27 5

6. Collective Matrix Factorization (KDD’08) • 2013/3/27 6

7. CodeBook Transfer (IJCAI’09) • 2013/3/27 7

8. Rating-matrix generative model (ICML’09) • RMGM is derived and extended from FMM generative model, which can be formulated as 2013/3/27 • The difference: • It learns (U, V) and (U3, V3) alternatively. • A soft indicator matrix is used. E.g., U [0, 1]n d. 8

9. Heterogeneous collaborative filtering problems • • 2013/3/27 9

10. Challenges • 2013/3/27 10

11. Overview of solution • 2013/3/27 11

12. Model formulation • Assume a user u’s rating on an item i in the target data, rui, is generated from 2013/3/27 • user-specific latent feature vector Uu 1 d, where u=1,…,n. • item-specific latent feature vector Vi 1 d, where i=1,…,m. • some data-dependent effect denoted as B d d. 12

13. Model formulation (Cont.) • Likelihood: • Prior: 2013/3/27 • Posterior Likelihood Prior (Bayesian inference) • Log(Posterior)= Log(Likelihood Prior) 13

14. Model formulation • 2013/3/27 14

15. Learning the TCF 2013/3/27 15

16. Learning U and V in CMTF • Theorem 1. Given B and V, we can obtain the user-specific latent matrix U in a closed form. 2013/3/27 16

17. Learning U and V in CSVD • 2013/3/27 17

18. Learning U and V in CSVD (Cont.) 2013/3/27 18

19. • 2013/3/27 19

20. Algorithm of TCF 2013/3/27 20

21. Data sets • 2013/3/27 21

22. Evaluation metrics • Summary of Data sets 2013/3/27 • Evaluation metrics 22

23. Baselines and parameter settings • 2013/3/27 23

24. Performance of Moviepilot data 2013/3/27 24

25. Performance of Netfliex data 2013/3/27 25

26. Performance on Netflix at different sparsity levels • SCVD performs better than CMTF in 2013/3/27 all cases. 26

27. Conclusion • This paper investigate how to address the sparsity problem in CF via a transfer learning solution. 2013/3/27 • The TCP framework is proposed to transfer knowledge from auxiliary data to target data to alleviates the data sparsity. • Experimental results show that TCP performs significantly better than several state-of-the-art baseline algorithms. • In the future, the “pure” cold-start problem for users without any rating is needed to be addressed via transfer learning. 27

28. 2013/3/27 Thank you for listening. Q&A 28