Transfer learning in heterogeneous collaborative filtering domains
1. 2013/3/27
Transfer learning in
heterogeneous collaborative
filtering domains
Authors/ Weike Pan and Qiang Yang
Affiliation/ Dept. of CSE, Hong Kong University of Science and Technology
Source/ Journal of Artificial Intelligence (2013)
Presenter/ Allen Wu
1
3. Introduction
• Data sparsity is a major challenge in collaborative filtering (CF).
• Overfitting can easily happen for prediction.
2013/3/27
• Some auxiliary data of the form “like” or “dislike” may be more
easily obtained.
• It’s more convenient for users to express preference.
• How do we take advantage of auxiliary knowledge to alleviate the
sparsity problem?
• Most existing transfer learning methods in CF consider auxiliary data from
several perspectives.
• User-side transfer, item-side transfer, knowledge-transfer. 3
8. Rating-matrix generative model (ICML’09)
• RMGM is derived and extended from FMM generative model,
which can be formulated as
2013/3/27
• The difference:
• It learns (U, V) and (U3, V3) alternatively.
• A soft indicator matrix is used. E.g., U [0, 1]n d.
8
12. Model formulation
• Assume a user u’s rating on an item i in the target data, rui, is
generated from
2013/3/27
• user-specific latent feature vector Uu 1 d, where u=1,…,n.
• item-specific latent feature vector Vi 1 d, where i=1,…,m.
• some data-dependent effect denoted as B d d.
12
26. Performance on Netflix at different
sparsity levels
• SCVD performs
better than CMTF in
2013/3/27
all cases.
26
27. Conclusion
• This paper investigate how to address the sparsity problem in
CF via a transfer learning solution.
2013/3/27
• The TCP framework is proposed to transfer knowledge from
auxiliary data to target data to alleviates the data sparsity.
• Experimental results show that TCP performs significantly
better than several state-of-the-art baseline algorithms.
• In the future, the “pure” cold-start problem for users without
any rating is needed to be addressed via transfer learning.
27