SVD and the Netflix Dataset

SVD Applied to
Collaborative Filtering
~ URUG 7-12-07 ~

Recommendation System
Answers the question:
What do I want next?!?

Recommendation System
Answers the question:
What do I want next?!?

Very consumer driven.

Must provide good results or a user may not
trust the system in the future.

Collaborative Filtering
Base user recommendations off of:

User’s past history.

History of like-minded users.

View data as product X user matrix.

Find a “neighborhood” of similar users
for that user.

Return the top-N recommendations.

Early Approaches

Goldberg, et. al. (1992), Using
collaborative ﬁltering to weave an
information tapestry
Konstan, J., el. at (1997), Applying
Collaborative Filtering to Usenet news.

Use Pearson Correlation or cosine similarity
as a measure of similarity to form
neighborhoods.

Early CF Challenges
Sparsity - No correlation between
users can be found. Reduced coverage
occurs.

Early CF Challenges
occurs.

Scalability - Nearest neighbor
algorithms computation time grows with
the number of products and users.

Early CF Challenges
occurs.

Scalability - Nearest neighbor
algorithms computation time grows with
the number of products and users.

Synonymy

Dimensionality Reduction
Latent Semantic Indexing (LSI)


Algorithm from IR community (late
80s-early 90s.)


80s-early 90s.)

Addresses the problems of synonymy,
polysemy, sparsity, and scalability for
large datasets.


80s-early 90s.)

large datasets.

Reduces dimensionality of a dataset
and captures the latent relationships.


80s-early 90s.)

large datasets.

Reduces dimensionality of a dataset
and captures the latent relationships.

Easily maps to CF!

Framing LSI for CF
Products X Users matrix instead of Terms X
Documents.

Netﬂix Dataset
480,189 users, 17,770 movies, only ~100 milion ratings.

17,770 X 480,189 matrix that is 99% sparse!

About 8.5 billion potential ratings.

SVD- The math behind LSI
Singular Value Decomposition

For any M x N matrix A of rank r, it can
decomposed as:

T
A = UΣV
U is a M x M orthogonal matrix.
V is a N X N orthogonal matrix.
Σ is a M x N diagonal matrix whose ﬁrst r diagonal
entries are the nonzero singular values of A.
σ1 ≥ σ2 ... ≥ σr > σr+1 = ... = σn = 0

Related to eigenvalue
decomposition (PCA)
U is the orthornormal eigenspace of
AA^T. Spans the “column space”, known
as left singular vectors.
V is the orthornormal eigenspace of
A^TA. Spans “row space”. Right vectors.
Singular values are the square roots of
the eigenvalues.

Reducing Dimensionality

T
Ak = Uk ΣkVk

A_k is the closest approximation to A.

A_k minimizes the Frobenius norm over all
rank-k matrices: ||A − Ak ||F

Making Recommendations
Cosine Similarity- common way to ﬁnd neighborhood.
i· j
cos(i, j) =
||i||2 ∗ || j||2
Somehow base recommendations off of that
neighborhood and its users.

Can also make predictions of products with a simple
dot product if the singular values are combined with
the singular vectors.
1/2 1/2 T
CPprod = Cavg +Uk Sk (c) · Sk Vk (p)

Challenges with SVD
Scalability - Once again, compute
time grows with the number of users
and products. O(m^3)
Ofﬂine stage.
Online stage.
Even doing the SVD computation ofﬂine
is not possible for large datasets.
Other methods are needed.

Incremental SVD
T
uk = u Vk Σk
−1

GHA for SVD
Gorrell (2006),GHA for Incremental SVD in
NLP

Based off of Sanger’s (1989) GHA for eigen
decomposition.
a
∆ci b
= ci · b(x − ∑ a a
(a · c j )c j )
j<i
b
∆ci a
= ci · a(b − ∑ b b
(b · c j )c j )
j<i

GHA extended by Funk

void train(int user, int movie, real rating)
{

real err = lrate * (rating - predictRating(movie, user));

userValue[user] += err * movieValue[movie];

movieValue[movie] += err * userValue[user];
}

Netﬂix Results
Best RMSEs

0.9283

0.9212

Blended to get 0.9189, 3.42% better than
Netﬂix.

Summary
SVD provides an elegant and automatic
recommendation system that has the
potential to scale.

There are many different algorithms to
calculate or at least approximate SVD which
can be used in ofﬂine stages for websites
that need to have CF.

Every dataset is different and requires
experimentation with to get the best results.

SVD and the Netflix Dataset

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie SVD and the Netflix Dataset

Ähnlich wie SVD and the Netflix Dataset (20)

Mehr von Ben Mabey

Mehr von Ben Mabey (8)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

SVD and the Netflix Dataset