Recommendations and Discovery at StumbleUpon

Recommendations and Discovery
at StumbleUpon
Sumanth Kolar,
Director, Engineering

@_5K

StumbleUpon’s Mission

Help users find content they did not expect to find
Be the best way to discover new
and interesting things from across
the Web.

How StumbleUpon works
1. Register 2. Tell us your interests 3. Start Stumbling and
rating web pages

We use your interests and behavior to
recommend new content for you!

Discovery is very different from search

Discovery at StumbleUpon Search
Serendipitous Intent driven
One at a time List of articles
Never repeats Always repeats
Constantly adapting Fixed results
Tailored for you Impersonal

There is a ongoing shift from search to discovery

StumbleUpon Overview
1 Users Automated
URL Index
Discovery Crawled

3

Ingestion
Pipeline Rec Engine
Yes
2
Pass
Sampling ?

What are the key challenges to
good recommendations?

Pillars of good recommendations
Understand who the user is and what he is
interested in.

Separate good content from the bad.

Explore various techniques for matching users
to content.

Learn from your recommendations.

User self reports topics of interest
Part of the sign up flow…

User’s Interest Graph

Italian Food/
User
Recipes Cooking

Cars

Vintage
Cars

Continually Enhance a User’s Interest Graph

Analyze user’s StumbleUpon history to expand on
interest preferences:
• Add/remove topics
• Follow/block particular domains


Leverage social network
data:
• Find friends & people
to follow
• Find content trending
in your social circles
• Find additional
interests


Mine internal StumbleUpon
rating and sharing data to
suggest other stumblers,
topics.

Enhanced Interest Graph

Friends
News

Italian Food/ Trending
User
Recipes Cooking

Cars
nasa.gov
Vintage
Cars 1x.com

Sampling

On average hundreds of URLs are ingested into the
StumbleUpon pipeline every minute.

• Sampling key goals:
1. Determine which URLs to sample and which to skip
completely
2. Examine sampling results to identify good URLs

• URL features used when sampling:
• Known domain performance(ratings, timespent)
• Content related features (#images, #ads, url length etc)
• User features of the discoverer (spammer vs trusted user)

Recommendations at StumbleUpon: Sampling

Classifier based on
User Feedback
Random Forest Vote Recommend (Timespent, Ratings)
Rating Timespent

Yes Good 35sec

Good 22sec
Webpage
Bad 15sec
Yes
No Yes
Good 45sec

Good 14sec
Yes
Good 28sec

Leveraging In-Network Experts

• Users who thumb-up good content and
thumb-down bad content
• For example
– Joe DiMaggio – Baseball
– Julia Child- Food/Cooking
– Da Vinci- Art and Architecture
• Ratings from Experts are more trustworthy
and earn more weight.

Non Expert Expert
P(Thumb Up | Page Quality) P(Thumb Up | Page Quality)

Page Quality
Page Quality
Recommendations at StumbleUpon: Experts

Challenge: User expectations are different

“I LOVE cars!” “Me too!”
-Anonymous Stumbler -Another Stumbler

Like-Minded Users

• Find users who like content
similar to the content you do
• Signals can be ratings, time
spent, interests, etc.
• Use the content they’ve liked

PLSI based like-minded

Vintage Cars
Action movies Astronomy
Astronomy Space Exploration
Robotics
Physics
Classic Movies

Movies
Cars Space
Neuroscience
Astronomy
Space Exploration
Science Comedy Movies

Like-Minded Users: Challenges Scaling

 Total Pairwise Similarity Calculations
= 50K users * 5 million users * 1K features
= 250 Trillion
 Probabilistic Latent Semantic Index (PLSI)
based similarity over 500 trillion calculations
 PLSI based similarity framework computes in
less than an hour

Grow User’s Interest Graph:
Implicit + Explicit

Experts Friends

Likeminded
Users News

User
Food/ Trending
Italian
Recipes Cooking

Cars nasa.gov

Vintage 1x.com
Cars

Different methods perform differently for
different users at different times
100%

75%
Trending
Follow
50% Bias domains
Experts
News
25%
Like-minded

0%
User 1 User 2 User 3 User 4 User 5

Two Main Signals from Recommendation

Rating Time Spent

Both present numerous challenges . . .

Ratings: volume decay

Users rate more during
their initial experience
# Ratings

Time

Why is this happening?

Time Spent

?
?
Images
Video Text
Images
Video
T5 sec
T3 sec T4 sec
T2 sec
T1 sec

• Ratings are sparse
• < 10% of recommendations have explicit ratings.
• Using time spent decide whether the stumble was skipped
• Timespent on videos is longer than images.
• Solution: Estimate p(Like | Timespent)
• Model based on user, content patterns

Challenges: Time spent on different devices

Stumble Bar
Median time spent per stumble

Mobile / Tablets
Installed plugin

5th percentile time spent per stumble

How do we know we are doing a
good job?

Extensive A/B Testing

AB Tests on metrics such as session
length, retention, rating behavior etc

0
2
6
8
10
14
16

4
12
Dec-08
Feb-09
Apr-09
Jun-09
Aug-09
Oct-09
+111% improvement!

Dec-09
Feb-10
Apr-10
Jun-10
Aug-10
Oct-10
Dec-10
Recent Months
Feb-11
Apr-11
Normalized Likes vs Dislikes

Jun-11
Aug-11
Oct-11
Dec-11
Feb-12
Apr-12
Measurable Improvements In Rec Quality

Jun-12
R² = 0.736

Aug-12

Many other interesting problems…

• Dupe detection
• Anti-spam
• News
• Topic classification
• Metrics, quality analysis
• Trending
• Search We are HIRING !!!
• User biases, mood
• Many more…

Recommendations and Discovery at StumbleUpon

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Andere mochten auch

Andere mochten auch (18)

Ähnlich wie Recommendations and Discovery at StumbleUpon

Ähnlich wie Recommendations and Discovery at StumbleUpon (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Recommendations and Discovery at StumbleUpon

Hinweis der Redaktion