SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
Music
Recommendation
Evan Casey and Erin Coughlan
Problem
● Million Song Dataset Challenge (Kaggle)
○ 110k Users, 1m+ unique songs

● Music Recommendation
○ Recommend songs for each user based on a larger
training set of user listening histories

● Winner - 0.17910 (17.9%)
● Benchmark - 0.02079 (2.1%)
Data
● Million Song Dataset
● Two subsets of 1000
users (random and
most active)
● Echonest API to get
metadata
Echonest Data
Metadata we
obtained:
●
●
●
●
●

Tempo
Danceability
Energy
Speech
Acousticness

Unavailable
metadata:
●
●
●
●
●

Genre
Artist popularity
Song popularity
Location
Year released
Previous Approaches
Dynamic K-Means:
● Kim et. al (6th Int’l Conference on ML)
● Li et. al (University of Michigan)

Item and user-based collaborative-filtering:
● Niu et. al (Stanford)
● Lu et. al (Stanford)
K-Means
K-Means for our Problem
Step 1:
K-Means from all
songs listened to by all
users
K-Means for our Problem
Step 2:
K-Means from user
listening history
K-Means for our Problem
Step 3:
Predict based on
location of user
centroids
Mean Average Precision
Predicted:

Actual:
Mean Average Precision
What are the Results?
All Metadata

0.00200326282427

Weighted Centroids

0.00375567272976

Multiple Centroids (2)

0.00364834470835

Modified Metadata

0.00994279218087

All Improvements

0.01008282844

More Data

0.00266295400221
Number of Clusters?
Collaborative Filtering
Shawn

1

4

1

3

4

8

9

Billy

8

8

Paul

2

3

4

2

4
Collaborative Filtering
User-based Collaborative Filtering
Step 1:
Obtain user history
profile
User-based Collaborative Filtering
Step 2:
Calculate similarity
between users to find
their nearest neighbors
User-based Collaborative Filtering
Step 3:
Compute weighted
average of the ratings
by the neighbors and
find the items with the
highest average
Implementation Details
Used Amazon EMR
with MRJob to
parallelize the
algorithm across
multiple machines

MRJob
What are the Results?
User Collaborative Filtering (1k Users)

0.008223545412

User Collaborative Filtering (10k Users)

0.012654713312

User Collaborative Filtering (110k Users)

0.112794360446
Compiled Results
Benchmark (1k Users)

0.0104030562401

K-means

0.01008282844

Benchmark (110k Users)

0.02079

User Collaborative Filtering

0.112794360446
Improvements?
● Ensemble techniques
● More metadata from echonest (genre, artist
popularity, etc.)
● MapReduce for k-means
Questions?
References
http://cs229.stanford.edu/proj2012/NiuYinZhang-MillionSongDatasetChallenge.
pdf
http://cs229.stanford.edu/proj2012/LuXiongLiuMusicRecommenderSystemUtilizingUsers%E2%80%
99ListeningHistoryandSocialNetworkInformation.pdf
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4457263&tag=1
http://www-personal.umich.edu/~yjli/content/projectreport.pdf
Github Repo
https://github.
com/erinkidd01/CS158FinalProject.git

Weitere ähnliche Inhalte

Ähnlich wie CS158: Final Project

Data science-2013-heekim
Data science-2013-heekimData science-2013-heekim
Data science-2013-heekim
Haklae Kim
 

Ähnlich wie CS158: Final Project (7)

The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
 
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
 
Big data and machine learning @ Spotify
Big data and machine learning @ SpotifyBig data and machine learning @ Spotify
Big data and machine learning @ Spotify
 
MediaEval 2018: AcousticBrainz Genre Task: Content-based Music Genre Recognit...
MediaEval 2018: AcousticBrainz Genre Task: Content-based Music Genre Recognit...MediaEval 2018: AcousticBrainz Genre Task: Content-based Music Genre Recognit...
MediaEval 2018: AcousticBrainz Genre Task: Content-based Music Genre Recognit...
 
[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music
 
A Unified Music Recommender System Using Listening Habits and Semantics of Tags
A Unified Music Recommender System Using Listening Habits and Semantics of TagsA Unified Music Recommender System Using Listening Habits and Semantics of Tags
A Unified Music Recommender System Using Listening Habits and Semantics of Tags
 
Data science-2013-heekim
Data science-2013-heekimData science-2013-heekim
Data science-2013-heekim
 

Kürzlich hochgeladen

Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
fonyou31
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
SoniaTolstoy
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Kürzlich hochgeladen (20)

Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 

CS158: Final Project