CS158: Final Project

•

0 gefällt mir•638 views

Evan Casey

Bildung Technologie

Problem
● Million Song Dataset Challenge (Kaggle)
○ 110k Users, 1m+ unique songs

● Music Recommendation
○ Recommend songs for each user based on a larger
training set of user listening histories

● Winner - 0.17910 (17.9%)
● Benchmark - 0.02079 (2.1%)

Data
● Million Song Dataset
● Two subsets of 1000
users (random and
most active)
● Echonest API to get
metadata

Echonest Data
Metadata we
obtained:
●
●
●
●
●

Tempo
Danceability
Energy
Speech
Acousticness

Unavailable
metadata:
●
●
●
●
●

Genre
Artist popularity
Song popularity
Location
Year released

Previous Approaches
Dynamic K-Means:
● Kim et. al (6th Int’l Conference on ML)
● Li et. al (University of Michigan)

Item and user-based collaborative-filtering:
● Niu et. al (Stanford)
● Lu et. al (Stanford)

K-Means for our Problem
Step 1:
K-Means from all
songs listened to by all
users

K-Means for our Problem
Step 2:
K-Means from user
listening history

K-Means for our Problem
Step 3:
Predict based on
location of user
centroids

Mean Average Precision
Predicted:

Actual:

What are the Results?
All Metadata

0.00200326282427

Weighted Centroids

0.00375567272976

Multiple Centroids (2)

0.00364834470835

Modified Metadata

0.00994279218087

All Improvements

0.01008282844

More Data

0.00266295400221

Collaborative Filtering
Shawn

1

4

1

3

4

8

9

Billy

8

8

Paul

2

3

4

2

4

User-based Collaborative Filtering
Step 2:
Calculate similarity
between users to find
their nearest neighbors

User-based Collaborative Filtering
Step 3:
Compute weighted
average of the ratings
by the neighbors and
find the items with the
highest average

Implementation Details
Used Amazon EMR
with MRJob to
parallelize the
algorithm across
multiple machines

MRJob

What are the Results?
User Collaborative Filtering (1k Users)

0.008223545412

User Collaborative Filtering (10k Users)

0.012654713312

User Collaborative Filtering (110k Users)

0.112794360446

Compiled Results
Benchmark (1k Users)

0.0104030562401

K-means

0.01008282844

Benchmark (110k Users)

0.02079

User Collaborative Filtering

0.112794360446

Improvements?
● Ensemble techniques
● More metadata from echonest (genre, artist
popularity, etc.)
● MapReduce for k-means

References
http://cs229.stanford.edu/proj2012/NiuYinZhang-MillionSongDatasetChallenge.
pdf
http://cs229.stanford.edu/proj2012/LuXiongLiuMusicRecommenderSystemUtilizingUsers%E2%80%
99ListeningHistoryandSocialNetworkInformation.pdf
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4457263&tag=1
http://www-personal.umich.edu/~yjli/content/projectreport.pdf

Github Repo
https://github.
com/erinkidd01/CS158FinalProject.git

Weitere ähnliche Inhalte

Ähnlich wie CS158: Final Project

Presenters: Dmitry Bogdanov, Universitat Pompeu Fabra, Spain Alastair Porter, Universitat Pompeu Fabra, Spain Hendrik Schreiber, tagtraum industries incorporated Paper: http://ceur-ws.org/Vol-1984/Mediaeval_2017_paper_6.pdf Video: https://youtu.be/NpN2Fr3go_Y Authors: Dmitry Bogdanov, Alastair Porter, Julián Urbano, Hendrik Schreiber Abstract: This paper provides an overview of the AcousticBrainz Genre Task organized as part of the MediaEval 2017 Benchmarking Initiative for Multimedia Evaluation. The task is focused on content-based music genre recognition using genre annotations from multiple sources and large-scale music features data available in the AcousticBrainz database. The goal of our task is to explore how the same music pieces can be annotated differently by different communities following different genre taxonomies, and how this should be addressed by content-based genre recognition systems. We present the task challenges, the employed ground-truth information and datasets, and the evaluation methodology.

The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...

multimediaeval

Random Walk with Restart for Automatic Playlist Continuation and Query-specif...

Timo van Niedek

Big data and machine learning @ Spotify

Oscar Carlsson

Paper: http://ceur-ws.org/Vol-2283/MediaEval_18_paper_2.pdf Youtube: https://youtu.be/eFYYkUpvzxk Dmitry Bogdanov, Alastair Porter, Julián Urbano, Hendrik Schreiber, The MediaEval 2018 AcousticBrainz Genre Task: Content-based Music Genre Recognition from Multiple Sources. Proc. of MediaEval 2018, 29-31 October 2018, Sophia Antipolis, France. Abstract: This paper provides an overview of the AcousticBrainz Genre Task organized as part of the MediaEval 2018 Benchmarking Initiative for Multimedia Evaluation. The task is focused on content-based music genre recognition using genre annotations from multiple sources and large-scale music features data available in the AcousticBrainz database. The goal of our task is to explore how the same music pieces can be annotated differently by different communities following different genre taxonomies, and how this should be addressed by content-based genre recognition systems. We present the task challenges, the employed ground-truth information and datasets, and the evaluation methodology. Presented by Alastair Porter

MediaEval 2018: AcousticBrainz Genre Task: Content-based Music Genre Recognit...

multimediaeval

[221]똑똑한 인공지능 dj 비서 clova music

NAVER D2

A Unified Music Recommender System Using Listening Habits and Semantics of Tags

datasciencekorea

Data science-2013-heekim

Haklae Kim

Ähnlich wie CS158: Final Project (7)

The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...

Random Walk with Restart for Automatic Playlist Continuation and Query-specif...

Big data and machine learning @ Spotify

MediaEval 2018: AcousticBrainz Genre Task: Content-based Music Genre Recognit...

[221]똑똑한 인공지능 dj 비서 clova music

A Unified Music Recommender System Using Listening Habits and Semantics of Tags

Data science-2013-heekim

Kürzlich hochgeladen

Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...

fonyou31

Measures of Central Tendency: Mean, Median and Mode

Thiyagu K

Grant Readiness 101 TechSoup and Remy Consulting

TechSoup

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx

iammrhaywood

Mattingly "AI & Prompt Design: The Basics of Prompt Design"

National Information Standards Organization (NISO)

social pharmacy d-pharm 1st year by Pragati K. Mahajan

pragatimahajan3

Are you looking for some sexual fun and erotic girl in the vibrant city of Lucknow? Look no further than our top-notch call girl services in Lucknow! Whether you prefer in-call or out-call services, we just need to call or WhatsApp us at 9548086042 In-call and Out-call services for Call girls in Lucknow Looking for privacy? Our in-call services provide a discreet and comfortable setting for your rendezvous with our lovely call girls in Lucknow. Best Price with hotel services When looking for the best-priced call girl in Lucknow, our services at Lucknow Call Girl Service stand out with our exceptional BDSM, GFE, Oral Experience, etc. We provide top-notch experiences that cater to your needs and desires, ensuring a memorable encounter every time. Call or WhatsApp us for more details - 9548086042

9548086042 for call girls in Indira Nagar with room service

discovermytutordmt

Advanced Views - Calendar View in Odoo 17

Celine George

Unit-IV- Pharma. Marketing Channels.pptx

VishalSingh1417

Sports & Fitness Value Added Course FY..

Disha Kariya

Introduction to Nonprofit Accounting: The Basics

TechSoup

The basics of sentences session 2pptx copy.pptx

heathfieldcps1

Interactive Powerpoint_How to Master effective communication

nomboosow

Student login on Anyboli platform.helpin

RaunakKeshri1

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi Welcome to VIP Call Girl In Delhi Hello! Delhi Call Girls is one of the most popular cities in India. Girls who call in Delhi frequently Advertise their services in small promgons in magazines, as well as on the Internet but We do not act as a direct-promoter. We will do everything we can to make sure that you're safe to the max to the best of our abilities and making sure of our ability and ensuring that you're obtained to the best of our abilities and making sure that you get what you want. Sexuality of our females is recognized by our Business proposals. Top-of-the-line, fully-featured Delhi girl call number and we offer To be aware of it is a major reason in deciding to use our services to ensure that our customers realize the worth of their lives swiftly and in a pleasant manner by engaging with web series performers for a cost of 10000.Here you are able to be Relax knowing that personal information is stored in the business proposals, giving an appearance of like you're as if you are a full affirmation. Call Girls Service Now Delhi +91-9899900591 *********** N.M.************* 01/04/2024 ▬█⓿▀█▀ 𝐈𝐍𝐃𝐄𝐏𝐄𝐍𝐃𝐄𝐍𝐓 CALL 𝐆𝐈𝐑𝐋 𝐕𝐈𝐏 𝐄𝐒𝐂𝐎𝐑𝐓 SERVICE ✅ ❣️ ⭐➡️HOT & SEXY MODELS // COLLEGE GIRLS AVAILABLE FOR COMPLETE ENJOYMENT WITH HIGH PROFILE INDIAN MODEL AVAILABLE HOTEL & HOME ★ SAFE AND SECURE HIGH CLASS SERVICE AFFORDABLE RATE ★ SATISFACTION,UNLIMITED ENJOYMENT. ★ All Meetings are confidential and no information is provided to any one at any cost. ★ EXCLUSIVE PROFILes Are Safe and Consensual with Most Limits Respected ★ Service Available In: - HOME & HOTEL Star Hotel Service .In Call & Out call SeRvIcEs : ★ A-Level (star escort) ★ Strip-tease ★ BBBJ (Bareback Blowjob)Receive advanced sexual techniques in different mode make their life more pleasurable. ★ Spending time in hotel rooms ★ BJ (Blowjob Without a Condom) ★ Completion (Oral to completion) ★ Covered (Covered blowjob Without condom SAFE AND SECURE

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi

kauryashika82

Arihant handbook biology for class 11 .pdf

chloefrazer622

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf

SoniaTolstoy

IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...

PsychoTech Services

Q4-W6-Restating Informational Text Grade 3

JemimahLaneBuaron

1029-Danh muc Sach Giao Khoa khoi 6.pdf

QucHHunhnh

Kürzlich hochgeladen (20)

Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...

Measures of Central Tendency: Mean, Median and Mode

Grant Readiness 101 TechSoup and Remy Consulting

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx

Mattingly "AI & Prompt Design: The Basics of Prompt Design"

social pharmacy d-pharm 1st year by Pragati K. Mahajan

9548086042 for call girls in Indira Nagar with room service

Advanced Views - Calendar View in Odoo 17

Unit-IV- Pharma. Marketing Channels.pptx

Sports & Fitness Value Added Course FY..

Introduction to Nonprofit Accounting: The Basics

The basics of sentences session 2pptx copy.pptx

Interactive Powerpoint_How to Master effective communication

Student login on Anyboli platform.helpin

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi

Arihant handbook biology for class 11 .pdf

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf

IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...

Q4-W6-Restating Informational Text Grade 3

1029-Danh muc Sach Giao Khoa khoi 6.pdf

CS158: Final Project

1. Music Recommendation Evan Casey and Erin Coughlan

2. Problem ● Million Song Dataset Challenge (Kaggle) ○ 110k Users, 1m+ unique songs ● Music Recommendation ○ Recommend songs for each user based on a larger training set of user listening histories ● Winner - 0.17910 (17.9%) ● Benchmark - 0.02079 (2.1%)

3. Data ● Million Song Dataset ● Two subsets of 1000 users (random and most active) ● Echonest API to get metadata

4. Echonest Data Metadata we obtained: ● ● ● ● ● Tempo Danceability Energy Speech Acousticness Unavailable metadata: ● ● ● ● ● Genre Artist popularity Song popularity Location Year released

5. Previous Approaches Dynamic K-Means: ● Kim et. al (6th Int’l Conference on ML) ● Li et. al (University of Michigan) Item and user-based collaborative-filtering: ● Niu et. al (Stanford) ● Lu et. al (Stanford)

6. K-Means

7. K-Means for our Problem Step 1: K-Means from all songs listened to by all users

8. K-Means for our Problem Step 2: K-Means from user listening history

9. K-Means for our Problem Step 3: Predict based on location of user centroids

10. Mean Average Precision Predicted: Actual:

11. Mean Average Precision

12. What are the Results? All Metadata 0.00200326282427 Weighted Centroids 0.00375567272976 Multiple Centroids (2) 0.00364834470835 Modified Metadata 0.00994279218087 All Improvements 0.01008282844 More Data 0.00266295400221

13. Number of Clusters?

14. Collaborative Filtering Shawn 1 4 1 3 4 8 9 Billy 8 8 Paul 2 3 4 2 4

15. Collaborative Filtering

16. User-based Collaborative Filtering Step 1: Obtain user history profile

17. User-based Collaborative Filtering Step 2: Calculate similarity between users to find their nearest neighbors

18. User-based Collaborative Filtering Step 3: Compute weighted average of the ratings by the neighbors and find the items with the highest average

19. Implementation Details Used Amazon EMR with MRJob to parallelize the algorithm across multiple machines MRJob

20. What are the Results? User Collaborative Filtering (1k Users) 0.008223545412 User Collaborative Filtering (10k Users) 0.012654713312 User Collaborative Filtering (110k Users) 0.112794360446

21. Compiled Results Benchmark (1k Users) 0.0104030562401 K-means 0.01008282844 Benchmark (110k Users) 0.02079 User Collaborative Filtering 0.112794360446

22. Improvements? ● Ensemble techniques ● More metadata from echonest (genre, artist popularity, etc.) ● MapReduce for k-means

23. Questions?

24. References http://cs229.stanford.edu/proj2012/NiuYinZhang-MillionSongDatasetChallenge. pdf http://cs229.stanford.edu/proj2012/LuXiongLiuMusicRecommenderSystemUtilizingUsers%E2%80% 99ListeningHistoryandSocialNetworkInformation.pdf http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4457263&tag=1 http://www-personal.umich.edu/~yjli/content/projectreport.pdf

25. Github Repo https://github. com/erinkidd01/CS158FinalProject.git

CS158: Final Project

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie CS158: Final Project

Ähnlich wie CS158: Final Project (7)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

CS158: Final Project