SlideShare ist ein Scribd-Unternehmen logo
1 von 39
Downloaden Sie, um offline zu lesen
USING SEARCH ENGINES FOR
 CLASSIFICATION: DOES IT
       STILL WORK?
    Sten Govaerts, Nik Corthaut, Erik Duval
• Our   problem

• Classification   using search engines

• The   setup

• The   evaluation

• Conclusion
TUNIFY
TUNIFY
TUNIFY
HOW DOES IT WORK?

• manually   annotated metadata

•5 music experts at Aristo Music and
 different consultants

• almost   80,000 songs

• but, not   enough...
PROBLEMS

• satisfying
         the music choice of all
 customers

  • retail
         and catering differ from you
    and me!

• new   markets

• react   fast on emerging music trends

• adding     the full Belgian library catalog
GENERATE THE METADATA

• from    different sources:

  • the audio signal
  • web sources
  • the Aristo database
  • attention metadata

• using   our metadata generation framework: SamgI
GENRE...




• our   master thesis looked at different ways to generate genre...
ONE APPROACH...

• M. Schedl, T. Pohle, P. Knees, G. Widmer, “Assigning
                                                   and
 Visualizing Music Genres by Web-based Co-occurrence
 Analysis”, Proceedings of the 7th International Conference on
 Music Information Retrieval, 2006, pp. 260-265.

• G. Geleijnse, J. Korst, "Web-based Artist
                                          Categorization",
 Proceedings of the 7th International Conference on Music
 Information Retrieval, 2006, pp. 266 - 271.
CLASSIFICATION WITH
  SEARCH ENGINES
     using co-occurrence
CLASSIFICATION WITH
  SEARCH ENGINES
     using co-occurrence
CLASSIFICATION WITH
  SEARCH ENGINES
     using co-occurrence

   Artist + Genre + Schema
CLASSIFICATION WITH
  SEARCH ENGINES
     using co-occurrence

   Artist + Genre + Schema
CLASSIFICATION WITH
  SEARCH ENGINES
     using co-occurrence

   Artist + Genre + Schema
CLASSIFICATION WITH
  SEARCH ENGINES
     using co-occurrence

   Artist + Genre + Schema
Using search engines for classification: does it still work?
Rock:      Jazz:




Blues:      Pop:




Country:   Metal:
Rock:              Jazz:




           0,013            0,013
Blues:              Pop:



           0,009            0,015
Country:           Metal:



           0,009            0,005
RESULTS

• master   thesis student’s results were much worse

• what   happened?

 • did   Google search result count change?

 • has   Google Search API different results?

 • is   the student’s implementation correct?
HOW TO EVALUATE THIS?


• re-run   the original experiment

  • evaluate   on the same data set: 1995 artists and 9 genres.

• different   search engines: Google,Yahoo! and Live! Search.

• over   time: 8 times over a period of 36 days.
THE DATA SET
       Blues   Country   Electronic
       Folk    Jazz      Metal
       Rap     Reggae    RnB
THE DATA SET
                  Blues   Country   Electronic
                  Folk    Jazz      Metal
                  Rap     Reggae    RnB
     10% 9%
   3%
  2%        12%
13%          5%
             4%


      41%
THE DATA SET
       Blues   Country   Electronic
       Folk    Jazz      Metal
       Rap     Reggae    RnB
Using search engines for classification: does it still work?
MOTION CHART



• http://hmdb.cs.kuleuven.be/muzik/gapminder.html
Using search engines for classification: does it still work?
Using search engines for classification: does it still work?
MORE FINE-GRAINED...

• 18   artists

• more  search engines: Google.co.uk/.fr/.be, uk/
 fr.search.yahoo.com

• twice   a day for 53 days

• 250,000    queries!
2 Pac            Rap
  Alan Lomax         Folk
  Art Pepper          Jazz
 Cradle of Filth     Metal
 David Parsons     Electronic
Desmond Dekker      Reggae
  Downpour           Metal
      IceT            Rap
  Jerry Butler       RnB
 Joy Lynn White    Country
 Louisiana Red       Blues
   Lou Rawls         RnB
   LTJ Bukem       Electronic
   Peter Tosh       Reggae
 Pinetop Smith        Jazz
Robert Johnson       Blues
  Roy Rogers       Country
 Steeleye Span       Folk
Using search engines for classification: does it still work?
MAIN SEARCH ENGINE
      RESULTS
REGIONAL GOOGLES
Using search engines for classification: does it still work?
WHAT TO USE?

• use   Google when it’s stable else rely on Yahoo!

• when    is it stable? test with a small set

  • some    artists get classified incorrectly on bad days

  • compare     the accuracy achieved with the test set to the
   average.
CONCLUSION

• still   works after 3 years

• Google      -> Yahoo! -> Live! Search

• why     does Google fluctuate?

•a generic version of an all purpose classifier is implemented in
  metadata generation framework
FUTURE WORK

• understand the performance
 differences of regional search
 engines

• use   alternative search engines

• tweak
      the genre taxonomy
 depending on the search engine
Q & A.
DEMO METADATA
              GENERATION



• http://ariadne.cs.kuleuven.be/samgi-service/

Weitere ähnliche Inhalte

Andere mochten auch

Coclustering Base Classification For Out Of Domain Documents
Coclustering Base Classification For Out Of Domain DocumentsCoclustering Base Classification For Out Of Domain Documents
Coclustering Base Classification For Out Of Domain Documentslau
 
DODDLE-OWL: A Domain Ontology Construction Tool with OWL
DODDLE-OWL: A Domain Ontology Construction Tool with OWLDODDLE-OWL: A Domain Ontology Construction Tool with OWL
DODDLE-OWL: A Domain Ontology Construction Tool with OWLTakeshi Morita
 
Search Engine, SEO and Google Algorithms
Search Engine, SEO and Google AlgorithmsSearch Engine, SEO and Google Algorithms
Search Engine, SEO and Google Algorithmsjhenrey1992
 
تعرف على برامج معالجة النصوص
تعرف على برامج معالجة النصوصتعرف على برامج معالجة النصوص
تعرف على برامج معالجة النصوصsuhaedu
 
From ISO to Implementation A framework for ECM Implementation
From ISO to Implementation  A framework for ECM ImplementationFrom ISO to Implementation  A framework for ECM Implementation
From ISO to Implementation A framework for ECM Implementationgbroadbent67
 
Grey-level Co-occurence features for salt texture classification
Grey-level Co-occurence features for salt texture classificationGrey-level Co-occurence features for salt texture classification
Grey-level Co-occurence features for salt texture classificationIgor Orlov
 
Machine Learning and Quran - The Meccan and Medinan Verses
Machine Learning and Quran - The Meccan and Medinan VersesMachine Learning and Quran - The Meccan and Medinan Verses
Machine Learning and Quran - The Meccan and Medinan VersesAbdul Baquee Muhammad Sharaf
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Dev Sahu
 

Andere mochten auch (9)

Coclustering Base Classification For Out Of Domain Documents
Coclustering Base Classification For Out Of Domain DocumentsCoclustering Base Classification For Out Of Domain Documents
Coclustering Base Classification For Out Of Domain Documents
 
DODDLE-OWL: A Domain Ontology Construction Tool with OWL
DODDLE-OWL: A Domain Ontology Construction Tool with OWLDODDLE-OWL: A Domain Ontology Construction Tool with OWL
DODDLE-OWL: A Domain Ontology Construction Tool with OWL
 
Search Engine, SEO and Google Algorithms
Search Engine, SEO and Google AlgorithmsSearch Engine, SEO and Google Algorithms
Search Engine, SEO and Google Algorithms
 
تعرف على برامج معالجة النصوص
تعرف على برامج معالجة النصوصتعرف على برامج معالجة النصوص
تعرف على برامج معالجة النصوص
 
Text Similarity
Text SimilarityText Similarity
Text Similarity
 
From ISO to Implementation A framework for ECM Implementation
From ISO to Implementation  A framework for ECM ImplementationFrom ISO to Implementation  A framework for ECM Implementation
From ISO to Implementation A framework for ECM Implementation
 
Grey-level Co-occurence features for salt texture classification
Grey-level Co-occurence features for salt texture classificationGrey-level Co-occurence features for salt texture classification
Grey-level Co-occurence features for salt texture classification
 
Machine Learning and Quran - The Meccan and Medinan Verses
Machine Learning and Quran - The Meccan and Medinan VersesMachine Learning and Quran - The Meccan and Medinan Verses
Machine Learning and Quran - The Meccan and Medinan Verses
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier
 

Ähnlich wie Using search engines for classification: does it still work?

Using mashup technology to improve findability
Using mashup technology to improve findabilityUsing mashup technology to improve findability
Using mashup technology to improve findabilitySten Govaerts
 
Drupal case study: ABC Dig Music
Drupal case study: ABC Dig MusicDrupal case study: ABC Dig Music
Drupal case study: ABC Dig MusicDavid Peterson
 
How Technology is Changing The Business of Culture: Configurable Culture and ...
How Technology is ChangingThe Business of Culture:Configurable Culture and ...How Technology is ChangingThe Business of Culture:Configurable Culture and ...
How Technology is Changing The Business of Culture: Configurable Culture and ...Aram Sinnreich
 
Random Forests R vs Python by Linda Uruchurtu
Random Forests R vs Python by Linda UruchurtuRandom Forests R vs Python by Linda Uruchurtu
Random Forests R vs Python by Linda UruchurtuPyData
 
Understanding Music Playlists
Understanding Music PlaylistsUnderstanding Music Playlists
Understanding Music PlaylistsKeunwoo Choi
 
Debriefing iron sheep
Debriefing iron sheepDebriefing iron sheep
Debriefing iron sheepmatthewzook
 
Info Design Process Book
Info Design Process BookInfo Design Process Book
Info Design Process BookShelby Albrecht
 
Music Hackday Boston - The Last.fm API
Music Hackday Boston - The Last.fm APIMusic Hackday Boston - The Last.fm API
Music Hackday Boston - The Last.fm APIdavidsingleton
 
Spotify Machine Learning Solution for Music Discovery
Spotify Machine Learning Solution for Music DiscoverySpotify Machine Learning Solution for Music Discovery
Spotify Machine Learning Solution for Music DiscoveryKarthik Murugesan
 
Machine Learning and Big Data for Music Discovery at Spotify
Machine Learning and Big Data for Music Discovery at SpotifyMachine Learning and Big Data for Music Discovery at Spotify
Machine Learning and Big Data for Music Discovery at SpotifyChing-Wei Chen
 
Results of Web-scale discovery: Data, discussions and decisions
Results of Web-scale discovery: Data, discussions and decisionsResults of Web-scale discovery: Data, discussions and decisions
Results of Web-scale discovery: Data, discussions and decisionsNASIG
 
World IA Day 2013 - Is Search IA?
World IA Day 2013 - Is Search IA?World IA Day 2013 - Is Search IA?
World IA Day 2013 - Is Search IA?Margaret Hanley
 
[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova musicNAVER D2
 
Metadata and Search Analysis: PBS.org
Metadata and Search Analysis: PBS.orgMetadata and Search Analysis: PBS.org
Metadata and Search Analysis: PBS.orgNikki Kerber
 
The future of search
The future of searchThe future of search
The future of searchKevin Stein
 
The Dissemination of Music Through Social Media
The Dissemination of Music Through Social MediaThe Dissemination of Music Through Social Media
The Dissemination of Music Through Social MediaJosh Crafter
 
Deezer - Big data as a streaming service
Deezer - Big data as a streaming serviceDeezer - Big data as a streaming service
Deezer - Big data as a streaming serviceJulie Knibbe
 
Music recognition
Music recognition Music recognition
Music recognition aaronloklok
 
Beyond The Bench Workshops
Beyond The Bench WorkshopsBeyond The Bench Workshops
Beyond The Bench WorkshopsBeyond The Bench
 

Ähnlich wie Using search engines for classification: does it still work? (20)

Using mashup technology to improve findability
Using mashup technology to improve findabilityUsing mashup technology to improve findability
Using mashup technology to improve findability
 
69 kuta
69 kuta69 kuta
69 kuta
 
Drupal case study: ABC Dig Music
Drupal case study: ABC Dig MusicDrupal case study: ABC Dig Music
Drupal case study: ABC Dig Music
 
How Technology is Changing The Business of Culture: Configurable Culture and ...
How Technology is ChangingThe Business of Culture:Configurable Culture and ...How Technology is ChangingThe Business of Culture:Configurable Culture and ...
How Technology is Changing The Business of Culture: Configurable Culture and ...
 
Random Forests R vs Python by Linda Uruchurtu
Random Forests R vs Python by Linda UruchurtuRandom Forests R vs Python by Linda Uruchurtu
Random Forests R vs Python by Linda Uruchurtu
 
Understanding Music Playlists
Understanding Music PlaylistsUnderstanding Music Playlists
Understanding Music Playlists
 
Debriefing iron sheep
Debriefing iron sheepDebriefing iron sheep
Debriefing iron sheep
 
Info Design Process Book
Info Design Process BookInfo Design Process Book
Info Design Process Book
 
Music Hackday Boston - The Last.fm API
Music Hackday Boston - The Last.fm APIMusic Hackday Boston - The Last.fm API
Music Hackday Boston - The Last.fm API
 
Spotify Machine Learning Solution for Music Discovery
Spotify Machine Learning Solution for Music DiscoverySpotify Machine Learning Solution for Music Discovery
Spotify Machine Learning Solution for Music Discovery
 
Machine Learning and Big Data for Music Discovery at Spotify
Machine Learning and Big Data for Music Discovery at SpotifyMachine Learning and Big Data for Music Discovery at Spotify
Machine Learning and Big Data for Music Discovery at Spotify
 
Results of Web-scale discovery: Data, discussions and decisions
Results of Web-scale discovery: Data, discussions and decisionsResults of Web-scale discovery: Data, discussions and decisions
Results of Web-scale discovery: Data, discussions and decisions
 
World IA Day 2013 - Is Search IA?
World IA Day 2013 - Is Search IA?World IA Day 2013 - Is Search IA?
World IA Day 2013 - Is Search IA?
 
[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music
 
Metadata and Search Analysis: PBS.org
Metadata and Search Analysis: PBS.orgMetadata and Search Analysis: PBS.org
Metadata and Search Analysis: PBS.org
 
The future of search
The future of searchThe future of search
The future of search
 
The Dissemination of Music Through Social Media
The Dissemination of Music Through Social MediaThe Dissemination of Music Through Social Media
The Dissemination of Music Through Social Media
 
Deezer - Big data as a streaming service
Deezer - Big data as a streaming serviceDeezer - Big data as a streaming service
Deezer - Big data as a streaming service
 
Music recognition
Music recognition Music recognition
Music recognition
 
Beyond The Bench Workshops
Beyond The Bench WorkshopsBeyond The Bench Workshops
Beyond The Bench Workshops
 

Mehr von Sten Govaerts

Learning Analytics at KULeuven by the team of Erik Duval
Learning Analytics at KULeuven by the team of Erik DuvalLearning Analytics at KULeuven by the team of Erik Duval
Learning Analytics at KULeuven by the team of Erik DuvalSten Govaerts
 
The Smart Device Specification for Remote Labs
The Smart Device Specification for Remote LabsThe Smart Device Specification for Remote Labs
The Smart Device Specification for Remote LabsSten Govaerts
 
Speakup - in NordiCHI2014 Personal or Social workshop.
Speakup - in NordiCHI2014 Personal or Social workshop.Speakup - in NordiCHI2014 Personal or Social workshop.
Speakup - in NordiCHI2014 Personal or Social workshop.Sten Govaerts
 
The Go-Lab project at the REACT Research Days
The Go-Lab project at the REACT Research DaysThe Go-Lab project at the REACT Research Days
The Go-Lab project at the REACT Research DaysSten Govaerts
 
Learning Analytics Dashboards
Learning Analytics DashboardsLearning Analytics Dashboards
Learning Analytics DashboardsSten Govaerts
 
Towards an online lab portal for inquiry-based STEM learning at school.
Towards an online lab portal for inquiry-based STEM learning at school.Towards an online lab portal for inquiry-based STEM learning at school.
Towards an online lab portal for inquiry-based STEM learning at school.Sten Govaerts
 
SpeakUp – A Mobile App Facilitating Audience Interaction
SpeakUp – A Mobile App Facilitating Audience InteractionSpeakUp – A Mobile App Facilitating Audience Interaction
SpeakUp – A Mobile App Facilitating Audience InteractionSten Govaerts
 
The Student Activity Meter
The Student Activity MeterThe Student Activity Meter
The Student Activity MeterSten Govaerts
 
Learning Dashboards & Learnscapes
Learning Dashboards & LearnscapesLearning Dashboards & Learnscapes
Learning Dashboards & LearnscapesSten Govaerts
 
An introduction to Git.
An introduction to Git.An introduction to Git.
An introduction to Git.Sten Govaerts
 
Quantified Self in the Multimedia course.
Quantified Self in the Multimedia course.Quantified Self in the Multimedia course.
Quantified Self in the Multimedia course.Sten Govaerts
 
Learning Analytics & Learnscapes.
Learning Analytics & Learnscapes.Learning Analytics & Learnscapes.
Learning Analytics & Learnscapes.Sten Govaerts
 
From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...
From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...
From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...Sten Govaerts
 
The Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflectionThe Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflectionSten Govaerts
 
Learning Dashboards and Learnscapes
Learning Dashboards and LearnscapesLearning Dashboards and Learnscapes
Learning Dashboards and LearnscapesSten Govaerts
 
The Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflectionThe Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflectionSten Govaerts
 
Evaluating the Student Activity Meter: Two Case Studies.
Evaluating the Student Activity Meter: Two Case Studies.Evaluating the Student Activity Meter: Two Case Studies.
Evaluating the Student Activity Meter: Two Case Studies.Sten Govaerts
 
Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...
Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...
Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...Sten Govaerts
 
Winter School defense simulation: Visualizing Activities for Self-reflection ...
Winter School defense simulation: Visualizing Activities for Self-reflection ...Winter School defense simulation: Visualizing Activities for Self-reflection ...
Winter School defense simulation: Visualizing Activities for Self-reflection ...Sten Govaerts
 

Mehr von Sten Govaerts (20)

Learning Analytics at KULeuven by the team of Erik Duval
Learning Analytics at KULeuven by the team of Erik DuvalLearning Analytics at KULeuven by the team of Erik Duval
Learning Analytics at KULeuven by the team of Erik Duval
 
The Smart Device Specification for Remote Labs
The Smart Device Specification for Remote LabsThe Smart Device Specification for Remote Labs
The Smart Device Specification for Remote Labs
 
Speakup - in NordiCHI2014 Personal or Social workshop.
Speakup - in NordiCHI2014 Personal or Social workshop.Speakup - in NordiCHI2014 Personal or Social workshop.
Speakup - in NordiCHI2014 Personal or Social workshop.
 
The Go-Lab project at the REACT Research Days
The Go-Lab project at the REACT Research DaysThe Go-Lab project at the REACT Research Days
The Go-Lab project at the REACT Research Days
 
Learning Analytics Dashboards
Learning Analytics DashboardsLearning Analytics Dashboards
Learning Analytics Dashboards
 
Towards an online lab portal for inquiry-based STEM learning at school.
Towards an online lab portal for inquiry-based STEM learning at school.Towards an online lab portal for inquiry-based STEM learning at school.
Towards an online lab portal for inquiry-based STEM learning at school.
 
SpeakUp – A Mobile App Facilitating Audience Interaction
SpeakUp – A Mobile App Facilitating Audience InteractionSpeakUp – A Mobile App Facilitating Audience Interaction
SpeakUp – A Mobile App Facilitating Audience Interaction
 
The Go-Lab portal
The Go-Lab portalThe Go-Lab portal
The Go-Lab portal
 
The Student Activity Meter
The Student Activity MeterThe Student Activity Meter
The Student Activity Meter
 
Learning Dashboards & Learnscapes
Learning Dashboards & LearnscapesLearning Dashboards & Learnscapes
Learning Dashboards & Learnscapes
 
An introduction to Git.
An introduction to Git.An introduction to Git.
An introduction to Git.
 
Quantified Self in the Multimedia course.
Quantified Self in the Multimedia course.Quantified Self in the Multimedia course.
Quantified Self in the Multimedia course.
 
Learning Analytics & Learnscapes.
Learning Analytics & Learnscapes.Learning Analytics & Learnscapes.
Learning Analytics & Learnscapes.
 
From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...
From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...
From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...
 
The Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflectionThe Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflection
 
Learning Dashboards and Learnscapes
Learning Dashboards and LearnscapesLearning Dashboards and Learnscapes
Learning Dashboards and Learnscapes
 
The Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflectionThe Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflection
 
Evaluating the Student Activity Meter: Two Case Studies.
Evaluating the Student Activity Meter: Two Case Studies.Evaluating the Student Activity Meter: Two Case Studies.
Evaluating the Student Activity Meter: Two Case Studies.
 
Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...
Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...
Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...
 
Winter School defense simulation: Visualizing Activities for Self-reflection ...
Winter School defense simulation: Visualizing Activities for Self-reflection ...Winter School defense simulation: Visualizing Activities for Self-reflection ...
Winter School defense simulation: Visualizing Activities for Self-reflection ...
 

Kürzlich hochgeladen

2024.03.23 What do successful readers do - Sandy Millin for PARK.pptx
2024.03.23 What do successful readers do - Sandy Millin for PARK.pptx2024.03.23 What do successful readers do - Sandy Millin for PARK.pptx
2024.03.23 What do successful readers do - Sandy Millin for PARK.pptxSandy Millin
 
CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...
CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...
CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...Nguyen Thanh Tu Collection
 
Education and training program in the hospital APR.pptx
Education and training program in the hospital APR.pptxEducation and training program in the hospital APR.pptx
Education and training program in the hospital APR.pptxraviapr7
 
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdfP4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdfYu Kanazawa / Osaka University
 
How to Add a many2many Relational Field in Odoo 17
How to Add a many2many Relational Field in Odoo 17How to Add a many2many Relational Field in Odoo 17
How to Add a many2many Relational Field in Odoo 17Celine George
 
Practical Research 1: Lesson 8 Writing the Thesis Statement.pptx
Practical Research 1: Lesson 8 Writing the Thesis Statement.pptxPractical Research 1: Lesson 8 Writing the Thesis Statement.pptx
Practical Research 1: Lesson 8 Writing the Thesis Statement.pptxKatherine Villaluna
 
UKCGE Parental Leave Discussion March 2024
UKCGE Parental Leave Discussion March 2024UKCGE Parental Leave Discussion March 2024
UKCGE Parental Leave Discussion March 2024UKCGE
 
The Stolen Bacillus by Herbert George Wells
The Stolen Bacillus by Herbert George WellsThe Stolen Bacillus by Herbert George Wells
The Stolen Bacillus by Herbert George WellsEugene Lysak
 
5 charts on South Africa as a source country for international student recrui...
5 charts on South Africa as a source country for international student recrui...5 charts on South Africa as a source country for international student recrui...
5 charts on South Africa as a source country for international student recrui...CaraSkikne1
 
Patient Counselling. Definition of patient counseling; steps involved in pati...
Patient Counselling. Definition of patient counseling; steps involved in pati...Patient Counselling. Definition of patient counseling; steps involved in pati...
Patient Counselling. Definition of patient counseling; steps involved in pati...raviapr7
 
Drug Information Services- DIC and Sources.
Drug Information Services- DIC and Sources.Drug Information Services- DIC and Sources.
Drug Information Services- DIC and Sources.raviapr7
 
DUST OF SNOW_BY ROBERT FROST_EDITED BY_ TANMOY MISHRA
DUST OF SNOW_BY ROBERT FROST_EDITED BY_ TANMOY MISHRADUST OF SNOW_BY ROBERT FROST_EDITED BY_ TANMOY MISHRA
DUST OF SNOW_BY ROBERT FROST_EDITED BY_ TANMOY MISHRATanmoy Mishra
 
Human-AI Co-Creation of Worked Examples for Programming Classes
Human-AI Co-Creation of Worked Examples for Programming ClassesHuman-AI Co-Creation of Worked Examples for Programming Classes
Human-AI Co-Creation of Worked Examples for Programming ClassesMohammad Hassany
 
Patterns of Written Texts Across Disciplines.pptx
Patterns of Written Texts Across Disciplines.pptxPatterns of Written Texts Across Disciplines.pptx
Patterns of Written Texts Across Disciplines.pptxMYDA ANGELICA SUAN
 
How to Add Existing Field in One2Many Tree View in Odoo 17
How to Add Existing Field in One2Many Tree View in Odoo 17How to Add Existing Field in One2Many Tree View in Odoo 17
How to Add Existing Field in One2Many Tree View in Odoo 17Celine George
 
Practical Research 1 Lesson 9 Scope and delimitation.pptx
Practical Research 1 Lesson 9 Scope and delimitation.pptxPractical Research 1 Lesson 9 Scope and delimitation.pptx
Practical Research 1 Lesson 9 Scope and delimitation.pptxKatherine Villaluna
 
The Singapore Teaching Practice document
The Singapore Teaching Practice documentThe Singapore Teaching Practice document
The Singapore Teaching Practice documentXsasf Sfdfasd
 

Kürzlich hochgeladen (20)

2024.03.23 What do successful readers do - Sandy Millin for PARK.pptx
2024.03.23 What do successful readers do - Sandy Millin for PARK.pptx2024.03.23 What do successful readers do - Sandy Millin for PARK.pptx
2024.03.23 What do successful readers do - Sandy Millin for PARK.pptx
 
CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...
CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...
CHUYÊN ĐỀ DẠY THÊM TIẾNG ANH LỚP 11 - GLOBAL SUCCESS - NĂM HỌC 2023-2024 - HK...
 
Education and training program in the hospital APR.pptx
Education and training program in the hospital APR.pptxEducation and training program in the hospital APR.pptx
Education and training program in the hospital APR.pptx
 
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdfP4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
 
Prelims of Kant get Marx 2.0: a general politics quiz
Prelims of Kant get Marx 2.0: a general politics quizPrelims of Kant get Marx 2.0: a general politics quiz
Prelims of Kant get Marx 2.0: a general politics quiz
 
How to Add a many2many Relational Field in Odoo 17
How to Add a many2many Relational Field in Odoo 17How to Add a many2many Relational Field in Odoo 17
How to Add a many2many Relational Field in Odoo 17
 
Practical Research 1: Lesson 8 Writing the Thesis Statement.pptx
Practical Research 1: Lesson 8 Writing the Thesis Statement.pptxPractical Research 1: Lesson 8 Writing the Thesis Statement.pptx
Practical Research 1: Lesson 8 Writing the Thesis Statement.pptx
 
UKCGE Parental Leave Discussion March 2024
UKCGE Parental Leave Discussion March 2024UKCGE Parental Leave Discussion March 2024
UKCGE Parental Leave Discussion March 2024
 
The Stolen Bacillus by Herbert George Wells
The Stolen Bacillus by Herbert George WellsThe Stolen Bacillus by Herbert George Wells
The Stolen Bacillus by Herbert George Wells
 
5 charts on South Africa as a source country for international student recrui...
5 charts on South Africa as a source country for international student recrui...5 charts on South Africa as a source country for international student recrui...
5 charts on South Africa as a source country for international student recrui...
 
Patient Counselling. Definition of patient counseling; steps involved in pati...
Patient Counselling. Definition of patient counseling; steps involved in pati...Patient Counselling. Definition of patient counseling; steps involved in pati...
Patient Counselling. Definition of patient counseling; steps involved in pati...
 
Drug Information Services- DIC and Sources.
Drug Information Services- DIC and Sources.Drug Information Services- DIC and Sources.
Drug Information Services- DIC and Sources.
 
DUST OF SNOW_BY ROBERT FROST_EDITED BY_ TANMOY MISHRA
DUST OF SNOW_BY ROBERT FROST_EDITED BY_ TANMOY MISHRADUST OF SNOW_BY ROBERT FROST_EDITED BY_ TANMOY MISHRA
DUST OF SNOW_BY ROBERT FROST_EDITED BY_ TANMOY MISHRA
 
Finals of Kant get Marx 2.0 : a general politics quiz
Finals of Kant get Marx 2.0 : a general politics quizFinals of Kant get Marx 2.0 : a general politics quiz
Finals of Kant get Marx 2.0 : a general politics quiz
 
Human-AI Co-Creation of Worked Examples for Programming Classes
Human-AI Co-Creation of Worked Examples for Programming ClassesHuman-AI Co-Creation of Worked Examples for Programming Classes
Human-AI Co-Creation of Worked Examples for Programming Classes
 
Patterns of Written Texts Across Disciplines.pptx
Patterns of Written Texts Across Disciplines.pptxPatterns of Written Texts Across Disciplines.pptx
Patterns of Written Texts Across Disciplines.pptx
 
How to Add Existing Field in One2Many Tree View in Odoo 17
How to Add Existing Field in One2Many Tree View in Odoo 17How to Add Existing Field in One2Many Tree View in Odoo 17
How to Add Existing Field in One2Many Tree View in Odoo 17
 
Personal Resilience in Project Management 2 - TV Edit 1a.pdf
Personal Resilience in Project Management 2 - TV Edit 1a.pdfPersonal Resilience in Project Management 2 - TV Edit 1a.pdf
Personal Resilience in Project Management 2 - TV Edit 1a.pdf
 
Practical Research 1 Lesson 9 Scope and delimitation.pptx
Practical Research 1 Lesson 9 Scope and delimitation.pptxPractical Research 1 Lesson 9 Scope and delimitation.pptx
Practical Research 1 Lesson 9 Scope and delimitation.pptx
 
The Singapore Teaching Practice document
The Singapore Teaching Practice documentThe Singapore Teaching Practice document
The Singapore Teaching Practice document
 

Using search engines for classification: does it still work?

  • 1. USING SEARCH ENGINES FOR CLASSIFICATION: DOES IT STILL WORK? Sten Govaerts, Nik Corthaut, Erik Duval
  • 2. • Our problem • Classification using search engines • The setup • The evaluation • Conclusion
  • 6. HOW DOES IT WORK? • manually annotated metadata •5 music experts at Aristo Music and different consultants • almost 80,000 songs • but, not enough...
  • 7. PROBLEMS • satisfying the music choice of all customers • retail and catering differ from you and me! • new markets • react fast on emerging music trends • adding the full Belgian library catalog
  • 8. GENERATE THE METADATA • from different sources: • the audio signal • web sources • the Aristo database • attention metadata • using our metadata generation framework: SamgI
  • 9. GENRE... • our master thesis looked at different ways to generate genre...
  • 10. ONE APPROACH... • M. Schedl, T. Pohle, P. Knees, G. Widmer, “Assigning and Visualizing Music Genres by Web-based Co-occurrence Analysis”, Proceedings of the 7th International Conference on Music Information Retrieval, 2006, pp. 260-265. • G. Geleijnse, J. Korst, "Web-based Artist Categorization", Proceedings of the 7th International Conference on Music Information Retrieval, 2006, pp. 266 - 271.
  • 11. CLASSIFICATION WITH SEARCH ENGINES using co-occurrence
  • 12. CLASSIFICATION WITH SEARCH ENGINES using co-occurrence
  • 13. CLASSIFICATION WITH SEARCH ENGINES using co-occurrence Artist + Genre + Schema
  • 14. CLASSIFICATION WITH SEARCH ENGINES using co-occurrence Artist + Genre + Schema
  • 15. CLASSIFICATION WITH SEARCH ENGINES using co-occurrence Artist + Genre + Schema
  • 16. CLASSIFICATION WITH SEARCH ENGINES using co-occurrence Artist + Genre + Schema
  • 18. Rock: Jazz: Blues: Pop: Country: Metal:
  • 19. Rock: Jazz: 0,013 0,013 Blues: Pop: 0,009 0,015 Country: Metal: 0,009 0,005
  • 20. RESULTS • master thesis student’s results were much worse • what happened? • did Google search result count change? • has Google Search API different results? • is the student’s implementation correct?
  • 21. HOW TO EVALUATE THIS? • re-run the original experiment • evaluate on the same data set: 1995 artists and 9 genres. • different search engines: Google,Yahoo! and Live! Search. • over time: 8 times over a period of 36 days.
  • 22. THE DATA SET Blues Country Electronic Folk Jazz Metal Rap Reggae RnB
  • 23. THE DATA SET Blues Country Electronic Folk Jazz Metal Rap Reggae RnB 10% 9% 3% 2% 12% 13% 5% 4% 41%
  • 24. THE DATA SET Blues Country Electronic Folk Jazz Metal Rap Reggae RnB
  • 29. MORE FINE-GRAINED... • 18 artists • more search engines: Google.co.uk/.fr/.be, uk/ fr.search.yahoo.com • twice a day for 53 days • 250,000 queries!
  • 30. 2 Pac Rap Alan Lomax Folk Art Pepper Jazz Cradle of Filth Metal David Parsons Electronic Desmond Dekker Reggae Downpour Metal IceT Rap Jerry Butler RnB Joy Lynn White Country Louisiana Red Blues Lou Rawls RnB LTJ Bukem Electronic Peter Tosh Reggae Pinetop Smith Jazz Robert Johnson Blues Roy Rogers Country Steeleye Span Folk
  • 35. WHAT TO USE? • use Google when it’s stable else rely on Yahoo! • when is it stable? test with a small set • some artists get classified incorrectly on bad days • compare the accuracy achieved with the test set to the average.
  • 36. CONCLUSION • still works after 3 years • Google -> Yahoo! -> Live! Search • why does Google fluctuate? •a generic version of an all purpose classifier is implemented in metadata generation framework
  • 37. FUTURE WORK • understand the performance differences of regional search engines • use alternative search engines • tweak the genre taxonomy depending on the search engine
  • 39. DEMO METADATA GENERATION • http://ariadne.cs.kuleuven.be/samgi-service/

Hinweis der Redaktion

  1. NOT the Southern African Media and Gender Institute.
  2. 1. MG is better than MS, a possible explanation is that style is a broader term than genre for music 2. Google outperforms Yahoo! & Live! 3. results fluctuate over time 4. technical issues with Yahoo! only a fraction of the artists are retrieved
  3. 1. the accuracy is not the exactly the same as for the large data set. but the overall trends are similar. 2. MG schema is still more accurate 3. Yahoo! MG is a very stable 4. Live! is still the worst and Google the best!
  4. 1. Yahoo! is very stable 2. Live is the worst, Google the best! 3. no noticable differences between Live and Bing. Bing was launched on 3 June. 4. On 29 July, collaboration between Bing and Yahoo
  5. 1. .com performs best! -> co.uk -> fr -> be 2. fr and be worse: maybe because genres are in english 3. one could also check if local artists are classified better
  6. correct: light incorrect: dark 1. yahoo most stable 2. google changes most often. 3. changing from correct to incorrect occurs most, but no clear pattern 4. Live seems to struggle with the same artists, one time they do it correctly, the next time wrong.