SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Downloaden Sie, um offline zu lesen
Audio ïŹngerprinting and metadata
     correction with Python

           Alastair Porter


         November 21, 2011
Me

     Background in Computer Science
     Masters McGill Music Tech
     Online
         http://github.com/alastair (20/28 music; 11 in python)
         http://twitter.com/alastairporter
Python as a go-to language

     Quick for prototyping
     Use the same code in a production release
     Very handy for API access (thin wrapper around urllib2)
Music and Metadata
Music and Metadata

  The problem:
      People are really bad at naming music
      Inconsistent over releases


  The solution:
      Crowdsourcing
      Get info from as many trusted sources as possible
      Make renaming take no effort
MusicBrainz
Amazon
Amazon (Coverart)
Last.fm
Last.fm (Genre tags)
MusicBrainz
albumidentify




  http://github.com/albumidentify/albumidentify
MP3, FLAC, Ogg, CDs
IdentiïŹcation strategy

      If there’s a CD TOC, use that (musicbrainz lookup)
      If no match, use audio ïŹngerprinting
      If no match, do a text lookup (artist/album)
Fingerprinting

     Converts an audio signal to a short sequence of numbers
     Smaller to compare than an entire ïŹle
     Perceptual features rather than byte comparison (works
     with different encodings)
IdentiïŹcation strategy

      Fingerprinting gives us a set of candidate tracks
      A track could be on many albums (original release, best of,
      mix album)
      Keep a list of what tracks we have for each album
      Once we ïŹll all the slots for an album, success!
Metadata strategy

     Text information from Musicbrainz
     Genre from last.fm
     Image from Amazon (or folder.jpg)
     Musicbrainz tells us where these are (don’t need to search)
     Save in every ïŹle (Text is cheap)
Writing it all out

      Custom MP3/ID3 writer
      Ogg meta tags
      FLAC meta tags
      Name ïŹles
          Artist/Artist - Year - Album/01 - Artist - Track
      Replaygain!
      Be a good citizen: Submit ïŹngerprints to musicbrainz
What’s next

     New version of musicbrainz
     New ïŹngerprinter
     More metadata
     More metadata
Thanks

  More information:
      MusicBrainz: http://musicbrainz.org
      albumidentify:
      http://github.com/albumidentify/albumidentify
      More ïŹngerprinting: http://acoustid.org,
      http://echoprint.me
      Last.fm

Weitere Àhnliche Inhalte

Was ist angesagt?

CFADW PRESENTATION(Music sampling in hip hop)
CFADW PRESENTATION(Music sampling in hip hop)CFADW PRESENTATION(Music sampling in hip hop)
CFADW PRESENTATION(Music sampling in hip hop)shirlon
 
Props List
Props ListProps List
Props Listalexhester
 
1. initial plans (js)
1. initial plans (js)1. initial plans (js)
1. initial plans (js)Jack Sullivan
 
Music Sampling in Hip Hop
Music Sampling in Hip HopMusic Sampling in Hip Hop
Music Sampling in Hip HopAshamim
 
Twitter bots I have known and loved
Twitter bots I have known and lovedTwitter bots I have known and loved
Twitter bots I have known and lovedSteve Winton
 
Podcasting Tips
Podcasting TipsPodcasting Tips
Podcasting Tipstheartguy
 
Podcast Tutorial
Podcast TutorialPodcast Tutorial
Podcast TutorialRick McKinnon
 
FCP #3 Importing Media
FCP #3 Importing MediaFCP #3 Importing Media
FCP #3 Importing MediaSamuel Edsall
 
Analysis of the mystery jets digi pack for the
Analysis of the mystery jets digi pack for theAnalysis of the mystery jets digi pack for the
Analysis of the mystery jets digi pack for thechrismuzz
 
The Olympic Soundtrack Artists 2008 Summer Olympics Beijing China
The Olympic Soundtrack Artists 2008 Summer Olympics Beijing ChinaThe Olympic Soundtrack Artists 2008 Summer Olympics Beijing China
The Olympic Soundtrack Artists 2008 Summer Olympics Beijing Chinaolympic125
 
Elvis Presley Cut Me And I Bleed 1999
Elvis Presley   Cut Me And I Bleed 1999Elvis Presley   Cut Me And I Bleed 1999
Elvis Presley Cut Me And I Bleed 1999Elvis Live
 
Sgp slideshow
Sgp slideshowSgp slideshow
Sgp slideshowjprestler
 
Scott Slotnick Personal Persona
Scott Slotnick Personal PersonaScott Slotnick Personal Persona
Scott Slotnick Personal PersonaScott Slotnick
 
File Naming Conventions and Creating Stems and Mixes
File Naming Conventions and Creating Stems and MixesFile Naming Conventions and Creating Stems and Mixes
File Naming Conventions and Creating Stems and MixesMagic Finger Lounge
 
Magazine names
Magazine namesMagazine names
Magazine namesChloe Hewitt
 
Music Horror Analysis
Music Horror AnalysisMusic Horror Analysis
Music Horror Analysisgmckillop
 

Was ist angesagt? (20)

CFADW PRESENTATION(Music sampling in hip hop)
CFADW PRESENTATION(Music sampling in hip hop)CFADW PRESENTATION(Music sampling in hip hop)
CFADW PRESENTATION(Music sampling in hip hop)
 
Props List
Props ListProps List
Props List
 
1. initial plans (js)
1. initial plans (js)1. initial plans (js)
1. initial plans (js)
 
Music Sampling in Hip Hop
Music Sampling in Hip HopMusic Sampling in Hip Hop
Music Sampling in Hip Hop
 
Assignment 53
Assignment 53Assignment 53
Assignment 53
 
Twitter bots I have known and loved
Twitter bots I have known and lovedTwitter bots I have known and loved
Twitter bots I have known and loved
 
Podcasting
PodcastingPodcasting
Podcasting
 
Podcasting Tips
Podcasting TipsPodcasting Tips
Podcasting Tips
 
Podcast Tutorial
Podcast TutorialPodcast Tutorial
Podcast Tutorial
 
FCP #3 Importing Media
FCP #3 Importing MediaFCP #3 Importing Media
FCP #3 Importing Media
 
Analysis of the mystery jets digi pack for the
Analysis of the mystery jets digi pack for theAnalysis of the mystery jets digi pack for the
Analysis of the mystery jets digi pack for the
 
The Olympic Soundtrack Artists 2008 Summer Olympics Beijing China
The Olympic Soundtrack Artists 2008 Summer Olympics Beijing ChinaThe Olympic Soundtrack Artists 2008 Summer Olympics Beijing China
The Olympic Soundtrack Artists 2008 Summer Olympics Beijing China
 
Elvis Presley Cut Me And I Bleed 1999
Elvis Presley   Cut Me And I Bleed 1999Elvis Presley   Cut Me And I Bleed 1999
Elvis Presley Cut Me And I Bleed 1999
 
Project pronunciation game 1
Project pronunciation game 1Project pronunciation game 1
Project pronunciation game 1
 
Sgp slideshow
Sgp slideshowSgp slideshow
Sgp slideshow
 
Scott Slotnick Personal Persona
Scott Slotnick Personal PersonaScott Slotnick Personal Persona
Scott Slotnick Personal Persona
 
File Naming Conventions and Creating Stems and Mixes
File Naming Conventions and Creating Stems and MixesFile Naming Conventions and Creating Stems and Mixes
File Naming Conventions and Creating Stems and Mixes
 
Magazine names
Magazine namesMagazine names
Magazine names
 
Music Horror Analysis
Music Horror AnalysisMusic Horror Analysis
Music Horror Analysis
 
\-_-/
\-_-/\-_-/
\-_-/
 

Andere mochten auch

Mp25: Optical Music Recognition with Python
Mp25: Optical Music Recognition with PythonMp25: Optical Music Recognition with Python
Mp25: Optical Music Recognition with PythonMontreal Python
 
Mp24: The Bachelor, a facebook game
Mp24: The Bachelor, a facebook gameMp24: The Bachelor, a facebook game
Mp24: The Bachelor, a facebook gameMontreal Python
 
Mp24: Fabulous Mobile Development with and without Python
Mp24: Fabulous Mobile Development with and without PythonMp24: Fabulous Mobile Development with and without Python
Mp24: Fabulous Mobile Development with and without PythonMontreal Python
 
Mp26 : Connecting Startups with Talents
Mp26 : Connecting Startups with TalentsMp26 : Connecting Startups with Talents
Mp26 : Connecting Startups with TalentsMontreal Python
 
Mp25 Message Switching for Actor Based Designs
Mp25 Message Switching for Actor Based DesignsMp25 Message Switching for Actor Based Designs
Mp25 Message Switching for Actor Based DesignsMontreal Python
 
Mp26 : How do you Solve a Problem like Santa Claus?
Mp26 : How do you Solve a Problem like Santa Claus?Mp26 : How do you Solve a Problem like Santa Claus?
Mp26 : How do you Solve a Problem like Santa Claus?Montreal Python
 
Mp26 : Tachyon, sloppiness is bliss
Mp26 : Tachyon, sloppiness is blissMp26 : Tachyon, sloppiness is bliss
Mp26 : Tachyon, sloppiness is blissMontreal Python
 

Andere mochten auch (7)

Mp25: Optical Music Recognition with Python
Mp25: Optical Music Recognition with PythonMp25: Optical Music Recognition with Python
Mp25: Optical Music Recognition with Python
 
Mp24: The Bachelor, a facebook game
Mp24: The Bachelor, a facebook gameMp24: The Bachelor, a facebook game
Mp24: The Bachelor, a facebook game
 
Mp24: Fabulous Mobile Development with and without Python
Mp24: Fabulous Mobile Development with and without PythonMp24: Fabulous Mobile Development with and without Python
Mp24: Fabulous Mobile Development with and without Python
 
Mp26 : Connecting Startups with Talents
Mp26 : Connecting Startups with TalentsMp26 : Connecting Startups with Talents
Mp26 : Connecting Startups with Talents
 
Mp25 Message Switching for Actor Based Designs
Mp25 Message Switching for Actor Based DesignsMp25 Message Switching for Actor Based Designs
Mp25 Message Switching for Actor Based Designs
 
Mp26 : How do you Solve a Problem like Santa Claus?
Mp26 : How do you Solve a Problem like Santa Claus?Mp26 : How do you Solve a Problem like Santa Claus?
Mp26 : How do you Solve a Problem like Santa Claus?
 
Mp26 : Tachyon, sloppiness is bliss
Mp26 : Tachyon, sloppiness is blissMp26 : Tachyon, sloppiness is bliss
Mp26 : Tachyon, sloppiness is bliss
 

Ähnlich wie Mp25: Audio Fingerprinting and metadata correction with Python

Social Tags and Music Information Retrieval (Part II)
Social Tags and Music Information Retrieval (Part II)Social Tags and Music Information Retrieval (Part II)
Social Tags and Music Information Retrieval (Part II)Paul Lamere
 
Machine Learning for Creative AI Applications in Music (2018 May)
Machine Learning for Creative AI Applications in Music (2018 May)Machine Learning for Creative AI Applications in Music (2018 May)
Machine Learning for Creative AI Applications in Music (2018 May)Yi-Hsuan Yang
 
Copyright in music a lesson in heavy metal
Copyright in music   a lesson in heavy metalCopyright in music   a lesson in heavy metal
Copyright in music a lesson in heavy metalStephen Marvin
 
Metadata for musicians: setting up release
Metadata for musicians: setting up releaseMetadata for musicians: setting up release
Metadata for musicians: setting up releaseKristin Thomson
 
Research at MAC Lab, Academia Sincia, in 2017
Research at MAC Lab, Academia Sincia, in 2017Research at MAC Lab, Academia Sincia, in 2017
Research at MAC Lab, Academia Sincia, in 2017Yi-Hsuan Yang
 
Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information RetrievalSease
 
Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information RetrievalAndrea Gazzarini
 
Do Androids Dream Of Algorithmic Playlists
Do Androids Dream Of Algorithmic PlaylistsDo Androids Dream Of Algorithmic Playlists
Do Androids Dream Of Algorithmic PlaylistsMatthew Hawn
 
Music Personalization At Spotify
Music Personalization At SpotifyMusic Personalization At Spotify
Music Personalization At SpotifyVidhya Murali
 
Audio on the web
Audio on the webAudio on the web
Audio on the webJoel May
 
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...Oscar Celma
 
Towards a musical Semantic Web
Towards a musical Semantic WebTowards a musical Semantic Web
Towards a musical Semantic WebYves Raimond
 
Music mobile
Music mobileMusic mobile
Music mobilevariable_orr
 
Audio format
Audio formatAudio format
Audio formatavid
 
Mti presentation
Mti presentationMti presentation
Mti presentationDing Xu
 
Mti presentation
Mti presentationMti presentation
Mti presentationDing Xu
 
Ig2 task 1 work sheet
Ig2 task 1 work sheetIg2 task 1 work sheet
Ig2 task 1 work sheetluisfvazquez1
 
Teaching Music Technology Concepts with Few Music Technology Resources
Teaching Music Technology Concepts with Few Music Technology ResourcesTeaching Music Technology Concepts with Few Music Technology Resources
Teaching Music Technology Concepts with Few Music Technology Resourcesbradfordswanson
 
Music discovery on the net
Music discovery on the netMusic discovery on the net
Music discovery on the netguestbf080
 

Ähnlich wie Mp25: Audio Fingerprinting and metadata correction with Python (20)

Social Tags and Music Information Retrieval (Part II)
Social Tags and Music Information Retrieval (Part II)Social Tags and Music Information Retrieval (Part II)
Social Tags and Music Information Retrieval (Part II)
 
Machine Learning for Creative AI Applications in Music (2018 May)
Machine Learning for Creative AI Applications in Music (2018 May)Machine Learning for Creative AI Applications in Music (2018 May)
Machine Learning for Creative AI Applications in Music (2018 May)
 
Copyright in music a lesson in heavy metal
Copyright in music   a lesson in heavy metalCopyright in music   a lesson in heavy metal
Copyright in music a lesson in heavy metal
 
Metadata for musicians: setting up release
Metadata for musicians: setting up releaseMetadata for musicians: setting up release
Metadata for musicians: setting up release
 
Research at MAC Lab, Academia Sincia, in 2017
Research at MAC Lab, Academia Sincia, in 2017Research at MAC Lab, Academia Sincia, in 2017
Research at MAC Lab, Academia Sincia, in 2017
 
Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information Retrieval
 
Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information Retrieval
 
Do Androids Dream Of Algorithmic Playlists
Do Androids Dream Of Algorithmic PlaylistsDo Androids Dream Of Algorithmic Playlists
Do Androids Dream Of Algorithmic Playlists
 
Music Personalization At Spotify
Music Personalization At SpotifyMusic Personalization At Spotify
Music Personalization At Spotify
 
Audio on the web
Audio on the webAudio on the web
Audio on the web
 
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
 
Towards a musical Semantic Web
Towards a musical Semantic WebTowards a musical Semantic Web
Towards a musical Semantic Web
 
Music mobile
Music mobileMusic mobile
Music mobile
 
Audio format
Audio formatAudio format
Audio format
 
Mti presentation
Mti presentationMti presentation
Mti presentation
 
Mti presentation
Mti presentationMti presentation
Mti presentation
 
Ig2 task 1 work sheet
Ig2 task 1 work sheetIg2 task 1 work sheet
Ig2 task 1 work sheet
 
Teaching Music Technology Concepts with Few Music Technology Resources
Teaching Music Technology Concepts with Few Music Technology ResourcesTeaching Music Technology Concepts with Few Music Technology Resources
Teaching Music Technology Concepts with Few Music Technology Resources
 
Music discovery on the net
Music discovery on the netMusic discovery on the net
Music discovery on the net
 
DJ Workshop v.0.2b
DJ Workshop v.0.2bDJ Workshop v.0.2b
DJ Workshop v.0.2b
 

KĂŒrzlich hochgeladen

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍾 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍾 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍾 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍾 8923113531 🎰 Avail...gurkirankumar98700
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 

KĂŒrzlich hochgeladen (20)

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍾 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍾 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍾 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍾 8923113531 🎰 Avail...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 

Mp25: Audio Fingerprinting and metadata correction with Python

  • 1. Audio ïŹngerprinting and metadata correction with Python Alastair Porter November 21, 2011
  • 2. Me Background in Computer Science Masters McGill Music Tech Online http://github.com/alastair (20/28 music; 11 in python) http://twitter.com/alastairporter
  • 3. Python as a go-to language Quick for prototyping Use the same code in a production release Very handy for API access (thin wrapper around urllib2)
  • 5. Music and Metadata The problem: People are really bad at naming music Inconsistent over releases The solution: Crowdsourcing Get info from as many trusted sources as possible Make renaming take no effort
  • 14. IdentiïŹcation strategy If there’s a CD TOC, use that (musicbrainz lookup) If no match, use audio ïŹngerprinting If no match, do a text lookup (artist/album)
  • 15. Fingerprinting Converts an audio signal to a short sequence of numbers Smaller to compare than an entire ïŹle Perceptual features rather than byte comparison (works with different encodings)
  • 16. IdentiïŹcation strategy Fingerprinting gives us a set of candidate tracks A track could be on many albums (original release, best of, mix album) Keep a list of what tracks we have for each album Once we ïŹll all the slots for an album, success!
  • 17. Metadata strategy Text information from Musicbrainz Genre from last.fm Image from Amazon (or folder.jpg) Musicbrainz tells us where these are (don’t need to search) Save in every ïŹle (Text is cheap)
  • 18. Writing it all out Custom MP3/ID3 writer Ogg meta tags FLAC meta tags Name ïŹles Artist/Artist - Year - Album/01 - Artist - Track Replaygain! Be a good citizen: Submit ïŹngerprints to musicbrainz
  • 19. What’s next New version of musicbrainz New ïŹngerprinter More metadata More metadata
  • 20. Thanks More information: MusicBrainz: http://musicbrainz.org albumidentify: http://github.com/albumidentify/albumidentify More ïŹngerprinting: http://acoustid.org, http://echoprint.me Last.fm