SlideShare ist ein Scribd-Unternehmen logo
1 von 12
Exploding information
•Recent studies show
that most of the stored
data is in the form of
multimedia.
•Large volume of
multimedia data makes
it difficult to handle it
manually
•Need to have an             1 hr of TV broadcast across the world is 100 Petabyte.
automatic method to
                            Source: http://www.sims.berkeley.edu/research/projects/how-much-
organize and use it                                info/summary.html#tv

appropriately.
Audio indexing
                                                    Audio classification - An
    Reason of choosing audio data
●
                                                    important step in building an
    for study
                                                    audio indexing system
         Easier to process
     –

                                                             An audio indexing system
         Contains significant information
     –

    Indexing – method of
●

    organizing data for further
    search and retrieval.
    Example – book indexing

    Audio Indexing – indexing
●

    non-text data using audio
    part of it
                                            Source: J. Makhoul et. al. “Speech and language technologies for audio
                                            indexing and retrieval”, in Proc. of the IEEE, 88(8), pp. 1338-1353, 2000.
Levels of information in audio signal
    Subsegmental information
●


        Related to excitation source characteristics
    –

    Segmental information
●


        Related to system / physiological characteristics
    –

    Suprasegmental information
●


        Related to behavioural characteristics of audio
    –
Missing component in existing
        approaches and it's importance
    Features derived based on spectral analysis
●


        Carry significant properties of audio data at segmental level
    –

        Miss information present at subsegmental, suprasegmental level
    –

    Perceptually significant information in linear prediction
●

    (LP) residual of signal
        Complimentary in nature to the spectral information
    –

        Suprasegmental information not being used in current systems
    –
EXPLORING
SUPRASEGMENTAL FEATURES
USING
LP RESIDUAL
FOR
AUDIO CLIP CLASSIFICATION


                                      B.Yegnanarayana
          Anvita Bajpai
                                         yegna@iiit.ac.in
         anvita@mailcity.com
                                       International Institute of
      Applied Research Group Satyam
                                        Information Technology
           Computer Services Ltd.,
                                              Hyderabad
                 Bangalore
Audio clip classification
    Closed set problem
●


    To classify a given audio clip in one of the following
●

    predefined categories
         Advertisement, Cartoon, Cricket, Football, News
     –

    Issues in audio clip classification
●

         Feature extraction
     –
              Effective representation of data to capture all significant properties of audio for
          ●

              the task
              Robust under various conditions
          ●


         Classification
     –
              Formulation of a distance measure and rule/models
          ●

                    Training a models for the task
                –
                    Testing – actual classification task
                –
                    Combining evidences from different systems
                –
Presence of audio-specific




                                                     Residual
                                          Original
information in LP residual

                             Aa_res.wav




        Aa1.wav




                              Aa1.wav
uprasegmental information in Hilbert
nvelope of LP residual of audio signal
Suprasegmental information in LP
   residual for audio clip classification




Autocorrelation samples of Hilbert envelope of LP residual for 5 audio classes
Statistics of autocorrelation sequence




Correction – here we have statistics of autocorrelation sequence peaks of HE (not LP residual)
Statistics of autocorrelation sequence
Summary & Conclusions
     Need to organize multimedia data because of its
●

    large volume and need in real-life applications


     Shown presence of audio-specific
●

    suprasegmental information in LP residual


     Need to explore methods to use the
●

    suprasegmental information as an additional
    evidence for the audio clip classification task

Weitere ähnliche Inhalte

Andere mochten auch (7)

Large Sub Surface Filter Design
Large Sub Surface Filter DesignLarge Sub Surface Filter Design
Large Sub Surface Filter Design
 
List of Schools South and East of Bangalore
List of Schools South and East of BangaloreList of Schools South and East of Bangalore
List of Schools South and East of Bangalore
 
Psychometric questionnaire
Psychometric questionnairePsychometric questionnaire
Psychometric questionnaire
 
Chemistry CV
Chemistry CVChemistry CV
Chemistry CV
 
Resume(Email) copy
Resume(Email) copyResume(Email) copy
Resume(Email) copy
 
NHS of Greater Cleveland.pptx
NHS of Greater Cleveland.pptxNHS of Greater Cleveland.pptx
NHS of Greater Cleveland.pptx
 
A life of inspiration - Mahatma Gandhi 1869-1948
A life of inspiration - Mahatma Gandhi 1869-1948A life of inspiration - Mahatma Gandhi 1869-1948
A life of inspiration - Mahatma Gandhi 1869-1948
 

Ähnlich wie Anvita Wisp 2007 Presentation

Anvita Ncvpripg 2008 Presentation
Anvita Ncvpripg 2008 PresentationAnvita Ncvpripg 2008 Presentation
Anvita Ncvpripg 2008 Presentation
guest6e7a1b1
 
Anvita Eusipco 2004
Anvita Eusipco 2004Anvita Eusipco 2004
Anvita Eusipco 2004
guest6e7a1b1
 
Tim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasetsTim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasets
TERN Australia
 
Anvita Audio Classification Presentation
Anvita Audio Classification PresentationAnvita Audio Classification Presentation
Anvita Audio Classification Presentation
guest6e7a1b1
 
The bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking Servers
Herbert Van de Sompel
 
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseTowards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Hilmar Lapp
 
MS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.docMS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.doc
butest
 
840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2
guestdad76b5
 
840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2
guestdad76b5
 

Ähnlich wie Anvita Wisp 2007 Presentation (20)

Anvita Ncvpripg 2008 Presentation
Anvita Ncvpripg 2008 PresentationAnvita Ncvpripg 2008 Presentation
Anvita Ncvpripg 2008 Presentation
 
Anvita Eusipco 2004
Anvita Eusipco 2004Anvita Eusipco 2004
Anvita Eusipco 2004
 
Anvita Eusipco 2004
Anvita Eusipco 2004Anvita Eusipco 2004
Anvita Eusipco 2004
 
Tim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasetsTim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasets
 
Deep Learning - Speaker Verification, Sound Event Detection
Deep Learning - Speaker Verification, Sound Event DetectionDeep Learning - Speaker Verification, Sound Event Detection
Deep Learning - Speaker Verification, Sound Event Detection
 
Anvita Audio Classification Presentation
Anvita Audio Classification PresentationAnvita Audio Classification Presentation
Anvita Audio Classification Presentation
 
The bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking Servers
 
Shaman Project Hemmje
Shaman Project  HemmjeShaman Project  Hemmje
Shaman Project Hemmje
 
Workshop on sparse image and signal processing
Workshop on sparse image and signal processingWorkshop on sparse image and signal processing
Workshop on sparse image and signal processing
 
Deep Learning for Automatic Speaker Recognition
Deep Learning for Automatic Speaker RecognitionDeep Learning for Automatic Speaker Recognition
Deep Learning for Automatic Speaker Recognition
 
2018 IEEE Big Data Cup Challenge - FEMH ​Voice Data Challenge
2018 IEEE Big Data Cup Challenge - FEMH ​Voice Data Challenge2018 IEEE Big Data Cup Challenge - FEMH ​Voice Data Challenge
2018 IEEE Big Data Cup Challenge - FEMH ​Voice Data Challenge
 
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseTowards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
 
BITS: Basics of sequence databases
BITS: Basics of sequence databasesBITS: Basics of sequence databases
BITS: Basics of sequence databases
 
Aplications for machine learning in IoT
Aplications for machine learning in IoTAplications for machine learning in IoT
Aplications for machine learning in IoT
 
Improving Software Maintenance using Unsupervised Machine Learning techniques
Improving Software Maintenance using Unsupervised Machine Learning techniquesImproving Software Maintenance using Unsupervised Machine Learning techniques
Improving Software Maintenance using Unsupervised Machine Learning techniques
 
MS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.docMS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.doc
 
No specimen (software) left behind
No specimen (software) left behindNo specimen (software) left behind
No specimen (software) left behind
 
EPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data TalkEPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data Talk
 
840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2
 
840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

Anvita Wisp 2007 Presentation

  • 1. Exploding information •Recent studies show that most of the stored data is in the form of multimedia. •Large volume of multimedia data makes it difficult to handle it manually •Need to have an 1 hr of TV broadcast across the world is 100 Petabyte. automatic method to Source: http://www.sims.berkeley.edu/research/projects/how-much- organize and use it info/summary.html#tv appropriately.
  • 2. Audio indexing Audio classification - An Reason of choosing audio data ● important step in building an for study audio indexing system Easier to process – An audio indexing system Contains significant information – Indexing – method of ● organizing data for further search and retrieval. Example – book indexing Audio Indexing – indexing ● non-text data using audio part of it Source: J. Makhoul et. al. “Speech and language technologies for audio indexing and retrieval”, in Proc. of the IEEE, 88(8), pp. 1338-1353, 2000.
  • 3. Levels of information in audio signal Subsegmental information ● Related to excitation source characteristics – Segmental information ● Related to system / physiological characteristics – Suprasegmental information ● Related to behavioural characteristics of audio –
  • 4. Missing component in existing approaches and it's importance Features derived based on spectral analysis ● Carry significant properties of audio data at segmental level – Miss information present at subsegmental, suprasegmental level – Perceptually significant information in linear prediction ● (LP) residual of signal Complimentary in nature to the spectral information – Suprasegmental information not being used in current systems –
  • 5. EXPLORING SUPRASEGMENTAL FEATURES USING LP RESIDUAL FOR AUDIO CLIP CLASSIFICATION B.Yegnanarayana Anvita Bajpai yegna@iiit.ac.in anvita@mailcity.com International Institute of Applied Research Group Satyam Information Technology Computer Services Ltd., Hyderabad Bangalore
  • 6. Audio clip classification Closed set problem ● To classify a given audio clip in one of the following ● predefined categories Advertisement, Cartoon, Cricket, Football, News – Issues in audio clip classification ● Feature extraction – Effective representation of data to capture all significant properties of audio for ● the task Robust under various conditions ● Classification – Formulation of a distance measure and rule/models ● Training a models for the task – Testing – actual classification task – Combining evidences from different systems –
  • 7. Presence of audio-specific Residual Original information in LP residual Aa_res.wav Aa1.wav Aa1.wav
  • 8. uprasegmental information in Hilbert nvelope of LP residual of audio signal
  • 9. Suprasegmental information in LP residual for audio clip classification Autocorrelation samples of Hilbert envelope of LP residual for 5 audio classes
  • 10. Statistics of autocorrelation sequence Correction – here we have statistics of autocorrelation sequence peaks of HE (not LP residual)
  • 12. Summary & Conclusions Need to organize multimedia data because of its ● large volume and need in real-life applications Shown presence of audio-specific ● suprasegmental information in LP residual Need to explore methods to use the ● suprasegmental information as an additional evidence for the audio clip classification task