SlideShare ist ein Scribd-Unternehmen logo
1 von 2
Downloaden Sie, um offline zu lesen
Speech Compression                                    Uncompressed audio data rates
• Recommended Reading: J. Harrington and S.                     • Voice: 8000samples/sec, 8bits/sample,
  Cassidy, “Techniques in Speech Acoustics”,                      mono
  Kluwer, 1999
                                                                                 = 64000bits/sec (64kbps)
• Contents
                                                                • CD: 44100samples/sec, 16bits/sample,
  – Uncompressed audio data rates
                                                                  stereo
  – ADPCM
  – SB-ADPCM
                                                                         =1411200bits/sec (~1.5Mbps)
  – LPC




  ADPCM (Adaptive Differential PCM)                                                    ADPCM
• Uses the statistical properties of human speech (=> not           Measured                                   Transmitted
  compatible with fax/modem signals)                                value                                      value
• Makes a prediction about the size of the next sample, based                                  Adaptive
  on previous info                                                                             quantiser
                                                                             -
• Transmitter then sends only the difference between real
  value and predicted value
                                                                                               Predictor
• Receiver uses the same prediction algorithm, together with
  the differences to reconstruct the speech data
• Enables the data rate to be reduced to 32kbps
• Used on international telephone links
• Specified in G.721, G.722, G.723, G.726, G.727




SB-APDCM (Sub-band ADPCM)                                                           SB-ADPCM
• Given 64kbps: ADPCM could produce                                                   Upper sub-band
                                                                                      ADPCM encoder
  better than toll voice quality (eg radio)
                                                                       Input 4-7KHz                  16kbps
                                                                                                              MUX
• Sub-bands are 0-4kHz (given 48kbps), 4-                              filters
  7kHz (given 16kbps)                                                                 Lower sub-band
                                                                                      ADPCM encoder
• Low band contains more audio energy, high                                        50Hz-4KHz         48kbps
  band contains intelligibility info.
• Standardised in G.722                                          Analogue                                      Digital signal
                                                                 signal in                                     out




                                                                                                                                1
Linear Predictive Coding (LPC)                                                        LPC
• Introduced in the 1960s                                      • coefficients (‘a’s) correspond to those of a vocal
• nth signal sample is represented as a linear                   tract filter and the error signal (‘e’) corresponds to
  combination of the previous p samples, plus a                  a source signal
  residual representing the prediction error:                  • Source signal will approximate either a voiced
                                                                 signal (which looks like a series of impulses) or a
x(n) = a1x(n-1) + a2x(n-2) + … + apx(n-p) + e(n)
                                                                 white noise source
                                                               • So, LPC involves “exciting” a source signal with a
• If the error (‘e’) is small enough, we can just
  transmit the coefficients (‘a’s)                               vocal tract filter




            Impulses and Filters                                        LPC – Autocorrelation
                                                               • Minimise the error signal by choosing optimal
                                                                 coefficients (‘a’s)
                                                               • Use the autocorrelation criteria (aka root mean
                                                                 squared criterion):




                                                                                                      for 1<=j<=p,
                                                               where R is the autocorrelation of x(n) defined as
                                                                R(i) = E[x(n)x(n-i)]




  LPC – Solving the autocorrelation
                                                                                       LPC
              formula
• In matrix form the equation can be written as                • Used in:
                      R*a=r
                                                                  – GSM (Groupe Speciale Mobile) (Residual
  where the autocorrelation matrix R is a symmetric Toeplitz        Pulse Excited-LPC) (13kbps)
  matrix with elements ri,j = R(i - j), vector r is the
  autocorrelation vector rj = R(j), and vector a is the           – LD-CELP (Low-Delay Code Excited Linear
  parameter vector of ai                                            Prediction) (G.728) (16kbps)
• An algorithm by N. Levinson (proposed in 1947) and              – CS-ACELP (Conjugate Structure-Algebraic
  modified by J. Durbin (in 1959) recursively calculates the        CELP) (G.729) (8kbps)
  solution to the Toeplitz matrix.
• GSM coder uses an integer version of the Schur recursion        – MP-MLQ (Multi Pulse – Maximum Likelihood
  (1917)                                                            Quantisation) (G.723.1) (6.3kbps)…




                                                                                                                          2

Weitere ähnliche Inhalte

Was ist angesagt?

Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compressionneeraj9217
 
Audio Morphing for Percussive Sound Generation
Audio Morphing for Percussive Sound GenerationAudio Morphing for Percussive Sound Generation
Audio Morphing for Percussive Sound Generationa3labdsp
 
Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPCDisha Modi
 
Digital Transmission Fundamentals
Digital Transmission FundamentalsDigital Transmission Fundamentals
Digital Transmission FundamentalsAisu
 
Lect2 up400 (100329)
Lect2 up400 (100329)Lect2 up400 (100329)
Lect2 up400 (100329)aicdesign
 
A BICMOS chipset for a DVB-H front-end receiver
A BICMOS chipset for a DVB-H front-end receiverA BICMOS chipset for a DVB-H front-end receiver
A BICMOS chipset for a DVB-H front-end receiverRFIC-IUMA
 
iDiff 2008 conference #04 IP-Racine FSSG
iDiff 2008 conference #04 IP-Racine   FSSGiDiff 2008 conference #04 IP-Racine   FSSG
iDiff 2008 conference #04 IP-Racine FSSGBenoit Michel
 
Data bit rate_by_abhishek_wadhwa
Data bit rate_by_abhishek_wadhwaData bit rate_by_abhishek_wadhwa
Data bit rate_by_abhishek_wadhwaAbhishek Wadhwa
 
Introduction To Video Compression
Introduction To Video CompressionIntroduction To Video Compression
Introduction To Video Compressionguestdd7ccca
 
Baud rate is the number of change in signal
Baud rate is the number of change in signalBaud rate is the number of change in signal
Baud rate is the number of change in signalAbhishek Pathak
 
Application of fourier series
Application of fourier seriesApplication of fourier series
Application of fourier seriesGirish Dhareshwar
 
3. digital transmission fundamentals
3. digital transmission fundamentals3. digital transmission fundamentals
3. digital transmission fundamentalsRovin Valencia
 
L'explication à propos du règlement sonore
L'explication à propos du règlement sonoreL'explication à propos du règlement sonore
L'explication à propos du règlement sonoreLeSoir.be
 

Was ist angesagt? (19)

Line coding
Line codingLine coding
Line coding
 
Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compression
 
Audio Morphing for Percussive Sound Generation
Audio Morphing for Percussive Sound GenerationAudio Morphing for Percussive Sound Generation
Audio Morphing for Percussive Sound Generation
 
Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPC
 
Digital Transmission Fundamentals
Digital Transmission FundamentalsDigital Transmission Fundamentals
Digital Transmission Fundamentals
 
Lect2 up400 (100329)
Lect2 up400 (100329)Lect2 up400 (100329)
Lect2 up400 (100329)
 
Presentation ct
Presentation ctPresentation ct
Presentation ct
 
3D Spatial Response
3D Spatial Response3D Spatial Response
3D Spatial Response
 
A BICMOS chipset for a DVB-H front-end receiver
A BICMOS chipset for a DVB-H front-end receiverA BICMOS chipset for a DVB-H front-end receiver
A BICMOS chipset for a DVB-H front-end receiver
 
Digital Audio
Digital AudioDigital Audio
Digital Audio
 
iDiff 2008 conference #04 IP-Racine FSSG
iDiff 2008 conference #04 IP-Racine   FSSGiDiff 2008 conference #04 IP-Racine   FSSG
iDiff 2008 conference #04 IP-Racine FSSG
 
Data bit rate_by_abhishek_wadhwa
Data bit rate_by_abhishek_wadhwaData bit rate_by_abhishek_wadhwa
Data bit rate_by_abhishek_wadhwa
 
CivcomIntelDuoBinary-TODC-OFC2006
CivcomIntelDuoBinary-TODC-OFC2006CivcomIntelDuoBinary-TODC-OFC2006
CivcomIntelDuoBinary-TODC-OFC2006
 
Introduction To Video Compression
Introduction To Video CompressionIntroduction To Video Compression
Introduction To Video Compression
 
Baud rate is the number of change in signal
Baud rate is the number of change in signalBaud rate is the number of change in signal
Baud rate is the number of change in signal
 
Application of fourier series
Application of fourier seriesApplication of fourier series
Application of fourier series
 
Chap 5
Chap 5Chap 5
Chap 5
 
3. digital transmission fundamentals
3. digital transmission fundamentals3. digital transmission fundamentals
3. digital transmission fundamentals
 
L'explication à propos du règlement sonore
L'explication à propos du règlement sonoreL'explication à propos du règlement sonore
L'explication à propos du règlement sonore
 

Ähnlich wie Z24 4 Speech Compression

Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxKIRUTHIKAAR2
 
Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxKIRUTHIKAAR2
 
Lpc vocoder implemented by using matlab
Lpc vocoder implemented by using matlabLpc vocoder implemented by using matlab
Lpc vocoder implemented by using matlabchakravarthy Gopi
 
Ncc2004 ofdm tutorial part ii-apal
Ncc2004 ofdm tutorial   part ii-apalNcc2004 ofdm tutorial   part ii-apal
Ncc2004 ofdm tutorial part ii-apalArpan Pal
 
Acoustic echo cancellation
Acoustic echo cancellationAcoustic echo cancellation
Acoustic echo cancellationchintanajoshi
 
Software-defined white-space cognitive systems: implementation of the spectru...
Software-defined white-space cognitive systems: implementation of the spectru...Software-defined white-space cognitive systems: implementation of the spectru...
Software-defined white-space cognitive systems: implementation of the spectru...CSP Scarl
 
Analog mixed vlsi notes
Analog mixed vlsi notesAnalog mixed vlsi notes
Analog mixed vlsi notesathiulla
 
Carrier to Noise Versus Signal to Noise.ppt
Carrier to Noise Versus Signal to Noise.pptCarrier to Noise Versus Signal to Noise.ppt
Carrier to Noise Versus Signal to Noise.pptAbdulMaalik17
 
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.pptZeyadAlabsy
 
05 signal encodingtechniques
05 signal encodingtechniques05 signal encodingtechniques
05 signal encodingtechniquesOrbay Yeşil
 
An audio quality evaluation of digital radio system
An audio quality evaluation of digital radio systemAn audio quality evaluation of digital radio system
An audio quality evaluation of digital radio systemRojith Thomas
 

Ähnlich wie Z24 4 Speech Compression (20)

add9.5.ppt
add9.5.pptadd9.5.ppt
add9.5.ppt
 
30 CHL PCM PDH SDH BY SKG
30 CHL PCM PDH SDH BY SKG30 CHL PCM PDH SDH BY SKG
30 CHL PCM PDH SDH BY SKG
 
Speech coding techniques
Speech coding techniquesSpeech coding techniques
Speech coding techniques
 
DC_PPT.pptx
DC_PPT.pptxDC_PPT.pptx
DC_PPT.pptx
 
Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptx
 
Harmonic speech coding
Harmonic speech codingHarmonic speech coding
Harmonic speech coding
 
Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptx
 
Lpc vocoder implemented by using matlab
Lpc vocoder implemented by using matlabLpc vocoder implemented by using matlab
Lpc vocoder implemented by using matlab
 
Ncc2004 ofdm tutorial part ii-apal
Ncc2004 ofdm tutorial   part ii-apalNcc2004 ofdm tutorial   part ii-apal
Ncc2004 ofdm tutorial part ii-apal
 
Acoustic echo cancellation
Acoustic echo cancellationAcoustic echo cancellation
Acoustic echo cancellation
 
Arithmetic Coding
Arithmetic CodingArithmetic Coding
Arithmetic Coding
 
Sampling
SamplingSampling
Sampling
 
Software-defined white-space cognitive systems: implementation of the spectru...
Software-defined white-space cognitive systems: implementation of the spectru...Software-defined white-space cognitive systems: implementation of the spectru...
Software-defined white-space cognitive systems: implementation of the spectru...
 
Analog mixed vlsi notes
Analog mixed vlsi notesAnalog mixed vlsi notes
Analog mixed vlsi notes
 
A1mpeg12 2004
A1mpeg12 2004A1mpeg12 2004
A1mpeg12 2004
 
Carrier to Noise Versus Signal to Noise.ppt
Carrier to Noise Versus Signal to Noise.pptCarrier to Noise Versus Signal to Noise.ppt
Carrier to Noise Versus Signal to Noise.ppt
 
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
 
Logsv2
Logsv2Logsv2
Logsv2
 
05 signal encodingtechniques
05 signal encodingtechniques05 signal encodingtechniques
05 signal encodingtechniques
 
An audio quality evaluation of digital radio system
An audio quality evaluation of digital radio systemAn audio quality evaluation of digital radio system
An audio quality evaluation of digital radio system
 

Mehr von anithabalaprabhu (20)

Shannon Fano
Shannon FanoShannon Fano
Shannon Fano
 
Ch 04 Arithmetic Coding ( P P T)
Ch 04  Arithmetic  Coding ( P P T)Ch 04  Arithmetic  Coding ( P P T)
Ch 04 Arithmetic Coding ( P P T)
 
Compression
CompressionCompression
Compression
 
Datacompression1
Datacompression1Datacompression1
Datacompression1
 
Speech Compression
Speech CompressionSpeech Compression
Speech Compression
 
Dictor
DictorDictor
Dictor
 
Dictionary Based Compression
Dictionary Based CompressionDictionary Based Compression
Dictionary Based Compression
 
Module 4 Arithmetic Coding
Module 4 Arithmetic CodingModule 4 Arithmetic Coding
Module 4 Arithmetic Coding
 
Ch 04 Arithmetic Coding (Ppt)
Ch 04 Arithmetic Coding (Ppt)Ch 04 Arithmetic Coding (Ppt)
Ch 04 Arithmetic Coding (Ppt)
 
Compression Ii
Compression IiCompression Ii
Compression Ii
 
06 Arithmetic 1
06 Arithmetic 106 Arithmetic 1
06 Arithmetic 1
 
Lassy
LassyLassy
Lassy
 
Compression Ii
Compression IiCompression Ii
Compression Ii
 
Lossy
LossyLossy
Lossy
 
Planning
PlanningPlanning
Planning
 
Lossless
LosslessLossless
Lossless
 
Losseless
LosselessLosseless
Losseless
 
Lec32
Lec32Lec32
Lec32
 
Lec5 Compression
Lec5 CompressionLec5 Compression
Lec5 Compression
 
Huffman Student
Huffman StudentHuffman Student
Huffman Student
 

Kürzlich hochgeladen

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 

Kürzlich hochgeladen (20)

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

Z24 4 Speech Compression

  • 1. Speech Compression Uncompressed audio data rates • Recommended Reading: J. Harrington and S. • Voice: 8000samples/sec, 8bits/sample, Cassidy, “Techniques in Speech Acoustics”, mono Kluwer, 1999 = 64000bits/sec (64kbps) • Contents • CD: 44100samples/sec, 16bits/sample, – Uncompressed audio data rates stereo – ADPCM – SB-ADPCM =1411200bits/sec (~1.5Mbps) – LPC ADPCM (Adaptive Differential PCM) ADPCM • Uses the statistical properties of human speech (=> not Measured Transmitted compatible with fax/modem signals) value value • Makes a prediction about the size of the next sample, based Adaptive on previous info quantiser - • Transmitter then sends only the difference between real value and predicted value Predictor • Receiver uses the same prediction algorithm, together with the differences to reconstruct the speech data • Enables the data rate to be reduced to 32kbps • Used on international telephone links • Specified in G.721, G.722, G.723, G.726, G.727 SB-APDCM (Sub-band ADPCM) SB-ADPCM • Given 64kbps: ADPCM could produce Upper sub-band ADPCM encoder better than toll voice quality (eg radio) Input 4-7KHz 16kbps MUX • Sub-bands are 0-4kHz (given 48kbps), 4- filters 7kHz (given 16kbps) Lower sub-band ADPCM encoder • Low band contains more audio energy, high 50Hz-4KHz 48kbps band contains intelligibility info. • Standardised in G.722 Analogue Digital signal signal in out 1
  • 2. Linear Predictive Coding (LPC) LPC • Introduced in the 1960s • coefficients (‘a’s) correspond to those of a vocal • nth signal sample is represented as a linear tract filter and the error signal (‘e’) corresponds to combination of the previous p samples, plus a a source signal residual representing the prediction error: • Source signal will approximate either a voiced signal (which looks like a series of impulses) or a x(n) = a1x(n-1) + a2x(n-2) + … + apx(n-p) + e(n) white noise source • So, LPC involves “exciting” a source signal with a • If the error (‘e’) is small enough, we can just transmit the coefficients (‘a’s) vocal tract filter Impulses and Filters LPC – Autocorrelation • Minimise the error signal by choosing optimal coefficients (‘a’s) • Use the autocorrelation criteria (aka root mean squared criterion): for 1<=j<=p, where R is the autocorrelation of x(n) defined as R(i) = E[x(n)x(n-i)] LPC – Solving the autocorrelation LPC formula • In matrix form the equation can be written as • Used in: R*a=r – GSM (Groupe Speciale Mobile) (Residual where the autocorrelation matrix R is a symmetric Toeplitz Pulse Excited-LPC) (13kbps) matrix with elements ri,j = R(i - j), vector r is the autocorrelation vector rj = R(j), and vector a is the – LD-CELP (Low-Delay Code Excited Linear parameter vector of ai Prediction) (G.728) (16kbps) • An algorithm by N. Levinson (proposed in 1947) and – CS-ACELP (Conjugate Structure-Algebraic modified by J. Durbin (in 1959) recursively calculates the CELP) (G.729) (8kbps) solution to the Toeplitz matrix. • GSM coder uses an integer version of the Schur recursion – MP-MLQ (Multi Pulse – Maximum Likelihood (1917) Quantisation) (G.723.1) (6.3kbps)… 2