SlideShare ist ein Scribd-Unternehmen logo
1 von 37
Audio Compression
M.K.H.Gunasekara – AS2010377
J.A.S. Fernando – AS2010362
CSC 362 1.5 Seminar I
Department of Statistics and Computer Science
Overview
• Introduction
• Pulse Code Modulation
• µ - law
• A - law
• Differential Pulse Code Modulation
• Adaptive Differential Pulse Code Modulation
• Application
• MPEG Audio Compression
• MPEG Encoder Architecture
• MPEG Audio Compression Algorithm
• MPEG Layers
• MPEG Layers Applications
• Comparison of MPEG audio compression standards
• Summary
• References
Audio Compression - AS2010377 2
Introduction
• A process that is useful for conserving both
transmission bandwidth and storage space.
• The various audio compression techniques
offer different levels of complexity,
compressed audio quality, and amount
of data compression.
Audio Compression - AS2010377
3
Audio Compression
Audio Compression - AS2010377 4
Lossy Audio Compression
• AAC
• ADPCM
• ATRAC
• Dolby AC-3
• MP2
• MP3
Lossless Audio Compression
• Apple Lossless
• apt-X Lossless
• Audio Lossless Coding
• Direct Stream Transfer (DST)
• Dolby TrueHD
• DTS-HD Master Audio
• Free Lossless Audio Codec
4
What is PCM ?
• PCM is a method used to digitally represent
sampled analog signals.
• It is the standard form of digital audio in
computers, Compact Discs, digital telephony
and other digital audio applications.
Audio Compression - AS2010377 5
5
How PCM works ?
• Sampling
• Quantizing
• Encoding
Audio Compression - AS2010377 6
Figure 01 : Block Diagram of Pulse Code Modulation
6
Sampling
• Sampling theory (Nyquist)
– If input signal has maximum frequency (bandwidth) f,
sampling frequency must be at least 2f
– With a low-pass filter to interpolate between samples,
the input signal can be fully reconstructed
Example : highest frequency of telephone voice channel is 3.4 kHz
Sampling rate ≥ 2 x 3.4
≥ 6.8 kHz
Hence sample rate is 8 kHz
Audio Compression - AS2010377 7
7
Sampling
• Ideal - an impulse at each sampling instant
• Natural - a pulse of short width with varying amplitude
• Flattop - sample and hold, like natural but with single
amplitude value
Audio Compression - AS2010377 8
Figure 02 : Sampling Methods
8
Quantization
• The samples are divided into many discrete levels.
Then each sample is numbered according to their
corresponding level.
• Quantization error - the coded signal is an
approximation of the actual amplitude value.
Figure 03 : Linear Quantization Figure 04 : Non Linear QuantizationAudio Compression - AS2010377 9
9
Encoding
• After quantizing the corresponding level it is
to be represented in some manner.
• We represent levels in binary format.
• Representation of level 50 = 110010
Figure 05 : Representation of sample signal
Audio Compression - AS2010377 10
10
Figure 07- Circuit diagram of pulse code modulation
Figure 06- Block diagram of pulse code modulation
Audio Compression - AS2010377 11
11
What is Âľ law ?
• µ -law encoding is a form of logarithmic
quantization or companding.
• It's based on the observation that many
signals are statistically more likely to be near a
low signal level than a high signal level.
• Most quality codecs (including the
Sparcstation's audio codec) use Âľ -law
encoded samples.
Audio Compression - AS2010377 12
12
What is A law ?
• A-Law compression is extremely similar to
Mu-Law compression
• If the absolute value of the source sample is
less than 256, the 16-bit sample is simply
shifted down 4 bits and converted to an 8-bit
value, thus losing the top 4 bits in the process.
Audio Compression - AS2010377 13
13
A law and Âľ law
Both methods compress 2:1 ratio.
• µ -Law has a larger dynamic range compared to
A-law
• µ -Law has worse distortion with small signals
compared to A-law
• µ -Law is used in North-America and Japan while
A-law is commonly used in Europe
• A-law takes precedence over µ -law with
international calls
Audio Compression - AS2010377 14
14
Differential PCM
• Differential pulse-code modulation is a signal
encoder that uses the baseline of pulse-code
modulation but adds some functionalities
based on the prediction of the samples of the
signal.
Audio Compression - AS2010377 15
15
Adaptive Differential PCM
• ADPCM, commonly termed as a form of
compression, is a more efficient way of storing
waveforms than 16-bit or 8-bit PCM.
• ADPCM stores the value differences between
two adjacent PCM samples and makes some
assumptions that allow data reduction.
• Because of these assumptions low frequencies
are properly reproduced
Audio Compression - AS2010377 16
16
Adaptive Differential PCM
Audio Compression - AS2010377 17
• An ADPCM algorithm is used to map a series of 8
bit Âľ-law (or a-law) PCM samples into a series of
4 bit ADPCM samples. In this way, the capacity of
the line is doubled.
• Some ADPCM techniques are used in Voice over
IP communications.
• ADPCM was also used by Interactive Multimedia
Association for development of legacy audio
codec known as ADPCM DVI, IMA ADPCM or DVI4
17
Applications
Audio
Compression
Format
Algorithm Bit Rate Bits per sample Implementations
G.711 Companding A-
law ; Âľ-law ; PCM
64 kbit/s 8 bit FFmpeg, Ekiga,
Asterisk (PBX)
G.711.1 A-law ; Âľ-law 64, 80, 96 kbit/s 16 bit VoIP
G.721 ADPCM 32 kbit/s 13 bit
G.722 ADPCM 64 kbit/s 14 bit QuickTime,
RealPlayer
Audio Compression - AS2010377 18
18
Wrap - up
• Techniques to compress general digital audio
signals include Âľ -law and adaptive differential
pulse code modulation.
• These approaches apply low-complexity, low-
compression, and medium audio quality
algorithms to audio signals.
Audio Compression - AS2010377 19
19
MPEG Audio Compression
• Motion Picture Experts Group
• An ISO standard for high-fidelity audio compression.
• A lossy compression.
• But can achieve transparent, perceptually lossless
compression.
• Compresses without regard to the source of the audio
data.
• Removes distortion and other features that are
imperceptible to the human ear.
• 6-to-1 compression ratio.
20Audio Compression - AS2010362
20
MPEG Audio Compression
• Four channel modes
- Monophonic
- Dual monophonic
- Stereo
- Joint- stereo
• Has 3 distinct layers for compression
21Audio Compression - AS2010362
21
MPEG Encoder Architecture
Audio
Input Encoded
Bit Stream
Time to
Frequency
Mapping
Filter bank
Bit Allocation,
Quantize and
coding
Psychoacoustic
Model
Bit stream
formatting
22Audio Compression - AS2010362
22
MPEG Audio Compression
Algorithm
• The input audio stream passes through a filter
bank.
Filter Bank
- Divides the input into multiple subbands.
- Relatively simple and provide good time
resolution
23Audio Compression - AS2010362
23
MPEG Audio Compression
Algorithm …
• The input audio stream simultaneously passes
through a psychoacoustic model.
The Psychoacoustic Model
- The key component of the MPEG encoder
that enables its high performance.
- Analyzes the audio signal and computes the
amount of noise masking that is available as a
function of frequency.
24Audio Compression - AS2010362
24
MPEG Audio Compression
Algorithm...
• The bit or noise allocation block uses the signal-
to-mask ratios to decide how to apportion the total
number of code bits available for the quantization
of the subband signals to minimize the audibility
of the quantization noise.
• Finally, the last block takes the representation of
the quantized audio samples and formats the data
into a decodable bit stream.
25Audio Compression - AS2010362
25
MPEG Layers
• Offers three compatible layers.
- Layer I
- Layer II
- Layer III
• Each succeeding layer able to understand the
lower layers.
• Each succeeding layer offers more complexity
and better compression for a given level of audio
quality.
26Audio Compression - AS2010362
26
MPEG Audio Layer I
27
• Uses the basic filter bank found in all layers.
• This filter bank divides the audio signal into 32
constant-width frequency bands.
• Bit rates greater than 128 kbits/s per channel.
Audio Compression - AS2010362
27
MPEG Audio Layer II
28
• A simple enhancement of Layer I.
• It improves compression performance by
coding data in larger groups.
• Bit rates of 128 kbits/s per channel.
• Improves performance by representing the bit
allocation, the scale factor values, and the
quantized samples.
Audio Compression - AS2010362
28
MPEG Audio Layer III
– Widely popular under the name Mp3
– The most complex with the best audio quality
– Used for bit rates of 64 kbits/s per channel.
• Compensates for some filter bank deficiencies
by
processing the filter outputs with a modified
discrete cosine transform (MDCT).
29Audio Compression - AS2010362
29
MPEG Audio Layer III Filter Bank
Processing
30Audio Compression - AS2010362
30
MPEG Audio Layer III
enhancements
31
• Alias reduction
• Nonuniform quantization
• Entropy coding of data values
• Use of a bit reservoir
• Noise allocation instead of bit allocation
Audio Compression - AS2010362
31
MPEG Layers Applications
32
Layer Application
I Digital audio cassette
II Digital audio
broadcast
III CD quality
Audio Compression - AS2010362
32
Comparison of MPEG audio
compression standards
33
Standard Audio Sampling
rate(kHz)
Compressed bit
rate (kbits/sec)
Channels
MPEG - 1 Layer I 32, 44.1, 48 32- 448 1- 2 channels
MPEG - 1 Layer I 32, 44.1, 48 32 – 384 1-2 channels
MPEG - 1 Layer III 32, 44.1, 48 32 – 320 1-2 channels
MPEG - 2 Layer I 32, 44.1, 48 32 - 448 1 – 5.1 channels
16, 22.05, 24 32 - 256
MPEG - 2 Layer II 32, 44.1, 48 32 - 384 1 – 5.1 channels
16, 22.05, 24 8 - 160
MPEG - 2 Layer III 32, 44.1, 48 32 - 384 1 – 5.1 channels
16, 22.05, 24 8 - 160
Audio Compression - AS2010362
33
Comparison of MPEG audio
compression standards
34
• MPEG 2 AAC
- more efficient than the previous MPEG audio standards
- supports from 1 to 48 channels at sampling rates of 8 to
96 kHz
• MPEG 4 AAC
- similar to the MPEG-2 AAC standard, with some minor
changes
- supports the compression of natural audio at bit rates
ranging from 2 up to 64 kb/s.
Audio Compression - AS2010362
34
Summary
35
• The MPEG audio standard has three layers of
successive complexity for improve compression
performance.
• MPEG audio has been extended in following
ways
- Multichannel audio support
- Lower, compressed audio rates
- Lower audio sampling rates
Audio Compression - AS2010362
35
References
36
• Introduction To Data Compression ,Third Edition
,Khalid Sayood
• B. Cavagndo , J. Bier, Introduction to Digital Audio
Compression,
Available:
http://rizal.blog.undip.ac.id/files/2009/07/introd
uctiontodigitalaudiocompression.pdf
• Davis Yen Pen, Digital Audio Compression,
Available:
www.ece.jcu.edu.au/subjects/ee3700/resources/
Digital_Audio_Compression.pdf
Audio Compression - AS2010362
36
37
THANK YOU
Audio Compression - AS2010377 & AS2010362
37

Weitere ähnliche Inhalte

Was ist angesagt?

Video Compression
Video CompressionVideo Compression
Video CompressionShreyash Patel
 
Video compression
Video compressionVideo compression
Video compressionnnmaurya
 
MPEG-1 Part 2 Video Encoding
MPEG-1 Part 2 Video EncodingMPEG-1 Part 2 Video Encoding
MPEG-1 Part 2 Video EncodingChristian Kehl
 
Audio compression
Audio compressionAudio compression
Audio compressionSahil Garg
 
Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compressionneeraj9217
 
Audio encoding principles
Audio encoding principlesAudio encoding principles
Audio encoding principlesPhillip Doyle
 
H261
H261H261
H261Videoguy
 
Compression: Video Compression (MPEG and others)
Compression: Video Compression (MPEG and others)Compression: Video Compression (MPEG and others)
Compression: Video Compression (MPEG and others)danishrafiq
 
Fundamentals of Data compression
Fundamentals of Data compressionFundamentals of Data compression
Fundamentals of Data compressionM.k. Praveen
 
Data Compression (Lossy and Lossless)
Data Compression (Lossy and Lossless)Data Compression (Lossy and Lossless)
Data Compression (Lossy and Lossless)Project Student
 
Lossless predictive coding in Digital Image Processing
Lossless predictive coding in Digital Image ProcessingLossless predictive coding in Digital Image Processing
Lossless predictive coding in Digital Image Processingpriyadharshini murugan
 
MPEG/Audio Compression
MPEG/Audio CompressionMPEG/Audio Compression
MPEG/Audio CompressionDaniel Brewster
 
Lecture# 7 midi file format
Lecture#  7 midi file formatLecture#  7 midi file format
Lecture# 7 midi file formatMr SMAK
 
Ppt on audio file formats
Ppt on audio file formatsPpt on audio file formats
Ppt on audio file formatsIshank Ranjan
 
H263.ppt
H263.pptH263.ppt
H263.pptVideoguy
 

Was ist angesagt? (20)

Video Compression
Video CompressionVideo Compression
Video Compression
 
Video compression
Video compressionVideo compression
Video compression
 
MPEG-1 Part 2 Video Encoding
MPEG-1 Part 2 Video EncodingMPEG-1 Part 2 Video Encoding
MPEG-1 Part 2 Video Encoding
 
Audio compression
Audio compressionAudio compression
Audio compression
 
Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compression
 
Audio encoding principles
Audio encoding principlesAudio encoding principles
Audio encoding principles
 
H261
H261H261
H261
 
Compression: Video Compression (MPEG and others)
Compression: Video Compression (MPEG and others)Compression: Video Compression (MPEG and others)
Compression: Video Compression (MPEG and others)
 
Mp3
Mp3Mp3
Mp3
 
Fundamentals of Data compression
Fundamentals of Data compressionFundamentals of Data compression
Fundamentals of Data compression
 
Data Compression (Lossy and Lossless)
Data Compression (Lossy and Lossless)Data Compression (Lossy and Lossless)
Data Compression (Lossy and Lossless)
 
Lossless predictive coding in Digital Image Processing
Lossless predictive coding in Digital Image ProcessingLossless predictive coding in Digital Image Processing
Lossless predictive coding in Digital Image Processing
 
MPEG/Audio Compression
MPEG/Audio CompressionMPEG/Audio Compression
MPEG/Audio Compression
 
Lecture# 7 midi file format
Lecture#  7 midi file formatLecture#  7 midi file format
Lecture# 7 midi file format
 
Ppt on audio file formats
Ppt on audio file formatsPpt on audio file formats
Ppt on audio file formats
 
Data compression
Data compressionData compression
Data compression
 
H263.ppt
H263.pptH263.ppt
H263.ppt
 
Codecs
CodecsCodecs
Codecs
 
Multimedia compression
Multimedia compressionMultimedia compression
Multimedia compression
 
What is mp3
What is mp3What is mp3
What is mp3
 

Ähnlich wie Audio compression

Audio Compression_2023.pptx
Audio Compression_2023.pptxAudio Compression_2023.pptx
Audio Compression_2023.pptxzulhelmanz
 
A1mpeg12 2004
A1mpeg12 2004A1mpeg12 2004
A1mpeg12 2004Thiago Skiba
 
J03502050055
J03502050055J03502050055
J03502050055theijes
 
Compression of digital voice and video
Compression of digital voice and videoCompression of digital voice and video
Compression of digital voice and videosangusajjan
 
Compression presentation 415 (1)
Compression presentation 415 (1)Compression presentation 415 (1)
Compression presentation 415 (1)Godo Dodo
 
Mm01 a vformat
Mm01 a vformatMm01 a vformat
Mm01 a vformatgotovikas
 
simple video compression
simple video compression simple video compression
simple video compression LaLit DuBey
 
Basics of audio coding
Basics of audio codingBasics of audio coding
Basics of audio codingsakshij91
 
Digital Video And Compression
Digital Video And CompressionDigital Video And Compression
Digital Video And CompressionRobert Burk
 
lect10-mpeg1.ppt
lect10-mpeg1.pptlect10-mpeg1.ppt
lect10-mpeg1.pptssuser6606eb
 
Multimedia Compression and Communication
Multimedia Compression and CommunicationMultimedia Compression and Communication
Multimedia Compression and CommunicationBenesh Selvanesan
 
Video Coding Standard
Video Coding StandardVideo Coding Standard
Video Coding StandardVideoguy
 

Ähnlich wie Audio compression (20)

Multimedia.pdf
Multimedia.pdfMultimedia.pdf
Multimedia.pdf
 
Audio Compression_2023.pptx
Audio Compression_2023.pptxAudio Compression_2023.pptx
Audio Compression_2023.pptx
 
A1mpeg12 2004
A1mpeg12 2004A1mpeg12 2004
A1mpeg12 2004
 
J03502050055
J03502050055J03502050055
J03502050055
 
Multimedia Object - Audio
Multimedia Object - AudioMultimedia Object - Audio
Multimedia Object - Audio
 
Compression of digital voice and video
Compression of digital voice and videoCompression of digital voice and video
Compression of digital voice and video
 
Digital audio
Digital audioDigital audio
Digital audio
 
Compression
CompressionCompression
Compression
 
Compression presentation 415 (1)
Compression presentation 415 (1)Compression presentation 415 (1)
Compression presentation 415 (1)
 
Mm01 a vformat
Mm01 a vformatMm01 a vformat
Mm01 a vformat
 
add9.5.ppt
add9.5.pptadd9.5.ppt
add9.5.ppt
 
SPEECH CODING
SPEECH CODINGSPEECH CODING
SPEECH CODING
 
simple video compression
simple video compression simple video compression
simple video compression
 
Basics of audio coding
Basics of audio codingBasics of audio coding
Basics of audio coding
 
Digital Video And Compression
Digital Video And CompressionDigital Video And Compression
Digital Video And Compression
 
DATA COMMUNICATUION Unit 5
DATA COMMUNICATUION  Unit 5DATA COMMUNICATUION  Unit 5
DATA COMMUNICATUION Unit 5
 
lect10-mpeg1.ppt
lect10-mpeg1.pptlect10-mpeg1.ppt
lect10-mpeg1.ppt
 
Multimedia Compression and Communication
Multimedia Compression and CommunicationMultimedia Compression and Communication
Multimedia Compression and Communication
 
Video Coding Standard
Video Coding StandardVideo Coding Standard
Video Coding Standard
 
Base band transmission
Base band transmissionBase band transmission
Base band transmission
 

Mehr von Madhawa Gunasekara

Customer interface - Business Ontology Model
Customer interface - Business Ontology ModelCustomer interface - Business Ontology Model
Customer interface - Business Ontology ModelMadhawa Gunasekara
 
Research: Automatic Diabetic Retinopathy Detection
Research: Automatic Diabetic Retinopathy DetectionResearch: Automatic Diabetic Retinopathy Detection
Research: Automatic Diabetic Retinopathy DetectionMadhawa Gunasekara
 
How to prepare Title and Abstract for Research Articles
How to prepare Title and Abstract for Research ArticlesHow to prepare Title and Abstract for Research Articles
How to prepare Title and Abstract for Research ArticlesMadhawa Gunasekara
 
Low cost self driven car system
Low cost self driven car systemLow cost self driven car system
Low cost self driven car systemMadhawa Gunasekara
 

Mehr von Madhawa Gunasekara (8)

Evolutionary Computing
Evolutionary ComputingEvolutionary Computing
Evolutionary Computing
 
Customer interface - Business Ontology Model
Customer interface - Business Ontology ModelCustomer interface - Business Ontology Model
Customer interface - Business Ontology Model
 
Radial Basis Function
Radial Basis FunctionRadial Basis Function
Radial Basis Function
 
Semiotics final
Semiotics finalSemiotics final
Semiotics final
 
Research: Automatic Diabetic Retinopathy Detection
Research: Automatic Diabetic Retinopathy DetectionResearch: Automatic Diabetic Retinopathy Detection
Research: Automatic Diabetic Retinopathy Detection
 
How to prepare Title and Abstract for Research Articles
How to prepare Title and Abstract for Research ArticlesHow to prepare Title and Abstract for Research Articles
How to prepare Title and Abstract for Research Articles
 
Radix sorting
Radix sortingRadix sorting
Radix sorting
 
Low cost self driven car system
Low cost self driven car systemLow cost self driven car system
Low cost self driven car system
 

KĂźrzlich hochgeladen

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

KĂźrzlich hochgeladen (20)

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Audio compression

  • 1. Audio Compression M.K.H.Gunasekara – AS2010377 J.A.S. Fernando – AS2010362 CSC 362 1.5 Seminar I Department of Statistics and Computer Science
  • 2. Overview • Introduction • Pulse Code Modulation • Âľ - law • A - law • Differential Pulse Code Modulation • Adaptive Differential Pulse Code Modulation • Application • MPEG Audio Compression • MPEG Encoder Architecture • MPEG Audio Compression Algorithm • MPEG Layers • MPEG Layers Applications • Comparison of MPEG audio compression standards • Summary • References Audio Compression - AS2010377 2
  • 3. Introduction • A process that is useful for conserving both transmission bandwidth and storage space. • The various audio compression techniques offer different levels of complexity, compressed audio quality, and amount of data compression. Audio Compression - AS2010377 3
  • 4. Audio Compression Audio Compression - AS2010377 4 Lossy Audio Compression • AAC • ADPCM • ATRAC • Dolby AC-3 • MP2 • MP3 Lossless Audio Compression • Apple Lossless • apt-X Lossless • Audio Lossless Coding • Direct Stream Transfer (DST) • Dolby TrueHD • DTS-HD Master Audio • Free Lossless Audio Codec 4
  • 5. What is PCM ? • PCM is a method used to digitally represent sampled analog signals. • It is the standard form of digital audio in computers, Compact Discs, digital telephony and other digital audio applications. Audio Compression - AS2010377 5 5
  • 6. How PCM works ? • Sampling • Quantizing • Encoding Audio Compression - AS2010377 6 Figure 01 : Block Diagram of Pulse Code Modulation 6
  • 7. Sampling • Sampling theory (Nyquist) – If input signal has maximum frequency (bandwidth) f, sampling frequency must be at least 2f – With a low-pass filter to interpolate between samples, the input signal can be fully reconstructed Example : highest frequency of telephone voice channel is 3.4 kHz Sampling rate ≥ 2 x 3.4 ≥ 6.8 kHz Hence sample rate is 8 kHz Audio Compression - AS2010377 7 7
  • 8. Sampling • Ideal - an impulse at each sampling instant • Natural - a pulse of short width with varying amplitude • Flattop - sample and hold, like natural but with single amplitude value Audio Compression - AS2010377 8 Figure 02 : Sampling Methods 8
  • 9. Quantization • The samples are divided into many discrete levels. Then each sample is numbered according to their corresponding level. • Quantization error - the coded signal is an approximation of the actual amplitude value. Figure 03 : Linear Quantization Figure 04 : Non Linear QuantizationAudio Compression - AS2010377 9 9
  • 10. Encoding • After quantizing the corresponding level it is to be represented in some manner. • We represent levels in binary format. • Representation of level 50 = 110010 Figure 05 : Representation of sample signal Audio Compression - AS2010377 10 10
  • 11. Figure 07- Circuit diagram of pulse code modulation Figure 06- Block diagram of pulse code modulation Audio Compression - AS2010377 11 11
  • 12. What is Âľ law ? • Âľ -law encoding is a form of logarithmic quantization or companding. • It's based on the observation that many signals are statistically more likely to be near a low signal level than a high signal level. • Most quality codecs (including the Sparcstation's audio codec) use Âľ -law encoded samples. Audio Compression - AS2010377 12 12
  • 13. What is A law ? • A-Law compression is extremely similar to Mu-Law compression • If the absolute value of the source sample is less than 256, the 16-bit sample is simply shifted down 4 bits and converted to an 8-bit value, thus losing the top 4 bits in the process. Audio Compression - AS2010377 13 13
  • 14. A law and Âľ law Both methods compress 2:1 ratio. • Âľ -Law has a larger dynamic range compared to A-law • Âľ -Law has worse distortion with small signals compared to A-law • Âľ -Law is used in North-America and Japan while A-law is commonly used in Europe • A-law takes precedence over Âľ -law with international calls Audio Compression - AS2010377 14 14
  • 15. Differential PCM • Differential pulse-code modulation is a signal encoder that uses the baseline of pulse-code modulation but adds some functionalities based on the prediction of the samples of the signal. Audio Compression - AS2010377 15 15
  • 16. Adaptive Differential PCM • ADPCM, commonly termed as a form of compression, is a more efficient way of storing waveforms than 16-bit or 8-bit PCM. • ADPCM stores the value differences between two adjacent PCM samples and makes some assumptions that allow data reduction. • Because of these assumptions low frequencies are properly reproduced Audio Compression - AS2010377 16 16
  • 17. Adaptive Differential PCM Audio Compression - AS2010377 17 • An ADPCM algorithm is used to map a series of 8 bit Âľ-law (or a-law) PCM samples into a series of 4 bit ADPCM samples. In this way, the capacity of the line is doubled. • Some ADPCM techniques are used in Voice over IP communications. • ADPCM was also used by Interactive Multimedia Association for development of legacy audio codec known as ADPCM DVI, IMA ADPCM or DVI4 17
  • 18. Applications Audio Compression Format Algorithm Bit Rate Bits per sample Implementations G.711 Companding A- law ; Âľ-law ; PCM 64 kbit/s 8 bit FFmpeg, Ekiga, Asterisk (PBX) G.711.1 A-law ; Âľ-law 64, 80, 96 kbit/s 16 bit VoIP G.721 ADPCM 32 kbit/s 13 bit G.722 ADPCM 64 kbit/s 14 bit QuickTime, RealPlayer Audio Compression - AS2010377 18 18
  • 19. Wrap - up • Techniques to compress general digital audio signals include Âľ -law and adaptive differential pulse code modulation. • These approaches apply low-complexity, low- compression, and medium audio quality algorithms to audio signals. Audio Compression - AS2010377 19 19
  • 20. MPEG Audio Compression • Motion Picture Experts Group • An ISO standard for high-fidelity audio compression. • A lossy compression. • But can achieve transparent, perceptually lossless compression. • Compresses without regard to the source of the audio data. • Removes distortion and other features that are imperceptible to the human ear. • 6-to-1 compression ratio. 20Audio Compression - AS2010362 20
  • 21. MPEG Audio Compression • Four channel modes - Monophonic - Dual monophonic - Stereo - Joint- stereo • Has 3 distinct layers for compression 21Audio Compression - AS2010362 21
  • 22. MPEG Encoder Architecture Audio Input Encoded Bit Stream Time to Frequency Mapping Filter bank Bit Allocation, Quantize and coding Psychoacoustic Model Bit stream formatting 22Audio Compression - AS2010362 22
  • 23. MPEG Audio Compression Algorithm • The input audio stream passes through a filter bank. Filter Bank - Divides the input into multiple subbands. - Relatively simple and provide good time resolution 23Audio Compression - AS2010362 23
  • 24. MPEG Audio Compression Algorithm … • The input audio stream simultaneously passes through a psychoacoustic model. The Psychoacoustic Model - The key component of the MPEG encoder that enables its high performance. - Analyzes the audio signal and computes the amount of noise masking that is available as a function of frequency. 24Audio Compression - AS2010362 24
  • 25. MPEG Audio Compression Algorithm... • The bit or noise allocation block uses the signal- to-mask ratios to decide how to apportion the total number of code bits available for the quantization of the subband signals to minimize the audibility of the quantization noise. • Finally, the last block takes the representation of the quantized audio samples and formats the data into a decodable bit stream. 25Audio Compression - AS2010362 25
  • 26. MPEG Layers • Offers three compatible layers. - Layer I - Layer II - Layer III • Each succeeding layer able to understand the lower layers. • Each succeeding layer offers more complexity and better compression for a given level of audio quality. 26Audio Compression - AS2010362 26
  • 27. MPEG Audio Layer I 27 • Uses the basic filter bank found in all layers. • This filter bank divides the audio signal into 32 constant-width frequency bands. • Bit rates greater than 128 kbits/s per channel. Audio Compression - AS2010362 27
  • 28. MPEG Audio Layer II 28 • A simple enhancement of Layer I. • It improves compression performance by coding data in larger groups. • Bit rates of 128 kbits/s per channel. • Improves performance by representing the bit allocation, the scale factor values, and the quantized samples. Audio Compression - AS2010362 28
  • 29. MPEG Audio Layer III – Widely popular under the name Mp3 – The most complex with the best audio quality – Used for bit rates of 64 kbits/s per channel. • Compensates for some filter bank deficiencies by processing the filter outputs with a modified discrete cosine transform (MDCT). 29Audio Compression - AS2010362 29
  • 30. MPEG Audio Layer III Filter Bank Processing 30Audio Compression - AS2010362 30
  • 31. MPEG Audio Layer III enhancements 31 • Alias reduction • Nonuniform quantization • Entropy coding of data values • Use of a bit reservoir • Noise allocation instead of bit allocation Audio Compression - AS2010362 31
  • 32. MPEG Layers Applications 32 Layer Application I Digital audio cassette II Digital audio broadcast III CD quality Audio Compression - AS2010362 32
  • 33. Comparison of MPEG audio compression standards 33 Standard Audio Sampling rate(kHz) Compressed bit rate (kbits/sec) Channels MPEG - 1 Layer I 32, 44.1, 48 32- 448 1- 2 channels MPEG - 1 Layer I 32, 44.1, 48 32 – 384 1-2 channels MPEG - 1 Layer III 32, 44.1, 48 32 – 320 1-2 channels MPEG - 2 Layer I 32, 44.1, 48 32 - 448 1 – 5.1 channels 16, 22.05, 24 32 - 256 MPEG - 2 Layer II 32, 44.1, 48 32 - 384 1 – 5.1 channels 16, 22.05, 24 8 - 160 MPEG - 2 Layer III 32, 44.1, 48 32 - 384 1 – 5.1 channels 16, 22.05, 24 8 - 160 Audio Compression - AS2010362 33
  • 34. Comparison of MPEG audio compression standards 34 • MPEG 2 AAC - more efficient than the previous MPEG audio standards - supports from 1 to 48 channels at sampling rates of 8 to 96 kHz • MPEG 4 AAC - similar to the MPEG-2 AAC standard, with some minor changes - supports the compression of natural audio at bit rates ranging from 2 up to 64 kb/s. Audio Compression - AS2010362 34
  • 35. Summary 35 • The MPEG audio standard has three layers of successive complexity for improve compression performance. • MPEG audio has been extended in following ways - Multichannel audio support - Lower, compressed audio rates - Lower audio sampling rates Audio Compression - AS2010362 35
  • 36. References 36 • Introduction To Data Compression ,Third Edition ,Khalid Sayood • B. Cavagndo , J. Bier, Introduction to Digital Audio Compression, Available: http://rizal.blog.undip.ac.id/files/2009/07/introd uctiontodigitalaudiocompression.pdf • Davis Yen Pen, Digital Audio Compression, Available: www.ece.jcu.edu.au/subjects/ee3700/resources/ Digital_Audio_Compression.pdf Audio Compression - AS2010362 36
  • 37. 37 THANK YOU Audio Compression - AS2010377 & AS2010362 37

Hinweis der Redaktion

  1. These simple techniques have been largely rendered obsolete by modern transform-based audio compression techniques.
  2. Therefore, it makes more sense to have more quantization points near a low level than a high level.