SlideShare ist ein Scribd-Unternehmen logo
1 von 5
Downloaden Sie, um offline zu lesen
HARMONIC VECTOR QUANTIZATION
Volodya Grancharov, Sigurdur Sverrisson, Erik Norvell, Tomas Toftgård,
Jonas Svedberg, and Harald Pobloth
SMN, Ericsson Research, Ericsson AB
164 80, Stockholm, Sweden
ABSTRACT
Audio coding of harmonic signals is a challenging task for
conventional MDCT coding schemes. In this paper we introduce a
novel algorithm for improved transform coding of harmonic audio.
The algorithm does not deploy the conventional scheme of
splitting the input signal into a spectrum envelope and a residual,
but models the spectral peak regions. Test results indicate that the
presented algorithm outperforms the conventional coding concept.
Index Terms— Audio coding, MDCT, VQ
1. INTRODUCTION
Transform coding is one main technology used to compress and
transmit audio signals. Typically the Modified Discrete Cosine
Transform (MDCT) [3] is used and the vector of MDCT
coefficients from one frame of audio is split into multiple bands of
pre-defined width. Then the energy in each band is calculated and
quantized. These energies are used to produce a residual vector by
scaling down the MDCT vector. Unfortunately, this commonly
used concept does not work well for transform coding of harmonic
audio signals, e.g. single instruments. The reason is that
normalization with band energies does not result in sufficiently
“flat” residual vectors, and the residual coding scheme cannot
represent the dynamics in the residual vector. Moreover, modeling
the pitch in the MDCT domain is a cumbersome task due to the
mix of time- and frequency domain information.
The presented algorithm provides an alternative audio coding
model that can efficiently process harmonic audio signals. The
main concept is that the MDCT vector of low-frequencies (LF) is
not split in envelope and residual, but instead spectral peaks are
directly extracted and quantized together with neighboring MDCT
bins. Low energy regions, outside the spectral peaks neighborhood,
are not coded but noise-filled at the decoder. In this way, the
conventional coding model: “spectrum envelope and residual” is
replaced with the concept of: “spectral peaks and noise-floor”.
2. MDCT FRAMEWORK
The presented Harmonic Vector Quantization (HVQ) algorithm is
integrated into the harmonic part of the EVS codec [1] to deal with
stationary harmonic content. It is targeted for the sample rates of
32 and 48 kHz, and bit rates of 24.4 and 32 kb/s. Similar to the
framework in [2], it operates on overlapping input audio frames
)(nx but here it uses an alternative asymmetric window [1], which
Figure 1. Example of a MDCT vector according to the HVQ
model. The two spectral peaks and the corresponding surrounding
coefficients are accurately coded. The remaining bits are allocated
to LF content, and the rest of the vector is noise-filled.
has a smaller overlap compared to the standard sinusoid window
and hence a reduced algorithmic delay. The windowed, time-
aliased input )(~ nx is transformed to MDCT coefficients )(kX
using DCTIV:
,
2
1
2
1
cos)(~)(
1
0






















K
n
K
knnxkX

1,...,0  Kk .(1)
Here K is the size of the MDCT vector ( 640K for 32 kHz and
960K for 48 kHz sampled input) and k is the transform
coefficient index. The lower 224 coefficients (up to 5.6 kHz), for
24.4 kb/s, and 320 coefficients (up to 8 kHz), for 32 kb/s, are
coded by the introduced HVQ scheme. This lower spectrum range
is encoded without a band structure, but relies on identification of
spectral peaks to be encoded. The high-frequency (HF) range of
the MDCT spectrum is partitioned into bands, where each band
b is associated with a norm factor )(bN . These norm factors are
quantized and transmitted to the decoder [1]. The decoding method
includes spectral filling which populates the non-coded LF parts of
the spectrum, as well as the entire HF part, which is scaled up with
the set of norm factors.
3. HVQ
The signal model used in HVQ, to code the LF transform
coefficients, is illustrated in Figure 1. The MDCT vector
corresponding to a particular audio signal frame is assumed to be
composed of prominent spectral peaks. These peaks and their
surroundings are accurately coded, as described in sections 3.2 and
3.3. The number of peaks and consequently the number of bits
used to code the peak regions vary with time. The remaining bits
Coded Peak I
Noise-FillCoded LF
Coded Peak II
not used for peak coding are used to directly code non-peak LF
MDCT coefficients, as low frequencies are of higher perceptual
importance. This is described in section 3.4. The remaining parts
of the MDCT vector are noise-filled, as illustrated in section 3.5.
Thus, the major algorithmic steps at the HVQ encoder are: detect
and code spectral peaks regions, code LF spectral coefficients (the
size of the coded region depends on the number of remaining bits
after peak coding), code noise-floor gains for spectral coefficients
outside the peaks regions and code HF norm factors to be used
with the HF noise-fill.
3.1. Classification
As the HVQ concept is applied on harmonic signals, the first
essential algorithmic step is signal classification, i.e. to decide on
the activation of the harmonic mode. Since that decision is
dependent on the spectral peak structure, the peak selection is
performed at the same step. First, the instantaneous noise-
level )(kEne and peak-level )(kEpe are estimated from the
absolute values of the transform coefficients )(kX . The noise-
level is calculated as:
  ,1,,0,)(1)1()(  LkkXkEkE nene  (2)
with 224L at 24.4 kb/s and 320L at 32 kb/s and where


 
 .
otherwise6472.0
)1()(if9578.0 kEkX ne
 (3)
Similarly, the peak-level is calculated as:
  ,1,,0,)(1)1()(  LkkXkEkE pepe  (4)
where
,
otherwise8029.0
)1()(if4223.0


 

kEkX pe
 (5)
and both )1(neE and )1(peE are initialized to 800. Per-band
averages of noise-level )(bEne and peak-level )(bEpe are
calculated by averaging the corresponding instantaneous level in
bands of 32 bins. The number of bands is 7B (5.6 kHz) at 24.4
kb/s and 10B (8 kHz) at 32 kb/s.
These noise and peak-level averages are used to derive three
variables, used by the decision logic. The first variable is the
number of detected peaks peaksN . A threshold for selecting peak
candidates is calculated as:
).(
)(
)(
)(
88.0
bE
bE
bE
b pe
ne
pe








 (6)
Absolute values of the transform coefficients  kX are compared
to the threshold )(b , the    bkX  form a vector of peak
candidates. The threshold is adjusted on both sides of the peaks in
the previous frame, with }21,5.0,25.0,5.0,21{ around
each peak position from the previous frame. This stabilizes the
peak selection over frames.
Elements from the peaks candidate vector are extracted in
decreasing order of peak amplitude, until the maximum number of
peaks (17 at 24.4 kb/s and 23 for 32 kb/s) is reached. This
procedure results in a set of peaksN spectral peaks.
The second variable used in the classification logic is sharpN ,
calculated as the number of bands for which 9)( bSharpne ,
where the measure of frequency sharpness per-band )(bSharpne is
the peak to noise-floor ratio in each band, defined as:
,
)(
)(xam
)(
bE
bX
bSharp
ne
ne  (7)
where )(bX is the set of all coefficients in the band b .
The third variable sharpD is calculated as:
 .9)(
1
0




B
b
nesharp bSharpD (8)
The decision to switch to HVQ mode is based on comparing the
above defined variables to the thresholds in Table 1.
If ssharp TN  , ppeaks TN  , and dsharp TD  the EVS codec will
process the current frame in HVQ mode.
Rate s p d
24.4 kb/s 4 20 22
32 kb/s 7 23 22
Table 1. Thresholds for HVQ mode decision.
3.2. Peak gains and positions
The essence of the HVQ algorithm is in explicit coding of spectral
peak positions and amplitudes. The peaks amplitudes )(mGp ,
where m is the peak index, are scalar quantized in a logarithmic
domain to form the quantized peak gains )(ˆ mGp and differentially
coded by 5 bits. The codewords are additionally Huffman coded
[4]. Since the HVQ is used to code harmonic signals the peak
positions mp will typically be equally spaced. However, some
frames have an irregular harmonic structure or the peak picker does
not select all the peaks in the harmonic structure. This leads to
irregular peak locations. Therefore, two methods are used to code
the peak positions efficiently.
The first method is Delta and Huffman coding. Due to the
constraint that peaks cannot be closer than 2 MDCT bins
(otherwise they are considered as one peak), Deltas are defined as:
Mpp mmm   11 , (9)
where 3M . The lack of small deltas reduces the size of the
Huffman table. Additionally, the largest delta in the table is
51max
 . An alternative sparse coding scheme is used
if max
 . The reduced table size make the Huffman coding
efficient.
The second method is based on the sparse coding algorithm
described in [5]. The entire vector of the peak locations is coded,
forming another vector, with 1 indicating peak presence and 0 no
peak. This vector is then in a first layer divided into sub-vectors of
length 5. The elements of the sub-vectors are OR-ed and the
concatenation of the results of the OR operation for each first layer
vector form another vector (the second layer). Each bit in this
second layer indicates presence or absence of peaks in the 5-dim
sub-vector from the first layer. In this way only the 5-dim sub-
vectors from the first layer that are not indicated as all-zero by the
Sub-vector Index
10000 000
01000 001
00100 010
00010 011
00001 100
10010 101
10001 110
01001 111
Table 2. The set of 5-dim vectors corresponding to the possible
peaks positions are indexed with 3-bit codewords.
second layer have to be transmitted. The second layer is always
transmitted. Prior to transmission the non-zero sub-vectors from
the first layer are mapped to exploit the fact that peaks cannot be
closer than 2 positions, and not all 5-dim vector combinations are
therefore possible.
For example, the positional vector {01000, 00000, 00000,
00100}, with commas added to increase readability, is compressed
to {1001, 001, 010}. The decoder reads from the bitstream the
layer 2 vector 1001. These 4 bits indicate that what will follow in
the bitstream is a description of the 1st and the 4th group, while the
2nd and the 3rd have to be filled-in with zeroes. The peak
positions for the 1st and the 4th group are indicated by 3bit
indices, and extracted from Table 2.
The decision logic for selection of the peak position coding
scheme operates as follows: if max
)max(  , then the sparse
coding is selected, otherwise the coding scheme that uses fewer
bits is chosen.
3.3. Vector quantization of peak regions
After the peak gain and position quantization, a neighborhood of 4
MDCT coefficients around each peak is quantized. The peak
region coefficients (the peak itself and two neighboring bins on
each side) are all scaled down by the quantized peak gain  mGp
ˆ .
In this way the central bin is scaled to unit amplitude, while the
surrounding 4 bins are normalized in relation to the central one.
The shape vector mS of the peak region, centered at bin k is
defined as:
 
 
 .)2()1()1()2(
ˆ
1
 kXkXkXkX
mG
k
p
mS (10)
These shape vectors are quantized by means of a classified and
structured vector quantizer (VQ) [6], with a trained codebook (CB)
[7] with the 4-dim codevectors:
 .)3()2()1()0( iiiii
uuuuu (11)
The numbers of peak regions vary over frames, which leads to
large variation in complexity due to different number of CB
searches. For audio codecs there are generally constraints on the
computational complexity and the memory usage. A large number
of CB searches would then limit the CB size and that is not
desirable as the quantization error will increase.
To keep the complexity nearly constant, while achieving low
quantization error, the search for each mS is performed in a
structured CB, with dynamically selected offset and size of the
search region. The starting point for the search is determined by an
Number of
coded peaks
Number of vectors
searched in codebook
24.4kb/s 32kb/s
23 - 128
22 - 134
21 - 141
20 - 149
19 - 158
18 - 168
17 128 179
16 136 192
15 145 206
14 155 224
13 167 244
12 181 256
11 197 
10 217 
9 241 
8 256 
  
1 256 256
Table 3. Search space for 24.4 and 32 kb/s, as a function of the
number of spectral peaks for the current audio frame.
initial classification of the input shape vector, while the length of
the search region depends on the number of shape vectors to
quantize. The CB is classified into two classes, with the centroids
0
C and 1
C , and ordered such that the codewords iu closest to
0
C and most distant to 1
C are in one side of the CB, while the
codewords closest to 1
C and most distant to 0
C are clustered in
the other side of the CB. Additionally it is noted that the shape
vectors exhibit certain symmetries (the MDCT coefficients on both
sides of the spectral peak have similar statistics). Therefore half of
the codevectors can be represented by a flipped version of the
other half, which reduces the memory usage as the flipped part
does not have to be pre-stored. The search is performed both
among the codevectors i
u as well as the flipped codevectors:
 .)0()1()2()3( iiiii
f uuuuu (12)
Thus the searched CB (both physically stored and virtual) can be
seen as clustered into four classes with
centroids 0
C , 1
C , 0
fC and 1
fC .
The following algorithmic steps are executed for every input
shape vector mS ; First the minimum mean squared distance
between the input vector and the four
centroids, 0
C , 1
C , 0
fC and 1
fC is used to determine the starting
point of the search and the CB orientation. Then the search space
is dynamically adjusted such that when larger numbers of peaks are
to be quantized in the current frame, the search space is reduced to
limit the maximum complexity according to Table 3. The
maximum search space (full-search) is used with 8 peaks or less at
24.4 kb/s, and 12 peaks or less at 32 kb/s.
Since the codevectors in the CB are sorted by the distance
between each codevector and the centroids, the search procedure
goes first over the set of vectors that is likely to contain the best
match. This property leads to minimum performance loss even
though the search space is dynamically adjusted. This trade-off
between complexity and performance is illustrated in section 4.1.
3.4. Allocation of remaining bits
As the number of spectral peaks varies in time, the presented
coding scheme will by its nature result in a variable bitrate. Bits
available after peak coding are used to directly code LF MDCT
coefficients. The target coefficients )(kX , outside of the peak
regions, are concatenated into a vector )( ckZ which is partitioned
into bands of 24 coefficients. The coding always starts at the
beginning of the MDCT vector, e.g. if the available bits allow
coding of only one 24-dim band, then only the coefficients
23,,0),( cc kkZ will be coded. The number of bands zN
in )( ckZ that will be coded depends on the remaining number of
bits availR after peak coding, and is determined by:
   ,0,30modsignxam 





 maxavail
max
avail
z RR
R
R
N (13)
where 80maxR at 24.4 kb/s and 95maxR at 32 kb/s. The
resulting vector )( ckZ is encoded with a gain-shape VQ with 5
bits per gain and PVQ encoded [8] shape vectors.
3.5. High-frequency regions and noise-fill
The non-coded coefficients below 5.6 kHz for 24.4 kb/s and 8 kHz
for 32 kb/s are grouped into two sections and filled with random
noise, using noise floor gains derived from the per-band noise-
level averages described in section 3.1. The average of the
elements of the first half of neE gives the noise-floor gain )0(neG ,
while the second half of average noise levels forms )1(neG . These
norm factors are quantized and transmitted to the decoder.
The frequency part above 5.6 kHz for 24.4 kb/s and 8 kHz for
32 kb/s is reconstructed by means of coded HF gain factors )(bN
and spectral filling codebook generated from the coded LF part. In
the process of creating the noise-fill codebook, the LF peak
positions are also recorded. Exploiting that knowledge, the applied
envelope gains are modified based on the presence of a peak in the
fill codebook according to:
 





otherwise.,)1(),(),1(nim
bandinpeak),1(1.0)(8.0)1(1.0
bNbNbN
bbNbNbN
Nb (14)
The purpose of the gain modification is to avoid that a peak is
significantly attenuated if it happens that the corresponding gain
comes from a band without any peaks. Additionally, a noise-like
structure in the noise-fill codebook should not be amplified by
applying a strong gain that is calculated from an original band that
contained one or more peaks.
4. EVALUATION
This section consists of complexity measurements confirming the
efficiency of the proposed dynamic search space adjustment, as
well as subjective evaluation of the HVQ coding mode by mean of
an ABX test [9]. Further details on formal perceptual evaluation by
means of MOS [10] tests of the EVS codec can be found in the
selection tests, available at [11].
4.1. Complexity measurements
The computational complexity in this section is measured
according to [12] and was obtained from a simulation over 1312
harmonic signal frames. The data contained on average of 19.4
peaks per frame with a variance of 7.1 (minimum 12 and maximum
23 peaks). As illustrated in Table 4 the use of adaptive search in
the classified structured codebook gives significant complexity
reduction with maintained accuracy. The adaptive search reduces
the maximum complexity with 45% while the average SNR
decreases by only 0.2%.
Search
Full Adaptive
Average SNR [dB] 8.743 8.723
Max complexity [WMOPS] 4.857 2.638
Table 4. Average SNR and maximum computational
complexity for the shape VQ with full and adaptive search.
4.2. Subjective evaluation
Subjective evaluation of the introduced algorithm has been
performed in terms of a 5 level ABX test. After being presented
with the reference signal and a pair of A and B samples in random
order, the listeners had to choose among the following options: B
much better (-2), B slightly better (-1), A and B are the same (0), A
slightly better (+1), A much better (+2). The systems under test
were the EVS codec with active HVQ module (option A) and the
EVS baseline with HVQ module deactivated (option B). The test
consisted of 10 harmonic items, and the votes from 7 experienced
listeners are presented in Table 5. The average results are presented
in Figure 2.
Vote -2 -1 0 +1 +2
Count 3 24 42 50 21
Table 5. Results from the subjective ABX test indicate clear
preference for the EVS codec with active HVQ module (positive
scores), compared to EVS with deactivated HVQ module (negative
scores).
Figure 2. Mean score and 95% confidence interval indicate
statistically significant preference of HVQ module in the ABX test.
5. CONCLUSIONS
By means of a subjective test we have demonstrated that the
proposed concept of a harmonic audio coding model provides
superior perceptual quality over the conventional coding scheme.
Complexity measurements confirm that the introduced adaptive
VQ concept provides significant complexity reduction while
maintaining quantization errors at the level of exhaustive search.
These subjective and objective measurements show that the HVQ
improves coding of harmonic audio signals.
6. REFERENCES
[1] 3GPP TS 26.445, “Codec for Enhanced Voice Services
(EVS); Detailed algorithmic description,” 2014.
[2] ITU-T Rec. G.719, “Low-complexity full-band audio coding
for high-quality conversational applications,” 2008.
[3] H. S. Malvar, “Signal Processing with Lapped Transforms,”
Norwood, MA: Artech House, 1992.
[4] D. Huffman, “A Method for the Construction of Minimum-
Redundancy Codes,” Proc. IRE 40 (9), 1952.
[5] D. Salomon, G. Motta, and D. Bryant, “Sparse Strings” in
“Handbook of Data Compression,” Springer, 2010.
[6] A. Gersho and R. Gray, “Vector Quantization and Signal
Compression,” Springer, 1991.
[7] Y. Linde, A. Burzo, and R. Gray, “An algorithm for vector
quantizer design,” IEEE Trans. Commun. COM-28, 1980.
[8] T. Fischer, “A Pyramid Vector quantizer,” IEEE Transact.
Inf. Theory, IT-32 (4), 1986.
[9] D. Clark, “High-Resolution Subjective Testing Using a
Double-Blind Comparator,” J. Audio Eng. Soc. 30 (5), 1982.
[10] ITU-T Rec. P.800, “Methods for Subjective Determination
of Transmission Quality,” 1996.
[11] 3GPP Tdoc S4-141065, “Report of the Global Analysis Lab
for the EVS Selection Phase,” Dynastat Inc., 2014.
[12] ITU-T Rec. G.191, “Software Tool Library 2009 User’s
Manual,” Geneva, 2009.

Weitere ähnliche Inhalte

Was ist angesagt?

Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...
Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...
Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...CSCJournals
 
Multiuser MIMO Vector Perturbation Precoding
Multiuser MIMO Vector Perturbation PrecodingMultiuser MIMO Vector Perturbation Precoding
Multiuser MIMO Vector Perturbation Precodingadeelrazi
 
Optimum Receiver corrupted by AWGN Channel
Optimum Receiver corrupted by AWGN ChannelOptimum Receiver corrupted by AWGN Channel
Optimum Receiver corrupted by AWGN ChannelAWANISHKUMAR84
 
Performance Analysis of M-ary Optical CDMA in Presence of Chromatic Dispersion
Performance Analysis of M-ary Optical CDMA in Presence of Chromatic DispersionPerformance Analysis of M-ary Optical CDMA in Presence of Chromatic Dispersion
Performance Analysis of M-ary Optical CDMA in Presence of Chromatic DispersionIDES Editor
 
BANDWIDTH UTILIZATION
BANDWIDTH UTILIZATIONBANDWIDTH UTILIZATION
BANDWIDTH UTILIZATIONAvijeet Negel
 
ADDITTIVE WHITE GAUSIAN NOIS ( AWGN)
ADDITTIVE WHITE GAUSIAN NOIS ( AWGN)ADDITTIVE WHITE GAUSIAN NOIS ( AWGN)
ADDITTIVE WHITE GAUSIAN NOIS ( AWGN)mohammedalimahdi
 
Paper id 25201478
Paper id 25201478Paper id 25201478
Paper id 25201478IJRAT
 
Cooperative partial transmit sequence for papr reduction in space frequency b...
Cooperative partial transmit sequence for papr reduction in space frequency b...Cooperative partial transmit sequence for papr reduction in space frequency b...
Cooperative partial transmit sequence for papr reduction in space frequency b...IAEME Publication
 
A Threshold Enhancement Technique for Chaotic On-Off Keying Scheme
A Threshold Enhancement Technique for Chaotic On-Off Keying SchemeA Threshold Enhancement Technique for Chaotic On-Off Keying Scheme
A Threshold Enhancement Technique for Chaotic On-Off Keying SchemeCSCJournals
 
EE402B Radio Systems and Personal Communication Networks notes
EE402B Radio Systems and Personal Communication Networks notesEE402B Radio Systems and Personal Communication Networks notes
EE402B Radio Systems and Personal Communication Networks notesHaris Hassan
 
Ch04
Ch04Ch04
Ch04H K
 
Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...
Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...
Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...ijcnac
 
A novel and efficient mixed-signal compressed sensing for wide-band cognitive...
A novel and efficient mixed-signal compressed sensing for wide-band cognitive...A novel and efficient mixed-signal compressed sensing for wide-band cognitive...
A novel and efficient mixed-signal compressed sensing for wide-band cognitive...Polytechnique Montreal
 

Was ist angesagt? (20)

Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...
Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...
Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...
 
Modulation techniques matlab_code
Modulation techniques matlab_codeModulation techniques matlab_code
Modulation techniques matlab_code
 
Multiuser MIMO Vector Perturbation Precoding
Multiuser MIMO Vector Perturbation PrecodingMultiuser MIMO Vector Perturbation Precoding
Multiuser MIMO Vector Perturbation Precoding
 
Optimum Receiver corrupted by AWGN Channel
Optimum Receiver corrupted by AWGN ChannelOptimum Receiver corrupted by AWGN Channel
Optimum Receiver corrupted by AWGN Channel
 
Ch4 1 v1
Ch4 1 v1Ch4 1 v1
Ch4 1 v1
 
wavelet packets
wavelet packetswavelet packets
wavelet packets
 
Sistec ppt
Sistec pptSistec ppt
Sistec ppt
 
Performance Analysis of M-ary Optical CDMA in Presence of Chromatic Dispersion
Performance Analysis of M-ary Optical CDMA in Presence of Chromatic DispersionPerformance Analysis of M-ary Optical CDMA in Presence of Chromatic Dispersion
Performance Analysis of M-ary Optical CDMA in Presence of Chromatic Dispersion
 
Masters Report 1
Masters Report 1Masters Report 1
Masters Report 1
 
BANDWIDTH UTILIZATION
BANDWIDTH UTILIZATIONBANDWIDTH UTILIZATION
BANDWIDTH UTILIZATION
 
ADDITTIVE WHITE GAUSIAN NOIS ( AWGN)
ADDITTIVE WHITE GAUSIAN NOIS ( AWGN)ADDITTIVE WHITE GAUSIAN NOIS ( AWGN)
ADDITTIVE WHITE GAUSIAN NOIS ( AWGN)
 
Paper id 25201478
Paper id 25201478Paper id 25201478
Paper id 25201478
 
Cooperative partial transmit sequence for papr reduction in space frequency b...
Cooperative partial transmit sequence for papr reduction in space frequency b...Cooperative partial transmit sequence for papr reduction in space frequency b...
Cooperative partial transmit sequence for papr reduction in space frequency b...
 
A Threshold Enhancement Technique for Chaotic On-Off Keying Scheme
A Threshold Enhancement Technique for Chaotic On-Off Keying SchemeA Threshold Enhancement Technique for Chaotic On-Off Keying Scheme
A Threshold Enhancement Technique for Chaotic On-Off Keying Scheme
 
EE402B Radio Systems and Personal Communication Networks notes
EE402B Radio Systems and Personal Communication Networks notesEE402B Radio Systems and Personal Communication Networks notes
EE402B Radio Systems and Personal Communication Networks notes
 
A0530106
A0530106A0530106
A0530106
 
Ch04
Ch04Ch04
Ch04
 
Waveform Coding
Waveform CodingWaveform Coding
Waveform Coding
 
Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...
Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...
Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...
 
A novel and efficient mixed-signal compressed sensing for wide-band cognitive...
A novel and efficient mixed-signal compressed sensing for wide-band cognitive...A novel and efficient mixed-signal compressed sensing for wide-band cognitive...
A novel and efficient mixed-signal compressed sensing for wide-band cognitive...
 

Andere mochten auch

Ericsson Sustainability and Corporate Responsibility Report 2012
Ericsson Sustainability and Corporate Responsibility Report 2012Ericsson Sustainability and Corporate Responsibility Report 2012
Ericsson Sustainability and Corporate Responsibility Report 2012Ericsson
 
Ericsson ConsumerLab: Personal Information Economy
   Ericsson ConsumerLab: Personal Information Economy   Ericsson ConsumerLab: Personal Information Economy
Ericsson ConsumerLab: Personal Information EconomyEricsson
 
Winning the Game
Winning the GameWinning the Game
Winning the GameEricsson
 
The sharing economy
The sharing economyThe sharing economy
The sharing economyEricsson
 
Ericsson Review: HSPA evolution for future mobile-broadband needs
Ericsson Review: HSPA evolution for future mobile-broadband needsEricsson Review: HSPA evolution for future mobile-broadband needs
Ericsson Review: HSPA evolution for future mobile-broadband needsEricsson
 
MDCT audio coding with pulse vector quantizers
MDCT audio coding with pulse vector quantizersMDCT audio coding with pulse vector quantizers
MDCT audio coding with pulse vector quantizersEricsson
 
Ericsson ConsumerLab: Connected lifestyles’ expectations identified
Ericsson ConsumerLab: Connected lifestyles’ expectations identifiedEricsson ConsumerLab: Connected lifestyles’ expectations identified
Ericsson ConsumerLab: Connected lifestyles’ expectations identifiedEricsson
 
Internet of Things from a Networked Society perspective
Internet of Things from a Networked Society perspectiveInternet of Things from a Networked Society perspective
Internet of Things from a Networked Society perspectiveEricsson
 
Consumers and the Internet of Things
Consumers and the Internet of ThingsConsumers and the Internet of Things
Consumers and the Internet of ThingsEricsson
 
API Market Demand
API Market DemandAPI Market Demand
API Market DemandEricsson
 
Ericsson Review: The benefits of self-organizing backhaul networks
Ericsson Review: The benefits of self-organizing backhaul networksEricsson Review: The benefits of self-organizing backhaul networks
Ericsson Review: The benefits of self-organizing backhaul networksEricsson
 
Ericsson Technology Review, issue #1, 2016
Ericsson Technology Review, issue #1, 2016Ericsson Technology Review, issue #1, 2016
Ericsson Technology Review, issue #1, 2016Ericsson
 
Scaling Internet of Things
Scaling Internet of ThingsScaling Internet of Things
Scaling Internet of ThingsEricsson
 
Ericsson ConsumerLab: The one-click ideal - Presentation
Ericsson ConsumerLab: The one-click ideal - PresentationEricsson ConsumerLab: The one-click ideal - Presentation
Ericsson ConsumerLab: The one-click ideal - PresentationEricsson
 

Andere mochten auch (14)

Ericsson Sustainability and Corporate Responsibility Report 2012
Ericsson Sustainability and Corporate Responsibility Report 2012Ericsson Sustainability and Corporate Responsibility Report 2012
Ericsson Sustainability and Corporate Responsibility Report 2012
 
Ericsson ConsumerLab: Personal Information Economy
   Ericsson ConsumerLab: Personal Information Economy   Ericsson ConsumerLab: Personal Information Economy
Ericsson ConsumerLab: Personal Information Economy
 
Winning the Game
Winning the GameWinning the Game
Winning the Game
 
The sharing economy
The sharing economyThe sharing economy
The sharing economy
 
Ericsson Review: HSPA evolution for future mobile-broadband needs
Ericsson Review: HSPA evolution for future mobile-broadband needsEricsson Review: HSPA evolution for future mobile-broadband needs
Ericsson Review: HSPA evolution for future mobile-broadband needs
 
MDCT audio coding with pulse vector quantizers
MDCT audio coding with pulse vector quantizersMDCT audio coding with pulse vector quantizers
MDCT audio coding with pulse vector quantizers
 
Ericsson ConsumerLab: Connected lifestyles’ expectations identified
Ericsson ConsumerLab: Connected lifestyles’ expectations identifiedEricsson ConsumerLab: Connected lifestyles’ expectations identified
Ericsson ConsumerLab: Connected lifestyles’ expectations identified
 
Internet of Things from a Networked Society perspective
Internet of Things from a Networked Society perspectiveInternet of Things from a Networked Society perspective
Internet of Things from a Networked Society perspective
 
Consumers and the Internet of Things
Consumers and the Internet of ThingsConsumers and the Internet of Things
Consumers and the Internet of Things
 
API Market Demand
API Market DemandAPI Market Demand
API Market Demand
 
Ericsson Review: The benefits of self-organizing backhaul networks
Ericsson Review: The benefits of self-organizing backhaul networksEricsson Review: The benefits of self-organizing backhaul networks
Ericsson Review: The benefits of self-organizing backhaul networks
 
Ericsson Technology Review, issue #1, 2016
Ericsson Technology Review, issue #1, 2016Ericsson Technology Review, issue #1, 2016
Ericsson Technology Review, issue #1, 2016
 
Scaling Internet of Things
Scaling Internet of ThingsScaling Internet of Things
Scaling Internet of Things
 
Ericsson ConsumerLab: The one-click ideal - Presentation
Ericsson ConsumerLab: The one-click ideal - PresentationEricsson ConsumerLab: The one-click ideal - Presentation
Ericsson ConsumerLab: The one-click ideal - Presentation
 

Ähnlich wie Harmonic vector quantization

EBDSS Max Research Report - Final
EBDSS  Max  Research Report - FinalEBDSS  Max  Research Report - Final
EBDSS Max Research Report - FinalMax Robertson
 
Modulation technology
Modulation technologyModulation technology
Modulation technologyPei-Che Chang
 
An identification of the tolerable time-interleaved analog-todigital convert...
An identification of the tolerable time-interleaved analog-todigital convert...An identification of the tolerable time-interleaved analog-todigital convert...
An identification of the tolerable time-interleaved analog-todigital convert...IJECEIAES
 
Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...
Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...
Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...ijsrd.com
 
Optical Spatial Modulation with Transmitter-Receiver Alignments
Optical Spatial Modulation with Transmitter-Receiver AlignmentsOptical Spatial Modulation with Transmitter-Receiver Alignments
Optical Spatial Modulation with Transmitter-Receiver AlignmentsMarwan Hammouda
 
Iaetsd performance analysis of multicarrier ds-cdma
Iaetsd performance analysis of multicarrier ds-cdmaIaetsd performance analysis of multicarrier ds-cdma
Iaetsd performance analysis of multicarrier ds-cdmaIaetsd Iaetsd
 
Introduction to communication system lecture5
Introduction to communication system lecture5Introduction to communication system lecture5
Introduction to communication system lecture5Jumaan Ally Mohamed
 
Efficient pu mode decision and motion estimation for h.264 avc to hevc transc...
Efficient pu mode decision and motion estimation for h.264 avc to hevc transc...Efficient pu mode decision and motion estimation for h.264 avc to hevc transc...
Efficient pu mode decision and motion estimation for h.264 avc to hevc transc...sipij
 
Performance bounds for unequally punctured terminated convolutional codes
Performance bounds for unequally punctured terminated convolutional codesPerformance bounds for unequally punctured terminated convolutional codes
Performance bounds for unequally punctured terminated convolutional codeseSAT Journals
 
Vibration source identification caused by bearing faults based on SVD-EMD-ICA
Vibration source identification caused by bearing faults based on SVD-EMD-ICAVibration source identification caused by bearing faults based on SVD-EMD-ICA
Vibration source identification caused by bearing faults based on SVD-EMD-ICAIJRES Journal
 
Investigation of repeated blasts at Aitik mine using waveform cross correlation
Investigation of repeated blasts at Aitik mine using waveform cross correlationInvestigation of repeated blasts at Aitik mine using waveform cross correlation
Investigation of repeated blasts at Aitik mine using waveform cross correlationIvan Kitov
 
_Digital-Modulation.pdf
_Digital-Modulation.pdf_Digital-Modulation.pdf
_Digital-Modulation.pdfSoyallRobi
 
Analysis of LDPC Codes under Wi-Max IEEE 802.16e
Analysis of LDPC Codes under Wi-Max IEEE 802.16eAnalysis of LDPC Codes under Wi-Max IEEE 802.16e
Analysis of LDPC Codes under Wi-Max IEEE 802.16eIJERD Editor
 

Ähnlich wie Harmonic vector quantization (20)

EBDSS Max Research Report - Final
EBDSS  Max  Research Report - FinalEBDSS  Max  Research Report - Final
EBDSS Max Research Report - Final
 
Modulation technology
Modulation technologyModulation technology
Modulation technology
 
t23notes
t23notest23notes
t23notes
 
9517cnc05
9517cnc059517cnc05
9517cnc05
 
Mk3621242127
Mk3621242127Mk3621242127
Mk3621242127
 
An identification of the tolerable time-interleaved analog-todigital convert...
An identification of the tolerable time-interleaved analog-todigital convert...An identification of the tolerable time-interleaved analog-todigital convert...
An identification of the tolerable time-interleaved analog-todigital convert...
 
7 227 2005
7 227 20057 227 2005
7 227 2005
 
Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...
Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...
Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...
 
Optical Spatial Modulation with Transmitter-Receiver Alignments
Optical Spatial Modulation with Transmitter-Receiver AlignmentsOptical Spatial Modulation with Transmitter-Receiver Alignments
Optical Spatial Modulation with Transmitter-Receiver Alignments
 
Cdma and 3 g
Cdma and 3 gCdma and 3 g
Cdma and 3 g
 
Iaetsd performance analysis of multicarrier ds-cdma
Iaetsd performance analysis of multicarrier ds-cdmaIaetsd performance analysis of multicarrier ds-cdma
Iaetsd performance analysis of multicarrier ds-cdma
 
Introduction to communication system lecture5
Introduction to communication system lecture5Introduction to communication system lecture5
Introduction to communication system lecture5
 
Bf4102414417
Bf4102414417Bf4102414417
Bf4102414417
 
Efficient pu mode decision and motion estimation for h.264 avc to hevc transc...
Efficient pu mode decision and motion estimation for h.264 avc to hevc transc...Efficient pu mode decision and motion estimation for h.264 avc to hevc transc...
Efficient pu mode decision and motion estimation for h.264 avc to hevc transc...
 
Performance bounds for unequally punctured terminated convolutional codes
Performance bounds for unequally punctured terminated convolutional codesPerformance bounds for unequally punctured terminated convolutional codes
Performance bounds for unequally punctured terminated convolutional codes
 
Vibration source identification caused by bearing faults based on SVD-EMD-ICA
Vibration source identification caused by bearing faults based on SVD-EMD-ICAVibration source identification caused by bearing faults based on SVD-EMD-ICA
Vibration source identification caused by bearing faults based on SVD-EMD-ICA
 
ASK,FSK and M-PSK using Matlab
ASK,FSK and M-PSK using MatlabASK,FSK and M-PSK using Matlab
ASK,FSK and M-PSK using Matlab
 
Investigation of repeated blasts at Aitik mine using waveform cross correlation
Investigation of repeated blasts at Aitik mine using waveform cross correlationInvestigation of repeated blasts at Aitik mine using waveform cross correlation
Investigation of repeated blasts at Aitik mine using waveform cross correlation
 
_Digital-Modulation.pdf
_Digital-Modulation.pdf_Digital-Modulation.pdf
_Digital-Modulation.pdf
 
Analysis of LDPC Codes under Wi-Max IEEE 802.16e
Analysis of LDPC Codes under Wi-Max IEEE 802.16eAnalysis of LDPC Codes under Wi-Max IEEE 802.16e
Analysis of LDPC Codes under Wi-Max IEEE 802.16e
 

Mehr von Ericsson

Ericsson Technology Review: Versatile Video Coding explained – the future of ...
Ericsson Technology Review: Versatile Video Coding explained – the future of ...Ericsson Technology Review: Versatile Video Coding explained – the future of ...
Ericsson Technology Review: Versatile Video Coding explained – the future of ...Ericsson
 
Ericsson Technology Review: issue 2, 2020
 Ericsson Technology Review: issue 2, 2020 Ericsson Technology Review: issue 2, 2020
Ericsson Technology Review: issue 2, 2020Ericsson
 
Ericsson Technology Review: Integrated access and backhaul – a new type of wi...
Ericsson Technology Review: Integrated access and backhaul – a new type of wi...Ericsson Technology Review: Integrated access and backhaul – a new type of wi...
Ericsson Technology Review: Integrated access and backhaul – a new type of wi...Ericsson
 
Ericsson Technology Review: Critical IoT connectivity: Ideal for time-critica...
Ericsson Technology Review: Critical IoT connectivity: Ideal for time-critica...Ericsson Technology Review: Critical IoT connectivity: Ideal for time-critica...
Ericsson Technology Review: Critical IoT connectivity: Ideal for time-critica...Ericsson
 
Ericsson Technology Review: 5G evolution: 3GPP releases 16 & 17 overview (upd...
Ericsson Technology Review: 5G evolution: 3GPP releases 16 & 17 overview (upd...Ericsson Technology Review: 5G evolution: 3GPP releases 16 & 17 overview (upd...
Ericsson Technology Review: 5G evolution: 3GPP releases 16 & 17 overview (upd...Ericsson
 
Ericsson Technology Review: The future of cloud computing: Highly distributed...
Ericsson Technology Review: The future of cloud computing: Highly distributed...Ericsson Technology Review: The future of cloud computing: Highly distributed...
Ericsson Technology Review: The future of cloud computing: Highly distributed...Ericsson
 
Ericsson Technology Review: Optimizing UICC modules for IoT applications
Ericsson Technology Review: Optimizing UICC modules for IoT applicationsEricsson Technology Review: Optimizing UICC modules for IoT applications
Ericsson Technology Review: Optimizing UICC modules for IoT applicationsEricsson
 
Ericsson Technology Review: issue 1, 2020
Ericsson Technology Review: issue 1, 2020Ericsson Technology Review: issue 1, 2020
Ericsson Technology Review: issue 1, 2020Ericsson
 
Ericsson Technology Review: 5G BSS: Evolving BSS to fit the 5G economy
Ericsson Technology Review: 5G BSS: Evolving BSS to fit the 5G economyEricsson Technology Review: 5G BSS: Evolving BSS to fit the 5G economy
Ericsson Technology Review: 5G BSS: Evolving BSS to fit the 5G economyEricsson
 
Ericsson Technology Review: 5G migration strategy from EPS to 5G system
Ericsson Technology Review: 5G migration strategy from EPS to 5G systemEricsson Technology Review: 5G migration strategy from EPS to 5G system
Ericsson Technology Review: 5G migration strategy from EPS to 5G systemEricsson
 
Ericsson Technology Review: Creating the next-generation edge-cloud ecosystem
Ericsson Technology Review: Creating the next-generation edge-cloud ecosystemEricsson Technology Review: Creating the next-generation edge-cloud ecosystem
Ericsson Technology Review: Creating the next-generation edge-cloud ecosystemEricsson
 
Ericsson Technology Review: Issue 2/2019
Ericsson Technology Review: Issue 2/2019Ericsson Technology Review: Issue 2/2019
Ericsson Technology Review: Issue 2/2019Ericsson
 
Ericsson Technology Review: Spotlight on the Internet of Things
Ericsson Technology Review: Spotlight on the Internet of ThingsEricsson Technology Review: Spotlight on the Internet of Things
Ericsson Technology Review: Spotlight on the Internet of ThingsEricsson
 
Ericsson Technology Review - Technology Trends 2019
Ericsson Technology Review - Technology Trends 2019Ericsson Technology Review - Technology Trends 2019
Ericsson Technology Review - Technology Trends 2019Ericsson
 
Ericsson Technology Review: Driving transformation in the automotive and road...
Ericsson Technology Review: Driving transformation in the automotive and road...Ericsson Technology Review: Driving transformation in the automotive and road...
Ericsson Technology Review: Driving transformation in the automotive and road...Ericsson
 
SD-WAN Orchestration
SD-WAN OrchestrationSD-WAN Orchestration
SD-WAN OrchestrationEricsson
 
Ericsson Technology Review: 5G-TSN integration meets networking requirements ...
Ericsson Technology Review: 5G-TSN integration meets networking requirements ...Ericsson Technology Review: 5G-TSN integration meets networking requirements ...
Ericsson Technology Review: 5G-TSN integration meets networking requirements ...Ericsson
 
Ericsson Technology Review: Meeting 5G latency requirements with inactive state
Ericsson Technology Review: Meeting 5G latency requirements with inactive stateEricsson Technology Review: Meeting 5G latency requirements with inactive state
Ericsson Technology Review: Meeting 5G latency requirements with inactive stateEricsson
 
Ericsson Technology Review: Cloud-native application design in the telecom do...
Ericsson Technology Review: Cloud-native application design in the telecom do...Ericsson Technology Review: Cloud-native application design in the telecom do...
Ericsson Technology Review: Cloud-native application design in the telecom do...Ericsson
 
Ericsson Technology Review: Service exposure: a critical capability in a 5G w...
Ericsson Technology Review: Service exposure: a critical capability in a 5G w...Ericsson Technology Review: Service exposure: a critical capability in a 5G w...
Ericsson Technology Review: Service exposure: a critical capability in a 5G w...Ericsson
 

Mehr von Ericsson (20)

Ericsson Technology Review: Versatile Video Coding explained – the future of ...
Ericsson Technology Review: Versatile Video Coding explained – the future of ...Ericsson Technology Review: Versatile Video Coding explained – the future of ...
Ericsson Technology Review: Versatile Video Coding explained – the future of ...
 
Ericsson Technology Review: issue 2, 2020
 Ericsson Technology Review: issue 2, 2020 Ericsson Technology Review: issue 2, 2020
Ericsson Technology Review: issue 2, 2020
 
Ericsson Technology Review: Integrated access and backhaul – a new type of wi...
Ericsson Technology Review: Integrated access and backhaul – a new type of wi...Ericsson Technology Review: Integrated access and backhaul – a new type of wi...
Ericsson Technology Review: Integrated access and backhaul – a new type of wi...
 
Ericsson Technology Review: Critical IoT connectivity: Ideal for time-critica...
Ericsson Technology Review: Critical IoT connectivity: Ideal for time-critica...Ericsson Technology Review: Critical IoT connectivity: Ideal for time-critica...
Ericsson Technology Review: Critical IoT connectivity: Ideal for time-critica...
 
Ericsson Technology Review: 5G evolution: 3GPP releases 16 & 17 overview (upd...
Ericsson Technology Review: 5G evolution: 3GPP releases 16 & 17 overview (upd...Ericsson Technology Review: 5G evolution: 3GPP releases 16 & 17 overview (upd...
Ericsson Technology Review: 5G evolution: 3GPP releases 16 & 17 overview (upd...
 
Ericsson Technology Review: The future of cloud computing: Highly distributed...
Ericsson Technology Review: The future of cloud computing: Highly distributed...Ericsson Technology Review: The future of cloud computing: Highly distributed...
Ericsson Technology Review: The future of cloud computing: Highly distributed...
 
Ericsson Technology Review: Optimizing UICC modules for IoT applications
Ericsson Technology Review: Optimizing UICC modules for IoT applicationsEricsson Technology Review: Optimizing UICC modules for IoT applications
Ericsson Technology Review: Optimizing UICC modules for IoT applications
 
Ericsson Technology Review: issue 1, 2020
Ericsson Technology Review: issue 1, 2020Ericsson Technology Review: issue 1, 2020
Ericsson Technology Review: issue 1, 2020
 
Ericsson Technology Review: 5G BSS: Evolving BSS to fit the 5G economy
Ericsson Technology Review: 5G BSS: Evolving BSS to fit the 5G economyEricsson Technology Review: 5G BSS: Evolving BSS to fit the 5G economy
Ericsson Technology Review: 5G BSS: Evolving BSS to fit the 5G economy
 
Ericsson Technology Review: 5G migration strategy from EPS to 5G system
Ericsson Technology Review: 5G migration strategy from EPS to 5G systemEricsson Technology Review: 5G migration strategy from EPS to 5G system
Ericsson Technology Review: 5G migration strategy from EPS to 5G system
 
Ericsson Technology Review: Creating the next-generation edge-cloud ecosystem
Ericsson Technology Review: Creating the next-generation edge-cloud ecosystemEricsson Technology Review: Creating the next-generation edge-cloud ecosystem
Ericsson Technology Review: Creating the next-generation edge-cloud ecosystem
 
Ericsson Technology Review: Issue 2/2019
Ericsson Technology Review: Issue 2/2019Ericsson Technology Review: Issue 2/2019
Ericsson Technology Review: Issue 2/2019
 
Ericsson Technology Review: Spotlight on the Internet of Things
Ericsson Technology Review: Spotlight on the Internet of ThingsEricsson Technology Review: Spotlight on the Internet of Things
Ericsson Technology Review: Spotlight on the Internet of Things
 
Ericsson Technology Review - Technology Trends 2019
Ericsson Technology Review - Technology Trends 2019Ericsson Technology Review - Technology Trends 2019
Ericsson Technology Review - Technology Trends 2019
 
Ericsson Technology Review: Driving transformation in the automotive and road...
Ericsson Technology Review: Driving transformation in the automotive and road...Ericsson Technology Review: Driving transformation in the automotive and road...
Ericsson Technology Review: Driving transformation in the automotive and road...
 
SD-WAN Orchestration
SD-WAN OrchestrationSD-WAN Orchestration
SD-WAN Orchestration
 
Ericsson Technology Review: 5G-TSN integration meets networking requirements ...
Ericsson Technology Review: 5G-TSN integration meets networking requirements ...Ericsson Technology Review: 5G-TSN integration meets networking requirements ...
Ericsson Technology Review: 5G-TSN integration meets networking requirements ...
 
Ericsson Technology Review: Meeting 5G latency requirements with inactive state
Ericsson Technology Review: Meeting 5G latency requirements with inactive stateEricsson Technology Review: Meeting 5G latency requirements with inactive state
Ericsson Technology Review: Meeting 5G latency requirements with inactive state
 
Ericsson Technology Review: Cloud-native application design in the telecom do...
Ericsson Technology Review: Cloud-native application design in the telecom do...Ericsson Technology Review: Cloud-native application design in the telecom do...
Ericsson Technology Review: Cloud-native application design in the telecom do...
 
Ericsson Technology Review: Service exposure: a critical capability in a 5G w...
Ericsson Technology Review: Service exposure: a critical capability in a 5G w...Ericsson Technology Review: Service exposure: a critical capability in a 5G w...
Ericsson Technology Review: Service exposure: a critical capability in a 5G w...
 

Kürzlich hochgeladen

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 

Kürzlich hochgeladen (20)

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 

Harmonic vector quantization

  • 1. HARMONIC VECTOR QUANTIZATION Volodya Grancharov, Sigurdur Sverrisson, Erik Norvell, Tomas Toftgård, Jonas Svedberg, and Harald Pobloth SMN, Ericsson Research, Ericsson AB 164 80, Stockholm, Sweden ABSTRACT Audio coding of harmonic signals is a challenging task for conventional MDCT coding schemes. In this paper we introduce a novel algorithm for improved transform coding of harmonic audio. The algorithm does not deploy the conventional scheme of splitting the input signal into a spectrum envelope and a residual, but models the spectral peak regions. Test results indicate that the presented algorithm outperforms the conventional coding concept. Index Terms— Audio coding, MDCT, VQ 1. INTRODUCTION Transform coding is one main technology used to compress and transmit audio signals. Typically the Modified Discrete Cosine Transform (MDCT) [3] is used and the vector of MDCT coefficients from one frame of audio is split into multiple bands of pre-defined width. Then the energy in each band is calculated and quantized. These energies are used to produce a residual vector by scaling down the MDCT vector. Unfortunately, this commonly used concept does not work well for transform coding of harmonic audio signals, e.g. single instruments. The reason is that normalization with band energies does not result in sufficiently “flat” residual vectors, and the residual coding scheme cannot represent the dynamics in the residual vector. Moreover, modeling the pitch in the MDCT domain is a cumbersome task due to the mix of time- and frequency domain information. The presented algorithm provides an alternative audio coding model that can efficiently process harmonic audio signals. The main concept is that the MDCT vector of low-frequencies (LF) is not split in envelope and residual, but instead spectral peaks are directly extracted and quantized together with neighboring MDCT bins. Low energy regions, outside the spectral peaks neighborhood, are not coded but noise-filled at the decoder. In this way, the conventional coding model: “spectrum envelope and residual” is replaced with the concept of: “spectral peaks and noise-floor”. 2. MDCT FRAMEWORK The presented Harmonic Vector Quantization (HVQ) algorithm is integrated into the harmonic part of the EVS codec [1] to deal with stationary harmonic content. It is targeted for the sample rates of 32 and 48 kHz, and bit rates of 24.4 and 32 kb/s. Similar to the framework in [2], it operates on overlapping input audio frames )(nx but here it uses an alternative asymmetric window [1], which Figure 1. Example of a MDCT vector according to the HVQ model. The two spectral peaks and the corresponding surrounding coefficients are accurately coded. The remaining bits are allocated to LF content, and the rest of the vector is noise-filled. has a smaller overlap compared to the standard sinusoid window and hence a reduced algorithmic delay. The windowed, time- aliased input )(~ nx is transformed to MDCT coefficients )(kX using DCTIV: , 2 1 2 1 cos)(~)( 1 0                       K n K knnxkX  1,...,0  Kk .(1) Here K is the size of the MDCT vector ( 640K for 32 kHz and 960K for 48 kHz sampled input) and k is the transform coefficient index. The lower 224 coefficients (up to 5.6 kHz), for 24.4 kb/s, and 320 coefficients (up to 8 kHz), for 32 kb/s, are coded by the introduced HVQ scheme. This lower spectrum range is encoded without a band structure, but relies on identification of spectral peaks to be encoded. The high-frequency (HF) range of the MDCT spectrum is partitioned into bands, where each band b is associated with a norm factor )(bN . These norm factors are quantized and transmitted to the decoder [1]. The decoding method includes spectral filling which populates the non-coded LF parts of the spectrum, as well as the entire HF part, which is scaled up with the set of norm factors. 3. HVQ The signal model used in HVQ, to code the LF transform coefficients, is illustrated in Figure 1. The MDCT vector corresponding to a particular audio signal frame is assumed to be composed of prominent spectral peaks. These peaks and their surroundings are accurately coded, as described in sections 3.2 and 3.3. The number of peaks and consequently the number of bits used to code the peak regions vary with time. The remaining bits Coded Peak I Noise-FillCoded LF Coded Peak II
  • 2. not used for peak coding are used to directly code non-peak LF MDCT coefficients, as low frequencies are of higher perceptual importance. This is described in section 3.4. The remaining parts of the MDCT vector are noise-filled, as illustrated in section 3.5. Thus, the major algorithmic steps at the HVQ encoder are: detect and code spectral peaks regions, code LF spectral coefficients (the size of the coded region depends on the number of remaining bits after peak coding), code noise-floor gains for spectral coefficients outside the peaks regions and code HF norm factors to be used with the HF noise-fill. 3.1. Classification As the HVQ concept is applied on harmonic signals, the first essential algorithmic step is signal classification, i.e. to decide on the activation of the harmonic mode. Since that decision is dependent on the spectral peak structure, the peak selection is performed at the same step. First, the instantaneous noise- level )(kEne and peak-level )(kEpe are estimated from the absolute values of the transform coefficients )(kX . The noise- level is calculated as:   ,1,,0,)(1)1()(  LkkXkEkE nene  (2) with 224L at 24.4 kb/s and 320L at 32 kb/s and where      . otherwise6472.0 )1()(if9578.0 kEkX ne  (3) Similarly, the peak-level is calculated as:   ,1,,0,)(1)1()(  LkkXkEkE pepe  (4) where , otherwise8029.0 )1()(if4223.0      kEkX pe  (5) and both )1(neE and )1(peE are initialized to 800. Per-band averages of noise-level )(bEne and peak-level )(bEpe are calculated by averaging the corresponding instantaneous level in bands of 32 bins. The number of bands is 7B (5.6 kHz) at 24.4 kb/s and 10B (8 kHz) at 32 kb/s. These noise and peak-level averages are used to derive three variables, used by the decision logic. The first variable is the number of detected peaks peaksN . A threshold for selecting peak candidates is calculated as: ).( )( )( )( 88.0 bE bE bE b pe ne pe          (6) Absolute values of the transform coefficients  kX are compared to the threshold )(b , the    bkX  form a vector of peak candidates. The threshold is adjusted on both sides of the peaks in the previous frame, with }21,5.0,25.0,5.0,21{ around each peak position from the previous frame. This stabilizes the peak selection over frames. Elements from the peaks candidate vector are extracted in decreasing order of peak amplitude, until the maximum number of peaks (17 at 24.4 kb/s and 23 for 32 kb/s) is reached. This procedure results in a set of peaksN spectral peaks. The second variable used in the classification logic is sharpN , calculated as the number of bands for which 9)( bSharpne , where the measure of frequency sharpness per-band )(bSharpne is the peak to noise-floor ratio in each band, defined as: , )( )(xam )( bE bX bSharp ne ne  (7) where )(bX is the set of all coefficients in the band b . The third variable sharpD is calculated as:  .9)( 1 0     B b nesharp bSharpD (8) The decision to switch to HVQ mode is based on comparing the above defined variables to the thresholds in Table 1. If ssharp TN  , ppeaks TN  , and dsharp TD  the EVS codec will process the current frame in HVQ mode. Rate s p d 24.4 kb/s 4 20 22 32 kb/s 7 23 22 Table 1. Thresholds for HVQ mode decision. 3.2. Peak gains and positions The essence of the HVQ algorithm is in explicit coding of spectral peak positions and amplitudes. The peaks amplitudes )(mGp , where m is the peak index, are scalar quantized in a logarithmic domain to form the quantized peak gains )(ˆ mGp and differentially coded by 5 bits. The codewords are additionally Huffman coded [4]. Since the HVQ is used to code harmonic signals the peak positions mp will typically be equally spaced. However, some frames have an irregular harmonic structure or the peak picker does not select all the peaks in the harmonic structure. This leads to irregular peak locations. Therefore, two methods are used to code the peak positions efficiently. The first method is Delta and Huffman coding. Due to the constraint that peaks cannot be closer than 2 MDCT bins (otherwise they are considered as one peak), Deltas are defined as: Mpp mmm   11 , (9) where 3M . The lack of small deltas reduces the size of the Huffman table. Additionally, the largest delta in the table is 51max  . An alternative sparse coding scheme is used if max  . The reduced table size make the Huffman coding efficient. The second method is based on the sparse coding algorithm described in [5]. The entire vector of the peak locations is coded, forming another vector, with 1 indicating peak presence and 0 no peak. This vector is then in a first layer divided into sub-vectors of length 5. The elements of the sub-vectors are OR-ed and the concatenation of the results of the OR operation for each first layer vector form another vector (the second layer). Each bit in this second layer indicates presence or absence of peaks in the 5-dim sub-vector from the first layer. In this way only the 5-dim sub- vectors from the first layer that are not indicated as all-zero by the
  • 3. Sub-vector Index 10000 000 01000 001 00100 010 00010 011 00001 100 10010 101 10001 110 01001 111 Table 2. The set of 5-dim vectors corresponding to the possible peaks positions are indexed with 3-bit codewords. second layer have to be transmitted. The second layer is always transmitted. Prior to transmission the non-zero sub-vectors from the first layer are mapped to exploit the fact that peaks cannot be closer than 2 positions, and not all 5-dim vector combinations are therefore possible. For example, the positional vector {01000, 00000, 00000, 00100}, with commas added to increase readability, is compressed to {1001, 001, 010}. The decoder reads from the bitstream the layer 2 vector 1001. These 4 bits indicate that what will follow in the bitstream is a description of the 1st and the 4th group, while the 2nd and the 3rd have to be filled-in with zeroes. The peak positions for the 1st and the 4th group are indicated by 3bit indices, and extracted from Table 2. The decision logic for selection of the peak position coding scheme operates as follows: if max )max(  , then the sparse coding is selected, otherwise the coding scheme that uses fewer bits is chosen. 3.3. Vector quantization of peak regions After the peak gain and position quantization, a neighborhood of 4 MDCT coefficients around each peak is quantized. The peak region coefficients (the peak itself and two neighboring bins on each side) are all scaled down by the quantized peak gain  mGp ˆ . In this way the central bin is scaled to unit amplitude, while the surrounding 4 bins are normalized in relation to the central one. The shape vector mS of the peak region, centered at bin k is defined as:      .)2()1()1()2( ˆ 1  kXkXkXkX mG k p mS (10) These shape vectors are quantized by means of a classified and structured vector quantizer (VQ) [6], with a trained codebook (CB) [7] with the 4-dim codevectors:  .)3()2()1()0( iiiii uuuuu (11) The numbers of peak regions vary over frames, which leads to large variation in complexity due to different number of CB searches. For audio codecs there are generally constraints on the computational complexity and the memory usage. A large number of CB searches would then limit the CB size and that is not desirable as the quantization error will increase. To keep the complexity nearly constant, while achieving low quantization error, the search for each mS is performed in a structured CB, with dynamically selected offset and size of the search region. The starting point for the search is determined by an Number of coded peaks Number of vectors searched in codebook 24.4kb/s 32kb/s 23 - 128 22 - 134 21 - 141 20 - 149 19 - 158 18 - 168 17 128 179 16 136 192 15 145 206 14 155 224 13 167 244 12 181 256 11 197  10 217  9 241  8 256     1 256 256 Table 3. Search space for 24.4 and 32 kb/s, as a function of the number of spectral peaks for the current audio frame. initial classification of the input shape vector, while the length of the search region depends on the number of shape vectors to quantize. The CB is classified into two classes, with the centroids 0 C and 1 C , and ordered such that the codewords iu closest to 0 C and most distant to 1 C are in one side of the CB, while the codewords closest to 1 C and most distant to 0 C are clustered in the other side of the CB. Additionally it is noted that the shape vectors exhibit certain symmetries (the MDCT coefficients on both sides of the spectral peak have similar statistics). Therefore half of the codevectors can be represented by a flipped version of the other half, which reduces the memory usage as the flipped part does not have to be pre-stored. The search is performed both among the codevectors i u as well as the flipped codevectors:  .)0()1()2()3( iiiii f uuuuu (12) Thus the searched CB (both physically stored and virtual) can be seen as clustered into four classes with centroids 0 C , 1 C , 0 fC and 1 fC . The following algorithmic steps are executed for every input shape vector mS ; First the minimum mean squared distance between the input vector and the four centroids, 0 C , 1 C , 0 fC and 1 fC is used to determine the starting point of the search and the CB orientation. Then the search space is dynamically adjusted such that when larger numbers of peaks are to be quantized in the current frame, the search space is reduced to limit the maximum complexity according to Table 3. The maximum search space (full-search) is used with 8 peaks or less at 24.4 kb/s, and 12 peaks or less at 32 kb/s. Since the codevectors in the CB are sorted by the distance between each codevector and the centroids, the search procedure goes first over the set of vectors that is likely to contain the best match. This property leads to minimum performance loss even
  • 4. though the search space is dynamically adjusted. This trade-off between complexity and performance is illustrated in section 4.1. 3.4. Allocation of remaining bits As the number of spectral peaks varies in time, the presented coding scheme will by its nature result in a variable bitrate. Bits available after peak coding are used to directly code LF MDCT coefficients. The target coefficients )(kX , outside of the peak regions, are concatenated into a vector )( ckZ which is partitioned into bands of 24 coefficients. The coding always starts at the beginning of the MDCT vector, e.g. if the available bits allow coding of only one 24-dim band, then only the coefficients 23,,0),( cc kkZ will be coded. The number of bands zN in )( ckZ that will be coded depends on the remaining number of bits availR after peak coding, and is determined by:    ,0,30modsignxam        maxavail max avail z RR R R N (13) where 80maxR at 24.4 kb/s and 95maxR at 32 kb/s. The resulting vector )( ckZ is encoded with a gain-shape VQ with 5 bits per gain and PVQ encoded [8] shape vectors. 3.5. High-frequency regions and noise-fill The non-coded coefficients below 5.6 kHz for 24.4 kb/s and 8 kHz for 32 kb/s are grouped into two sections and filled with random noise, using noise floor gains derived from the per-band noise- level averages described in section 3.1. The average of the elements of the first half of neE gives the noise-floor gain )0(neG , while the second half of average noise levels forms )1(neG . These norm factors are quantized and transmitted to the decoder. The frequency part above 5.6 kHz for 24.4 kb/s and 8 kHz for 32 kb/s is reconstructed by means of coded HF gain factors )(bN and spectral filling codebook generated from the coded LF part. In the process of creating the noise-fill codebook, the LF peak positions are also recorded. Exploiting that knowledge, the applied envelope gains are modified based on the presence of a peak in the fill codebook according to:        otherwise.,)1(),(),1(nim bandinpeak),1(1.0)(8.0)1(1.0 bNbNbN bbNbNbN Nb (14) The purpose of the gain modification is to avoid that a peak is significantly attenuated if it happens that the corresponding gain comes from a band without any peaks. Additionally, a noise-like structure in the noise-fill codebook should not be amplified by applying a strong gain that is calculated from an original band that contained one or more peaks. 4. EVALUATION This section consists of complexity measurements confirming the efficiency of the proposed dynamic search space adjustment, as well as subjective evaluation of the HVQ coding mode by mean of an ABX test [9]. Further details on formal perceptual evaluation by means of MOS [10] tests of the EVS codec can be found in the selection tests, available at [11]. 4.1. Complexity measurements The computational complexity in this section is measured according to [12] and was obtained from a simulation over 1312 harmonic signal frames. The data contained on average of 19.4 peaks per frame with a variance of 7.1 (minimum 12 and maximum 23 peaks). As illustrated in Table 4 the use of adaptive search in the classified structured codebook gives significant complexity reduction with maintained accuracy. The adaptive search reduces the maximum complexity with 45% while the average SNR decreases by only 0.2%. Search Full Adaptive Average SNR [dB] 8.743 8.723 Max complexity [WMOPS] 4.857 2.638 Table 4. Average SNR and maximum computational complexity for the shape VQ with full and adaptive search. 4.2. Subjective evaluation Subjective evaluation of the introduced algorithm has been performed in terms of a 5 level ABX test. After being presented with the reference signal and a pair of A and B samples in random order, the listeners had to choose among the following options: B much better (-2), B slightly better (-1), A and B are the same (0), A slightly better (+1), A much better (+2). The systems under test were the EVS codec with active HVQ module (option A) and the EVS baseline with HVQ module deactivated (option B). The test consisted of 10 harmonic items, and the votes from 7 experienced listeners are presented in Table 5. The average results are presented in Figure 2. Vote -2 -1 0 +1 +2 Count 3 24 42 50 21 Table 5. Results from the subjective ABX test indicate clear preference for the EVS codec with active HVQ module (positive scores), compared to EVS with deactivated HVQ module (negative scores). Figure 2. Mean score and 95% confidence interval indicate statistically significant preference of HVQ module in the ABX test. 5. CONCLUSIONS By means of a subjective test we have demonstrated that the proposed concept of a harmonic audio coding model provides superior perceptual quality over the conventional coding scheme. Complexity measurements confirm that the introduced adaptive VQ concept provides significant complexity reduction while maintaining quantization errors at the level of exhaustive search. These subjective and objective measurements show that the HVQ improves coding of harmonic audio signals.
  • 5. 6. REFERENCES [1] 3GPP TS 26.445, “Codec for Enhanced Voice Services (EVS); Detailed algorithmic description,” 2014. [2] ITU-T Rec. G.719, “Low-complexity full-band audio coding for high-quality conversational applications,” 2008. [3] H. S. Malvar, “Signal Processing with Lapped Transforms,” Norwood, MA: Artech House, 1992. [4] D. Huffman, “A Method for the Construction of Minimum- Redundancy Codes,” Proc. IRE 40 (9), 1952. [5] D. Salomon, G. Motta, and D. Bryant, “Sparse Strings” in “Handbook of Data Compression,” Springer, 2010. [6] A. Gersho and R. Gray, “Vector Quantization and Signal Compression,” Springer, 1991. [7] Y. Linde, A. Burzo, and R. Gray, “An algorithm for vector quantizer design,” IEEE Trans. Commun. COM-28, 1980. [8] T. Fischer, “A Pyramid Vector quantizer,” IEEE Transact. Inf. Theory, IT-32 (4), 1986. [9] D. Clark, “High-Resolution Subjective Testing Using a Double-Blind Comparator,” J. Audio Eng. Soc. 30 (5), 1982. [10] ITU-T Rec. P.800, “Methods for Subjective Determination of Transmission Quality,” 1996. [11] 3GPP Tdoc S4-141065, “Report of the Global Analysis Lab for the EVS Selection Phase,” Dynastat Inc., 2014. [12] ITU-T Rec. G.191, “Software Tool Library 2009 User’s Manual,” Geneva, 2009.