In this paper, the use of optimal Karhunen-Loeve (KL) transform for quantization of speech line spectrum frequency (LSF) coefficients is studied. Both scalar quantizer (SQ) and vector quantizer (VQ) schemes are developed to encode efficiently the transform parameters after operating one or two-dimensional KL transform. Furthermore, the SQ schemes are also combined with entropy coding by using Huffman variable length coding (VLC). The basic idea in developing these schemes is utilizing the strong correlation of LSF parameters to reduce the bit rate for a given level of fidelity. Since the use of global statistics for generating the coding scheme may not be appropriate, we propose several adaptive KL transform systems (AKL) to encode the LSF parameters. The performance of all systems for different bit rates is investigated and adequate comparisons are made. It is shown that the proposed KL transform coding systems introduce as good as or better performance for both SQ and VQ in the examined bit rates compared to other methods in the field of LSF coding.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Laszlo LOIS, Hai Le VU, "Spectral Coding of Speech LSF Parameters Using Karhunen-Loeve Transform" in IEICE TRANSACTIONS on Fundamentals,
vol. E82-A, no. 10, pp. 2138-2146, October 1999, doi: .
Abstract: In this paper, the use of optimal Karhunen-Loeve (KL) transform for quantization of speech line spectrum frequency (LSF) coefficients is studied. Both scalar quantizer (SQ) and vector quantizer (VQ) schemes are developed to encode efficiently the transform parameters after operating one or two-dimensional KL transform. Furthermore, the SQ schemes are also combined with entropy coding by using Huffman variable length coding (VLC). The basic idea in developing these schemes is utilizing the strong correlation of LSF parameters to reduce the bit rate for a given level of fidelity. Since the use of global statistics for generating the coding scheme may not be appropriate, we propose several adaptive KL transform systems (AKL) to encode the LSF parameters. The performance of all systems for different bit rates is investigated and adequate comparisons are made. It is shown that the proposed KL transform coding systems introduce as good as or better performance for both SQ and VQ in the examined bit rates compared to other methods in the field of LSF coding.
URL: https://global.ieice.org/en_transactions/fundamentals/10.1587/e82-a_10_2138/_p
Copy
@ARTICLE{e82-a_10_2138,
author={Laszlo LOIS, Hai Le VU, },
journal={IEICE TRANSACTIONS on Fundamentals},
title={Spectral Coding of Speech LSF Parameters Using Karhunen-Loeve Transform},
year={1999},
volume={E82-A},
number={10},
pages={2138-2146},
abstract={In this paper, the use of optimal Karhunen-Loeve (KL) transform for quantization of speech line spectrum frequency (LSF) coefficients is studied. Both scalar quantizer (SQ) and vector quantizer (VQ) schemes are developed to encode efficiently the transform parameters after operating one or two-dimensional KL transform. Furthermore, the SQ schemes are also combined with entropy coding by using Huffman variable length coding (VLC). The basic idea in developing these schemes is utilizing the strong correlation of LSF parameters to reduce the bit rate for a given level of fidelity. Since the use of global statistics for generating the coding scheme may not be appropriate, we propose several adaptive KL transform systems (AKL) to encode the LSF parameters. The performance of all systems for different bit rates is investigated and adequate comparisons are made. It is shown that the proposed KL transform coding systems introduce as good as or better performance for both SQ and VQ in the examined bit rates compared to other methods in the field of LSF coding.},
keywords={},
doi={},
ISSN={},
month={October},}
Copy
TY - JOUR
TI - Spectral Coding of Speech LSF Parameters Using Karhunen-Loeve Transform
T2 - IEICE TRANSACTIONS on Fundamentals
SP - 2138
EP - 2146
AU - Laszlo LOIS
AU - Hai Le VU
PY - 1999
DO -
JO - IEICE TRANSACTIONS on Fundamentals
SN -
VL - E82-A
IS - 10
JA - IEICE TRANSACTIONS on Fundamentals
Y1 - October 1999
AB - In this paper, the use of optimal Karhunen-Loeve (KL) transform for quantization of speech line spectrum frequency (LSF) coefficients is studied. Both scalar quantizer (SQ) and vector quantizer (VQ) schemes are developed to encode efficiently the transform parameters after operating one or two-dimensional KL transform. Furthermore, the SQ schemes are also combined with entropy coding by using Huffman variable length coding (VLC). The basic idea in developing these schemes is utilizing the strong correlation of LSF parameters to reduce the bit rate for a given level of fidelity. Since the use of global statistics for generating the coding scheme may not be appropriate, we propose several adaptive KL transform systems (AKL) to encode the LSF parameters. The performance of all systems for different bit rates is investigated and adequate comparisons are made. It is shown that the proposed KL transform coding systems introduce as good as or better performance for both SQ and VQ in the examined bit rates compared to other methods in the field of LSF coding.
ER -