Generalized Cepstral Modeling of Degraded Speech and Its Application to Speech Enhancement

Toshio KANNO; Takao KOBAYASHI; Satoshi IMAI

Generalized Cepstral Modeling of Degraded Speech and Its Application to Speech Enhancement

Toshio KANNO, Takao KOBAYASHI, Satoshi IMAI

Full Text Views

0

Cite this

Summary :

This paper proposes a technique for estimating speech parameters in noisy environment. The technique uses a spectral model represented by generalized cepstrum and estimates the generalized cepstral coefficients from the speech which has been degraded by additive background noise. Parameter estimation is based on maximum a posteriori (MAP) estimation procedure. An iterative approach which has been formulated for all-pole modeling is applied to the generalized cepstral modeling. Generalized cepstral coefficients are obtained by an iterative procedure that consists of the unbiased estimation of log spectrum and noncausal Wiener filtering. Since the generalized cepstral model includes the all-pole model as a special case, the technique can be viewed as a generalization of the all-pole modeling based on MAP estimation. The proposed technique is applied to the enhancement of speech and several experimental results are also shown.

Publication: IEICE TRANSACTIONS on Fundamentals Vol.E76-A No.8 pp.1300-1307

Publication Date: 1993/08/25

Publicized

Online ISSN

DOI

Type of Manuscript: Special Section PAPER (Special Section of Papers Selected from the 7th Digital Signal Processing Symposium)

Category: Speech and Acoustic Signal Processing

Cite this

Copy

Toshio KANNO, Takao KOBAYASHI, Satoshi IMAI, "Generalized Cepstral Modeling of Degraded Speech and Its Application to Speech Enhancement" in IEICE TRANSACTIONS on Fundamentals, vol. E76-A, no. 8, pp. 1300-1307, August 1993, doi: .
Abstract: This paper proposes a technique for estimating speech parameters in noisy environment. The technique uses a spectral model represented by generalized cepstrum and estimates the generalized cepstral coefficients from the speech which has been degraded by additive background noise. Parameter estimation is based on maximum a posteriori (MAP) estimation procedure. An iterative approach which has been formulated for all-pole modeling is applied to the generalized cepstral modeling. Generalized cepstral coefficients are obtained by an iterative procedure that consists of the unbiased estimation of log spectrum and noncausal Wiener filtering. Since the generalized cepstral model includes the all-pole model as a special case, the technique can be viewed as a generalization of the all-pole modeling based on MAP estimation. The proposed technique is applied to the enhancement of speech and several experimental results are also shown.
URL: https://global.ieice.org/en_transactions/fundamentals/10.1587/e76-a_8_1300/_p

Copy

@ARTICLE{e76-a_8_1300,
author={Toshio KANNO, Takao KOBAYASHI, Satoshi IMAI, },
journal={IEICE TRANSACTIONS on Fundamentals},
title={Generalized Cepstral Modeling of Degraded Speech and Its Application to Speech Enhancement},
year={1993},
volume={E76-A},
number={8},
pages={1300-1307},
abstract={This paper proposes a technique for estimating speech parameters in noisy environment. The technique uses a spectral model represented by generalized cepstrum and estimates the generalized cepstral coefficients from the speech which has been degraded by additive background noise. Parameter estimation is based on maximum a posteriori (MAP) estimation procedure. An iterative approach which has been formulated for all-pole modeling is applied to the generalized cepstral modeling. Generalized cepstral coefficients are obtained by an iterative procedure that consists of the unbiased estimation of log spectrum and noncausal Wiener filtering. Since the generalized cepstral model includes the all-pole model as a special case, the technique can be viewed as a generalization of the all-pole modeling based on MAP estimation. The proposed technique is applied to the enhancement of speech and several experimental results are also shown.},
keywords={},
doi={},
ISSN={},
month={August},}

Copy

TY - JOUR
TI - Generalized Cepstral Modeling of Degraded Speech and Its Application to Speech Enhancement
T2 - IEICE TRANSACTIONS on Fundamentals
SP - 1300
EP - 1307
AU - Toshio KANNO
AU - Takao KOBAYASHI
AU - Satoshi IMAI
PY - 1993
DO -
JO - IEICE TRANSACTIONS on Fundamentals
SN -
VL - E76-A
IS - 8
JA - IEICE TRANSACTIONS on Fundamentals
Y1 - August 1993
AB - This paper proposes a technique for estimating speech parameters in noisy environment. The technique uses a spectral model represented by generalized cepstrum and estimates the generalized cepstral coefficients from the speech which has been degraded by additive background noise. Parameter estimation is based on maximum a posteriori (MAP) estimation procedure. An iterative approach which has been formulated for all-pole modeling is applied to the generalized cepstral modeling. Generalized cepstral coefficients are obtained by an iterative procedure that consists of the unbiased estimation of log spectrum and noncausal Wiener filtering. Since the generalized cepstral model includes the all-pole model as a special case, the technique can be viewed as a generalization of the all-pole modeling based on MAP estimation. The proposed technique is applied to the enhancement of speech and several experimental results are also shown.
ER -