In general, the analysis-synthesis systems are constructed on a linear frequency scale. On the other hand, the frequency resolution of human hearing system have non-linear characteristics. So, it is interesting to study about the analysis-synthesis system on such a non-linear frequency scale like MEL scale. And it is well known that LSP analysis-synthesis method is superior to LPC or PARCOR method in frame rate and quantization characteristics. In this paper, we describe an LSP analysis-synthesis system on MEL frequency scale. At first, we propose the way to obtain LSP parameters on Mel frequency scale (Mel LSP parameters) from the speech signal in linear time domain. Next we propose how to construct the analysis and synthesis filters in linear time domain using the MEL LSP parameters. Furthermore, we combine this system with the ordinary LSP analysis-synthesis system to improve the quality of the synthetic speech. We carried out some experiments to make clear the characteristics of the combined system. The results of tests show that the quality of synthetic speech with the combined system is higher than that with the ordinary LSP system and that with the MEL LSP system on condition that total prediction order is 10. Through the further experiments, we confirm that the synthetic speech quality with the combined system is as good as the that with the standard LSP system at prediction order 12.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Shuuichi ARAI, Arata MIYAUCHI, Shinji OZAWA, "A LSP Analysis-Synthesis Method on Mel Frequency Scale Combined with Linear One" in IEICE TRANSACTIONS on transactions,
vol. E71-E, no. 7, pp. 648-653, July 1988, doi: .
Abstract: In general, the analysis-synthesis systems are constructed on a linear frequency scale. On the other hand, the frequency resolution of human hearing system have non-linear characteristics. So, it is interesting to study about the analysis-synthesis system on such a non-linear frequency scale like MEL scale. And it is well known that LSP analysis-synthesis method is superior to LPC or PARCOR method in frame rate and quantization characteristics. In this paper, we describe an LSP analysis-synthesis system on MEL frequency scale. At first, we propose the way to obtain LSP parameters on Mel frequency scale (Mel LSP parameters) from the speech signal in linear time domain. Next we propose how to construct the analysis and synthesis filters in linear time domain using the MEL LSP parameters. Furthermore, we combine this system with the ordinary LSP analysis-synthesis system to improve the quality of the synthetic speech. We carried out some experiments to make clear the characteristics of the combined system. The results of tests show that the quality of synthetic speech with the combined system is higher than that with the ordinary LSP system and that with the MEL LSP system on condition that total prediction order is 10. Through the further experiments, we confirm that the synthetic speech quality with the combined system is as good as the that with the standard LSP system at prediction order 12.
URL: https://global.ieice.org/en_transactions/transactions/10.1587/e71-e_7_648/_p
Copy
@ARTICLE{e71-e_7_648,
author={Shuuichi ARAI, Arata MIYAUCHI, Shinji OZAWA, },
journal={IEICE TRANSACTIONS on transactions},
title={A LSP Analysis-Synthesis Method on Mel Frequency Scale Combined with Linear One},
year={1988},
volume={E71-E},
number={7},
pages={648-653},
abstract={In general, the analysis-synthesis systems are constructed on a linear frequency scale. On the other hand, the frequency resolution of human hearing system have non-linear characteristics. So, it is interesting to study about the analysis-synthesis system on such a non-linear frequency scale like MEL scale. And it is well known that LSP analysis-synthesis method is superior to LPC or PARCOR method in frame rate and quantization characteristics. In this paper, we describe an LSP analysis-synthesis system on MEL frequency scale. At first, we propose the way to obtain LSP parameters on Mel frequency scale (Mel LSP parameters) from the speech signal in linear time domain. Next we propose how to construct the analysis and synthesis filters in linear time domain using the MEL LSP parameters. Furthermore, we combine this system with the ordinary LSP analysis-synthesis system to improve the quality of the synthetic speech. We carried out some experiments to make clear the characteristics of the combined system. The results of tests show that the quality of synthetic speech with the combined system is higher than that with the ordinary LSP system and that with the MEL LSP system on condition that total prediction order is 10. Through the further experiments, we confirm that the synthetic speech quality with the combined system is as good as the that with the standard LSP system at prediction order 12.},
keywords={},
doi={},
ISSN={},
month={July},}
Copy
TY - JOUR
TI - A LSP Analysis-Synthesis Method on Mel Frequency Scale Combined with Linear One
T2 - IEICE TRANSACTIONS on transactions
SP - 648
EP - 653
AU - Shuuichi ARAI
AU - Arata MIYAUCHI
AU - Shinji OZAWA
PY - 1988
DO -
JO - IEICE TRANSACTIONS on transactions
SN -
VL - E71-E
IS - 7
JA - IEICE TRANSACTIONS on transactions
Y1 - July 1988
AB - In general, the analysis-synthesis systems are constructed on a linear frequency scale. On the other hand, the frequency resolution of human hearing system have non-linear characteristics. So, it is interesting to study about the analysis-synthesis system on such a non-linear frequency scale like MEL scale. And it is well known that LSP analysis-synthesis method is superior to LPC or PARCOR method in frame rate and quantization characteristics. In this paper, we describe an LSP analysis-synthesis system on MEL frequency scale. At first, we propose the way to obtain LSP parameters on Mel frequency scale (Mel LSP parameters) from the speech signal in linear time domain. Next we propose how to construct the analysis and synthesis filters in linear time domain using the MEL LSP parameters. Furthermore, we combine this system with the ordinary LSP analysis-synthesis system to improve the quality of the synthetic speech. We carried out some experiments to make clear the characteristics of the combined system. The results of tests show that the quality of synthetic speech with the combined system is higher than that with the ordinary LSP system and that with the MEL LSP system on condition that total prediction order is 10. Through the further experiments, we confirm that the synthetic speech quality with the combined system is as good as the that with the standard LSP system at prediction order 12.
ER -