A speaker independent algorithm is given which automatically detects the most steady-state portion of a vowel (vowel center) from continuous speech. The algorithm first extracts the segments each of which contains a vowel and, if present, pre- and/or post-vocalic liquids and semivowels, and then locates the most steady-state portion of the segment. An advantage of the algorithm is its ability to distinguish the nasal and the intervocalic liquid and semivowel segments without relying upon the formant frequencies which have been used in most of the previous work of vowel segment detection procedure. This results in a computationally simple algorithm. The test on ten sentences spoken by each of two males and two females resulted in score of 93.2% correct vowel center localization.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Hideki KASUYA, Hisashi WAKITA, "Automatic Detection of Vowel Centers from Continuous Speech" in IEICE TRANSACTIONS on transactions,
vol. E64-E, no. 10, pp. 640-645, October 1981, doi: .
Abstract: A speaker independent algorithm is given which automatically detects the most steady-state portion of a vowel (vowel center) from continuous speech. The algorithm first extracts the segments each of which contains a vowel and, if present, pre- and/or post-vocalic liquids and semivowels, and then locates the most steady-state portion of the segment. An advantage of the algorithm is its ability to distinguish the nasal and the intervocalic liquid and semivowel segments without relying upon the formant frequencies which have been used in most of the previous work of vowel segment detection procedure. This results in a computationally simple algorithm. The test on ten sentences spoken by each of two males and two females resulted in score of 93.2% correct vowel center localization.
URL: https://global.ieice.org/en_transactions/transactions/10.1587/e64-e_10_640/_p
Copy
@ARTICLE{e64-e_10_640,
author={Hideki KASUYA, Hisashi WAKITA, },
journal={IEICE TRANSACTIONS on transactions},
title={Automatic Detection of Vowel Centers from Continuous Speech},
year={1981},
volume={E64-E},
number={10},
pages={640-645},
abstract={A speaker independent algorithm is given which automatically detects the most steady-state portion of a vowel (vowel center) from continuous speech. The algorithm first extracts the segments each of which contains a vowel and, if present, pre- and/or post-vocalic liquids and semivowels, and then locates the most steady-state portion of the segment. An advantage of the algorithm is its ability to distinguish the nasal and the intervocalic liquid and semivowel segments without relying upon the formant frequencies which have been used in most of the previous work of vowel segment detection procedure. This results in a computationally simple algorithm. The test on ten sentences spoken by each of two males and two females resulted in score of 93.2% correct vowel center localization.},
keywords={},
doi={},
ISSN={},
month={October},}
Copy
TY - JOUR
TI - Automatic Detection of Vowel Centers from Continuous Speech
T2 - IEICE TRANSACTIONS on transactions
SP - 640
EP - 645
AU - Hideki KASUYA
AU - Hisashi WAKITA
PY - 1981
DO -
JO - IEICE TRANSACTIONS on transactions
SN -
VL - E64-E
IS - 10
JA - IEICE TRANSACTIONS on transactions
Y1 - October 1981
AB - A speaker independent algorithm is given which automatically detects the most steady-state portion of a vowel (vowel center) from continuous speech. The algorithm first extracts the segments each of which contains a vowel and, if present, pre- and/or post-vocalic liquids and semivowels, and then locates the most steady-state portion of the segment. An advantage of the algorithm is its ability to distinguish the nasal and the intervocalic liquid and semivowel segments without relying upon the formant frequencies which have been used in most of the previous work of vowel segment detection procedure. This results in a computationally simple algorithm. The test on ten sentences spoken by each of two males and two females resulted in score of 93.2% correct vowel center localization.
ER -