This paper proposes a speech watermarking method based on the concept of formant tuning. The characteristic that formant tuning can improve the sound quality of synthesized speech was employed to achieve inaudibility for watermarking. In the proposed method, formants were firstly extracted with linear prediction (LP) analysis and then embedded with watermarks by symmetrically controlling a pair of line spectral frequencies (LSFs) as formant tuning. We evaluated the proposed method by two kinds of experiments regarding inaudibility and robustness compared with other methods. Inaudibility was evaluated with objective and subjective tests and robustness was evaluated with speech codecs and speech processing. The results revealed that the proposed method could satisfy both inaudibility and robustness that required for speech watermarking.
Shengbei WANG
Japan Advanced Institute of Science and Technology
Masashi UNOKI
Japan Advanced Institute of Science and Technology
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Shengbei WANG, Masashi UNOKI, "Speech Watermarking Method Based on Formant Tuning" in IEICE TRANSACTIONS on Information,
vol. E98-D, no. 1, pp. 29-37, January 2015, doi: 10.1587/transinf.2014MUP0009.
Abstract: This paper proposes a speech watermarking method based on the concept of formant tuning. The characteristic that formant tuning can improve the sound quality of synthesized speech was employed to achieve inaudibility for watermarking. In the proposed method, formants were firstly extracted with linear prediction (LP) analysis and then embedded with watermarks by symmetrically controlling a pair of line spectral frequencies (LSFs) as formant tuning. We evaluated the proposed method by two kinds of experiments regarding inaudibility and robustness compared with other methods. Inaudibility was evaluated with objective and subjective tests and robustness was evaluated with speech codecs and speech processing. The results revealed that the proposed method could satisfy both inaudibility and robustness that required for speech watermarking.
URL: https://global.ieice.org/en_transactions/information/10.1587/transinf.2014MUP0009/_p
Copy
@ARTICLE{e98-d_1_29,
author={Shengbei WANG, Masashi UNOKI, },
journal={IEICE TRANSACTIONS on Information},
title={Speech Watermarking Method Based on Formant Tuning},
year={2015},
volume={E98-D},
number={1},
pages={29-37},
abstract={This paper proposes a speech watermarking method based on the concept of formant tuning. The characteristic that formant tuning can improve the sound quality of synthesized speech was employed to achieve inaudibility for watermarking. In the proposed method, formants were firstly extracted with linear prediction (LP) analysis and then embedded with watermarks by symmetrically controlling a pair of line spectral frequencies (LSFs) as formant tuning. We evaluated the proposed method by two kinds of experiments regarding inaudibility and robustness compared with other methods. Inaudibility was evaluated with objective and subjective tests and robustness was evaluated with speech codecs and speech processing. The results revealed that the proposed method could satisfy both inaudibility and robustness that required for speech watermarking.},
keywords={},
doi={10.1587/transinf.2014MUP0009},
ISSN={1745-1361},
month={January},}
Copy
TY - JOUR
TI - Speech Watermarking Method Based on Formant Tuning
T2 - IEICE TRANSACTIONS on Information
SP - 29
EP - 37
AU - Shengbei WANG
AU - Masashi UNOKI
PY - 2015
DO - 10.1587/transinf.2014MUP0009
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E98-D
IS - 1
JA - IEICE TRANSACTIONS on Information
Y1 - January 2015
AB - This paper proposes a speech watermarking method based on the concept of formant tuning. The characteristic that formant tuning can improve the sound quality of synthesized speech was employed to achieve inaudibility for watermarking. In the proposed method, formants were firstly extracted with linear prediction (LP) analysis and then embedded with watermarks by symmetrically controlling a pair of line spectral frequencies (LSFs) as formant tuning. We evaluated the proposed method by two kinds of experiments regarding inaudibility and robustness compared with other methods. Inaudibility was evaluated with objective and subjective tests and robustness was evaluated with speech codecs and speech processing. The results revealed that the proposed method could satisfy both inaudibility and robustness that required for speech watermarking.
ER -