The search functionality is under construction.
The search functionality is under construction.

Label-Adversarial Jointly Trained Acoustic Word Embedding

Zhaoqi LI, Ta LI, Qingwei ZHAO, Pengyuan ZHANG

  • Full Text Views

    0

  • Cite this

Summary :

Query-by-example spoken term detection (QbE-STD) is a task of using speech queries to match utterances, and the acoustic word embedding (AWE) method of generating fixed-length representations for speech segments has shown high performance and efficiency in recent work. We propose an AWE training method using a label-adversarial network to reduce the interference information learned during AWE training. Experiments demonstrate that our method achieves significant improvements on multilingual and zero-resource test sets.

Publication
IEICE TRANSACTIONS on Information Vol.E105-D No.8 pp.1501-1505
Publication Date
2022/08/01
Publicized
2022/05/20
Online ISSN
1745-1361
DOI
10.1587/transinf.2022EDL8012
Type of Manuscript
LETTER
Category
Speech and Hearing

Authors

Zhaoqi LI
  Chinese Academy of Sciences,University of Chinese Academy of Sciences
Ta LI
  Chinese Academy of Sciences,University of Chinese Academy of Sciences
Qingwei ZHAO
  Chinese Academy of Sciences,University of Chinese Academy of Sciences
Pengyuan ZHANG
  Chinese Academy of Sciences,University of Chinese Academy of Sciences

Keyword