IEICE global.ieice.org Site

Author Search Result

[Author] Takahiro OKU(2hit)

1-2hit

Risk-Based Semi-Supervised Discriminative Language Modeling for Broadcast Transcription
Akio KOBAYASHI Takahiro OKU Toru IMAI Seiichi NAKAGAWA

PAPER-Speech and Hearing

Vol:
E95-D No:11
Page(s):
2674-2681
This paper describes a new method for semi-supervised discriminative language modeling, which is designed to improve the robustness of a discriminative language model (LM) obtained from manually transcribed (labeled) data. The discriminative LM is implemented as a log-linear model, which employs a set of linguistic features derived from word or phoneme sequences. The proposed semi-supervised discriminative modeling is formulated as a multi-objective optimization programming problem (MOP), which consists of two objective functions defined on both labeled lattices and automatic speech recognition (ASR) lattices as unlabeled data. The objectives are coherently designed based on the expected risks that reflect information about word errors for the training data. The model is trained in a discriminative manner and acquired as a solution to the MOP problem. In transcribing Japanese broadcast programs, the proposed method reduced relatively a word error rate by 6.3% compared with that achieved by a conventional trigram LM.
Learning Speech Variability in Discriminative Acoustic Model Adaptation
Shoei SATO Takahiro OKU Shinichi HOMMA Akio KOBAYASHI Toru IMAI

PAPER-Adaptation

Vol:
E93-D No:9
Page(s):
2370-2378
We present a new discriminative method of acoustic model adaptation that deals with a task-dependent speech variability. We have focused on differences of expressions or speaking styles between tasks and set the objective of this method as improving the recognition accuracy of indistinctly pronounced phrases dependent on a speaking style. The adaptation appends subword models for frequently observable variants of subwords in the task. To find the task-dependent variants, low-confidence words are statistically selected from words with higher frequency in the task's adaptation data by using their word lattices. HMM parameters of subword models dependent on the words are discriminatively trained by using linear transforms with a minimum phoneme error (MPE) criterion. For the MPE training, subword accuracy discriminating between the variants and the originals is also investigated. In speech recognition experiments, the proposed adaptation with the subword variants reduced the word error rate by 12.0% relative in a Japanese conversational broadcast task.

Author Search Result

[Author] Takahiro OKU(2hit)

Risk-Based Semi-Supervised Discriminative Language Modeling for Broadcast Transcription

Learning Speech Variability in Discriminative Acoustic Model Adaptation

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles