The search functionality is under construction.
The search functionality is under construction.

Toward Human-Friendly ASR Systems: Recovering Capitalization and Punctuation for Vietnamese Text

Thi Thu HIEN NGUYEN, Thai BINH NGUYEN, Ngoc PHUONG PHAM, Quoc TRUONG DO, Tu LUC LE, Chi MAI LUONG

  • Full Text Views

    0

  • Cite this

Summary :

Speech recognition is a technique that recognizes words and sentences in audio form and converts them into text sentences. Currently, with the advancement of deep learning technologies, speech recognition has achieved very satisfactory results close to human abilities. However, there are still limitations in identification results such as lack of punctuation, capitalization, and standardized numerical data. Vietnamese also contains local words, homonyms, etc, which make it difficult to read and understand the identification results for users as well as to perform the next tasks in Natural Language Processing (NLP). In this paper, we propose to combine the transformer decoder with conditional random field (CRF) to restore punctuation and capitalization for the Vietnamese automatic speech recognition (ASR) output. By chunking input sentences and merging output sequences, it is possible to handle longer strings with greater accuracy. Experiments show that the method proposed in the Vietnamese post-speech recognition dataset delivers the best results.

Publication
IEICE TRANSACTIONS on Information Vol.E104-D No.8 pp.1195-1203
Publication Date
2021/08/01
Publicized
2021/05/25
Online ISSN
1745-1361
DOI
10.1587/transinf.2020BDP0005
Type of Manuscript
Special Section PAPER (Special Section on Computational Intelligence and Big Data for Scientific and Technological Resources and Services)
Category

Authors

Thi Thu HIEN NGUYEN
  Thai Nguyen University of Education
Thai BINH NGUYEN
  Vietnam Artificial Intelligence System
Ngoc PHUONG PHAM
  Vietnam Artificial Intelligence System
Quoc TRUONG DO
  Vietnam Artificial Intelligence System
Tu LUC LE
  Office of Hanoi People's Committee
Chi MAI LUONG
  University of Science and Technology of Hanoi

Keyword