The search functionality is under construction.
The search functionality is under construction.

Keyword Search Result

[Keyword] ACELP(6hit)

1-6hit
  • Switching Search Method for Pulse Assignment in ITU-T G.729D

    Fu-Kun CHEN  Yu-Ruei TSAI  

     
    LETTER-Speech and Hearing

      Vol:
    E91-D No:10
      Page(s):
    2532-2535

    In this paper, the simplified search designs for the stochastic codebook of algebraic code excited linear prediction (ACELP) for ITU-T G.729D speech coder are proposed. By using two search rounds and limiting the search range, the computational complexity of the proposed approach is only 6.25% of the full search method recommended by G.729D. In addition, the computational complexity of proposed approach is only 59% of the global pulse replacement search method recommended by G.729.1. Simulation results show that the coded speech quality evaluated by using the standard subjective and objective quality measurements is with perceptually negligible degradation.

  • Efficient Codebook Search Method for AMR Wideband Speech Codecs

    Hochong PARK  Younhee KIM  Jisang YOO  

     
    PAPER-Speech and Hearing

      Vol:
    E87-D No:8
      Page(s):
    2114-2120

    The AMR wideband speech codec was recently developed for high-quality wideband speech communications. Although it has an excellent performance due to expanded bandwidth of speech signal, it requires a huge amount of computation especially in codebook search. To solve this problem, this paper proposes an efficient codebook search method for AMR wideband codec. Starting from a poorly performing initial codevector, the proposed method enhances the performance of the codevector iteratively by exchanging the worst pulse in the codevector with a better one after evaluating the role of each pulse. Simulations show that the AMR wideband codec adopting the proposed codebook search method provides better performance with much less computational load than that using the standard method.

  • Complexity Scalability for ACELP and MP-MLQ Speech Coders

    Fu-Kun CHEN  Jar-Ferr YANG  Yu-Pin LIN  

     
    PAPER-Speech and Hearing

      Vol:
    E85-D No:1
      Page(s):
    255-263

    For multimedia communications, the computational scalability of a multimedia codec is required to match with different working platforms and integrated services of media sources. In this paper, two condensed stochastic codebook search approaches are proposed to progressively reduce the computation required for the algebraic code excited linear predictive (ACELP) and multi-pulse maximum likelihood quantization (MP-MLQ) coders. By reducing the candidates of the codebook before search procedure, the proposed methods can effectively diminish the computation required for the ITU-T G.723.1 dual rate speech coder. Simulation results show that the proposed methods can save over 50 percent for the stochastic codebook search with perceptually intangible degradation in speech quality.

  • Comparison of Two Speech and Audio Coders at 8 kb/s from the Viewpoints of Coding Scheme and Quality

    Nobuhiko KITAWAKI  Takehiro MORIYA  Takao KANEKO  Naoki IWAKAMI  

     
    PAPER-Media Management

      Vol:
    E81-B No:11
      Page(s):
    2007-2012

    Low bit-rate speech and audio codings are key technologies for multimedia communications. A number of coding scheme have been developed for various applications. In Internet application, good speech and audio quality at very low bit-rate (8-16 kb/s) is valuable. Two recently proposed speech and audio-coding schemes, CS-ACELP (Conjugate Structure Algebraic Code Excited Linear Prediction, standardized by the ITU-T in Recommendation G. 729) and TwinVQ (Transform-domain Weighted INterleave Vector Quantization, one of the candidates for MPEG-4 audio) were compared from the viewpoints of coding schemes and quality. Although there are significant differences in their basic structures and frame lengths, this paper describes that both use the same compression techniques, such as LPC (Linear Predictive Coding)-analysis pitch-period estimation and vector quantization. While CS-ACELP provides toll quality for speech at 8 kb/s, the quality it provides for music signals is insufficient. The TwinVQ transform coder is based on LPC and vector quantization and is also capable of operating at 8 kb/s. Evaluation of these two schemes in terms of their fundamental technologies, quality, delay, and complexity showed that the quality of TwinVQ for music signals is better than that of CS-ACELP, and that the quality of CS-ACELP is better for speech signals. Therefore, TwinVQ may be better suited for one-directional Internet applications, and CS-ACELP may be better for two-directional communication.

  • Efficient Encoding of Excitation Codes Using Trained Partial Algebraic Codebook

    Yun Keun LEE  Hwang Soo LEE  Robert M. GRAY  

     
    LETTER-Speech Processing and Acoustics

      Vol:
    E81-D No:6
      Page(s):
    612-615

    An efficient encoding method of excitation codes using a partial algebraic codebook (PAC) is proposed. Since the conventional algebraic code excited linear prediction (ACELP) encodes the positions and signs of all excitation pulses separately, the bits required for encoding excitation codes take a large portion of the total bit rate. Vector quantization (VQ) of the positions and signs of the excitation pulses results in a PAC. Using PAC instead of the full set of algebraic codes, we can reduce the bits required to encode the excitation codes while maintaining the output speech quality. An iterative training algorithm is proposed to obtain the suboptimal PAC by modifying the Lloyd algorithm. Simulation results show that considerable bit savings can be obtained with only a small amount of degradation in the segmental signal to noise ratio (SEGSNR).

  • A 6.4-kbit/s Variable-Bit-Rate Extension to the G.729 (CS-ACELP) Speech Coder

    Akitoshi KATAOKA  Sachiko KURIHARA  Shinji HAYASHI  

     
    PAPER-Speech Processing and Acoustics

      Vol:
    E80-D No:12
      Page(s):
    1183-1189

    This paper proposes a 6.4-kbit/s extension to G.729 (conjugate structure algebraic code excited linear prediction: CS-ACELP). Each G.729 module was investigated to determine which bits could be removed without hurting the speech quality, then two coders that have different bit allocations were designed. They have two different algebraic codebooks (a 10-bit algebraic codebook that has two pulses and an 11-bit algebraic codebook that has two or three pulses). This paper also proposes a conditional orthogonalized search for a fixed codebook to improve the speech quality. The conditional orthogonalized search chooses, one of two search methods (orthogonalized or non-orthogonalized) based on the optimum pitch gain. The quality of the two coders was evaluated using objective measurements (SNR and segmental SNR) and subjective ones (mean opinion score: MOS and a pair-comparison test). The selected coder was evaluated under practical conditions. Subjective test results have indicated that the quality of the proposed coder (10-ms frame length) is equivalent to that of the 6.3-kbit/s G.723.1 coder, which has a 30-ms frame length.