The search functionality is under construction.
The search functionality is under construction.

Author Search Result

[Author] Jorge TREVINO(2hit)

1-2hit
  • Enhancing Stereo Signals with High-Order Ambisonics Spatial Information Open Access

    Jorge TREVINO  Shuichi SAKAMOTO  Junfeng LI  Yôiti SUZUKI  

     
    INVITED PAPER

      Pubricized:
    2015/10/21
      Vol:
    E99-D No:1
      Page(s):
    41-49

    There is a strong push towards the ultra-realistic presentation of multimedia contents made possible by the latest advances in computational and signal processing technologies. Three-dimensional sound presentation is necessary to convey a natural and rich multimedia experience. Promising ways to achieve this include the sound field reproduction technique known as high-order Ambisonics (HOA). While these advanced methods are now within the capabilities of consumer-level processing systems, their adoption is hindered by the lack of contents. Production and coding of the audio components in multimedia focus on traditional formats such as stereophonic sound. Mainstream audio codecs and media such as CDs or DVDs do not support advanced, rich contents such as HOA encodings. To ameliorate this problem and speed up the adoption of spatial sound technologies, this paper proposes a novel way to downmix HOA contents into a stereo signal. The resulting data can be distributed using conventional methods such as audio CDs or as the audio component of an internet video stream. The results can be listened to using legacy stereo reproduction systems. However, they include spatial information encoded as the inter-channel level and phase differences. The proposed method consists of a downmixing filterbank which independently modulate inter-channel differences at each frequency bin. The proposal is evaluated using simple test signals and found to outperform conventional methods such as matrix-encoded surround and the Ambisonics UHJ format in terms of spatial resolution. The proposal can be coupled with a previously presented method to recover HOA signals from stereo recordings. The resulting system allows for the preservation of full-surround spatial information in ultra-realistic contents when they are transferred using a stereo stream. Simulation results show that a compatible decoder can accurately recover up to five HOA channels from a stereo signal (2nd order HOA data in the horizontal plane).

  • Sound Field Reproduction Using Ambisonics and Irregular Loudspeaker Arrays

    Jorge TREVINO  Takuma OKAMOTO  Yukio IWAYA  Yôiti SUZUKI  

     
    INVITED PAPER

      Vol:
    E97-A No:9
      Page(s):
    1832-1839

    Sound field reproduction systems seek to realistically convey 3D spatial audio by re-creating the sound pressure inside a region enclosing the listener. High-order Ambisonics (HOA), a sound field reproduction technology, is notable for defining a scalable encoding format that characterizes the sound field in a system-independent way. Sound fields sampled with a particular microphone array and encoded into the HOA format can be reproduced using any sound presentation device, typically a loudspeaker array, by using a HOA decoder. The HOA encoding format is based on the spherical harmonic decomposition; this makes it easier to design a decoder for large arrays of loudspeakers uniformly distributed over all directions. In practice, it is seldom possible to cover all directions with loudspeakers placed at regular angular intervals. An irregular array, one where the angular separation between adjacent loudspeakers is not constant, does not perform as well as a regular one when reproducing HOA due to the uneven sampling of the spherical harmonics. This paper briefly introduces the techniques used in HOA and advances a new approach to design HOA decoders for irregular loudspeaker arrays. The main difference between conventional methods and our proposal is the use of a new error metric: the radial derivative of the reconstruction error. Minimizing this metric leads to a smooth reproduction, accurate over a larger region than that achieved by conventional HOA decoders. We evaluate our proposal using the computer simulation of two 115-channel loudspeaker arrays: a regular and an irregular one. We find that our proposal results in a larger listening region when used to decode HOA for reproduction using the irregular array. On the other hand, applying our method matches the high-quality reproduction that can be attained with the regular array and conventional HOA decoders.