The frequency domain binaural model (FDBM) has been previously proposed to localize multiple sound sources. Since the method requires only two input signals and uses interaural phase and level differences caused by the diffraction generated by the head, flexibility in application is very high when the head is considered as an object. When an object is symmetric with respect to the two microphones, the performance of sound source localization is degraded, as a human being has front-back confusion due to the symmetry in a median plane. This paper proposes to reduce the degradation of performance on sound source localization by a combination of the microphone pair outputs using the FDBM. The proposed method is evaluated by applying to a security camera system, and the results showed performance improvement in sound source localization because of reducing the number of cones of confusion.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Yoshifumi CHISAKI, Toshimichi TAKADA, Masahiro NAGANISHI, Tsuyoshi USAGAWA, "Azimuthal and Elevation Localization Using Inter-Channel Phase and Level Differences for a Hemispheric Object" in IEICE TRANSACTIONS on Fundamentals,
vol. E91-A, no. 10, pp. 3059-3062, October 2008, doi: 10.1093/ietfec/e91-a.10.3059.
Abstract: The frequency domain binaural model (FDBM) has been previously proposed to localize multiple sound sources. Since the method requires only two input signals and uses interaural phase and level differences caused by the diffraction generated by the head, flexibility in application is very high when the head is considered as an object. When an object is symmetric with respect to the two microphones, the performance of sound source localization is degraded, as a human being has front-back confusion due to the symmetry in a median plane. This paper proposes to reduce the degradation of performance on sound source localization by a combination of the microphone pair outputs using the FDBM. The proposed method is evaluated by applying to a security camera system, and the results showed performance improvement in sound source localization because of reducing the number of cones of confusion.
URL: https://global.ieice.org/en_transactions/fundamentals/10.1093/ietfec/e91-a.10.3059/_p
Copy
@ARTICLE{e91-a_10_3059,
author={Yoshifumi CHISAKI, Toshimichi TAKADA, Masahiro NAGANISHI, Tsuyoshi USAGAWA, },
journal={IEICE TRANSACTIONS on Fundamentals},
title={Azimuthal and Elevation Localization Using Inter-Channel Phase and Level Differences for a Hemispheric Object},
year={2008},
volume={E91-A},
number={10},
pages={3059-3062},
abstract={The frequency domain binaural model (FDBM) has been previously proposed to localize multiple sound sources. Since the method requires only two input signals and uses interaural phase and level differences caused by the diffraction generated by the head, flexibility in application is very high when the head is considered as an object. When an object is symmetric with respect to the two microphones, the performance of sound source localization is degraded, as a human being has front-back confusion due to the symmetry in a median plane. This paper proposes to reduce the degradation of performance on sound source localization by a combination of the microphone pair outputs using the FDBM. The proposed method is evaluated by applying to a security camera system, and the results showed performance improvement in sound source localization because of reducing the number of cones of confusion.},
keywords={},
doi={10.1093/ietfec/e91-a.10.3059},
ISSN={1745-1337},
month={October},}
Copy
TY - JOUR
TI - Azimuthal and Elevation Localization Using Inter-Channel Phase and Level Differences for a Hemispheric Object
T2 - IEICE TRANSACTIONS on Fundamentals
SP - 3059
EP - 3062
AU - Yoshifumi CHISAKI
AU - Toshimichi TAKADA
AU - Masahiro NAGANISHI
AU - Tsuyoshi USAGAWA
PY - 2008
DO - 10.1093/ietfec/e91-a.10.3059
JO - IEICE TRANSACTIONS on Fundamentals
SN - 1745-1337
VL - E91-A
IS - 10
JA - IEICE TRANSACTIONS on Fundamentals
Y1 - October 2008
AB - The frequency domain binaural model (FDBM) has been previously proposed to localize multiple sound sources. Since the method requires only two input signals and uses interaural phase and level differences caused by the diffraction generated by the head, flexibility in application is very high when the head is considered as an object. When an object is symmetric with respect to the two microphones, the performance of sound source localization is degraded, as a human being has front-back confusion due to the symmetry in a median plane. This paper proposes to reduce the degradation of performance on sound source localization by a combination of the microphone pair outputs using the FDBM. The proposed method is evaluated by applying to a security camera system, and the results showed performance improvement in sound source localization because of reducing the number of cones of confusion.
ER -