We proposed a method for estimating sound source positions in 3D space by integrating sound directions estimated by multiple microphone arrays and taking advantage of reflection information. Two types of sources with different directivity properties (human speech and loudspeaker speech) were evaluated for different positions and orientations. Experimental results showed the effectiveness of using reflection information, depending on the position and orientation of the sound sources relative to the array, walls, and the source type. The use of reflection information increased the source position detection rates by 10% on average and up to 60% for the best case.
Carlos T. ISHI
ATR
Jani EVEN
ATR
Norihiro HAGITA
ATR
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Carlos T. ISHI, Jani EVEN, Norihiro HAGITA, "Integration of Multiple Microphone Arrays and Use of Sound Reflections for 3D Localization of Sound Sources" in IEICE TRANSACTIONS on Fundamentals,
vol. E97-A, no. 9, pp. 1867-1874, September 2014, doi: 10.1587/transfun.E97.A.1867.
Abstract: We proposed a method for estimating sound source positions in 3D space by integrating sound directions estimated by multiple microphone arrays and taking advantage of reflection information. Two types of sources with different directivity properties (human speech and loudspeaker speech) were evaluated for different positions and orientations. Experimental results showed the effectiveness of using reflection information, depending on the position and orientation of the sound sources relative to the array, walls, and the source type. The use of reflection information increased the source position detection rates by 10% on average and up to 60% for the best case.
URL: https://global.ieice.org/en_transactions/fundamentals/10.1587/transfun.E97.A.1867/_p
Copy
@ARTICLE{e97-a_9_1867,
author={Carlos T. ISHI, Jani EVEN, Norihiro HAGITA, },
journal={IEICE TRANSACTIONS on Fundamentals},
title={Integration of Multiple Microphone Arrays and Use of Sound Reflections for 3D Localization of Sound Sources},
year={2014},
volume={E97-A},
number={9},
pages={1867-1874},
abstract={We proposed a method for estimating sound source positions in 3D space by integrating sound directions estimated by multiple microphone arrays and taking advantage of reflection information. Two types of sources with different directivity properties (human speech and loudspeaker speech) were evaluated for different positions and orientations. Experimental results showed the effectiveness of using reflection information, depending on the position and orientation of the sound sources relative to the array, walls, and the source type. The use of reflection information increased the source position detection rates by 10% on average and up to 60% for the best case.},
keywords={},
doi={10.1587/transfun.E97.A.1867},
ISSN={1745-1337},
month={September},}
Copy
TY - JOUR
TI - Integration of Multiple Microphone Arrays and Use of Sound Reflections for 3D Localization of Sound Sources
T2 - IEICE TRANSACTIONS on Fundamentals
SP - 1867
EP - 1874
AU - Carlos T. ISHI
AU - Jani EVEN
AU - Norihiro HAGITA
PY - 2014
DO - 10.1587/transfun.E97.A.1867
JO - IEICE TRANSACTIONS on Fundamentals
SN - 1745-1337
VL - E97-A
IS - 9
JA - IEICE TRANSACTIONS on Fundamentals
Y1 - September 2014
AB - We proposed a method for estimating sound source positions in 3D space by integrating sound directions estimated by multiple microphone arrays and taking advantage of reflection information. Two types of sources with different directivity properties (human speech and loudspeaker speech) were evaluated for different positions and orientations. Experimental results showed the effectiveness of using reflection information, depending on the position and orientation of the sound sources relative to the array, walls, and the source type. The use of reflection information increased the source position detection rates by 10% on average and up to 60% for the best case.
ER -