Evaluation of localization precision by proposed quasi-spherical nested microphone array in combination with multiresolution adaptive steered response power

Dehghan Firoozabadi, Ali; Irarrazaval, Pablo; Adasme, Pablo; Zabala-Blanco, David; Azurdia-Meza, Cesar A.

Mostrar el registro sencillo de la publicación

dc.contributor.author	Dehghan Firoozabadi, Ali
dc.contributor.author	Irarrazaval, Pablo
dc.contributor.author	Adasme, Pablo
dc.contributor.author	Zabala-Blanco, David
dc.contributor.author	Azurdia-Meza, Cesar A.
dc.date.accessioned	2020-10-26T21:10:30Z
dc.date.available	2020-10-26T21:10:30Z
dc.date.issued	2020
dc.identifier.uri	http://repositorio.ucm.cl/handle/ucm/3126
dc.description.abstract	Multiple sound source localization in noisy and reverberant conditions is one of the important challenges in the speech signal processing. The aim of this article is three-dimensional sound source localization in undesirable scenarios. For the localization algorithms, the spatial aliasing is one of the destructive factors in reducing the accuracy. Firstly, a 3D quasispherical nested microphone array (QSNMA) is proposed for eliminating the spatial aliasing. Since the speech signal has the windowed-disjoint orthogonality property, the speech information differs in terms of the frequency bands. Then, the Gammatone filter bank is introduced for the speech subband processing. In the following, the multiresolution steered response power (SRP) algorithm is adaptively implemented on subbands with the phase transform (PHAT)/maximum likelihood (ML) weighted functions based on the levels of the noise and reverberation. The peaks of the multiresolution adaptive SRP (MASRP) algorithm are extracted in each subband based on the number of speakers for continuous time frames. Finally, the distribution of these peaks are calculated in each subband and they are merged by the use of weighted averaging method. The final 3D speakers locations are estimated by extracting the peaks in the final distribution. The proposed QSNMAMASRP(PHAT/ML) algorithm is evaluated on real and simulated data for 2 and 3 simultaneous speakers in noisy and reverberant conditions. The proposed method is compared with SRP-PHAT, spectral source model-deep neural network, and spherical harmonic temporal extension of multiple response model sparse Bayesian learning algorithms on different range of signal-to-noise ratio and reverberation time. The mean absolute estimation error, averaged standard deviation for absolute estimation error, and computational complexity results show the superiority of the proposed method.	es_CL
dc.language.iso	en	es_CL
dc.rights	Atribución-NoComercial-SinDerivadas 3.0 Chile	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/cl/	*
dc.source	Journal of electrical engineering, 71(3), 150-164	es_CL
dc.subject	Sound source localization	es_CL
dc.subject	Nested microphone array	es_CL
dc.subject	Subband processing	es_CL
dc.subject	Time delay estimation	es_CL
dc.subject	Filter bank	es_CL
dc.title	Evaluation of localization precision by proposed quasi-spherical nested microphone array in combination with multiresolution adaptive steered response power	es_CL
dc.type	Article	es_CL
dc.ucm.facultad	Facultad de Ciencias de la Ingeniería	es_CL
dc.ucm.indexacion	Scopus	es_CL
dc.ucm.indexacion	Isi	es_CL
dc.ucm.uri	www.sciendo.com/article/10.2478/jee-2020-0022	es_CL
dc.ucm.doi	doi.org/10.2478/jee-2020-0022	es_CL

Ficheros en la publicación

Ficheros	Tamaño	Formato	Ver
No hay ficheros asociados a esta publicación.

Esta publicación aparece en la(s) siguiente(s) colección(ones)

Artículos Científicos

Mostrar el registro sencillo de la publicación

Excepto si se señala otra cosa, la licencia de la publicación se describe como Atribución-NoComercial-SinDerivadas 3.0 Chile

Listar

Mi cuenta

Evaluation of localization precision by proposed quasi-spherical nested microphone array in combination with multiresolution adaptive steered response power

Ficheros en la publicación

Esta publicación aparece en la(s) siguiente(s) colección(ones)