Hostname: page-component-89b8bd64d-rbxfs Total loading time: 0 Render date: 2026-05-13T16:08:24.475Z Has data issue: false hasContentIssue false

A two-stage approach for passive sound source localization based on the SRP-PHAT algorithm

Published online by Cambridge University Press:  26 February 2020

M.A. Awad-Alla*
Affiliation:
Ain Shams University, Cairo, Egypt
Ahmed Hamdy
Affiliation:
Helwan University, Cairo, Egypt
Farid A. Tolbah
Affiliation:
Ain Shams University, Cairo, Egypt
Moatasem A. Shahin
Affiliation:
Badr University, Cairo, Egypt
M.A. Abdelaziz
Affiliation:
Ain Shams University, Cairo, Egypt
*
Corresponding author: M.A. Awad-Alla E-mail: matef70@yahoo.com

Abstract

This paper presents a different approach to tackle the Sound Source Localization (SSL) problem apply on a compact microphone array that can be mounted on top of a small moving robot in an indoor environment. Sound source localization approaches can be categorized into the three main categories; Time Difference of Arrival (TDOA), high-resolution subspace-based methods, and steered beamformer-based methods. Each method has its limitations according to the search or application requirements. Steered beamformer-based method will be used in this paper because it has proven to be robust to ambient noise and reverberation to a certain extent. The most successful and used algorithm of this method is the SRP-PHAT algorithm. The main limitation of SRP-PHAT algorithm is the computational burden resulting from the search process, this limitation comes from searching among all possible candidate locations in the searching space for the location that maximizes a certain function. The aim of this paper is to develop a computationally viable approach to find the coordinate location of a sound source with acceptable accuracy. The proposed approach comprises two stages: the first stage contracts the search space by estimating the Direction of Arrival (DoA) vector from the time difference of arrival with an addition of reasonable error coefficient around the vector to make sure that the sound source locates inside the estimated region, the second stage is to apply the SRP-PHAT algorithm to search only in this contracted region for the source location. The AV16.3 corpus was used to evaluate the proposed approach, extensive experiments have been carried out to verify the reliability of the approach. The results showed that the proposed approach was successful in obtaining good results compared to the conventional SRP-PHAT algorithm.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Author(s), 2020
Figure 0

Fig. 1. Proposed localization scheme.

Figure 1

Fig. 2. Search boundary in the xy plane.

Figure 2

Fig. 3. Localization system's geometry.

Figure 3

Table 1. Microphone locations of AV16.3 first array.

Figure 4

Table 2. Algorithm input parameters.

Figure 5

Table 3. Variance of the RMSE.

Figure 6

Fig. 4. Results for location 1.

Figure 7

Fig. 5. Results for location 2.

Figure 8

Fig. 6. Results for location 3.

Figure 9

Fig. 7. Results for location 4.

Figure 10

Fig. 8. Results for location 5.

Figure 11

Fig. 9. Results for location 6.

Figure 12

Fig. 10. Results for location 7.

Figure 13

Fig. 11. Results for location 8.

Figure 14

Fig. 12. Results for location 9.

Figure 15

Fig. 13. Results for location 10.

Figure 16

Fig. 14. Results for location 11.

Figure 17

Fig. 15. Results for location 12.

Figure 18

Fig. 16. Results for location 13.

Figure 19

Fig. 17. Results for location 14.

Figure 20

Fig. 18. Results for location 15.

Figure 21

Fig. 19. Results for location 16.