nach oben

Autonomous Robots

Erschienen in:

31.08.2019

Asynchronous microphone arrays calibration and sound source tracking

verfasst von: Daobilige Su, Teresa Vidal-Calleja, Jaime Valls Miro

Erschienen in: Autonomous Robots | Ausgabe 2/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

In this paper, we proposed an optimisation method to solve the problem of sound source localisation and calibration of an asynchronous microphone array. This method is based on the graph-based formulation of the simultaneous localisation and mapping problem. In this formulation, a moving sound source is considered to be observed from a static microphone array. Traditional approaches for sound source localisation rely on the well-known geometrical information of the array and synchronous readings of the audio signals. Recent work relaxed these two requirements by estimating the temporal offset between pair of microphones based on the assumption that the clock timing of each microphone is exactly the same. This assumption requires the sound cards to be identically manufactured, which in practice is not possible to achieve. Hereby an approach is proposed to jointly estimate the array geometrical information, time offset and clock difference/drift rate of each microphone together with the location of a moving sound source. In addition, an observability analysis of the system is performed to investigate the most suitable configuration for sound source localisation. Simulation and experimental results are presented, which prove the effectiveness of the proposed methodology.

Vorheriger Artikel Simultaneous planning of sampling and optimization: study on lazy evaluation and configuration free space approximation for optimal motion planning algorithm

Nächster Artikel Model-referenced pose estimation using monocular vision for autonomous intervention tasks

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Nur mit Berechtigung zugänglich

Note that the main difference with a standard landmark-pose SLAM system is that here all the microphones are “observed” at all times. In a standard SLAM system only part of the landmarks are observed at any time. This fact allows the microphone array to be treated as a single landmark with a large state that contains locations of all microphones. However, the same solution can be achieved if the microphones are considered independently.

Note that other motion models, such as constant velocity model, can be applied as long as it describes the motion of the sound source properly.

Note that uncertainties of microphone locations do not directly related to minimum eigenvalues of sub FIMs corresponding to individual microphones, due to: (1) Minimum eigenvalues of sub FIMs corresponding to individual microphones related to not only locations but also starting time offsets and clock differences of microphones. (2) Uncertainties of microphone locations depend on relative distance to microphone 1, which is fixed to the origin of the coordinates as a reference. This means microphones close to microphone 1 have smaller uncertainties on their estimated locations. In comparison, minimum eigenvalues of sub FIMs corresponding to individual microphones do not depend on distance to microphone 1.

Bar-Shalom, Y., Li, X. R., & Kirubarajan, T. (2004). Estimation with applications to tracking and navigation: Theory algorithms and software. New York: Wiley.

Blandin, C., Ozerov, A., & Vincent, E. (2012). Multi-source TDOA estimation in reverberant audio using angular spectra and clustering. Signal Processing, 92(8), 1950–1960.CrossRef

Bove, J., Michael, V., & Dalton, B. (2005). Audio-based self-localization for ubiquitous sensor networks. In Audio Engineering Society Convention (p. 118).

Canclini, A., Antonacci, E., Sarti, A., & Tubaro, S. (2013). Acoustic source localization with distributed asynchronous microphone networks. IEEE Transactions on Audio, Speech and Signal Processing, 21(2), 439–443.CrossRef

Fan, H. H., & Yan, C. (2007). Asynchronous differential TDOA for sensor self-localization. In IEEE International Conference on Acoustics, Speech and Signal Processing 2007 (ICASSP 2007) (pp. II-1109–II-1112).

Grisetti, G., Kummerle, R., Stachniss, C., & Burgard, W. (2010). A tutorial on graph-based SLAM. IEEE Intelligent Transportation Systems Magazine, 2(4), 31–43.CrossRef

Hasegawa, K., Ono, N., Miyabe, S., & Sagayama, S. (2010). Blind estimation of locations and time offsets for distributed recording devices. In Latent variable analysis and signal separation (pp. 57–64).

Hennecke, M. H., & Fink, G. A. (2011). Towards acoustic self-localization of ad hoc smartphone arrays. In 2011 joint workshop on hands-free speech communication and microphone arrays (HSCMA) (pp. 127–132).

Huang, S., & Dissanayake, G. (2016). A critique of current developments in simultaneous localization and mapping. International Journal of Advanced Robotic Systems. https://doi.org/10.1177/1729881416669482. CrossRef

Lombard, A., Zheng, Y., Buchner, H., & Kellermann, W. (2011). TDOA estimation for multiple sound sources in noisy and reverberant environments using broadband independent component analysis. IEEE Transactions on Audio, Speech, and Language Processing, 19(6), 1490–1503.CrossRef

Miura, H., Yoshida, T., Nakamura, K., & Nakadai, K. (2011). SLAM-based online calibration of asynchronous microphone array for robot audition. In IEEE/RSJ international conference on intelligent robots and systems (IROS 2011) (pp. 524–529).

Nakadai, K., Yamamoto, S., Okuno, H. G., Nakajima, H., Hasegawa, Y., & Tsujino, H. (2008). A robot referee for rock-paper-scissors sound games. In The 2008 IEEE international conference on robotics and automation (ICRA 2008) (pp. 3469–3474).

Nakadai, K., Okuno, H. G., & Mizumoto, T. (2017). Development, deployment and applications of robot audition open source software HARK. Journal of Robotics and Mechatronics, 29(1), 16–25.CrossRef

Nakamura, K., Nakadai, K., Asano, F., & Ince, G. (2011). Intelligent sound source localization and its application to multimodal human tracking. In IEEE/RSJ international conference on intelligent robots and systems (IROS 2011) (pp. 143–148).

Nedis. (2019). 3D sound USB adapter. https://www.nedis.com/product/sound-card-3d-sound-51-usb-20-double-35-mm-connector. Accessed 10 May 2019.

Nesta, F., & Omologo, M. (2012). Generalized state coherence transform for multidimensional TDOA estimation of multiple sources. IEEE Transactions on Audio, Speech, and Language Processing, 20(1), 246–260.CrossRef

Ohata, T., Nakamura, K., Nagamine, A., Mizumoto, T., Ishizaki, T., Kojima, R., et al. (2017). Outdoor sound source detection using a quadcopter with microphone array. Journal of Robotics and Mechatronics, 29(1), 177–187.CrossRef

Ono, N., Shibata, K., & Kameoka, H. (2016). Self-localization and channel synchronization of smartphone arrays using sound emissions. In 2016 Asia–Pacific signal and information processing association annual summit and conference (APSIPA 2016) (pp. 1–5).

Pertila, P., Mieskolainen, M., & Hamalainen, M. S. (2011). Closed-form self-localization of asynchronous microphone arrays. In 2011 joint workshop on hands-free speech communication and microphone arrays (HSCMA) (pp. 139–144).

Plinge, A., Jacob, F., Haeb-Umbach, R., & Fink, G. A. (2016). Acoustic microphone geometry calibration: An overview and experimental evaluation of state-of-the-art algorithms. IEEE Signal Processing Magazine, 33(4), 14–29.CrossRef

Plinge, A., Fink, G. A., & Gannot, S. (2017). Passive online geometry calibration of acoustic sensor networks. IEEE Signal Processing Letters, 24(3), 324–328.CrossRef

Raykar, V. C., Yegnanarayana, B., Prasanna, S., & Duraiswami, R. (2005). Speaker localization using excitation source information in speech. IEEE Transactions on Speech and Audio Processing, 13(5), 751–761.CrossRef

Samueli, H. (1988). On the design of optimal equiripple FIR digital filters for data transmission applications. IEEE Transactions on Circuits and Systems, 35(12), 1542–1546.MathSciNetCrossRef

Sekiguchi, K., Bando, Y., Nakamura, K., Nakadai, K., Itoyama, K., & Yoshii, K. (2016). Online simultaneous localization and mapping of multiple sound sources and asynchronous microphone arrays. In IEEE/RSJ international conference on intelligent robots and systems 2016 (IROS 2016) (pp. 1973–1979).

Su, D., Vidal-Calleja, T., & Valls Miro, J. (2015). Simultaneous asynchronous microphone array calibration and sound source localisation. In IEEE/RSJ international conference on intelligent robots and systems 2015 (IROS 2015) (pp. 5561–5567).

Valin, J. M., Rouat, J., & Michaud, F. (2004). Enhanced robot audition based on microphone array source separation with post-filter. In IEEE/RSJ international conference on intelligent robots and systems 2014 (IROS 2014) (pp. 2123–2128).

Wang, Z., & Dissanayake, G. (2008). Observability analysis of SLAM using Fisher information matrix. In The 2008 10th international conference on control, automation, robotics and vision (ICARCV 2008) (pp. 1242–1247).

Yamamoto, S., Valin, J. M., Nakadai, K., Rouat, J., Michaud, F., Ogata, T., & Okuno, H. G. (2005). Enhanced robot speech recognition based on microphone array source separation and missing feature theory. In The 2005 IEEE international conference on robotics and automation (ICRA 2005) (pp. 1477–1482).

Titel: Asynchronous microphone arrays calibration and sound source tracking
verfasst von: Daobilige Su
Teresa Vidal-Calleja
Jaime Valls Miro
Publikationsdatum: 31.08.2019
Verlag: Springer US
Erschienen in: Autonomous Robots / Ausgabe 2/2020
Print ISSN: 0929-5593
Elektronische ISSN: 1573-7527
DOI: https://doi.org/10.1007/s10514-019-09885-w

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence_ieS/© Springer Fachmedien Wiesbaden GmbH, Search Icon, Banner Hanser, Dr. Alexandru Oproiescu/© Dr. Alexandru Oproiescu, Julian Erhard/© Packex GmbH, Cloud Netzwerk Open Banking/© vege / Fotolia, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Weitere Artikel der Ausgabe 2/2020

Design of a new air pressure perception multi-cavity pneumatic-driven earthworm-like soft robot

Simultaneous planning of sampling and optimization: study on lazy evaluation and configuration free space approximation for optimal motion planning algorithm

Application of Lissajous curves in trajectory planning of multiple agents

DVL-SLAM: sparse depth enhanced direct visual-LiDAR SLAM

A velocity obstacles approach for autonomous landing and teleoperated robots

Robustness and efficiency insights from a mechanical coupling metric for ankle-actuated biped robots

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.