Skip to main content
Erschienen in: Cognitive Computation 3/2017

07.03.2017

Real-time Audio Processing with a Cascade of Discrete-Time Delay Line-Based Reservoir Computers

verfasst von: Lars Keuninckx, Jan Danckaert, Guy Van der Sande

Erschienen in: Cognitive Computation | Ausgabe 3/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Background: Real-time processing of audio or audio-like signals is a promising research topic for the field of machine learning, with many potential applications in music and communications. We present a cascaded delay line reservoir computer capable of real-time audio processing on standard computing equipment, aimed at black-box system identification of nonlinear audio systems. The cascaded reservoir blocks use two-pole filtered virtual neurons to match their timescales to that of the target signals. The reservoir blocks receive both the global input signal and the target estimate from the previous block (local input). The units in the cascade are trained in a successive manner on a single input output training pair, such that a successively better approximation of the target is reached. A cascade of 5 dual-input reservoir blocks of 100 neurons each is trained to mimic the distortion of a measured guitar amplifier. This cascade outperforms both a single delay reservoir having the same total number of neurons as well as a cascade with only single-input blocks. We show that the presented structure is a viable platform for real-time audio applications on present-day computing hardware. A benefit of this structure is that it works directly from the audio samples as input, avoiding computationally intensive preprocessing.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
An Intel dual core running at 2.4 GHz, Dell Latitude E4300, built in 2009.
 
2
Also after 20 iterations of the RO algorithm, on P = 50 parameter tuples.
 
3
This is equivalent to setting the filter coefficients to b 0(n) = 1 and b 1(n) = b 2(n) = 0.
 
Literatur
1.
Zurück zum Zitat Jäger H. The echo state approach to analysing and training recurrent neural networks. Technical report, German National Research Center for Information Technology. 2001. Jäger H. The echo state approach to analysing and training recurrent neural networks. Technical report, German National Research Center for Information Technology. 2001.
2.
Zurück zum Zitat Maass W, Natschläger T, Markram H. Real-time computing without stable states: a new framework for neural computation based on perturbations. Neural Comput 2002;14(11):2531–2560.CrossRefPubMed Maass W, Natschläger T, Markram H. Real-time computing without stable states: a new framework for neural computation based on perturbations. Neural Comput 2002;14(11):2531–2560.CrossRefPubMed
3.
Zurück zum Zitat Verstraeten D, Schrauwen B, d’Haene M, Stroobandt D. An experimental unification of reservoir computing methods. Neural Netw 2007;20:391–403.CrossRefPubMed Verstraeten D, Schrauwen B, d’Haene M, Stroobandt D. An experimental unification of reservoir computing methods. Neural Netw 2007;20:391–403.CrossRefPubMed
4.
Zurück zum Zitat Appeltant L, Soriano M, Van der Sande G, Danckaert J, Massar S, Dambre J, Schrauwen B, Mirasso CR, Fischer I. Information processing using a single dynamical node as complex system. Nat Commun 2011;2:468.CrossRefPubMedPubMedCentral Appeltant L, Soriano M, Van der Sande G, Danckaert J, Massar S, Dambre J, Schrauwen B, Mirasso CR, Fischer I. Information processing using a single dynamical node as complex system. Nat Commun 2011;2:468.CrossRefPubMedPubMedCentral
5.
Zurück zum Zitat Brünner D., Soriano M, Mirasso CR, Fischer I. Parallel photonic information processing at gigabyte per second data rates using transient states. Nat Commun 2013;4:1364.CrossRefPubMedPubMedCentral Brünner D., Soriano M, Mirasso CR, Fischer I. Parallel photonic information processing at gigabyte per second data rates using transient states. Nat Commun 2013;4:1364.CrossRefPubMedPubMedCentral
6.
Zurück zum Zitat Duport F, Schneider B, Smerieri A, Haelterman M, Massar S. All-optical reservoir computing. Opt Exp 2012;20:22,783–22,795.CrossRef Duport F, Schneider B, Smerieri A, Haelterman M, Massar S. All-optical reservoir computing. Opt Exp 2012;20:22,783–22,795.CrossRef
7.
Zurück zum Zitat Soriano M, Ortín S, Keuninckx L, Appeltant L, Danckaert J, Pesquera L, der Sande GV. Delay based reservoir computing: Noise effects in a combined analog and digital implementation. IEEE TNNLS 2015;26(2):388–393. Soriano M, Ortín S, Keuninckx L, Appeltant L, Danckaert J, Pesquera L, der Sande GV. Delay based reservoir computing: Noise effects in a combined analog and digital implementation. IEEE TNNLS 2015;26(2):388–393.
8.
Zurück zum Zitat Larger L, Soriano M, Brünner D, Appeltant L, Gutierrez JM, Pesquera L, Mirasso CR, Fischer I. Photonic information processing beyond Turing: an opto-electronic implementation of reservoir computing. Opt Exp 2012;20:3241–3249.CrossRef Larger L, Soriano M, Brünner D, Appeltant L, Gutierrez JM, Pesquera L, Mirasso CR, Fischer I. Photonic information processing beyond Turing: an opto-electronic implementation of reservoir computing. Opt Exp 2012;20:3241–3249.CrossRef
10.
Zurück zum Zitat Holzmann G. Reservoir computing: a powerful black-box framework for nonlinear audio processing. In: Proceedings of the 12th International Conference on Digital Audio Effects (DAFx-09), pp 90–97. 2009. Holzmann G. Reservoir computing: a powerful black-box framework for nonlinear audio processing. In: Proceedings of the 12th International Conference on Digital Audio Effects (DAFx-09), pp 90–97. 2009.
11.
Zurück zum Zitat Scardapane S, Uncini A. Semi-supervised echo state networks for audio classification. Cognitive Computation, pp 1–11. 2016. Scardapane S, Uncini A. Semi-supervised echo state networks for audio classification. Cognitive Computation, pp 1–11. 2016.
12.
Zurück zum Zitat Liu X, Bao CC. Audio bandwidth extension using ensemble of recurrent neural networks. EURASIP J Audio Speech, Music Process 2016;2016(1):12.CrossRef Liu X, Bao CC. Audio bandwidth extension using ensemble of recurrent neural networks. EURASIP J Audio Speech, Music Process 2016;2016(1):12.CrossRef
13.
Zurück zum Zitat Triefenbach F, Jalalvand A, Demuynck K, Martens JP. Acoustic modeling with hierarchical reservoirs. IEEE Transactions on Audio. Speech Lang Process 2013;21(11):2439–2450.CrossRef Triefenbach F, Jalalvand A, Demuynck K, Martens JP. Acoustic modeling with hierarchical reservoirs. IEEE Transactions on Audio. Speech Lang Process 2013;21(11):2439–2450.CrossRef
14.
Zurück zum Zitat Grigoryeva L, Henriques J, Larger L, Ortega JP. Stochastic time series forecasting using time-delay reservoirs. Neural Netw 2014;55:59–71.CrossRefPubMed Grigoryeva L, Henriques J, Larger L, Ortega JP. Stochastic time series forecasting using time-delay reservoirs. Neural Netw 2014;55:59–71.CrossRefPubMed
18.
Zurück zum Zitat Tretter SA. Communication system design using DSP algorithms, 1st ed.: Springer; 2008. Tretter SA. Communication system design using DSP algorithms, 1st ed.: Springer; 2008.
19.
Zurück zum Zitat Pakarinen J, Yeh DT. A review of digital techniques for modelling vacuum-tube guitar amplifiers. Comput Music J 2009;33(2):85–100.CrossRef Pakarinen J, Yeh DT. A review of digital techniques for modelling vacuum-tube guitar amplifiers. Comput Music J 2009;33(2):85–100.CrossRef
20.
Zurück zum Zitat Yeh DT. Digital implementation of musical distorcion circuits by analysis and simulation. Ph.D. thesis, Helsinki University of Technology. 2009. Yeh DT. Digital implementation of musical distorcion circuits by analysis and simulation. Ph.D. thesis, Helsinki University of Technology. 2009.
24.
Zurück zum Zitat Appeltant L. Reservoir computing based on delay dynamical systems. Ph.D. thesis, Vrije Universiteit Brussel (VUB) Universitat de les Illes Balears. 2012. Appeltant L. Reservoir computing based on delay dynamical systems. Ph.D. thesis, Vrije Universiteit Brussel (VUB) Universitat de les Illes Balears. 2012.
25.
Zurück zum Zitat Mackey MC, Glass L. Oscillation and chaos in physiological control systems. Science 1977;197(4300):287–289.CrossRefPubMed Mackey MC, Glass L. Oscillation and chaos in physiological control systems. Science 1977;197(4300):287–289.CrossRefPubMed
27.
28.
Zurück zum Zitat Knapp C, Carter G. Generalized correlation methods for estimation of time delay. IEEE TASSP 1976;24(4):320–327.CrossRef Knapp C, Carter G. Generalized correlation methods for estimation of time delay. IEEE TASSP 1976;24(4):320–327.CrossRef
30.
Zurück zum Zitat Hauser H. Echo state networks with filter neurons and a delay and sum readout. Neural Netw 2009;23(2):244–256.PubMed Hauser H. Echo state networks with filter neurons and a delay and sum readout. Neural Netw 2009;23(2):244–256.PubMed
31.
Zurück zum Zitat III JOS. Introduction to digital filter theory with audio applications: W3K Publishing; 2007. III JOS. Introduction to digital filter theory with audio applications: W3K Publishing; 2007.
32.
Zurück zum Zitat Yates R, Lyons R. DSP tips and tricks: DC blocker algorithms. IEEE Sig Proc Mag 2008;25(2):132–134.CrossRef Yates R, Lyons R. DSP tips and tricks: DC blocker algorithms. IEEE Sig Proc Mag 2008;25(2):132–134.CrossRef
33.
Zurück zum Zitat Matyas J. Random optimization. Autom Remote Control 1965;26(2):246–253. Matyas J. Random optimization. Autom Remote Control 1965;26(2):246–253.
35.
Zurück zum Zitat Smith SW. The Scientist and Engineer’s Guide to Digital Signal Processing. 1st ed. San Diego, CA: California Technical Publishing; 1997. Smith SW. The Scientist and Engineer’s Guide to Digital Signal Processing. 1st ed. San Diego, CA: California Technical Publishing; 1997.
36.
Zurück zum Zitat Lennes M, Lehtokoski A, Alku P, Näätänen R. 1999. Acoustic, psychoacoustic and psychophysiological measures of distance in the Finnish vowel space. In: Proceedings ICPhS99, pp. 2465–2468. Lennes M, Lehtokoski A, Alku P, Näätänen R. 1999. Acoustic, psychoacoustic and psychophysiological measures of distance in the Finnish vowel space. In: Proceedings ICPhS99, pp. 2465–2468.
37.
Zurück zum Zitat Logan B, Salomon A. A music similarity function based on signal analysis. In: Proceedings International Symposium. Tokyo, Japan: Music Information Retrieval; 2001. Logan B, Salomon A. A music similarity function based on signal analysis. In: Proceedings International Symposium. Tokyo, Japan: Music Information Retrieval; 2001.
Metadaten
Titel
Real-time Audio Processing with a Cascade of Discrete-Time Delay Line-Based Reservoir Computers
verfasst von
Lars Keuninckx
Jan Danckaert
Guy Van der Sande
Publikationsdatum
07.03.2017
Verlag
Springer US
Erschienen in
Cognitive Computation / Ausgabe 3/2017
Print ISSN: 1866-9956
Elektronische ISSN: 1866-9964
DOI
https://doi.org/10.1007/s12559-017-9457-5

Weitere Artikel der Ausgabe 3/2017

Cognitive Computation 3/2017 Zur Ausgabe

Premium Partner