Skip to main content

2021 | OriginalPaper | Buchkapitel

Deep Dense and Convolutional Autoencoders for Machine Acoustic Anomaly Detection

verfasst von : Gabriel Coelho, Pedro Pereira, Luis Matos, Alexandrine Ribeiro, Eduardo C. Nunes, André Ferreira, Paulo Cortez, André Pilastri

Erschienen in: Artificial Intelligence Applications and Innovations

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Recently, there have been advances in using unsupervised learning methods for Acoustic Anomaly Detection (AAD). In this paper, we propose an improved version of two deep AutoEncoders (AE) for unsupervised AAD for six types of working machines, namely Dense and Convolutional AEs. A large set of computational experiments was held, showing that the two proposed deep autoencoders, when combined with a mel-spectrogram sound preprocessing, are quite competitive and outperform a recently proposed AE baseline. Overall, a high-quality class discrimination level was achieved, ranging from 72% to 92%.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Abadi, M., et al.: TensorFlow: large-scale machine learning on heterogeneous distributed systems (2016) Abadi, M., et al.: TensorFlow: large-scale machine learning on heterogeneous distributed systems (2016)
3.
Zurück zum Zitat An, J., Cho, S.: Variational autoencoder based anomaly detection using reconstruction probability. Special Lecture on IE, vol. 2, no. 1, pp. 1–18 (2015) An, J., Cho, S.: Variational autoencoder based anomaly detection using reconstruction probability. Special Lecture on IE, vol. 2, no. 1, pp. 1–18 (2015)
4.
Zurück zum Zitat Aurino, F., Folla, M., Gargiulo, F., Moscato, V., Picariello, A., Sansone, C.: One-class SVM based approach for detecting anomalous audio events. In: 2014 International Conference on Intelligent Networking and Collaborative Systems, pp. 145–151. IEEE (2014) Aurino, F., Folla, M., Gargiulo, F., Moscato, V., Picariello, A., Sansone, C.: One-class SVM based approach for detecting anomalous audio events. In: 2014 International Conference on Intelligent Networking and Collaborative Systems, pp. 145–151. IEEE (2014)
8.
Zurück zum Zitat Duman, T.B., Bayram, B., İnce, G.: Acoustic anomaly detection using convolutional autoencoders in industrial processes. In: Martínez Álvarez, F., Troncoso Lora, A., Sáez Muñoz, J.A., Quintián, H., Corchado, E. (eds.) SOCO 2019. AISC, vol. 950, pp. 432–442. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-20055-8_41CrossRef Duman, T.B., Bayram, B., İnce, G.: Acoustic anomaly detection using convolutional autoencoders in industrial processes. In: Martínez Álvarez, F., Troncoso Lora, A., Sáez Muñoz, J.A., Quintián, H., Corchado, E. (eds.) SOCO 2019. AISC, vol. 950, pp. 432–442. Springer, Cham (2020). https://​doi.​org/​10.​1007/​978-3-030-20055-8_​41CrossRef
9.
Zurück zum Zitat Farzad, A., Gulliver, T.A.: Unsupervised log message anomaly detection. ICT Exp. 6(3), 229–237 (2020)CrossRef Farzad, A., Gulliver, T.A.: Unsupervised log message anomaly detection. ICT Exp. 6(3), 229–237 (2020)CrossRef
11.
Zurück zum Zitat Hershey, S., et al.: CNN architectures for large-scale audio classification. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 131–135. IEEE (2017) Hershey, S., et al.: CNN architectures for large-scale audio classification. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 131–135. IEEE (2017)
12.
Zurück zum Zitat Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International Conference on Machine Learning, pp. 448–456. PMLR (2015) Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International Conference on Machine Learning, pp. 448–456. PMLR (2015)
13.
Zurück zum Zitat Jam, M.M., Sadjedi, H.: Identification of hearing disorder by multi-band entropy cepstrum extraction from infant’s cry. In: 2009 International Conference on Biomedical and Pharmaceutical Engineering, pp. 1–5 (2009) Jam, M.M., Sadjedi, H.: Identification of hearing disorder by multi-band entropy cepstrum extraction from infant’s cry. In: 2009 International Conference on Biomedical and Pharmaceutical Engineering, pp. 1–5 (2009)
14.
Zurück zum Zitat Kawaguchi, Y., Endo, T.: How can we detect anomalies from subsampled audio signals? In: 2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP), pp. 1–6. IEEE (2017) Kawaguchi, Y., Endo, T.: How can we detect anomalies from subsampled audio signals? In: 2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP), pp. 1–6. IEEE (2017)
15.
Zurück zum Zitat Kohlsdorf, D., Herzing, D., Starner, T.: An auto encoder for audio dolphin communication. In: 2020 International Joint Conference on Neural Networks (IJCNN), pp. 1–7. IEEE (2020) Kohlsdorf, D., Herzing, D., Starner, T.: An auto encoder for audio dolphin communication. In: 2020 International Joint Conference on Neural Networks (IJCNN), pp. 1–7. IEEE (2020)
16.
Zurück zum Zitat Koizumi, Y., et al.: Description and discussion on DCASE2020 challenge task2: unsupervised anomalous sound detection for machine condition monitoring. CoRR abs/2006.05822 (2020) Koizumi, Y., et al.: Description and discussion on DCASE2020 challenge task2: unsupervised anomalous sound detection for machine condition monitoring. CoRR abs/2006.05822 (2020)
17.
Zurück zum Zitat Koizumi, Y., Saito, S., Uematsu, H., Harada, N., Imoto, K.: ToyADMOS: a dataset of miniature-machine operating sounds for anomalous sound detection. In: 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 313–317. IEEE (2019). https://ieeexplore.ieee.org/document/8937164 Koizumi, Y., Saito, S., Uematsu, H., Harada, N., Imoto, K.: ToyADMOS: a dataset of miniature-machine operating sounds for anomalous sound detection. In: 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 313–317. IEEE (2019). https://​ieeexplore.​ieee.​org/​document/​8937164
18.
Zurück zum Zitat Koizumi, Y., Saito, S., Uematsu, H., Kawachi, Y., Harada, N.: Unsupervised detection of anomalous sound based on deep learning and the Neyman-Pearson Lemma. IEEE/ACM Trans. Audio Speech Lang. Process. 27(1), 212–224 (2018)CrossRef Koizumi, Y., Saito, S., Uematsu, H., Kawachi, Y., Harada, N.: Unsupervised detection of anomalous sound based on deep learning and the Neyman-Pearson Lemma. IEEE/ACM Trans. Audio Speech Lang. Process. 27(1), 212–224 (2018)CrossRef
19.
Zurück zum Zitat Koizumi, Y., Saito, S., Yamaguchi, M., Murata, S., Harada, N.: Batch uniformization for minimizing maximum anomaly score of DNN-based anomaly detection in sounds (2019) Koizumi, Y., Saito, S., Yamaguchi, M., Murata, S., Harada, N.: Batch uniformization for minimizing maximum anomaly score of DNN-based anomaly detection in sounds (2019)
20.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. Adv. Neural. Inf. Process. Syst. 25, 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. Adv. Neural. Inf. Process. Syst. 25, 1097–1105 (2012)
21.
Zurück zum Zitat Li, J., Dai, W., Metze, F., Qu, S., Das, S.: A comparison of deep learning methods for environmental sound detection. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 126–130. IEEE (2017) Li, J., Dai, W., Metze, F., Qu, S., Das, S.: A comparison of deep learning methods for environmental sound detection. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 126–130. IEEE (2017)
22.
Zurück zum Zitat Liu, Y., Zhuang, C., Lu, F.: Unsupervised two-stage anomaly detection (2021) Liu, Y., Zhuang, C., Lu, F.: Unsupervised two-stage anomaly detection (2021)
24.
Zurück zum Zitat Oh, D.Y., Yun, I.D.: Residual error based anomaly detection using auto-encoder in SMD machine sound. Sensors 18(5), 1308 (2018)CrossRef Oh, D.Y., Yun, I.D.: Residual error based anomaly detection using auto-encoder in SMD machine sound. Sensors 18(5), 1308 (2018)CrossRef
25.
Zurück zum Zitat Panfilenko, D., Poller, P., Sonntag, D., Zillner, S., Schneider, M.: BPMN for knowledge acquisition and anomaly handling in CPS for smart factories. In: 2016 IEEE 21st International Conference on Emerging Technologies and Factory Automation (ETFA), pp. 1–4. IEEE (2016) Panfilenko, D., Poller, P., Sonntag, D., Zillner, S., Schneider, M.: BPMN for knowledge acquisition and anomaly handling in CPS for smart factories. In: 2016 IEEE 21st International Conference on Emerging Technologies and Factory Automation (ETFA), pp. 1–4. IEEE (2016)
26.
Zurück zum Zitat Provotar, O.I., Linder, Y.M., Veres, M.M.: Unsupervised anomaly detection in time series using LSTM-based autoencoders. In: 2019 IEEE International Conference on Advanced Trends in Information Theory (ATIT), pp. 513–517. IEEE (2019) Provotar, O.I., Linder, Y.M., Veres, M.M.: Unsupervised anomaly detection in time series using LSTM-based autoencoders. In: 2019 IEEE International Conference on Advanced Trends in Information Theory (ATIT), pp. 513–517. IEEE (2019)
27.
Zurück zum Zitat Purohit, H., et al.: MIMII dataset: sound dataset for malfunctioning industrial machine investigation and inspection. In: Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop, DCASE 2019, pp. 209–213 (November 2019) Purohit, H., et al.: MIMII dataset: sound dataset for malfunctioning industrial machine investigation and inspection. In: Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop, DCASE 2019, pp. 209–213 (November 2019)
29.
Zurück zum Zitat Rovetta, S., Mnasri, Z., Masulli, F.: Detection of hazardous road events from audio streams: an ensemble outlier detection approach. In: 2020 IEEE Conference on Evolving and Adaptive Intelligent Systems (EAIS), pp. 1–6. IEEE (2020) Rovetta, S., Mnasri, Z., Masulli, F.: Detection of hazardous road events from audio streams: an ensemble outlier detection approach. In: 2020 IEEE Conference on Evolving and Adaptive Intelligent Systems (EAIS), pp. 1–6. IEEE (2020)
30.
Zurück zum Zitat Sharma, G., Umapathy, K., Krishnan, S.: Trends in audio signal feature extraction methods. Appl. Acoust. 158, 107020 (2020)CrossRef Sharma, G., Umapathy, K., Krishnan, S.: Trends in audio signal feature extraction methods. Appl. Acoust. 158, 107020 (2020)CrossRef
31.
Zurück zum Zitat Sonntag, D., Zillner, S., van der Smagt, P., Lörincz, A.: Overview of the CPS for smart factories project: deep learning, knowledge acquisition, anomaly detection and intelligent user interfaces. In: Jeschke, S., Brecher, C., Song, H., Rawat, D.B. (eds.) Industrial Internet of Things. SSWT, pp. 487–504. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-42559-7_19CrossRef Sonntag, D., Zillner, S., van der Smagt, P., Lörincz, A.: Overview of the CPS for smart factories project: deep learning, knowledge acquisition, anomaly detection and intelligent user interfaces. In: Jeschke, S., Brecher, C., Song, H., Rawat, D.B. (eds.) Industrial Internet of Things. SSWT, pp. 487–504. Springer, Cham (2017). https://​doi.​org/​10.​1007/​978-3-319-42559-7_​19CrossRef
32.
Zurück zum Zitat Tagawa, T., Tadokoro, Y., Yairi, T.: Structured denoising autoencoder for fault detection and analysis. In: Asian Conference on Machine Learning, pp. 96–111 (2015) Tagawa, T., Tadokoro, Y., Yairi, T.: Structured denoising autoencoder for fault detection and analysis. In: Asian Conference on Machine Learning, pp. 96–111 (2015)
33.
Zurück zum Zitat Torfi, A., Iranmanesh, S.M., Nasrabadi, N.M., Dawson, J.M.: 3D convolutional neural networks for cross audio-visual matching recognition. IEEE Access 5, 22081–22091 (2017)CrossRef Torfi, A., Iranmanesh, S.M., Nasrabadi, N.M., Dawson, J.M.: 3D convolutional neural networks for cross audio-visual matching recognition. IEEE Access 5, 22081–22091 (2017)CrossRef
Metadaten
Titel
Deep Dense and Convolutional Autoencoders for Machine Acoustic Anomaly Detection
verfasst von
Gabriel Coelho
Pedro Pereira
Luis Matos
Alexandrine Ribeiro
Eduardo C. Nunes
André Ferreira
Paulo Cortez
André Pilastri
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-79150-6_27

Premium Partner