Skip to main content

2021 | OriginalPaper | Buchkapitel

Rescue Dog Action Recognition by Integrating Ego-Centric Video, Sound and Sensor Information

verfasst von : Yuta Ide, Tsuyohito Araki, Ryunosuke Hamada, Kazunori Ohno, Keiji Yanai

Erschienen in: Pattern Recognition. ICPR International Workshops and Challenges

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

A dog which assists rescue activity in the scene of disasters such as earthquakes and landslides is called a “disaster rescue dog” or just a “rescue dog”. In Japan where earthquakes happen frequently, a research project on “Cyber-Rescue” is being organized for more efficient rescue activities. In the project, to analyze the activities of rescue dogs in the scene of disasters, “Cyber Dog Suits” equipped with sensors, a camera and a GPS were developed. In this work, we recognize dog activities in the ego-centric dog videos taken by the camera mounted on the cyber-dog suits. To do that, we propose an image/sound/sensor-based four-stream CNN for dog activity recognition which integrates sound and sensor signals as well as motion and appearance. We conducted some experiments for multi-class activity categorization using the proposed method. As a result, the proposed method which integrates appearance, motion, sound and sensor information achieved the highest accuracy, 48.05%. This result is relatively high as a recognition result of ego-centric videos.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
“Triage” means the process of deciding who receives medical treatment first, according to how seriously the person is injured.
 
Literatur
1.
Zurück zum Zitat Chung, J., Gulcehre, C., Cho, K., Bengio, Y.: Empirical evaluation of gated recurrent neural networks on sequence modeling. In: Adavances in Neural Infomatoin Processing Systems Workshop on Deep Learning (2014) Chung, J., Gulcehre, C., Cho, K., Bengio, Y.: Empirical evaluation of gated recurrent neural networks on sequence modeling. In: Adavances in Neural Infomatoin Processing Systems Workshop on Deep Learning (2014)
2.
Zurück zum Zitat Damen, D., et al.: Scaling egocentric vision: The EPIC-KITCHENS dataset. In: Proceedings of European Conference on Computer Vision (2018) Damen, D., et al.: Scaling egocentric vision: The EPIC-KITCHENS dataset. In: Proceedings of European Conference on Computer Vision (2018)
3.
Zurück zum Zitat Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: Proceedings of IEEE Computer Vision and Pattern Recognition (2009) Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: Proceedings of IEEE Computer Vision and Pattern Recognition (2009)
4.
Zurück zum Zitat Ehsani, K., Bagherinezhad, H., Redmon, J., Mottaghi, R., Farhadi, A.: Who let the dogs out? modeling dog behavior from visual data. In: Proceedings of IEEE Computer Vision and Pattern Recognition (2018) Ehsani, K., Bagherinezhad, H., Redmon, J., Mottaghi, R., Farhadi, A.: Who let the dogs out? modeling dog behavior from visual data. In: Proceedings of IEEE Computer Vision and Pattern Recognition (2018)
5.
Zurück zum Zitat Evangelos, K., Arsha, N., Andrew, Z., Dima, D.: Epic-Fusion: audio-visual temporal binding for egocentric action recognition. In: Proceedings of IEEE Computer Vision and Pattern Recognition (2019) Evangelos, K., Arsha, N., Andrew, Z., Dima, D.: Epic-Fusion: audio-visual temporal binding for egocentric action recognition. In: Proceedings of IEEE Computer Vision and Pattern Recognition (2019)
6.
7.
Zurück zum Zitat Gedas, B., Stella, X.Y., Hyun, S.P., Jianbo, S.: Am I a baller? basketball skill assessment using first-person cameras. In: Proceedings of IEEE International Conference on Computer Vision (2016). http://arxiv.org/abs/1611.05365 Gedas, B., Stella, X.Y., Hyun, S.P., Jianbo, S.: Am I a baller? basketball skill assessment using first-person cameras. In: Proceedings of IEEE International Conference on Computer Vision (2016). http://​arxiv.​org/​abs/​1611.​05365
8.
Zurück zum Zitat Graves, A., Mohamed, A., Hinton, G.: Speech recognition with deep recurrent neural networks. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 6645–6649 (2013) Graves, A., Mohamed, A., Hinton, G.: Speech recognition with deep recurrent neural networks. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 6645–6649 (2013)
9.
Zurück zum Zitat Iwashita, Y., Takamine, A., Kurazume, R., Ryoo, M.S.: First-person animal activity recognition from egocentric videos. In: Proceedings of International Conference on Pattern Recognition (ICPR) (2014) Iwashita, Y., Takamine, A., Kurazume, R., Ryoo, M.S.: First-person animal activity recognition from egocentric videos. In: Proceedings of International Conference on Pattern Recognition (ICPR) (2014)
10.
Zurück zum Zitat Komori, Y., Fujieda, T., Ohno, K., Suzuki, T., Tadokoro, S.: Detection of continuous barking actions from search and rescue dogs’ activities data. In: Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 630–635 (2015) Komori, Y., Fujieda, T., Ohno, K., Suzuki, T., Tadokoro, S.: Detection of continuous barking actions from search and rescue dogs’ activities data. In: Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 630–635 (2015)
12.
Zurück zum Zitat Simonyan, K., Vedaldi, A., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: Proceedings of International Conference on Learning Representations (2015) Simonyan, K., Vedaldi, A., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: Proceedings of International Conference on Learning Representations (2015)
13.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Two-stream convolutional networks for action recognition in videos. In: Advances in Neural Information Processing Systems, pp. 568–576 (2014) Simonyan, K., Zisserman, A.: Two-stream convolutional networks for action recognition in videos. In: Advances in Neural Information Processing Systems, pp. 568–576 (2014)
Metadaten
Titel
Rescue Dog Action Recognition by Integrating Ego-Centric Video, Sound and Sensor Information
verfasst von
Yuta Ide
Tsuyohito Araki
Ryunosuke Hamada
Kazunori Ohno
Keiji Yanai
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-68796-0_23