Skip to main content

2021 | OriginalPaper | Buchkapitel

Understanding Event Boundaries for Egocentric Activity Recognition from Photo-Streams

verfasst von : Alejandro Cartas, Estefania Talavera, Petia Radeva, Mariella Dimiccoli

Erschienen in: Pattern Recognition. ICPR International Workshops and Challenges

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The recognition of human activities captured by a wearable photo-camera is especially suited for understanding the behavior of a person. However, it has received comparatively little attention with respect to activity recognition from fixed cameras. In this work, we propose to use segmented events from photo-streams as temporal boundaries to improve the performance of activity recognition. Furthermore, we robustly measure its effectiveness when images of the evaluated person have been seen during training, and when the person is completely unknown during testing. Experimental results show that leveraging temporal boundary information on pictures of seen people improves all classification metrics, particularly it improves the classification accuracy up to 85.73%.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Cartas, A., Luque, J., Radeva, P., Segura, C., Dimiccoli, M.: Seeing and hearing egocentric actions: How much can we learn? In: Proceedings of the IEEE International Conference on Computer Vision Workshops (2019) Cartas, A., Luque, J., Radeva, P., Segura, C., Dimiccoli, M.: Seeing and hearing egocentric actions: How much can we learn? In: Proceedings of the IEEE International Conference on Computer Vision Workshops (2019)
3.
Zurück zum Zitat de Jong, R.: Multimodal deep learning for the classification of human activity: radar and video data fusion for the classification of human activity (2019) de Jong, R.: Multimodal deep learning for the classification of human activity: radar and video data fusion for the classification of human activity (2019)
4.
Zurück zum Zitat Bolaños, M., Dimiccoli, M., Radeva, P.: Toward storytelling from visual lifelogging: an overview. IEEE Trans. Hum.-Mach. Syst. 47(1), 77–90 (2017) Bolaños, M., Dimiccoli, M., Radeva, P.: Toward storytelling from visual lifelogging: an overview. IEEE Trans. Hum.-Mach. Syst. 47(1), 77–90 (2017)
5.
Zurück zum Zitat Aghaei, M., Dimiccoli, M., Radeva, P.: All the people around me: face discovery in egocentric photo-streams. In: 2017 IEEE International Conference on Image Processing (ICIP), pp. 1342–1346. IEEE (2017) Aghaei, M., Dimiccoli, M., Radeva, P.: All the people around me: face discovery in egocentric photo-streams. In: 2017 IEEE International Conference on Image Processing (ICIP), pp. 1342–1346. IEEE (2017)
6.
Zurück zum Zitat Cartas, A., Radeva, P., Dimiccoli, M.: Activities of daily living monitoring via a wearable camera: toward real-world applications. IEEE Access 8, 77344–77363 (2020)CrossRef Cartas, A., Radeva, P., Dimiccoli, M.: Activities of daily living monitoring via a wearable camera: toward real-world applications. IEEE Access 8, 77344–77363 (2020)CrossRef
7.
Zurück zum Zitat Castro, D., et al.: Predicting daily activities from egocentric images using deep learning, pp. 75–82 (2015) Castro, D., et al.: Predicting daily activities from egocentric images using deep learning, pp. 75–82 (2015)
8.
Zurück zum Zitat Cartas, A., Dimiccoli, M., Radeva, P.: Batch-based activity recognition from egocentric photo-streams. In: Proceedings of the IEEE International Conference on Computer Vision Workshops, pp. 2347–2354 (2017) Cartas, A., Dimiccoli, M., Radeva, P.: Batch-based activity recognition from egocentric photo-streams. In: Proceedings of the IEEE International Conference on Computer Vision Workshops, pp. 2347–2354 (2017)
10.
Zurück zum Zitat Aghaei, M., Dimiccoli, M., Ferrer, C.C., Radeva, P.: Towards social pattern characterization in egocentric photo-streams. Comput. Vision Image Unders. 171, 104–117 (2018)CrossRef Aghaei, M., Dimiccoli, M., Ferrer, C.C., Radeva, P.: Towards social pattern characterization in egocentric photo-streams. Comput. Vision Image Unders. 171, 104–117 (2018)CrossRef
11.
Zurück zum Zitat Aimar, E.S., Radeva, P., Dimiccoli, M.: Social relation recognition in egocentric photostreams. In: 2019 IEEE International Conference on Image Processing (ICIP), pp. 3227–3231. IEEE (2019) Aimar, E.S., Radeva, P., Dimiccoli, M.: Social relation recognition in egocentric photostreams. In: 2019 IEEE International Conference on Image Processing (ICIP), pp. 3227–3231. IEEE (2019)
12.
Zurück zum Zitat Talavera, E., Leyva-Vallina, M., Sarker, M.K., Puig, D., Petkov, N., Radeva, P.: Hierarchical approach to classify food scenes in egocentric photo-streams. IEEE J. Biomed. Health Inf. 24, 866–877 (2019) Talavera, E., Leyva-Vallina, M., Sarker, M.K., Puig, D., Petkov, N., Radeva, P.: Hierarchical approach to classify food scenes in egocentric photo-streams. IEEE J. Biomed. Health Inf. 24, 866–877 (2019)
13.
Zurück zum Zitat Talavera, E., Wuerich, C., Petkov, N., Radeva, P.: Topic modelling for routine discovery from egocentric photo-streams. Pattern Recogn 104, 107330 (2020)CrossRef Talavera, E., Wuerich, C., Petkov, N., Radeva, P.: Topic modelling for routine discovery from egocentric photo-streams. Pattern Recogn 104, 107330 (2020)CrossRef
14.
Zurück zum Zitat Poleg, Y., Arora, C., Peleg, S.: Temporal segmentation of egocentric videos. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2537–2544 (2014) Poleg, Y., Arora, C., Peleg, S.: Temporal segmentation of egocentric videos. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2537–2544 (2014)
16.
Zurück zum Zitat Dimiccoli, M., Bolaños, M., Talavera, E., Aghaei, M., Nikolov, S.G., Radeva, P.: Sr-clustering: semantic regularized clustering for egocentric photo streams segmentation. Comput. Vision Image Underst. 155, 55–69 (2017)CrossRef Dimiccoli, M., Bolaños, M., Talavera, E., Aghaei, M., Nikolov, S.G., Radeva, P.: Sr-clustering: semantic regularized clustering for egocentric photo streams segmentation. Comput. Vision Image Underst. 155, 55–69 (2017)CrossRef
17.
Zurück zum Zitat Dias, C., Dimiccoli, M.: Learning event representations by encoding the temporal context. In: Proceedings of the European Conference on Computer Vision (ECCV) (2018) Dias, C., Dimiccoli, M.: Learning event representations by encoding the temporal context. In: Proceedings of the European Conference on Computer Vision (ECCV) (2018)
18.
Zurück zum Zitat Pirsiavash, H., Ramanan, D.: Detecting activities of daily living in first-person camera views. In: Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR), pp. 2847–2854. IEEE (2012) Pirsiavash, H., Ramanan, D.: Detecting activities of daily living in first-person camera views. In: Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR), pp. 2847–2854. IEEE (2012)
19.
Zurück zum Zitat Sudhakaran, S., Lanz, O.: Attention is all we need: Nailing down object-centric attention for egocentric activity recognition. In: Proceedings of the British Machine Vision Conference (BMVC) (2018) Sudhakaran, S., Lanz, O.: Attention is all we need: Nailing down object-centric attention for egocentric activity recognition. In: Proceedings of the British Machine Vision Conference (BMVC) (2018)
20.
Zurück zum Zitat García Hernando, G., Yuan, S., Baek, S., Kim, T.-K.: First-person hand action benchmark with rgb-d videos and 3D hand pose annotations. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018) García Hernando, G., Yuan, S., Baek, S., Kim, T.-K.: First-person hand action benchmark with rgb-d videos and 3D hand pose annotations. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)
24.
Zurück zum Zitat Song, S., et al.: Multimodal multi-stream deep learning for egocentric activity recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 24–31 (2016) Song, S., et al.: Multimodal multi-stream deep learning for egocentric activity recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 24–31 (2016)
26.
Zurück zum Zitat Chollet, F.: Deep Learning with Python, 1st edn., pp. 219–221. Manning Publications Co, Greenwich (2017) Chollet, F.: Deep Learning with Python, 1st edn., pp. 219–221. Manning Publications Co, Greenwich (2017)
27.
Zurück zum Zitat Ng, J.Y.-H., Hausknecht, M., Vijayanarasimhan, S., Vinyals, O., Monga, R., Toderici, G.: Beyond short snippets: deep networks for video classification. In: Computer Vision and Pattern Recognition (2015) Ng, J.Y.-H., Hausknecht, M., Vijayanarasimhan, S., Vinyals, O., Monga, R., Toderici, G.: Beyond short snippets: deep networks for video classification. In: Computer Vision and Pattern Recognition (2015)
28.
Zurück zum Zitat Chollet, F.: Xception: deep learning with depthwise separable convolutions, pp. 1800–1807 (2017) Chollet, F.: Xception: deep learning with depthwise separable convolutions, pp. 1800–1807 (2017)
29.
Zurück zum Zitat King, G., Zeng, L.: Logistic regression in rare events data. Polit. Anal. 9(2), 137–163 (2001)CrossRef King, G., Zeng, L.: Logistic regression in rare events data. Polit. Anal. 9(2), 137–163 (2001)CrossRef
30.
Zurück zum Zitat Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: CVPR09 (2009) Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: CVPR09 (2009)
31.
Zurück zum Zitat Breiman, L., Friedman, J., Stone, C.J., Olshen, R.A.: Classification and Regression Trees. CRC Press, Boca Raton (1984)MATH Breiman, L., Friedman, J., Stone, C.J., Olshen, R.A.: Classification and Regression Trees. CRC Press, Boca Raton (1984)MATH
32.
Zurück zum Zitat Garcia del Molino, A., Lim, J.-H., Tan, A.-H.: Predicting visual context for unsupervised event segmentation in continuous photo-streams. In: 2018 ACM Multimedia Conference on Multimedia Conference, pp. 10–17. ACM (2018) Garcia del Molino, A., Lim, J.-H., Tan, A.-H.: Predicting visual context for unsupervised event segmentation in continuous photo-streams. In: 2018 ACM Multimedia Conference on Multimedia Conference, pp. 10–17. ACM (2018)
Metadaten
Titel
Understanding Event Boundaries for Egocentric Activity Recognition from Photo-Streams
verfasst von
Alejandro Cartas
Estefania Talavera
Petia Radeva
Mariella Dimiccoli
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-68796-0_24