nach oben

Erschienen in:

2021 | OriginalPaper | Buchkapitel

Understanding Event Boundaries for Egocentric Activity Recognition from Photo-Streams

verfasst von : Alejandro Cartas, Estefania Talavera, Petia Radeva, Mariella Dimiccoli

Erschienen in: Pattern Recognition. ICPR International Workshops and Challenges

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

The recognition of human activities captured by a wearable photo-camera is especially suited for understanding the behavior of a person. However, it has received comparatively little attention with respect to activity recognition from fixed cameras. In this work, we propose to use segmented events from photo-streams as temporal boundaries to improve the performance of activity recognition. Furthermore, we robustly measure its effectiveness when images of the evaluated person have been seen during training, and when the person is completely unknown during testing. Experimental results show that leveraging temporal boundary information on pictures of seen people improves all classification metrics, particularly it improves the classification accuracy up to 85.73%.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Rescue Dog Action Recognition by Integrating Ego-Centric Video, Sound and Sensor Information

Nächstes Kapitel Egomap: Hierarchical First-Person Semantic Mapping

Cartas, A., Luque, J., Radeva, P., Segura, C., Dimiccoli, M.: Seeing and hearing egocentric actions: How much can we learn? In: Proceedings of the IEEE International Conference on Computer Vision Workshops (2019)

Chen, L., Nugent, C.D.: Human Activity Recognition and Behaviour Analysis. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-19408-6CrossRef

de Jong, R.: Multimodal deep learning for the classification of human activity: radar and video data fusion for the classification of human activity (2019)

Bolaños, M., Dimiccoli, M., Radeva, P.: Toward storytelling from visual lifelogging: an overview. IEEE Trans. Hum.-Mach. Syst. 47(1), 77–90 (2017)

Aghaei, M., Dimiccoli, M., Radeva, P.: All the people around me: face discovery in egocentric photo-streams. In: 2017 IEEE International Conference on Image Processing (ICIP), pp. 1342–1346. IEEE (2017)

Cartas, A., Radeva, P., Dimiccoli, M.: Activities of daily living monitoring via a wearable camera: toward real-world applications. IEEE Access 8, 77344–77363 (2020)CrossRef

Castro, D., et al.: Predicting daily activities from egocentric images using deep learning, pp. 75–82 (2015)

Cartas, A., Dimiccoli, M., Radeva, P.: Batch-based activity recognition from egocentric photo-streams. In: Proceedings of the IEEE International Conference on Computer Vision Workshops, pp. 2347–2354 (2017)

Cartas, A., Marín, J., Radeva, P., Dimiccoli, M.: Recognizing activities of daily living from egocentric images. In: Alexandre, L.A., Salvador Sánchez, J., Rodrigues, J.M.F. (eds.) IbPRIA 2017. LNCS, vol. 10255, pp. 87–95. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-58838-4_10CrossRef

10.

Aghaei, M., Dimiccoli, M., Ferrer, C.C., Radeva, P.: Towards social pattern characterization in egocentric photo-streams. Comput. Vision Image Unders. 171, 104–117 (2018)CrossRef

11.

Aimar, E.S., Radeva, P., Dimiccoli, M.: Social relation recognition in egocentric photostreams. In: 2019 IEEE International Conference on Image Processing (ICIP), pp. 3227–3231. IEEE (2019)

12.

Talavera, E., Leyva-Vallina, M., Sarker, M.K., Puig, D., Petkov, N., Radeva, P.: Hierarchical approach to classify food scenes in egocentric photo-streams. IEEE J. Biomed. Health Inf. 24, 866–877 (2019)

13.

Talavera, E., Wuerich, C., Petkov, N., Radeva, P.: Topic modelling for routine discovery from egocentric photo-streams. Pattern Recogn 104, 107330 (2020)CrossRef

14.

Poleg, Y., Arora, C., Peleg, S.: Temporal segmentation of egocentric videos. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2537–2544 (2014)

15.

Furnari, A., Farinella, G.M., Battiato, S.: Temporal segmentation of egocentric videos to highlight personal locations of interest. In: Hua, G., Jégou, H. (eds.) ECCV 2016. LNCS, vol. 9913, pp. 474–489. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46604-0_34CrossRef

16.

Dimiccoli, M., Bolaños, M., Talavera, E., Aghaei, M., Nikolov, S.G., Radeva, P.: Sr-clustering: semantic regularized clustering for egocentric photo streams segmentation. Comput. Vision Image Underst. 155, 55–69 (2017)CrossRef

17.

Dias, C., Dimiccoli, M.: Learning event representations by encoding the temporal context. In: Proceedings of the European Conference on Computer Vision (ECCV) (2018)

18.

Pirsiavash, H., Ramanan, D.: Detecting activities of daily living in first-person camera views. In: Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR), pp. 2847–2854. IEEE (2012)

19.

Sudhakaran, S., Lanz, O.: Attention is all we need: Nailing down object-centric attention for egocentric activity recognition. In: Proceedings of the British Machine Vision Conference (BMVC) (2018)

20.

García Hernando, G., Yuan, S., Baek, S., Kim, T.-K.: First-person hand action benchmark with rgb-d videos and 3D hand pose annotations. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)

21.

Cartas, A., Marín, J., Radeva, P., Dimiccoli, M.: Batch-based activity recognition from egocentric photo-streams revisited. Pattern Anal. Appl. (2018). https://doi.org/10.1007/s10044-018-0708-1

22.

Yu, H., et al.: A multisource fusion framework driven by user-defined knowledge for egocentric activity recognition. EURASIP J. Adv. Signal Process. 2019(1), 14 (2019). https://doi.org/10.1186/s13634-019-0612-x

23.

Yu, H., et al.: A hierarchical deep fusion framework for egocentric activity recognition using a wearable hybrid sensor system. Sensors 19(3) (2019). https://www.mdpi.com/1424-8220/19/3/546

24.

Song, S., et al.: Multimodal multi-stream deep learning for egocentric activity recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 24–31 (2016)

25.

Graves, A., Schmidhuber, J.: Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw. 18(5), 602–610 (2005). http://www.sciencedirect.com/science/article/pii/S0893608005001206

26.

Chollet, F.: Deep Learning with Python, 1st edn., pp. 219–221. Manning Publications Co, Greenwich (2017)

27.

Ng, J.Y.-H., Hausknecht, M., Vijayanarasimhan, S., Vinyals, O., Monga, R., Toderici, G.: Beyond short snippets: deep networks for video classification. In: Computer Vision and Pattern Recognition (2015)

28.

Chollet, F.: Xception: deep learning with depthwise separable convolutions, pp. 1800–1807 (2017)

29.

King, G., Zeng, L.: Logistic regression in rare events data. Polit. Anal. 9(2), 137–163 (2001)CrossRef

30.

Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: CVPR09 (2009)

31.

Breiman, L., Friedman, J., Stone, C.J., Olshen, R.A.: Classification and Regression Trees. CRC Press, Boca Raton (1984)MATH

32.

Garcia del Molino, A., Lim, J.-H., Tan, A.-H.: Predicting visual context for unsupervised event segmentation in continuous photo-streams. In: 2018 ACM Multimedia Conference on Multimedia Conference, pp. 10–17. ACM (2018)

33.

Jiang, Y.-G.. et al.: THUMOS challenge: action recognition with a large number of classes (2014). http://crcv.ucf.edu/THUMOS14/

Titel: Understanding Event Boundaries for Egocentric Activity Recognition from Photo-Streams
verfasst von: Alejandro Cartas
Estefania Talavera
Petia Radeva
Mariella Dimiccoli
Verlag: Springer International Publishing
Buch: Pattern Recognition. ICPR International Workshops and Challenges
Print ISBN: 978-3-030-68795-3

Electronic ISBN: 978-3-030-68796-0

Copyright-Jahr: 2021
DOI: https://doi.org/10.1007/978-3-030-68796-0_24

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"