skip to main content
10.1145/3341162.3344865acmconferencesArticle/Chapter ViewAbstractPublication PagesubicompConference Proceedingsconference-collections
research-article

Cross-dataset deep transfer learning for activity recognition

Published:09 September 2019Publication History

ABSTRACT

Convolution Neural Network (CNN) filters learned on one domain can be used as feature extractors on another similar domain. Transferring filters allow reusing datasets across domains and reducing labelling costs. In this paper, four activity recognition datasets were analyzed to study the effects of transferring filters across the datasets. A spectro-temporal ResNet was implemented as a deep, end-to-end learning architecture. We analyzed the number of transferred CNN residual blocks with respect to the size of the target-adaptation data. The analysis showed that transfer learning using small adaptation subsets is more useful when the target domain contains a small number of different activities. Furthermore, the similarity between the domains, participating in the transfer learning scenario, seems to play a role in its success. The most successful transfer achieved an F1-score of 93%, which is an increase of 9 percentage points compared to a domain-specific baseline model.

References

  1. L. Deng, D. Yu. (2014). Deep Learning: Methods and Applications (PDF). Foundations and Trends in Signal Processing. 7 (3--4): 1--199.Google ScholarGoogle Scholar
  2. W. McCulloch, W. Pitts (1943). A Logical Calculus of Ideas Immanent in Nervous Activity. Bulletin of Mathematical Biophysics. 5 (4): 115--133.Google ScholarGoogle ScholarCross RefCross Ref
  3. Y. Bengio (2009). "Learning Deep Architectures for AI." Technical Report 1312. {Online. Accessible at: https://www.iro.umontreal.ca/~lisa/pointeurs/TR1312.pdf}Google ScholarGoogle Scholar
  4. A. Krizhevsky, I. Sutskever, G. E. Hinton (2012). ImageNet classification with deep convolutional neural networks. in International Conference on Neural Information Processing Systems, pp. 1097--1105. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. H. Lee, R. Grosse, R. Ranganath, A.Y. Ng (2009). Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In Proceedings of the 26th Annual International Conference on Machine Learning, pages 609--616. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. K. Jarrett, K. Kavukcuoglu, M. A. Ranzato, and Y. LeCun (2009). What is the best multi-stage architecture for object recognition? In International Conference on Computer V sion, pages 2146--2153. IEEE.Google ScholarGoogle ScholarCross RefCross Ref
  7. Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE, vol. 86, no. 11, pp. 2278--2324.Google ScholarGoogle ScholarCross RefCross Ref
  8. H. Gjoreski, J. Bizjak, M. Gjoreski, M. Gams (2016). Comparing deep and classical machine learning methods for human activity recognition using wrist accelerometer, in: Proceedings of the IJCAI 2016 Workshop on Deep Learning for Artificial Intelligence, New York, NY, USA, Vol. 10.Google ScholarGoogle Scholar
  9. D. Ravi, C. Wong, B. Lo, G.-Z. Yang (2017). A deep learning approach to on-node sensor data analytics for mobile or wearable devices, IEEE journal of biomedical and health informatics 21 (1) 56--64.Google ScholarGoogle ScholarCross RefCross Ref
  10. S. Bhattacharya, N. D. Lane, Sparsification and separation of deep learning layers for constrained resource inference on wearables, in: Proceedings of the 14th ACM Conference on Embedded Network Sensor Systems CD- ROM, ACM, 2016, pp. 176--189. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. Zeng, L. T. Nguyen, B. Yu, O. J. Mengshoel, J. Zhu, P. Wu, J. Zhang (2014). Convolutional neural networks for human activity recognition using mobile sensors, in: 6th International Conference on Mobile Computing, Applications and Services, IEEE, pp. 197--205.Google ScholarGoogle ScholarCross RefCross Ref
  12. J. Yang, M. N. Nguyen, P. P. San, X. L. Li, S. Krishnaswamy (2015). Deep convolutional neural networks on multichannel time series for human activity recognition, in: Twenty-Fourth International Joint Conference on Artificial Intelligence. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. F. Ordonez, D. Roggen (2016). Deep convolutional and LSTM recurrent neural networks for multimodal wearable activity recognition, Sensors 16 (1) 115.Google ScholarGoogle ScholarCross RefCross Ref
  14. S. Hochreiter, J. Schmidhuber (1997). Long short-term memory, Neural computation 9 (8) 1735--1780. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. K. Krishna, D. Jain, S. V. Mehta, S. Choudhary (2018). An LSTM based system for prediction of human activities with durations, Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 1 (4) 147. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. D. Cook, K. D. Feuz, N.C. Krishnan (2013). Transfer learning for activity recognition: A survey. Knowledge and information systems, 36(3), pp. 537--556. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. S. Pan, Q. Yang (2010). A survey on transfer learning. Knowledge Data Entering IEEE Trans 22(10):1345--1359 Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A. Calatroni, D. Roggen, and G. Tröster (2011). Automatic transfer of activity recognition capabilities between body-worn motion sensors: Training newcomers to recognize locomotion. In Proc. INSS, Vol. 6.Google ScholarGoogle Scholar
  19. M. Kurz, M. HÖlzle, A. Ferscha, A. Calatroni, and others (2011). Real-time transfer and evaluation of activity recognition capabilities in an opportunistic system. In Proc. ADAPTIVE. 73--78.Google ScholarGoogle Scholar
  20. R. Ding et al. (2019). Empirical Study and Improvement on Deep Transfer Learning for Human Activity Recognition. Sensors, 19(1), 57.Google ScholarGoogle ScholarCross RefCross Ref
  21. F. J. O. Morales, D. Roggen, (2016). Deep convolutional feature transfer across mobile activity recognition domains, sensor modalities and locations. In Proceedings of the 2016 ACM International Symposium on Wearable Computers (pp. 92--99). ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. D. Roggen, et al. (2010). Collecting complex activity datasets in highly rich networked sensor environments. In 2010 Seventh international conference on networked sensing systems (INSS) (pp. 233--240). IEEE.Google ScholarGoogle ScholarCross RefCross Ref
  23. P. Zappi et al. (2008). Activity recognition from on-body sensors: accuracy-power trade-off by dynamic sensor selection. In European Conference on Wireless Sensor Networks (pp. 17--33). Springer, Berlin, Heidelberg. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. H. Gjoreski, B. Kaluža, M. Gams, Radoje Milić, M. Luštrek (2015). Context-based Ensemble Method for Human Energy Expenditure Estimation. Applied Soft Computing, Pages 960--970. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. S. Kozina, H. Gjoreski, M. Gams, M. Luštrek (2013). Three-layer activity recognition combining domain knowledge and meta-classification. Journal of Medical and Biological Engineering, vol. 33, no. 4. p.p. 406--414.Google ScholarGoogle ScholarCross RefCross Ref
  26. A. Reiss, D. Stricker (2012). Introducing a new benchmarked dataset for activity monitoring. In 2012 16th International Symposium on Wearable Computers (pp. 108--109). IEEE. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. M. Gjoreski et al. (2019). Classical and deep learning methods for recognizing human activities and modes of transportation with smartphone sensors. Information Fusion, Under review.Google ScholarGoogle Scholar
  28. S. Hochreiter, Y. Bengio, P. Frasconi, J. Schmidhuber, et al. (2001). Gradient flow in recurrent nets: the difficulty of learning long-term dependenciesGoogle ScholarGoogle Scholar
  29. H. Gjoreski, M. Ciliberto, L. Wang, F. J. O. Morales, S. Mekki, S. Valentin, D. Roggen (2018). The University of Sussex-Huawei Locomotion and Transportation Dataset for Multimodal Analytics with Mobile Devices." IEEE Access, 2018Google ScholarGoogle Scholar
  30. S. Ioffe, C. Szegedy (2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift, arXiv preprint arXiv:1502.03167. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. V. Nair, G. E. Hinton (2010). Rectified linear units improve restricted Boltzmann machines, in: Proceedings of the 27th international conference on machine learning (ICML-10), 2010, pp. 807--814. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. S. Inoue, X Pan. (2016). Supervised and Unsupervised Transfer Learning for Activity Recognition from Simple In-home Sensors. MobiQuitous. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Cross-dataset deep transfer learning for activity recognition

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        UbiComp/ISWC '19 Adjunct: Adjunct Proceedings of the 2019 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2019 ACM International Symposium on Wearable Computers
        September 2019
        1234 pages
        ISBN:9781450368698
        DOI:10.1145/3341162

        Copyright © 2019 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 9 September 2019

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate764of2,912submissions,26%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader