Skip to main content

2020 | OriginalPaper | Buchkapitel

Detection and Recognition of Food in Photo Galleries for Analysis of User Preferences

verfasst von : Evgeniy Miasnikov, Andrey Savchenko

Erschienen in: Image Analysis and Recognition

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Food analysis is one of the most important parts of user preference prediction engines for recommendation systems in the travel domain. In this paper, we describe and study the neural network method that allows you to recognize food in a gallery of photos taken with mobile devices. The described method consists of three main stages, including the classification of scenes, food detection, and subsequent classification. An essential feature of the developed method is the use of lightweight neural network models, which allows its usage on mobile devices. The development of the method was carried out using both known open data and a proprietary data set.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Savchenko, A.V., Demochkin, K.V., Grechikhin, I.S.: User preference prediction in visual data on mobile devices. arXiv preprint 1907.04519 (2019) Savchenko, A.V., Demochkin, K.V., Grechikhin, I.S.: User preference prediction in visual data on mobile devices. arXiv preprint 1907.04519 (2019)
2.
Zurück zum Zitat Matsuda, Y., Yanai, K.: Multiple-food recognition considering co-occurrence employing manifold ranking. In: Proceedings of the 21st International Conference on Pattern Recognition, ICPR 2012, pp. 2017–2020, November 2012 Matsuda, Y., Yanai, K.: Multiple-food recognition considering co-occurrence employing manifold ranking. In: Proceedings of the 21st International Conference on Pattern Recognition, ICPR 2012, pp. 2017–2020, November 2012
3.
Zurück zum Zitat Kitamura, K., Yamasaki, T., Aizawa, K.: FoodLog: capture, analysis and retrieval of personal food images via web. In: Proceedings of the ACM Multimedia 2009 Workshop on Multimedia for Cooking and Eating Activities, CEA 2009, pp. 23–30. Association for Computing Machinery, New York (2009) Kitamura, K., Yamasaki, T., Aizawa, K.: FoodLog: capture, analysis and retrieval of personal food images via web. In: Proceedings of the ACM Multimedia 2009 Workshop on Multimedia for Cooking and Eating Activities, CEA 2009, pp. 23–30. Association for Computing Machinery, New York (2009)
4.
Zurück zum Zitat Farinella, G.M., Allegra, D., Stanco, F., Battiato, S.: On the exploitation of one class classification to distinguish food vs non-food images. In: Murino, V., Puppo, E., Sona, D., Cristani, M., Sansone, C. (eds.) ICIAP 2015. LNCS, vol. 9281, pp. 375–383. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-23222-5_46CrossRef Farinella, G.M., Allegra, D., Stanco, F., Battiato, S.: On the exploitation of one class classification to distinguish food vs non-food images. In: Murino, V., Puppo, E., Sona, D., Cristani, M., Sansone, C. (eds.) ICIAP 2015. LNCS, vol. 9281, pp. 375–383. Springer, Cham (2015). https://​doi.​org/​10.​1007/​978-3-319-23222-5_​46CrossRef
5.
Zurück zum Zitat Ragusa, F., Tomaselli, V., Furnari, A., Battiato, S., Farinella, G.M.: Food vs non-food classification. In: Proceedings of the International Workshop on Multimedia Assisted Dietary Management (MADiMa), pp. 77–81. ACM (2016) Ragusa, F., Tomaselli, V., Furnari, A., Battiato, S., Farinella, G.M.: Food vs non-food classification. In: Proceedings of the International Workshop on Multimedia Assisted Dietary Management (MADiMa), pp. 77–81. ACM (2016)
6.
Zurück zum Zitat Myers, A., et al.: Im2Calories: towards an automated mobile vision food diary. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1233–1241, December 2015 Myers, A., et al.: Im2Calories: towards an automated mobile vision food diary. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1233–1241, December 2015
7.
Zurück zum Zitat Anzawa, M., Amano, S., Yamakata, Y., Motonaga, K., Kamei, A., Aizawa, K.: Recognition of multiple food items in a single photo for use in a buffet-style restaurant. IEICE Trans. Inf. Syst. E102.D(2), 410–414 (2019) Anzawa, M., Amano, S., Yamakata, Y., Motonaga, K., Kamei, A., Aizawa, K.: Recognition of multiple food items in a single photo for use in a buffet-style restaurant. IEICE Trans. Inf. Syst. E102.D(2), 410–414 (2019)
8.
Zurück zum Zitat Kagaya, H., Aizawa, K., Ogawa, M.: Food detection and recognition using convolutional neural network. In: Proceedings of the 22nd ACM International Conference on Multimedia, MM 2014, pp. 1085–1088. Association for Computing Machinery, New York (2014) Kagaya, H., Aizawa, K., Ogawa, M.: Food detection and recognition using convolutional neural network. In: Proceedings of the 22nd ACM International Conference on Multimedia, MM 2014, pp. 1085–1088. Association for Computing Machinery, New York (2014)
9.
Zurück zum Zitat Singla, A., Yuan, L., Ebrahimi, T.: Food/non-food image classification and food categorization using pre-trained GoogLeNet model. In: Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. MADiMa 2016. Association for Computing Machinery, New York (2016) Singla, A., Yuan, L., Ebrahimi, T.: Food/non-food image classification and food categorization using pre-trained GoogLeNet model. In: Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. MADiMa 2016. Association for Computing Machinery, New York (2016)
11.
Zurück zum Zitat Oliveira, L., Costa, V., Neves, G., Oliveira, T., Jorge, E., Lizarraga, M.: A mobile, lightweight, poll-based food identification system. Pattern Recogn. 47(5), 1941–1952 (2014)CrossRef Oliveira, L., Costa, V., Neves, G., Oliveira, T., Jorge, E., Lizarraga, M.: A mobile, lightweight, poll-based food identification system. Pattern Recogn. 47(5), 1941–1952 (2014)CrossRef
12.
Zurück zum Zitat Martinel, N., Piciarelli, C., Micheloni, C., Foresti, G.L.: A structured committee for food recognition. In: 2015 IEEE International Conference on Computer Vision Workshop (ICCVW), pp. 484–492 (2015) Martinel, N., Piciarelli, C., Micheloni, C., Foresti, G.L.: A structured committee for food recognition. In: 2015 IEEE International Conference on Computer Vision Workshop (ICCVW), pp. 484–492 (2015)
13.
Zurück zum Zitat Zheng, J., Wang, Z., Zhu, C.: Food image recognition via superpixel based low-level and mid-level distance coding for smart home applications. Sustainability 9(5), 856 (2017)CrossRef Zheng, J., Wang, Z., Zhu, C.: Food image recognition via superpixel based low-level and mid-level distance coding for smart home applications. Sustainability 9(5), 856 (2017)CrossRef
14.
Zurück zum Zitat Bettadapura, V., Thomaz, E., Parnami, A., Abowd, G.D., Essa, I.: Leveraging context to support automated food recognition in restaurants. In: Proceedings of the Winter Conference on Applications of Computer Vision (WACV), pp. 580–587. IEEE (2015) Bettadapura, V., Thomaz, E., Parnami, A., Abowd, G.D., Essa, I.: Leveraging context to support automated food recognition in restaurants. In: Proceedings of the Winter Conference on Applications of Computer Vision (WACV), pp. 580–587. IEEE (2015)
15.
Zurück zum Zitat Bolanos, M., P., R.: Simultaneous food localization and recognition. In: International Conference on Pattern Recognition, pp. 3140–3145 (2017) Bolanos, M., P., R.: Simultaneous food localization and recognition. In: International Conference on Pattern Recognition, pp. 3140–3145 (2017)
16.
Zurück zum Zitat Wu, H., Merler, M., Uceda-Sosa, R., Smith, J.R.: Learning to make better mistakes: semantics-aware visual food recognition. In: Proceedings of the 24th International Conference on Multimedia (MM), pp. 172–176. ACM (2016) Wu, H., Merler, M., Uceda-Sosa, R., Smith, J.R.: Learning to make better mistakes: semantics-aware visual food recognition. In: Proceedings of the 24th International Conference on Multimedia (MM), pp. 172–176. ACM (2016)
17.
Zurück zum Zitat Ciocca, G., Napoletano, P., Schettini, R.: Food recognition: a new dataset, experiments, and results. IEEE J. Biomed. Health Inform. 21(3), 588–598 (2016)CrossRef Ciocca, G., Napoletano, P., Schettini, R.: Food recognition: a new dataset, experiments, and results. IEEE J. Biomed. Health Inform. 21(3), 588–598 (2016)CrossRef
18.
Zurück zum Zitat Kaur, P., Sikka, K., Wang, W., Belongie, S., Divakaran, A.: FoodX-251: a dataset for fine-grained food classification. arXiv preprint 1907.06167 (2019) Kaur, P., Sikka, K., Wang, W., Belongie, S., Divakaran, A.: FoodX-251: a dataset for fine-grained food classification. arXiv preprint 1907.06167 (2019)
21.
Zurück zum Zitat Xin Wang, Kumar, D., Thome, N., Cord, M., Precioso, F.: Recipe recognition with large multimodal food dataset. In: Proceedings of the International Conference on Multimedia Expo Workshops (ICMEW), pp. 1–6. IEEE (2015) Xin Wang, Kumar, D., Thome, N., Cord, M., Precioso, F.: Recipe recognition with large multimodal food dataset. In: Proceedings of the International Conference on Multimedia Expo Workshops (ICMEW), pp. 1–6. IEEE (2015)
22.
Zurück zum Zitat Liu, C., Cao, Y., Luo, Y., Chen, G., Vokkarane, V., Ma, Y.: DeepFood: deep learning-based food image recognition for computer-aided dietary assessment. In: Chang, C.K., Chiari, L., Cao, Y., Jin, H., Mokhtari, M., Aloulou, H. (eds.) ICOST 2016. LNCS, vol. 9677, pp. 37–48. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-39601-9_4CrossRef Liu, C., Cao, Y., Luo, Y., Chen, G., Vokkarane, V., Ma, Y.: DeepFood: deep learning-based food image recognition for computer-aided dietary assessment. In: Chang, C.K., Chiari, L., Cao, Y., Jin, H., Mokhtari, M., Aloulou, H. (eds.) ICOST 2016. LNCS, vol. 9677, pp. 37–48. Springer, Cham (2016). https://​doi.​org/​10.​1007/​978-3-319-39601-9_​4CrossRef
25.
Zurück zum Zitat Zhang, H., Cissé, M., Dauphin, Y.N., Lopez-Paz, D.: Mixup: beyond empirical risk minimization. CoRR abs/1710.09412 (2017) Zhang, H., Cissé, M., Dauphin, Y.N., Lopez-Paz, D.: Mixup: beyond empirical risk minimization. CoRR abs/1710.09412 (2017)
26.
Zurück zum Zitat Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. CoRR abs/1709.01507 (2017) Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. CoRR abs/1709.01507 (2017)
Metadaten
Titel
Detection and Recognition of Food in Photo Galleries for Analysis of User Preferences
verfasst von
Evgeniy Miasnikov
Andrey Savchenko
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-50347-5_9