Skip to main content

2023 | OriginalPaper | Buchkapitel

Review of Modern Technologies of Computer Vision

verfasst von : Ekaterina Bezuglova, Andrey Gladkov, Georgy Valuev

Erschienen in: Current Problems in Applied Mathematics and Computer Science and Systems

Verlag: Springer Nature Switzerland

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Today, the use of artificial intelligence technologies is becoming more and more popular. Scientific and technological progress contributes to increasing the power of hardware, as well as obtaining effective methods for implementing methods such as machine learning, neural networks, and deep learning. This created the possibility of creating effective methods for recognizing images and video data, which is what computer vision is. At the time of 2022, a huge number of methods, technologies, and techniques for using computer vision were received, in this paper a study was conducted on the use of computer vision in 2022. Results were obtained on the decrease in the popularity of computer vision in the scientific community, its introduction into industry, medicine, zoology and human social life, the most popular method of computer vision is the ResNet neural network model.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Mitchell, T., Buchanan, B., DeJong, G., Dietterich, T., Rosenbloom, P., Waibel, A.: Machine learning. Annu. Rev. Comput. Sci. 4(1), 417–433 (1990)CrossRef Mitchell, T., Buchanan, B., DeJong, G., Dietterich, T., Rosenbloom, P., Waibel, A.: Machine learning. Annu. Rev. Comput. Sci. 4(1), 417–433 (1990)CrossRef
2.
Zurück zum Zitat Shapiro, L.G., Stockman, G.C.: Computer Vision, vol. 3. Prentice Hall, Upper Saddle River (2001) Shapiro, L.G., Stockman, G.C.: Computer Vision, vol. 3. Prentice Hall, Upper Saddle River (2001)
3.
Zurück zum Zitat Lei, Y., et al.: Development of a slow loris computer vision detection model. Animals 12(12), 1553 (2022)CrossRef Lei, Y., et al.: Development of a slow loris computer vision detection model. Animals 12(12), 1553 (2022)CrossRef
4.
Zurück zum Zitat Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016) Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
5.
Zurück zum Zitat Yao, J., Qi, J., Zhang, J., Shao, H., Yang, J., Li, X.: A real-time detection algorithm for Kiwifruit defects based on YOLOv5. Electronics 10(14), 1711 (2021)CrossRef Yao, J., Qi, J., Zhang, J., Shao, H., Yang, J., Li, X.: A real-time detection algorithm for Kiwifruit defects based on YOLOv5. Electronics 10(14), 1711 (2021)CrossRef
7.
Zurück zum Zitat Ciotti, M., Ciccozzi, M., Terrinoni, A., Jiang, W.C., Wang, C.B., Bernardini, S.: The COVID-19 pandemic. Crit. Rev. Clin. Lab. Sci. 57(6), 365–388 (2020)CrossRef Ciotti, M., Ciccozzi, M., Terrinoni, A., Jiang, W.C., Wang, C.B., Bernardini, S.: The COVID-19 pandemic. Crit. Rev. Clin. Lab. Sci. 57(6), 365–388 (2020)CrossRef
8.
Zurück zum Zitat Crespo, F., Crespo, A., Sierra-Martínez, L.M., Peluffo-Ordóñez, D.H., Morocho-Cayamcela, M.E.: A computer vision model to identify the incorrect use of face masks for COVID-19 awareness. Appl. Sci. 12(14), 6924 (2022) Crespo, F., Crespo, A., Sierra-Martínez, L.M., Peluffo-Ordóñez, D.H., Morocho-Cayamcela, M.E.: A computer vision model to identify the incorrect use of face masks for COVID-19 awareness. Appl. Sci. 12(14), 6924 (2022)
9.
Zurück zum Zitat Yang, S., Luo, P., Loy, C.C., Tang, X.: Wider face: a face detection benchmark. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5525–5533 (2016) Yang, S., Luo, P., Loy, C.C., Tang, X.: Wider face: a face detection benchmark. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5525–5533 (2016)
10.
Zurück zum Zitat Batagelj, B., Peer, P., Štruc, V., Dobrišek, S.: How to correctly detect face-masks for covid-19 from visual information? Appl. Sci. 11(5), 2070 (2021)CrossRef Batagelj, B., Peer, P., Štruc, V., Dobrišek, S.: How to correctly detect face-masks for covid-19 from visual information? Appl. Sci. 11(5), 2070 (2021)CrossRef
11.
Zurück zum Zitat Ge, S., Li, J., Ye, Q., Luo, Z.: Detecting masked faces in the wild with LLE-CNNs. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2682–2690 (2017) Ge, S., Li, J., Ye, Q., Luo, Z.: Detecting masked faces in the wild with LLE-CNNs. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2682–2690 (2017)
12.
Zurück zum Zitat Deng, J., Guo, J., Ververas, E., Kotsia, I., Zafeiriou, S.: Retinaface: single-shot multi-level face localisation in the wild. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5203–5212 (2020) Deng, J., Guo, J., Ververas, E., Kotsia, I., Zafeiriou, S.: Retinaface: single-shot multi-level face localisation in the wild. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5203–5212 (2020)
13.
14.
Zurück zum Zitat Zhang, H., et al.: Resnet: split-attention networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2736–2746 (2022) Zhang, H., et al.: Resnet: split-attention networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2736–2746 (2022)
15.
Zurück zum Zitat Amari, S.I.: Backpropagation and stochastic gradient descent method. Neurocomputing 5(4–5), 185–196 (1993)CrossRefMATH Amari, S.I.: Backpropagation and stochastic gradient descent method. Neurocomputing 5(4–5), 185–196 (1993)CrossRefMATH
16.
Zurück zum Zitat Dehghani, M., Gritsenko, A., Arnab, A., Minderer, M., Tay, Y.: Scenic: a JAX library for computer vision research and beyond. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 21393–21398 (2022) Dehghani, M., Gritsenko, A., Arnab, A., Minderer, M., Tay, Y.: Scenic: a JAX library for computer vision research and beyond. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 21393–21398 (2022)
17.
Zurück zum Zitat Dosovitskiy, A., et al.: An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020) Dosovitskiy, A., et al.: An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:​2010.​11929 (2020)
19.
Zurück zum Zitat Radford, A., et al.: Learning transferable visual models from natural language supervision. In: International Conference on Machine Learning, pp. 8748–8763. PMLR, July 2021 Radford, A., et al.: Learning transferable visual models from natural language supervision. In: International Conference on Machine Learning, pp. 8748–8763. PMLR, July 2021
20.
Zurück zum Zitat Tolstikhin, I.O., et al.: MLP-mixer: an all-MLP architecture for vision. Adv. Neural. Inf. Process. Syst. 34, 24261–24272 (2021) Tolstikhin, I.O., et al.: MLP-mixer: an all-MLP architecture for vision. Adv. Neural. Inf. Process. Syst. 34, 24261–24272 (2021)
21.
Zurück zum Zitat Raffel, C., et al.: Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res. 21(140), 1–67 (2020)MathSciNetMATH Raffel, C., et al.: Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res. 21(140), 1–67 (2020)MathSciNetMATH
22.
Zurück zum Zitat Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018) Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:​1810.​04805 (2018)
23.
Zurück zum Zitat Arnab, A., Dehghani, M., Heigold, G., Sun, C., Lučić, M., Schmid, C.: Vivit: a video vision transformer. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6836–6846 (2021) Arnab, A., Dehghani, M., Heigold, G., Sun, C., Lučić, M., Schmid, C.: Vivit: a video vision transformer. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6836–6846 (2021)
24.
Zurück zum Zitat Tay, Y., et al.: Omninet: omnidirectional representations from transformers. In: International Conference on Machine Learning, pp. 10193–10202. PMLR, July 2021 Tay, Y., et al.: Omninet: omnidirectional representations from transformers. In: International Conference on Machine Learning, pp. 10193–10202. PMLR, July 2021
26.
27.
Zurück zum Zitat Su, J., Zhou, C., Chen, H., Xia, N., Shi, Z.: The physical and mechanical properties for flexible biomass particles using computer vision. Fuel 315, 123278 (2022)CrossRef Su, J., Zhou, C., Chen, H., Xia, N., Shi, Z.: The physical and mechanical properties for flexible biomass particles using computer vision. Fuel 315, 123278 (2022)CrossRef
28.
Zurück zum Zitat Kroetsch, D., Wang, C.: Particle size distribution. Soil Sampling Methods Anal. 2, 713–725 (2008) Kroetsch, D., Wang, C.: Particle size distribution. Soil Sampling Methods Anal. 2, 713–725 (2008)
29.
Zurück zum Zitat Toolbox, S.M.: Matlab. Mathworks Inc. (1993) Toolbox, S.M.: Matlab. Mathworks Inc. (1993)
30.
Zurück zum Zitat Stefenon, S.F., et al.: Classification of insulators using neural network based on computer vision. IET Gener. Transm. Distrib. 16(6), 1096–1107 (2022)CrossRef Stefenon, S.F., et al.: Classification of insulators using neural network based on computer vision. IET Gener. Transm. Distrib. 16(6), 1096–1107 (2022)CrossRef
31.
Zurück zum Zitat Bradski, G., Kaehler, A.: OpenCV. Dr. Dobb’s J. Softw. Tools 3, 120 (2000) Bradski, G., Kaehler, A.: OpenCV. Dr. Dobb’s J. Softw. Tools 3, 120 (2000)
33.
Zurück zum Zitat Lee, J., et al.: BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36(4), 1234–1240 (2020)CrossRef Lee, J., et al.: BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36(4), 1234–1240 (2020)CrossRef
34.
Zurück zum Zitat Patrício, D.I., Rieder, R.: Computer vision and artificial intelligence in precision agriculture for grain crops: a systematic review. Comput. Electron. Agric. 153, 69–81 (2018)CrossRef Patrício, D.I., Rieder, R.: Computer vision and artificial intelligence in precision agriculture for grain crops: a systematic review. Comput. Electron. Agric. 153, 69–81 (2018)CrossRef
36.
Zurück zum Zitat Navarro Soto, J., Satorres Martínez, S., Martínez Gila, D., Gómez Ortega, J., Gámez García, J.: Fast and reliable determination of virgin olive oil quality by fruit inspection using computer vision. Sensors 18(11), 3826 (2018) Navarro Soto, J., Satorres Martínez, S., Martínez Gila, D., Gómez Ortega, J., Gámez García, J.: Fast and reliable determination of virgin olive oil quality by fruit inspection using computer vision. Sensors 18(11), 3826 (2018)
37.
Zurück zum Zitat Sherine, A., Peter, G., Stonier, A.A., Praghash, K., Ganji, V: CMY color spaced-based visual cryptography scheme for secret sharing of data. Wirel. Commun. Mob. Comput. (2022) Sherine, A., Peter, G., Stonier, A.A., Praghash, K., Ganji, V: CMY color spaced-based visual cryptography scheme for secret sharing of data. Wirel. Commun. Mob. Comput. (2022)
Metadaten
Titel
Review of Modern Technologies of Computer Vision
verfasst von
Ekaterina Bezuglova
Andrey Gladkov
Georgy Valuev
Copyright-Jahr
2023
DOI
https://doi.org/10.1007/978-3-031-34127-4_31

    Marktübersichten

    Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.