Skip to main content

2022 | OriginalPaper | Buchkapitel

Implementation of a Method Using Image Sequentialization, Patch Embedding and ViT Encoder to Detect the Breast Cancer on RGBA Images and Binary Masks

verfasst von : Tanishka Dixit, Namrata Singh, Geetika Srivastava, Meenakshi Srivastava

Erschienen in: Artificial Intelligence and Speech Technology

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper uses an approach where we will be training the unlabelled and labelled datasets into the system and these datasets comprises of: RGBA images and Binary masks. This will be a benefit in order to get a much better result and more accurate also as pre-training method always helps in getting excellent output. We will use a Transformer Model in this paper where we will apply an image sequentialization technique having 51 million parameters which will help us in attaining the smoother images without any noise. At times, this approach could be time consuming due to used datasets and its sizes, however it shows more accurate and efficient results. Further, we will try to figure out the pixel wise label map using patch embedding technique. Once these techniques are applied then CNN-Transformer Hybrid will come into the role which will encode and decode the images to high-level feature extractions and full spatial resolution respectively. This way of doing encoding and decoding is also known as forward pass and back propagation. Also, it will involve the cascaded upsampler where we will try to use self-attention processes into the design of encoder using the transformers. This entire mechanism will involve few of the best evaluation metrics and those are: Pixel accuracy, IoU, Mean-IoU and Recall/Precision/F1 Score giving the effective and best results.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Dosovitskiy, A., et al.: An Image is Worth 16X16 Words: Transformers for Image Recognition at Scale. Published as a conference paper at ICLR (2021) Dosovitskiy, A., et al.: An Image is Worth 16X16 Words: Transformers for Image Recognition at Scale. Published as a conference paper at ICLR (2021)
2.
3.
5.
Zurück zum Zitat Liu, X., He, P., Chen, W., Gao, J.: Improving multi-task deep neural networks via knowledge distillation for natural language understanding. arXiv:1904.09482v1 [cs.CL] (2019) Liu, X., He, P., Chen, W., Gao, J.: Improving multi-task deep neural networks via knowledge distillation for natural language understanding. arXiv:​1904.​09482v1 [cs.CL] (2019)
6.
Zurück zum Zitat Sohn, K., et al.: FixMatch: Simplifying semi-supervised learning with consistency and confidence. In: 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada (2020) Sohn, K., et al.: FixMatch: Simplifying semi-supervised learning with consistency and confidence. In: 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada (2020)
8.
Zurück zum Zitat Al-Rfou, R., Choe, D., Constant, N., Guo, M., Jones, L.: Character-level language modeling with deeper self-attention. arXiv:1808.04444v2 [cs.CL] (2018) Al-Rfou, R., Choe, D., Constant, N., Guo, M., Jones, L.: Character-level language modeling with deeper self-attention. arXiv:​1808.​04444v2 [cs.CL] (2018)
9.
Zurück zum Zitat Khandelwal, U., He, H., Qi, P., Jurafsky, D.: Sharp nearby, fuzzy far away: how neural language models use context. Association for Computational Linguistics (2018) Khandelwal, U., He, H., Qi, P., Jurafsky, D.: Sharp nearby, fuzzy far away: how neural language models use context. Association for Computational Linguistics (2018)
10.
Zurück zum Zitat Conneau, A., Schwenk, H., Cun, Y.L., Barrault, L.: Very deep convolutional networks for text classification. arXiv:1606.01781v2 [cs.CL] (2017) Conneau, A., Schwenk, H., Cun, Y.L., Barrault, L.: Very deep convolutional networks for text classification. arXiv:​1606.​01781v2 [cs.CL] (2017)
11.
Zurück zum Zitat Arısoy, E., Sainath, T.N., Kingsbury, B., Ramabhadran, B.: Deep neural network language models. Association for Computational Linguistics (2012) Arısoy, E., Sainath, T.N., Kingsbury, B., Ramabhadran, B.: Deep neural network language models. Association for Computational Linguistics (2012)
12.
Zurück zum Zitat Bengio, Y.: Learning deep architectures for AI. Technical Report 1312 (2009) Bengio, Y.: Learning deep architectures for AI. Technical Report 1312 (2009)
13.
14.
Zurück zum Zitat Girshick, R., Donahue, J., Darrell, J., Malik, J., Berkeley, U.C.: Rich feature hierarchies for accurate object detection and semantic segmentation Tech report (v5). arXiv:1311.2524v5 [cs.CV] (2014) Girshick, R., Donahue, J., Darrell, J., Malik, J., Berkeley, U.C.: Rich feature hierarchies for accurate object detection and semantic segmentation Tech report (v5). arXiv:​1311.​2524v5 [cs.CV] (2014)
15.
Zurück zum Zitat Arazo, E., Ortego, D., Albert, P., O’Connor, N.E., McGuinness, K.: Pseudo-labeling and confirmation bias in deep semi-supervised learning. arXiv:1908.02983v5 [cs.CV] (2020) Arazo, E., Ortego, D., Albert, P., O’Connor, N.E., McGuinness, K.: Pseudo-labeling and confirmation bias in deep semi-supervised learning. arXiv:​1908.​02983v5 [cs.CV] (2020)
16.
Zurück zum Zitat Xie, S., Sun, C., Huang, J., Tu, Z., Murphy, K.: Rethinking Spatiotemporal feature learning: speed-accuracy trade-offs in video classification. arXiv:1712.04851v2 [cs.CV] (2018) Xie, S., Sun, C., Huang, J., Tu, Z., Murphy, K.: Rethinking Spatiotemporal feature learning: speed-accuracy trade-offs in video classification. arXiv:​1712.​04851v2 [cs.CV] (2018)
18.
Zurück zum Zitat Phang, J., F´evry, T., Bowman, S.R.: Sentence encoders on STILTs: supplementary training on intermediate labeled-data tasks. arXiv:1811.01088v2 [cs.CL] (2019) Phang, J., F´evry, T., Bowman, S.R.: Sentence encoders on STILTs: supplementary training on intermediate labeled-data tasks. arXiv:​1811.​01088v2 [cs.CL] (2019)
19.
Zurück zum Zitat Cho, K., et al.: Learning phrase representations using RNN encoder–decoder for statistical machine translation. arXiv:1406.1078v3 [cs.CL] (2014) Cho, K., et al.: Learning phrase representations using RNN encoder–decoder for statistical machine translation. arXiv:​1406.​1078v3 [cs.CL] (2014)
21.
Zurück zum Zitat Gamil, M.E., Fouad, M.M., Abd El Ghany, M.A., Hoflman, K.: Fully automated CADx for early breast cancer detection using image processing and machine learning. In:30th International Conference on Microelectronics (ICM). IEEE (2018). 978-l-5386-8167-l/18/$31.00 ©2018 Gamil, M.E., Fouad, M.M., Abd El Ghany, M.A., Hoflman, K.: Fully automated CADx for early breast cancer detection using image processing and machine learning. In:30th International Conference on Microelectronics (ICM). IEEE (2018). 978-l-5386-8167-l/18/$31.00 ©2018
22.
Zurück zum Zitat Maicas, G., Carneiro, G., Bradley, A.P.: Globally Optimal Breast Mass Segmentation from DCE-MRI Using Deep Semantic Segmentation as Shape Prior. IEEE (2017). 978-1-5090-1172-8/17/$31.00 ©2017 Maicas, G., Carneiro, G., Bradley, A.P.: Globally Optimal Breast Mass Segmentation from DCE-MRI Using Deep Semantic Segmentation as Shape Prior. IEEE (2017). 978-1-5090-1172-8/17/$31.00 ©2017
23.
Zurück zum Zitat Saturi, R., Prem Chand, P.: Implementation of efficient segmentation method for histopathological images. In: Proceedings of the Fifth International Conference on Inventive Computation Technologies (ICICT-2020). IEEE (2020). 978-1-7281-4685-0/20/$31.00 ©2020 Saturi, R., Prem Chand, P.: Implementation of efficient segmentation method for histopathological images. In: Proceedings of the Fifth International Conference on Inventive Computation Technologies (ICICT-2020). IEEE (2020). 978-1-7281-4685-0/20/$31.00 ©2020
24.
Zurück zum Zitat Yin, X., Neamtiu, I., Patil, S., Andrews, S.T.: Implementation-induced inconsistency and nondeterminism in deterministic clustering algorithms. In: 2020 IEEE 13th International Conference on Software Testing, Validation and Verification (ICST). IEEE (2020). 978-1-7281-5778-8/20/$31.00 ©2020 Yin, X., Neamtiu, I., Patil, S., Andrews, S.T.: Implementation-induced inconsistency and nondeterminism in deterministic clustering algorithms. In: 2020 IEEE 13th International Conference on Software Testing, Validation and Verification (ICST). IEEE (2020). 978-1-7281-5778-8/20/$31.00 ©2020
27.
31.
Zurück zum Zitat Li, Q., et al.: Direct extraction of tumor response based on ensemble empirical mode decomposition for image reconstruction of early breast cancer detection by UWB. IEEE Trans. Biomed. Circuits Syst. 9(5), 710–724 (2015)CrossRef Li, Q., et al.: Direct extraction of tumor response based on ensemble empirical mode decomposition for image reconstruction of early breast cancer detection by UWB. IEEE Trans. Biomed. Circuits Syst. 9(5), 710–724 (2015)CrossRef
34.
Zurück zum Zitat Wu, N., et al.: Deep neural networks improve radiologists’ performance in breast cancer screening. IEEE Trans. Med. Imaging 39(4), 1184–1194 (2020)CrossRef Wu, N., et al.: Deep neural networks improve radiologists’ performance in breast cancer screening. IEEE Trans. Med. Imaging 39(4), 1184–1194 (2020)CrossRef
Metadaten
Titel
Implementation of a Method Using Image Sequentialization, Patch Embedding and ViT Encoder to Detect the Breast Cancer on RGBA Images and Binary Masks
verfasst von
Tanishka Dixit
Namrata Singh
Geetika Srivastava
Meenakshi Srivastava
Copyright-Jahr
2022
DOI
https://doi.org/10.1007/978-3-030-95711-7_47

Premium Partner