Skip to main content
Erschienen in: Automatic Control and Computer Sciences 6/2020

01.11.2020

Multi-Attention Mechanism Medical Image Segmentation Combined with Word Embedding Technology

verfasst von: Junlong Cheng, Shengwei Tian, Long Yu, Hongfeng You

Erschienen in: Automatic Control and Computer Sciences | Ausgabe 6/2020

Einloggen, um Zugang zu erhalten

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In order to solve the problems of low gray scale contrast and blurred organ boundaries in some medical images, we proposed a joint algorithm of Multi-Attention Parallel CNNs and Independent Recurrent Neural Networks (MACIR) with word embedding technique combined. First, the word embedding technique is used to map the sparse spatial relation matrix into a real dense vector, which is combined with gray scale and edge matrix as input features. The multi -attention mechanism is used to add weight information to capture the importance of each feature more sensitive. Then, the Parallel Convolutional Neural Networks are used to fully exploit the deep semantic information, and IndRNN is introduced to avoid the loss of pixel hierarchy information and realize the integration of information flow. Finally, the Softmax classifier is used to complete the medical image segmentation task. Experiments showed that word embedding MACIR algorithm could effectively improve the segmentation performance of medical images on the data sets of lung X-ray and cervical CT images.
Literatur
1.
Zurück zum Zitat Hwang, S. and Park, S., Accurate lung segmentation via network-wise training of convolutional networks, in Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support, Cham: Springer, 2017. Hwang, S. and Park, S., Accurate lung segmentation via network-wise training of convolutional networks, in Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support, Cham: Springer, 2017.
2.
Zurück zum Zitat Tian Juanxiu, Liu Guocai, Gu Shanshan, et al., Research and challenge of medical image analysis deep learning method, J. Autom., 2018, vol. 44, no. 3, pp. 401–424. Tian Juanxiu, Liu Guocai, Gu Shanshan, et al., Research and challenge of medical image analysis deep learning method, J. Autom., 2018, vol. 44, no. 3, pp. 401–424.
3.
Zurück zum Zitat LeCun, Y., Bengio, Y., and Hinton, G., Deep learning, Nature, 2015, vol. 521, no. 7553, p. 436.CrossRef LeCun, Y., Bengio, Y., and Hinton, G., Deep learning, Nature, 2015, vol. 521, no. 7553, p. 436.CrossRef
4.
Zurück zum Zitat Ma Chao, Liu Yashu, Luo Gongning, et al., 3D MR image segmentation based on cascaded random forest and active contours, J. Autom., 2019, vol. 45, no. 5, pp. 1004–1014. Ma Chao, Liu Yashu, Luo Gongning, et al., 3D MR image segmentation based on cascaded random forest and active contours, J. Autom., 2019, vol. 45, no. 5, pp. 1004–1014.
5.
Zurück zum Zitat Li Xiangxia, Li Bin, Tian Lianfang, et al., Segmentation of ground glass-type pulmonary nodules based on sparse representation and random walk, J. Autom., 2018, vol. 44, no. 9, pp. 1637–1647. Li Xiangxia, Li Bin, Tian Lianfang, et al., Segmentation of ground glass-type pulmonary nodules based on sparse representation and random walk, J. Autom., 2018, vol. 44, no. 9, pp. 1637–1647.
6.
Zurück zum Zitat Onoma, D.P., Ruan, S., Thureau, S., et al., Segmentation of heterogeneous or small FDG PET positive tissue based on a 3D-locally adaptive random walk algorithm, Comput. Med. Imaging Graphics, 2014, vol. 38, no. 8, pp. 753–763.CrossRef Onoma, D.P., Ruan, S., Thureau, S., et al., Segmentation of heterogeneous or small FDG PET positive tissue based on a 3D-locally adaptive random walk algorithm, Comput. Med. Imaging Graphics, 2014, vol. 38, no. 8, pp. 753–763.CrossRef
7.
Zurück zum Zitat Garnavi, R., Aldeen, M., Celebi, M.E., et al., Border detection in dermoscopy images using hybrid thresholding on optimized color channels, Comput. Med. Imaging Graphics, 2011, vol. 35, no. 2, pp. 105–115.CrossRef Garnavi, R., Aldeen, M., Celebi, M.E., et al., Border detection in dermoscopy images using hybrid thresholding on optimized color channels, Comput. Med. Imaging Graphics, 2011, vol. 35, no. 2, pp. 105–115.CrossRef
8.
Zurück zum Zitat Ge, Z., Demyanov, S., Bozorgtabar, B., et al., Exploiting local and generic features for accurate skin lesions classification using clinical and dermoscopy imaging, 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017), 2017, pp. 986–990. Ge, Z., Demyanov, S., Bozorgtabar, B., et al., Exploiting local and generic features for accurate skin lesions classification using clinical and dermoscopy imaging, 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017), 2017, pp. 986–990.
9.
Zurück zum Zitat Abuzaghleh, O., Barkana, B.D., and Faezipour, M., Automated skin lesion analysis based on color and shape geometry feature set for melanoma early detection and prevention, IEEE Long Island Systems, Applications and Technology (LISAT) Conference 2014, 2014, pp. 1–6. Abuzaghleh, O., Barkana, B.D., and Faezipour, M., Automated skin lesion analysis based on color and shape geometry feature set for melanoma early detection and prevention, IEEE Long Island Systems, Applications and Technology (LISAT) Conference 2014, 2014, pp. 1–6.
10.
Zurück zum Zitat Akbulut, Y., Guo, Y., Sengür, A., et al., An effective color texture image segmentation algorithm based on hermite transform, Appl. Soft Comput., 2018, vol. 67, pp. 494–504.CrossRef Akbulut, Y., Guo, Y., Sengür, A., et al., An effective color texture image segmentation algorithm based on hermite transform, Appl. Soft Comput., 2018, vol. 67, pp. 494–504.CrossRef
11.
Zurück zum Zitat Tahir, B., Iqbal, S., Usman, Ghani., Khan, M., et al., Feature enhancement framework for brain tumor segmentation and classification, Microsc. Res. Tech., 2019, vol. 82, no. 6, pp. 803–811.CrossRef Tahir, B., Iqbal, S., Usman, Ghani., Khan, M., et al., Feature enhancement framework for brain tumor segmentation and classification, Microsc. Res. Tech., 2019, vol. 82, no. 6, pp. 803–811.CrossRef
12.
Zurück zum Zitat Barui, S., Latha, S., Samiappan, D., et al., SVM pixel classification on colour image segmentation, J. Phys.: Conf. Ser., 2018, vol. 1000. Barui, S., Latha, S., Samiappan, D., et al., SVM pixel classification on colour image segmentation, J. Phys.: Conf. Ser., 2018, vol. 1000.
13.
Zurück zum Zitat Chan, Y.H., Zeng, Y.Z., Wu, H.C., et al., Effective pneumothorax detection for chest X-ray images using local binary pattern and support vector machine, J. Healthcare Eng., 2018, vol. 2018. Chan, Y.H., Zeng, Y.Z., Wu, H.C., et al., Effective pneumothorax detection for chest X-ray images using local binary pattern and support vector machine, J. Healthcare Eng., 2018, vol. 2018.
14.
Zurück zum Zitat Long, J., Shelhamer, E., and Darrell, T., Fully convolutional networks for semantic segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015, pp. 3431–3440. Long, J., Shelhamer, E., and Darrell, T., Fully convolutional networks for semantic segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015, pp. 3431–3440.
15.
Zurück zum Zitat Ronneberger, O., Fischer, P., and Brox, T., U-net: Convolutional networks for biomedical image segmentation, International Conference on Medical Image Computing and Computer-Assisted Intervention, Cham: Springer, 2015, pp. 234–241. Ronneberger, O., Fischer, P., and Brox, T., U-net: Convolutional networks for biomedical image segmentation, International Conference on Medical Image Computing and Computer-Assisted Intervention, Cham: Springer, 2015, pp. 234–241.
16.
Zurück zum Zitat Li, Z., Gan, Y., Liang, X., et al., LSTM-CF: Unifying context modeling and fusion with LSTMS for RGB-d scene labeling, European Conference on Computer Vision, Cham: Springer, 2016, pp. 541–557. Li, Z., Gan, Y., Liang, X., et al., LSTM-CF: Unifying context modeling and fusion with LSTMS for RGB-d scene labeling, European Conference on Computer Vision, Cham: Springer, 2016, pp. 541–557.
17.
Zurück zum Zitat Wang, X., Girshick, R., Gupta, A., et al., Non-local neural networks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 7794–7803. Wang, X., Girshick, R., Gupta, A., et al., Non-local neural networks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 7794–7803.
18.
Zurück zum Zitat Wang, S., Zhou, M., Liu, Z., et al., Central focused convolutional neural networks: Developing a data-driven model for lung nodule segmentation, Med. Image Anal., 2017, vol. 40, pp. 172–183.CrossRef Wang, S., Zhou, M., Liu, Z., et al., Central focused convolutional neural networks: Developing a data-driven model for lung nodule segmentation, Med. Image Anal., 2017, vol. 40, pp. 172–183.CrossRef
19.
Zurück zum Zitat Goldberg, Y. and Levy, O., Word2vec explained: Deriving Mikolov et al.'s negative-sampling word-embedding method, 2014. arXiv:1402.3722. Goldberg, Y. and Levy, O., Word2vec explained: Deriving Mikolov et al.'s negative-sampling word-embedding method, 2014. arXiv:1402.3722.
20.
Zurück zum Zitat Henry, S., Cuffy, C., and McInnes, B.T., Vector representations of multi-word terms for semantic relatedness, J. Biomed. Inf., 2018, p. 77. Henry, S., Cuffy, C., and McInnes, B.T., Vector representations of multi-word terms for semantic relatedness, J. Biomed. Inf., 2018, p. 77.
21.
Zurück zum Zitat Bamler, R. and Mandt, S., Dynamic word embeddings, Proceedings of the 34th International Conference on Machine Learning, 2017, vol. 70, pp. 380–389. Bamler, R. and Mandt, S., Dynamic word embeddings, Proceedings of the 34th International Conference on Machine Learning, 2017, vol. 70, pp. 380–389.
22.
Zurück zum Zitat Mikolov, T., Karafiát, M., Burget, L., et al., Recurrent neural network based language model, Eleventh Annual Conference of the International Speech Communication Association, 2010. Mikolov, T., Karafiát, M., Burget, L., et al., Recurrent neural network based language model, Eleventh Annual Conference of the International Speech Communication Association, 2010.
23.
Zurück zum Zitat Li, S., Li, W., Cook, C., et al., Independently recurrent neural network (INDRNN): Building a longer and deeper RNN, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 5457–5466. Li, S., Li, W., Cook, C., et al., Independently recurrent neural network (INDRNN): Building a longer and deeper RNN, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 5457–5466.
24.
Zurück zum Zitat Van Ginneken, B., Stegmann, M.B., and Loog, M., Segmentation of anatomical structures in chest radiographs using supervised methods: A comparative study on a public database, Med. Image Anal., 2006, vol. 10, no. 1, pp. 19–40.CrossRef Van Ginneken, B., Stegmann, M.B., and Loog, M., Segmentation of anatomical structures in chest radiographs using supervised methods: A comparative study on a public database, Med. Image Anal., 2006, vol. 10, no. 1, pp. 19–40.CrossRef
25.
Zurück zum Zitat Clark, K., Vendt, B., Smith, K., et al., The Cancer Imaging Archive (TCIA): Maintaining and operating a public information repository, J. Digital Imaging, 2013, vol. 26, no. 6, pp. 1045–1057.CrossRef Clark, K., Vendt, B., Smith, K., et al., The Cancer Imaging Archive (TCIA): Maintaining and operating a public information repository, J. Digital Imaging, 2013, vol. 26, no. 6, pp. 1045–1057.CrossRef
Metadaten
Titel
Multi-Attention Mechanism Medical Image Segmentation Combined with Word Embedding Technology
verfasst von
Junlong Cheng
Shengwei Tian
Long Yu
Hongfeng You
Publikationsdatum
01.11.2020
Verlag
Pleiades Publishing
Erschienen in
Automatic Control and Computer Sciences / Ausgabe 6/2020
Print ISSN: 0146-4116
Elektronische ISSN: 1558-108X
DOI
https://doi.org/10.3103/S0146411620060024

Weitere Artikel der Ausgabe 6/2020

Automatic Control and Computer Sciences 6/2020 Zur Ausgabe

Neuer Inhalt