skip to main content
survey

Understanding Deep Learning Techniques for Image Segmentation

Published:30 August 2019Publication History
Skip Abstract Section

Abstract

The machine learning community has been overwhelmed by a plethora of deep learning--based approaches. Many challenging computer vision tasks, such as detection, localization, recognition, and segmentation of objects in an unconstrained environment, are being efficiently addressed by various types of deep neural networks, such as convolutional neural networks, recurrent networks, adversarial networks, and autoencoders. Although there have been plenty of analytical studies regarding the object detection or recognition domain, many new deep learning techniques have surfaced with respect to image segmentation techniques. This article approaches these various deep learning techniques of image segmentation from an analytical perspective. The main goal of this work is to provide an intuitive understanding of the major techniques that have made a significant contribution to the image segmentation domain. Starting from some of the traditional image segmentation approaches, the article progresses by describing the effect that deep learning has had on the image segmentation domain. Thereafter, most of the major segmentation algorithms have been logically categorized with paragraphs dedicated to their unique contribution. With an ample amount of intuitive explanations, the reader is expected to have an improved ability to visualize the internal dynamics of these processes.

Skip Supplemental Material Section

Supplemental Material

References

  1. Radhakrishna Achanta, Appu Shaji, Kevin Smith, Aurelien Lucchi, Pascal Fua, Sabine Süsstrunk, et al. 2012. SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Transactions on Pattern Analysis and Machine Intelligence 34, 11 (2012), 2274--2282. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Aseem Agarwala, Aaron Hertzmann, David H. Salesin, and Steven M. Seitz. 2004. Keyframe-based tracking for rotoscoping and animation. 23, 584--591. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Jamil Ahmad, Irfan Mehmood, and Sung Wook Baik. 2017. Efficient object-based surveillance image search using spatial pooling of convolutional features. Journal of Visual Communication and Image Representation 45 (2017), 62--76. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Fahim Irfan Alam, Jun Zhou, Alan Wee-Chung Liew, and Xiuping Jia. 2016. CRF learning with CNN features for hyperspectral image segmentation. In Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS’16). IEEE, Los Alamitos, CA, 6890--6893.Google ScholarGoogle ScholarCross RefCross Ref
  5. Alberto Albiol, Luis Torres, and Edward J. Delp. 2001. An unsupervised color image segmentation algorithm for face detection applications. In Proceedings of the 2001 International Conference on Image Processing, Vol. 2. IEEE, Los Alamitos, CA, 681--684.Google ScholarGoogle Scholar
  6. Teresa Araújo, Guilherme Aresta, Eduardo Castro, José Rouco, Paulo Aguiar, Catarina Eloy, António Polónia, and Aurélio Campilho. 2017. Classification of breast cancer histology images using convolutional neural networks. PloS One 12, 6 (2017), e0177544.Google ScholarGoogle ScholarCross RefCross Ref
  7. Aamer Ather. 2009. A Quality Analysis of OpenStreetMap Data. Master’s Thesis. University College London, London, UK.Google ScholarGoogle Scholar
  8. Vijay Badrinarayanan, Alex Kendall, and Roberto Cipolla. 2017. Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 12 (2017), 2481--2495.Google ScholarGoogle ScholarCross RefCross Ref
  9. John Barlow, Steven Franklin, and Yvonne Martin. 2006. High spatial resolution satellite imagery, DEM derivatives, and image segmentation for the detection of mass wasting processes. Photogrammetric Engineering and Remote Sensing 72, 6 (2006), 687--692.Google ScholarGoogle ScholarCross RefCross Ref
  10. Serge Belongie, Chad Carson, Hayit Greenspan, and Jitendra Malik. 1998. Color-and texture-based image segmentation using EM and its application to content-based image retrieval. In Proceedings of the 6th International Conference on Computer Vision. IEEE, Los Alamitos, CA, 675--682. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Yoshua Bengio, Pascal Lamblin, Dan Popovici, and Hugo Larochelle. 2007. Greedy layer-wise training of deep networks. In Advances in Neural Information Processing Systems. 153--160. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. L. Sant’Anna Bins, L. M. Garcia Fonseca, G. J. Erthal, and F. Mitsuo Ii. 1996. Satellite imagery segmentation: A region growing approach. Simpósio Brasileiro de Sensoriamento Remoto 8, 1996 (1996), 677--680.Google ScholarGoogle Scholar
  13. Ali Borji. 2015. What is a salient object? A dataset and a baseline model for salient object detection. IEEE Transactions on Image Processing 24, 2 (2015), 742--756.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Ali Borji, Ming-Ming Cheng, Qibin Hou, Huaizu Jiang, and Jia Li. 2014. Salient object detection: A survey. arXiv:1411.5878.Google ScholarGoogle Scholar
  15. Ali Borji, Ming-Ming Cheng, Huaizu Jiang, and Jia Li. 2015. Salient object detection: A benchmark. IEEE Transactions on Image Processing 24, 12 (2015), 5706--5722.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Gabriel J. Brostow, Julien Fauqueur, and Roberto Cipolla. 2009. Semantic object classes in video: A high-definition ground truth database. Pattern Recognition Letters 30, 2 (2009), 88--97. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Nathan D. Cahill and Lawrence A. Ray. 2007. Method and system for compositing images to produce a cropped image. US Patent 7,162,102.Google ScholarGoogle Scholar
  18. Aaron Carass, Snehashis Roy, Amod Jog, Jennifer L. Cuzzocreo, Elizabeth Magrath, Adrian Gherman, et al. 2017. Longitudinal multiple sclerosis lesion segmentation data resource. Data in Brief 12 (2017), 346--350.Google ScholarGoogle ScholarCross RefCross Ref
  19. Lluis Castrejon, Kaustav Kundu, Raquel Urtasun, and Sanja Fidler. 2017. Annotating object instances with a polygon-rnn. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5230--5238.Google ScholarGoogle ScholarCross RefCross Ref
  20. Ping-Lin Chang and Wei-Guang Teng. 2007. Exploiting the self-organizing map for medical image segmentation. In Proceedings of the 20th IEEE International Symposium on Computer-Based Medical Systems (CBMS’07). IEEE, Los Alamitos, CA, 281--288. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Jianxu Chen, Lin Yang, Yizhe Zhang, Mark Alber, and Danny Z. Chen. 2016. Combining fully convolutional and recurrent neural networks for 3D biomedical image segmentation. In Advances in Neural Information Processing Systems. 3036--3044. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L. Yuille. 2014. Semantic image segmentation with deep convolutional nets and fully connected CRFs. arXiv:1412.7062.Google ScholarGoogle Scholar
  23. Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L. Yuille. 2018. DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 4 (2018), 834--848.Google ScholarGoogle ScholarCross RefCross Ref
  24. Shuai Zheng, Sadeep Jayasumana, Bernardino Romera-Paredes, Vibhav Vineet, Zhizhong Su, Dalong Du, Chang Huang, and Philip H. S. Torr. 2015. Conditional random fields as recurrent neural networks. In Proceedings of the IEEE International Conference on Computer Vision. 1529--1537. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Liang-Chieh Chen, George Papandreou, Florian Schroff, and Hartwig Adam. 2017. Rethinking atrous convolution for semantic image segmentation. arXiv:1706.05587.Google ScholarGoogle Scholar
  26. Liang-Chieh Chen, Yi Yang, Jiang Wang, Wei Xu, and Alan L. Yuille. 2016. Attention to scale: Scale-aware semantic image segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3640--3649.Google ScholarGoogle Scholar
  27. Liang-Chieh Chen, Yukun Zhu, George Papandreou, Florian Schroff, and Hartwig Adam. 2018. Encoder-decoder with atrous separable convolution for semantic image segmentation. arXiv:1802.02611.Google ScholarGoogle Scholar
  28. Kuo-Sheng Cheng, Jzau-Sheng Lin, and Chi-Wu Mao. 1996. The application of competitive Hopfield neural network to medical image segmentation. IEEE Transactions on Medical Imaging 15, 4 (1996), 560--567.Google ScholarGoogle ScholarCross RefCross Ref
  29. Ming-Ming Cheng, Niloy J. Mitra, Xiaolei Huang, and Shi-Min Hu. 2014. SalientShape: Group saliency in image collections. Visual Computer 30, 4 (2014), 443--453. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Ming-Ming Cheng, Niloy J. Mitra, Xiaolei Huang, Philip H. S. Torr, and Shi-Min Hu. 2015. Global contrast based salient region detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 37, 3 (2015), 569--582. https://mmcheng.net/msra10k/.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Keh-Shih Chuang, Hong-Long Tzeng, Sharon Chen, Jay Wu, and Tzong-Jer Chen. 2006. Fuzzy c-means clustering with spatial information for image segmentation. Computerized Medical Imaging and Graphics 30, 1 (2006), 9--15.Google ScholarGoogle ScholarCross RefCross Ref
  32. Dorin Comaniciu and Peter Meer. 1997. Robust analysis of feature spaces: Color image segmentation. In Proceedings of the 1997 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, Los Alamitos, CA, 750--755. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Marius Cordts, Mohamed Omran, Sebastian Ramos, Timo Rehfeld, Markus Enzweiler, Rodrigo Benenson, Uwe Franke, Stefan Roth, and Bernt Schiele. 2016. The Cityscapes dataset for semantic urban scene understanding. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3213--3223.Google ScholarGoogle ScholarCross RefCross Ref
  34. Jifeng Dai, Kaiming He, and Jian Sun. 2015. BoxSup: Exploiting bounding boxes to supervise convolutional networks for semantic segmentation. In Proceedings of the IEEE International Conference on Computer Vision. 1635--1643. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Jifeng Dai, Kaiming He, and Jian Sun. 2016. Instance-aware semantic segmentation via multi-task network cascades. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3150--3158.Google ScholarGoogle ScholarCross RefCross Ref
  36. Jifeng Dai, Yi Li, Kaiming He, and Jian Sun. 2016. R-FCN: Object detection via region-based fully convolutional networks. In Advances in Neural Information Processing Systems. 379--387. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Aritra Das, Swarnendu Ghosh, Ritesh Sarkhel, Sandipan Choudhuri, Nibaran Das, and Mita Nasipuri. 2019. Combining multilevel contexts of superpixel using convolutional neural networks to perform natural scene labeling. In Recent Developments in Machine Learning and Data Analytics. Springer, 297--306.Google ScholarGoogle Scholar
  38. M. Portes De Albuquerque, I. A. Esquef, and A. R. Gesualdi Mello. 2004. Image thresholding using Tsallis entropy. Pattern Recognition Letters 25, 9 (2004), 1059--1065. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Marleen de Bruijne, Bram van Ginneken, Max A. Viergever, and Wiro J. Niessen. 2004. Interactive segmentation of abdominal aortic aneurysms in CTA images. Medical Image Analysis 8, 2 (2004), 127--138.Google ScholarGoogle ScholarCross RefCross Ref
  40. Ilke Demir, Krzysztof Koperski, David Lindenbaum, Guan Pang, Jing Huang, Saikat Basu, Forest Hughes, Devis Tuia, and Ramesh Raskar. 2018. DeepGlobe 2018: A challenge to parse the earth through satellite images. arXiv:1805.06561.Google ScholarGoogle Scholar
  41. Yingzi Du, Emrah Arslanturk, Zhi Zhou, and Craig Belcher. 2011. Video-based noncooperative iris image segmentation. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 41, 1 (2011), 64--74. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Lixin Duan, Ivor W. Tsang, Dong Xu, and Tat-Seng Chua. 2009. Domain adaptation from multiple sources via auxiliary classifiers. In Proceedings of the 26th Annual International Conference on Machine Learning. ACM, New York, NY, 289--296. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Vincent Dumoulin and Francesco Visin. 2016. A guide to convolution arithmetic for deep learning. arXiv:1603.07285.Google ScholarGoogle Scholar
  44. Mark Everingham, Luc Van Gool, Christopher K. I. Williams, John Winn, and Andrew Zisserman. 2010. The Pascal Visual Object Classes (VOC) Challenge. International Journal of Computer Vision 88, 2 (2010), 303--338. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Clement Farabet, Camille Couprie, Laurent Najman, and Yann LeCun. 2013. Learning hierarchical features for scene labeling. IEEE Transactions on Pattern Analysis and Machine Intelligence 35, 8 (2013), 1915--1929. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. International Society for Photogrammetry and Remote Sensing. {n.d.}. ISPRS 2D Semantic Labeling Contest. Retrieved August 1, 2019 from http://www2.isprs.org/commissions/comm3/wg4/semantic-labeling.htmlGoogle ScholarGoogle Scholar
  47. Muhammad Moazam Fraz, Paolo Remagnino, Andreas Hoppe, Bunyarit Uyyanonvara, Alicja R. Rudnicka, Christopher G. Owen, and Sarah A. Barman. 2012. Blood vessel segmentation methodologies in retinal images—A survey. Computer Methods and Programs in Biomedicine 108, 1 (2012), 407--433. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Jordi Freixenet, Xavier Muñoz, David Raba, Joan Martí, and Xavier Cufí. 2002. Yet another survey on image segmentation: Region and boundary information integration. In Proceedings of the European Conference on Computer Vision. 408--422. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Nir Friedman and Stuart Russell. 1997. Image segmentation in video sequences: A probabilistic approach. In Proceedings of the 13th Conference on Uncertainty in Artificial Intelligence. 175--181. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. K.-S. Fu and J. K. Mui. 1981. A survey on image segmentation. Pattern Recognition 13, 1 (1981), 3--16.Google ScholarGoogle ScholarCross RefCross Ref
  51. Fabio Galasso, Naveen Shankar Nagaraja, Tatiana Jimenez Cardenas, Thomas Brox, and Bernt Schiele. 2013. A unified video segmentation benchmark: Annotation, metrics and analysis. In Proceedings of the IEEE International Conference on Computer Vision. 3527--3534. Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Abhishek Gangwar and Akanksha Joshi. 2016. DeepIrisNet: Deep iris representation with applications in iris recognition and cross-sensor iris recognition. In Proceedings of the 2016 IEEE International Conference on Image Processing (ICIP’16). IEEE, Los Alamitos, CA, 2301--2305.Google ScholarGoogle ScholarCross RefCross Ref
  53. Alberto Garcia-Garcia, Sergio Orts-Escolano, Sergiu Oprea, Victor Villena-Martinez, and Jose Garcia-Rodriguez. 2017. A review on deep learning techniques applied to semantic segmentation. arXiv:1704.06857.Google ScholarGoogle Scholar
  54. Andreas Geiger, Philip Lenz, and Raquel Urtasun. 2012. Are we ready for autonomous driving? The Kitti vision benchmark suite. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’12). IEEE, Los Alamitos, CA, 3354--3361. Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. Qichuan Geng, Zhong Zhou, and Xiaochun Cao. 2018. Survey of recent progress in semantic image segmentation with CNNs. Science China Information Sciences 61, 5 (2018), 051101.Google ScholarGoogle ScholarCross RefCross Ref
  56. Ross Girshick. 2015. Fast R-CNN. arXiv:1504.08083. Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Ross Girshick, Jeff Donahue, Trevor Darrell, and Jitendra Malik. 2014. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 580--587. Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. Stephen Gould, Richard Fulton, and Daphne Koller. 2009. Decomposing a scene into geometric and semantically consistent regions. In Proceedings of the 2009 IEEE 12th International Conference on Computer Vision. IEEE, Los Alamitos, CA, 1--8.Google ScholarGoogle ScholarCross RefCross Ref
  59. Xiao Han. 2017. Automatic liver lesion segmentation using a deep convolutional neural network method. arXiv:1704.07239. https://competitions.codalab.org/competitions/17094.Google ScholarGoogle Scholar
  60. Bharath Hariharan, Pablo Arbelaez, Lubomir Bourdev, Subhransu Maji, and Jitendra Malik. 2011. Semantic contours from inverse detectors. In Proceedings of the International Conference on Computer Vision (ICCV’11). Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017. Mask R-CNN. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV’17). IEEE, Los Alamitos, CA, 2980--2988.Google ScholarGoogle Scholar
  62. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 37, 9 (2015), 1904--1916.Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 770--778.Google ScholarGoogle ScholarCross RefCross Ref
  64. Seunghoon Hong, Tackgeun You, Suha Kwak, and Bohyung Han. 2015. Online tracking by learning discriminative saliency map with convolutional neural network. In Proceedings of the International Conference on Machine Learning. 597--606. Google ScholarGoogle ScholarDigital LibraryDigital Library
  65. Yang Hu, Andrea Soltoggio, Russell Lock, and Steve Carter. 2019. A fully convolutional two-stream fusion network for interactive image segmentation. Neural Networks 109 (2019), 31--42.Google ScholarGoogle ScholarCross RefCross Ref
  66. Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv:1502.03167.Google ScholarGoogle Scholar
  67. Humayun Irshad, Antoine Veillard, Ludovic Roux, and Daniel Racoceanu. 2014. Methods for nuclei detection, segmentation, and classification in digital histopathology: A review—Current status and future potential. IEEE Reviews in Biomedical Engineering 7 (2014), 97--114.Google ScholarGoogle ScholarCross RefCross Ref
  68. Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, and Alexei A. Efros. 2017. Image-to-image translation with conditional adversarial networks. arXiv:1611.07004.Google ScholarGoogle Scholar
  69. Firas Ajil Jassim and Fawzi H. Altaani. 2013. Hybridization of Otsu method and median filter for color image segmentation. arXiv:1305.1052.Google ScholarGoogle Scholar
  70. Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, and Yoshua Bengio. 2017. The one hundred layers tiramisu: Fully convolutional densenets for semantic segmentation. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW’17). IEEE, Los Alamitos, CA, 1175--1183.Google ScholarGoogle ScholarCross RefCross Ref
  71. Cheng-Bin Jin, Shengzhe Li, Trung Dung Do, and Hakil Kim. 2015. Real-time human action recognition using CNN over temporal images for static video surveillance cameras. In Proceedings of the Pacific Rim Conference on Multimedia. 330--339. Google ScholarGoogle ScholarDigital LibraryDigital Library
  72. A. H. Kam, T. T. Ng, N. G. Kingsbury, and W. J. Fitzgerald. 2000. Content based image retrieval through object extraction and querying. In Proceedings of the Workshop on Content-Based Access of Image and Visual Libraries (CBAIVL’00). IEEE, Los Alamitos, CA, 91. Google ScholarGoogle ScholarDigital LibraryDigital Library
  73. Konstantinos Kamnitsas, Christian Ledig, Virginia F. J. Newcombe, Joanna P. Simpson, Andrew D. Kane, David K. Menon, Daniel Rueckert, and Ben Glocker. 2017. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation. Medical Image Analysis 36 (2017), 61--78.Google ScholarGoogle ScholarCross RefCross Ref
  74. Asako Kanezaki. 2018. Unsupervised image segmentation by backpropagation. In Proceedings of the 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP’18). IEEE, Los Alamitos, CA, 1543--1547.Google ScholarGoogle ScholarCross RefCross Ref
  75. Jiayin Kang, Xiao Li, Qingxian Luan, Jinzhu Liu, and Lequan Min. 2006. Dental plaque quantification using cellular neural network-based image segmentation. In Intelligent Computing in Signal Processing and Pattern Recognition. Springer, 797--802.Google ScholarGoogle Scholar
  76. Jiayin Kang and Wenjuan Zhang. 2009. Fingerprint segmentation using cellular neural network. In Proceedings of the International Conference on Computational Intelligence and Natural Computing (CINC’09), Vol. 2. IEEE, Los Alamitos, CA, 11--14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  77. Kai Kang and Xiaogang Wang. 2014. Fully convolutional neural networks for crowd segmentation. arXiv:1411.4464.Google ScholarGoogle Scholar
  78. Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv:1412.6980.Google ScholarGoogle Scholar
  79. Tao Kong, Anbang Yao, Yurong Chen, and Fuchun Sun. 2016. HyperNet: Towards accurate region proposal generation and joint object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 845--853.Google ScholarGoogle ScholarCross RefCross Ref
  80. Philipp Krähenbühl and Vladlen Koltun. 2011. Efficient inference in fully connected CRFs with Gaussian edge potentials. In Advances in Neural Information Processing Systems. 109--117. Google ScholarGoogle ScholarDigital LibraryDigital Library
  81. Bolei Zhou, Hang Zhao, Xavier Puig, Sanja Fidler, Adela Barriuso, and Antonio Torralba. 2016. Semantic understanding of scenes through the ADE20K dataset. arXiv:1608.05442. Google ScholarGoogle ScholarDigital LibraryDigital Library
  82. Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems. 1097--1105. Google ScholarGoogle ScholarDigital LibraryDigital Library
  83. Alfonso B. Labao and Prospero C. Naval. 2017. Weakly-labelled semantic segmentation of fish objects in underwater videos using a deep residual network. In Proceedings of the Asian Conference on Intelligent Information and Database Systems. 255--265.Google ScholarGoogle Scholar
  84. W. Ladys Law Skarbek and Andreas Koschan. 1994. Colour image segmentation a survey. IEEE Transactions on Circuits and Systems for Video Technology 14, 7 (1994).Google ScholarGoogle Scholar
  85. Rodney LaLonde and Ulas Bagci. 2018. Capsules for object segmentation. arXiv:1804.04241.Google ScholarGoogle Scholar
  86. Martin Längkvist, Andrey Kiselev, Marjan Alirezaie, and Amy Loutfi. 2016. Classification and segmentation of satellite orthoimagery using convolutional neural networks. Remote Sensing 8, 4 (2016), 329.Google ScholarGoogle ScholarCross RefCross Ref
  87. Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. 1998. Gradient-based learning applied to document recognition. Proceedings of the IEEE 86, 11 (1998), 2278--2324.Google ScholarGoogle ScholarCross RefCross Ref
  88. Seong-Hun Lee, Min Su Cho, Kyomin Jung, and Jin Hyung Kim. 2010. Scene text extraction with edge constraint and text collinearity. In Proceedings of the 2010 International Conference on Pattern Recognition. IEEE, Los Alamitos, CA, 3983--3986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  89. Bastian Leibe, Edgar Seemann, and Bernt Schiele. 2005. Pedestrian detection in crowded scenes. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), Vol. 1. IEEE, Los Alamitos, CA, 878--885. Google ScholarGoogle ScholarDigital LibraryDigital Library
  90. Dan Levi, Noa Garnett, Ethan Fetaya, and Israel Herzlyia. 2015. StixelNet: A deep convolutional network for obstacle detection and road segmentation. In Proceedings of the British Machine Vision Association (BMVC’15). 109.Google ScholarGoogle ScholarCross RefCross Ref
  91. Anat Levin, Dani Lischinski, and Yair Weiss. 2008. A closed-form solution to natural image matting. IEEE Transactions on Pattern Analysis and Machine Intelligence 30, 2 (2008), 228--242. Google ScholarGoogle ScholarDigital LibraryDigital Library
  92. Xiaoxiao Li, Ziwei Liu, Ping Luo, Chen Change Loy, and Xiaoou Tang. 2017. Not all pixels are equal: Difficulty-aware semantic segmentation via deep layer cascade. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3193--3202.Google ScholarGoogle ScholarCross RefCross Ref
  93. Yin Li, Xiaodi Hou, Christof Koch, James M. Rehg, and Alan L. Yuille. 2014. The secrets of salient object segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 280--287. Google ScholarGoogle ScholarDigital LibraryDigital Library
  94. Yi Li, Haozhi Qi, Jifeng Dai, Xiangyang Ji, and Yichen Wei. 2016. Fully convolutional instance-aware semantic segmentation. arXiv:1611.07709.Google ScholarGoogle Scholar
  95. Wen-Nung Lie. 1995. Automatic target segmentation by locally adaptive image thresholding. IEEE Transactions on Image Processing 4, 7 (1995), 1036--1041. Google ScholarGoogle ScholarDigital LibraryDigital Library
  96. Guosheng Lin, Anton Milan, Chunhua Shen, and Ian Reid. 2017. RefineNet: Multi-path refinement networks for high-resolution semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17).Google ScholarGoogle ScholarCross RefCross Ref
  97. Min Lin, Qiang Chen, and Shuicheng Yan. 2013. Network in network. arXiv:1312.4400.Google ScholarGoogle Scholar
  98. Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C. Lawrence Zitnick. 2014. Microsoft COCO: Common objects in context. In Proceedings of the European Conference on Computer Vision. 740--755.Google ScholarGoogle Scholar
  99. Geert Litjens, Thijs Kooi, Babak Ehteshami Bejnordi, Arnaud Arindra Adiyoso Setio, Francesco Ciompi, Mohsen Ghafoorian, Jeroen AWM van der Laak, Bram van Ginneken, and Clara I Sánchez. 2017. A survey on deep learning in medical image analysis. Medical Image Analysis 42 (2017), 60--88.Google ScholarGoogle ScholarCross RefCross Ref
  100. Nianfeng Liu, Haiqing Li, Man Zhang, Jing Liu, Zhenan Sun, and Tieniu Tan. 2016. Accurate iris segmentation in non-cooperative environments using fully convolutional networks. In Proceedings of the 2016 International Conference on Biometrics (ICB’16). IEEE, Los Alamitos, CA, 1--8.Google ScholarGoogle ScholarCross RefCross Ref
  101. Sifei Liu, Shalini De Mello, Jinwei Gu, Guangyu Zhong, Ming-Hsuan Yang, and Jan Kautz. 2017. Learning affinity via spatial propagation networks. In Advances in Neural Information Processing Systems. 1520--1530. Google ScholarGoogle ScholarDigital LibraryDigital Library
  102. Ying Liu, Dengsheng Zhang, Guojun Lu, and Wei-Ying Ma. 2007. A survey of content-based image retrieval with high-level semantics. Pattern Recognition 40, 1 (2007), 262--282. Google ScholarGoogle ScholarDigital LibraryDigital Library
  103. Ziwei Liu, Xiaoxiao Li, Ping Luo, Chen-Change Loy, and Xiaoou Tang. 2015. Semantic image segmentation via deep parsing network. In Proceedings of the IEEE International Conference on Computer Vision. 1377--1385. Google ScholarGoogle ScholarDigital LibraryDigital Library
  104. Christos P. Loizou, Víctor Murray, Marios S. Pattichis, Ioannis Seimenis, Marios Pantziaris, and Constantinos S. Pattichis. 2011. Multiscale amplitude-modulation frequency-modulation (AM--FM) texture analysis of multiple sclerosis in brain MRI images. IEEE Transactions on Information Technology in Biomedicine 15, 1 (2011), 119--129. Google ScholarGoogle ScholarDigital LibraryDigital Library
  105. Jonathan Long, Evan Shelhamer, and Trevor Darrell. 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3431--3440.Google ScholarGoogle ScholarCross RefCross Ref
  106. Karen López-Linares, Nerea Lete, Luis Kabongo, Mario Ceresa, Gregory Maclair, Ainhoa García-Familiar, Iván Macía, and Miguel Ángel González Ballester. 2018. Comparison of regularization techniques for DCNN-based abdominal aortic aneurysm segmentation. In Proceedings of the 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI’18). IEEE, Los Alamitos, CA, 864--867.Google ScholarGoogle ScholarCross RefCross Ref
  107. Ping Lu, Livia Barazzetti, Vimal Chandran, Kate Gavaghan, Stefan Weber, Nicolas Gerber, and Mauricio Reyes. 2018. Highly accurate facial nerve segmentation refinement from CBCT/CT imaging using a super-resolution classification approach. IEEE Transactions on Biomedical Engineering 65, 1 (2018), 178--188.Google ScholarGoogle ScholarCross RefCross Ref
  108. Pauline Luc, Camille Couprie, Soumith Chintala, and Jakob Verbeek. 2016. Semantic segmentation using adversarial networks. arXiv:1611.08408.Google ScholarGoogle Scholar
  109. Emmanuel Maggiori, Yuliya Tarabalka, Guillaume Charpiat, and Pierre Alliez. 2017. Can semantic labeling methods generalize to any city? The INRIA aerial image labeling benchmark. In Proceedings of the IEEE International Symposium on Geoscience and Remote Sensing (IGARSS’17).Google ScholarGoogle ScholarCross RefCross Ref
  110. Oskar Maier, Bjoern H. Menze, Janina von der Gablentz, Levin Häni, Mattias P. Heinrich, Matthias Liebrand, et al. 2017. ISLES 2015—A public evaluation benchmark for ischemic stroke lesion segmentation from multispectral MRI. Medical Image Analysis 35 (2017), 250--269.Google ScholarGoogle ScholarCross RefCross Ref
  111. Rupesh Mandal and Nupur Choudhury. 2016. Automatic video surveillance for theft detection in ATM machines: An enhanced approach. In Proceedings of the 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom’16). IEEE, Los Alamitos, CA, 2821--2826.Google ScholarGoogle Scholar
  112. Kevis-Kokitsi Maninis, Sergi Caelles, Jordi Pont-Tuset, and Luc Van Gool. 2018. Deep extreme cut: From extreme points to object segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 616--625.Google ScholarGoogle ScholarCross RefCross Ref
  113. Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Pablo Arbeláez, and Luc Van Gool. 2016. Convolutional oriented boundaries. In Proceedings of the European Conference on Computer Vision. 580--596.Google ScholarGoogle ScholarCross RefCross Ref
  114. D. Martin, C. Fowlkes, D. Tal, and J. Malik. 2001. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In Proceedings of the 8th International Conference on Computer Vision, Vol. 2. 416--423.Google ScholarGoogle Scholar
  115. Jonathan Masci, Ueli Meier, Dan Cireşan, and Jürgen Schmidhuber. 2011. Stacked convolutional auto-encoders for hierarchical feature extraction. In Proceedings of the International Conference on Artificial Neural Networks. 52--59. Google ScholarGoogle ScholarDigital LibraryDigital Library
  116. L. R. Medsker and L. C. Jain. 2001. Recurrent Neural Networks: Design and Applications. CRC Press, Boca Raton, FL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  117. B. M. Mehtre, N. N. Murthy, S. Kapoor, and B. Chatterjee. 1987. Segmentation of fingerprint images using the directional image. Pattern Recognition 20, 4 (1987), 429--435. Google ScholarGoogle ScholarDigital LibraryDigital Library
  118. Bjoern H. Menze, Andras Jakab, Stefan Bauer, Jayashree Kalpathy-Cramer, Keyvan Farahani, Justin Kirby, et al. 2015. The multimodal brain tumor image segmentation benchmark (BRATS). IEEE Transactions on Medical Imaging 34, 10 (2015), 1993--2024.Google ScholarGoogle ScholarCross RefCross Ref
  119. Bolei Zhou, Hang Zhao, Xavier Puig, Sanja Fidler, Adela Barriuso, and Antonio Torralba. 2017. Scene parsing through ADE20K dataset. In Proceedings of the Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle ScholarCross RefCross Ref
  120. Andrew Merlino, Daryl Morey, and Mark Maybury. 1997. Broadcast news navigation using story segmentation. In Proceedings of the 5th ACM International Conference on Multimedia. ACM, New York, NY, 381--391. Google ScholarGoogle ScholarDigital LibraryDigital Library
  121. Filippo Molinari, Guang Zeng, and Jasjit S. Suri. 2010. A state of the art review on intima-media thickness (IMT) measurement and wall segmentation techniques for carotid ultrasound. Computer Methods and Programs in Biomedicine 100, 3 (2010), 201--221. Google ScholarGoogle ScholarDigital LibraryDigital Library
  122. Takayasu Moriya, Holger R. Roth, Shota Nakamura, Hirohisa Oda, Kai Nagara, Masahiro Oda, and Kensaku Mori. 2018. Unsupervised segmentation of 3D medical images based on clustering and deep representation learning. In Medical Imaging 2018: Biomedical Applications in Molecular, Structural, and Functional Imaging, Vol. 10578. International Society for Optics and Photonics, Bellingham, WA, 1057820.Google ScholarGoogle Scholar
  123. T. Nathan Mundhenk, Daniel Ho, and Barry Y. Chen. 2018. Improvements to context based self-supervised learning. In Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’18).Google ScholarGoogle Scholar
  124. Vinod Nair and Geoffrey E. Hinton. 2010. Rectified linear units improve restricted Boltzmann machines. In Proceedings of the 27th International Conference on Machine Learning (ICML’10). 807--814. Google ScholarGoogle ScholarDigital LibraryDigital Library
  125. Ahmed Nassar, Karim Amer, Reda El Hakim, and Mohamed El Helw. 2018. A deep CNN-based framework for enhanced aerial imagery registration with applications to UAV geolocalization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 1513--1523.Google ScholarGoogle ScholarCross RefCross Ref
  126. Gerhard Neuhold, Tobias Ollmann, S. Rota Bulo, and Peter Kontschieder. 2017. The Mapillary Vistas dataset for semantic understanding of street scenes. In Proceedings of the International Conference on Computer Vision (ICCV’17). 22--29.Google ScholarGoogle ScholarCross RefCross Ref
  127. Hyeonwoo Noh, Seunghoon Hong, and Bohyung Han. 2015. Learning deconvolution network for semantic segmentation. In Proceedings of the IEEE International Conference on Computer Vision. 1520--1528. Google ScholarGoogle ScholarDigital LibraryDigital Library
  128. Mehdi Noroozi and Paolo Favaro. 2016. Unsupervised learning of visual representations by solving jigsaw puzzles. In Proceedings of the European Conference on Computer Vision. 69--84.Google ScholarGoogle ScholarCross RefCross Ref
  129. Christine M. Onyango and John A. Marchant. 2001. Physics-based colour image segmentation for scenes containing vegetation and soil. Image and Vision Computing 19, 8 (2001), 523--538.Google ScholarGoogle ScholarCross RefCross Ref
  130. Anisha Pal, Shourya Jaiswal, Swarnendu Ghosh, Nibaran Das, and Mita Nasipuri. {n.d.}. SegFast: A faster SqueezeNet based semantic image segmentation technique using depth-wise separable convolutions. In Proceedings of the 11th Indian Conference on Computer Vision, Graphics, and Image Processing (ICVGIP’18). ACM, New York, NY, 7.Google ScholarGoogle Scholar
  131. Nikhil R. Pal and Sankar K. Pal. 1993. A review on image segmentation techniques. Pattern Recognition 26, 9 (1993), 1277--1294.Google ScholarGoogle ScholarCross RefCross Ref
  132. George Papandreou, Liang-Chieh Chen, Kevin P. Murphy, and Alan L. Yuille. 2015. Weakly- and semi-supervised learning of a deep convolutional network for semantic image segmentation. In Proceedings of the IEEE International Conference on Computer Vision. 1742--1750. Google ScholarGoogle ScholarDigital LibraryDigital Library
  133. Adam Paszke, Abhishek Chaurasia, Sangpil Kim, and Eugenio Culurciello. 2016. ENet: A deep neural network architecture for real-time semantic segmentation. arXiv:1606.02147.Google ScholarGoogle Scholar
  134. Deepak Pathak, Philipp Krahenbuhl, Jeff Donahue, Trevor Darrell, and Alexei A. Efros. 2016. Context encoders: Feature learning by inpainting. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2536--2544.Google ScholarGoogle Scholar
  135. Chao Peng, Xiangyu Zhang, Gang Yu, Guiming Luo, and Jian Sun. 2017. Large kernel matters—Improve semantic segmentation by global convolutional network. arXiv:1703.02719.Google ScholarGoogle Scholar
  136. Pedro O. Pinheiro, Ronan Collobert, and Piotr Dollár. 2015. Learning to segment object candidates. In Advances in Neural Information Processing Systems. 1990--1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  137. Pedro O. Pinheiro, Tsung-Yi Lin, Ronan Collobert, and Piotr Dollár. 2016. Learning to refine object segments. In Proceedings of the European Conference on Computer Vision. 75--91.Google ScholarGoogle ScholarCross RefCross Ref
  138. Jordi Pont-Tuset, Federico Perazzi, Sergi Caelles, Pablo Arbeláez, Alex Sorkine-Hornung, and Luc Van Gool. 2017. The 2017 Davis Challenge on video object segmentation. arXiv:1704.00675.Google ScholarGoogle Scholar
  139. Prasanna Porwal, Samiksha Pachade, Ravi Kamble, Manesh Kokare, Girish Deshmukh, Vivek Sahasrabuddhe, et al. 2018. Diabetic retinopathy: Segmentation and grading challenge workshop. In Proceedings of the IEEE International Symposium on Biomedical Imaging(ISBI’18).https://idrid.grand-challenge.org/organizers/Google ScholarGoogle Scholar
  140. Huafeng Qin and Mounim A. El-Yacoubi. 2017. Deep representation-based feature extraction and recovering for finger-vein verification. IEEE Transactions on Information Forensics and Security 12, 8 (2017), 1816--1829. Google ScholarGoogle ScholarDigital LibraryDigital Library
  141. P. Radau, Y. Lu, K. Connelly, G. Paul, A. Dick, and G. Wright. 2009. Evaluation framework for algorithms segmenting short axis cardiac MRI. MIDAS Journal 49 (2009).Google ScholarGoogle Scholar
  142. Anurag Ranjan, Varun Jampani, Kihwan Kim, Deqing Sun, Jonas Wulff, and Michael J. Black. 2018. Adversarial collaboration: Joint unsupervised learning of depth, camera motion, optical flow and motion segmentation. arXiv:1805.09806.Google ScholarGoogle Scholar
  143. Mahdyar Ravanbakhsh, Moin Nabi, Hossein Mousavi, Enver Sangineto, and Nicu Sebe. 2016. Plug-and-play CNN for crowd motion analysis: An application in abnormal event detection. arXiv:1610.00307.Google ScholarGoogle Scholar
  144. Mengye Ren and Richard S. Zemel. 2017. End-to-end instance segmentation with recurrent attention. arXiv:1605.09410.Google ScholarGoogle Scholar
  145. Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards real-time object detection with region proposal networks. In Advances in Neural Information Processing Systems. 91--99. Google ScholarGoogle ScholarDigital LibraryDigital Library
  146. Bernardino Romera-Paredes and Philip Hilaire Sean Torr. 2016. Recurrent instance segmentation. In Proceedings of the European Conference on Computer Vision. 312--329.Google ScholarGoogle ScholarCross RefCross Ref
  147. Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-Net: Convolutional networks for biomedical image segmentation. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention. 234--241.Google ScholarGoogle ScholarCross RefCross Ref
  148. German Ros, Laura Sellart, Joanna Materzynska, David Vazquez, and Antonio M. Lopez. 2016. The Synthia dataset: A large collection of synthetic images for semantic segmentation of urban scenes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3234--3243.Google ScholarGoogle Scholar
  149. Brandon Rothrock, Ryan Kennedy, Chris Cunningham, Jeremie Papon, Matthew Heverly, and Masahiro Ono. 2016. Spoc: Deep learning-based terrain classification for Mars Rover missions. In Proceedings of the AIAA SPACE 2016 Conference. 5539.Google ScholarGoogle ScholarCross RefCross Ref
  150. David E. Rumelhart, Geoffrey E. Hinton, and Ronald J. Williams. 1986. Learning representations by back-propagating errors. Nature 323, 6088 (1986), 533.Google ScholarGoogle ScholarCross RefCross Ref
  151. Sara Sabour, Nicholas Frosst, and Geoffrey E. Hinton. 2017. Dynamic routing between capsules. In Advances in Neural Information Processing Systems. 3856--3866. Google ScholarGoogle ScholarDigital LibraryDigital Library
  152. N. Senthilkumaran and R. Rajesh. 2009. Edge detection techniques for image segmentation—A survey of soft computing approaches. International Journal of Recent Trends in Engineering 1, 2 (2009), 250--254.Google ScholarGoogle Scholar
  153. Neeraj Sharma and Lalit M. Aggarwal. 2010. Automated medical image segmentation techniques. Journal of Medical Physics/Association of Medical Physicists of India 35, 1 (2010), 3.Google ScholarGoogle Scholar
  154. Jianbo Shi and Jitendra Malik. 2000. Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 22, 8 (2000), 888--905. Google ScholarGoogle ScholarDigital LibraryDigital Library
  155. Jianping Shi, Qiong Yan, Li Xu, and Jiaya Jia. 2016. Hierarchical image saliency detection on extended CSSD. IEEE Transactions on Pattern Analysis and Machine Intelligence 38, 4 (2016), 717--729. Google ScholarGoogle ScholarDigital LibraryDigital Library
  156. Jamie Shotton, John Winn, Carsten Rother, and Antonio Criminisi. 2006. TextonBoost: Joint appearance, shape and context modeling for multi-class object recognition and segmentation. In Proceedings of the European Conference on Computer Vision. 1--15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  157. Margarida Silveira, Jacinto C. Nascimento, Jorge S. Marques, André R. S. Marçal, Teresa Mendonça, Syogo Yamauchi, Junji Maeda, and Jorge Rozeira. 2009. Comparison of segmentation methods for melanoma diagnosis in dermoscopy images. IEEE Journal of Selected Topics in Signal Processing 3, 1 (2009), 35--45.Google ScholarGoogle ScholarCross RefCross Ref
  158. Yan Song, Yuemei Zhu, Guangliang Li, Chen Feng, Bo He, and Tianhong Yan. 2017. Side scan sonar segmentation using deep convolutional neural network. In Proceedings of the 2017 OCEANS--Anchorage Conference. IEEE, Los Alamitos, CA, 1--4.Google ScholarGoogle Scholar
  159. Joes Staal, Michael D. Abràmoff, Meindert Niemeijer, Max A. Viergever, and Bram Van Ginneken. 2004. Ridge-based vessel segmentation in color images of the retina. IEEE Transactions on Medical Imaging 23, 4 (2004), 501--509.Google ScholarGoogle ScholarCross RefCross Ref
  160. Tamás Szirányi, Károly László, László Czúni, and Francesco Ziliani. 1999. Object oriented motion-segmentation for video-compression in the CNN-UM. Journal of VLSI Signal Processing Systems for Signal, Image and Video Technology 23, 2-3 (1999), 479--496. Google ScholarGoogle ScholarDigital LibraryDigital Library
  161. Khang Siang Tan and Nor Ashidi Mat Isa. 2011. Color image segmentation using histogram thresholding—Fuzzy c-means hybrid approach. Pattern Recognition 44, 1 (2011), 1--15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  162. Orlando José Tobias and Rui Seara. 2002. Image segmentation by histogram thresholding using fuzzy sets. IEEE Transactions on Image Processing 11, 12 (2002), 1457--1465. Google ScholarGoogle ScholarDigital LibraryDigital Library
  163. Michael Treml, José Arjona-Medina, Thomas Unterthiner, Rupesh Durgesh, Felix Friedmann, Peter Schuberth, et al. 2016. Speeding up semantic segmentation for autonomous driving. In Proceedings of the MLITS NIPS Workshop.Google ScholarGoogle Scholar
  164. Wei-Chih Tu, Ming-Yu Liu, Varun Jampani, Deqing Sun, Shao-Yi Chien, Ming-Hsuan Yang, and Jan Kautz. 2018. Learning superpixels with segmentation-aware affinity loss. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 568--576.Google ScholarGoogle ScholarCross RefCross Ref
  165. Jasper R. R. Uijlings, Koen E. A. Van De Sande, Theo Gevers, and Arnold W. M. Smeulders. 2013. Selective search for object recognition. International Journal of Computer Vision 104, 2 (2013), 154--171. Google ScholarGoogle ScholarDigital LibraryDigital Library
  166. Koen E. A. Van de Sande, Jasper R. R. Uijlings, Theo Gevers, and Arnold W. M. Smeulders. 2011. Segmentation as selective search for object recognition. In Proceedings of the 2011 IEEE International Conference on Computer Vision (ICCV’11). IEEE, Los Alamitos, CA, 1879--1886. Google ScholarGoogle ScholarDigital LibraryDigital Library
  167. Bram Van Ginneken, Mikkel B. Stegmann, and Marco Loog. 2006. Segmentation of anatomical structures in chest radiographs using supervised methods: A comparative study on a public database. Medical Image Analysis 10, 1 (2006), 19--40.Google ScholarGoogle ScholarCross RefCross Ref
  168. G. Varma, A. Subramanian, A. Namboodiri, M. Chandraker, and C. V. Jawahar. 2018. IDD: A dataset for exploring problems of autonomous navigation in unconstrained environments. arXiv:1811.10200.Google ScholarGoogle Scholar
  169. Andreas Veit, Tomas Matera, Lukas Neumann, Jiri Matas, and Serge Belongie. 2016. Coco-text: Dataset and benchmark for text detection and recognition in natural images. arXiv:1601.07140.Google ScholarGoogle Scholar
  170. David L. Vilarino, Diego Cabello, and Victor M. Brea. 2002. An analogic CNN-algorithm of pixel level snakes for tracking and surveillance tasks. In Proceedings of the 2002 7th IEEE International Workshop on Cellular Neural Networks and Their Applications (CNNA’02). IEEE, Los Alamitos, CA, 84--91.Google ScholarGoogle Scholar
  171. Kai Wang, Boris Babenko, and Serge Belongie. 2011. End-to-end scene text recognition. In Proceedings of the 2011 IEEE International Conference on Computer Vision (ICCV’11). IEEE, Los Alamitos, CA, 1457--1464. Google ScholarGoogle ScholarDigital LibraryDigital Library
  172. Guo-Qing Wei, Klaus Arbter, and Gerd Hirzinger. 1997. Real-time visual servoing for laparoscopic surgery. Controlling robot motion with color image segmentation. IEEE Engineering in Medicine and Biology Magazine 16, 1 (1997), 40--45.Google ScholarGoogle ScholarCross RefCross Ref
  173. Xide Xia and Brian Kulis. 2017. W-Net: A deep model for fully unsupervised image segmentation. arXiv:1711.08506.Google ScholarGoogle Scholar
  174. Jieqiong Xu, Guoyu Wang, and Feifei Sun. 2013. A novel method for detecting and tracking vehicles in traffic-image sequence. In Proceedings of the 5th International Conference on Digital Image Processing (ICDIP’13), Vol. 8878. 88782P.Google ScholarGoogle ScholarCross RefCross Ref
  175. Ning Xu, Linjie Yang, Yuchen Fan, Dingcheng Yue, Yuchen Liang, Jianchao Yang, and Thomas Huang. 2018. YouTube-VOS: A large-scale video object segmentation benchmark. arXiv:1809.03327.Google ScholarGoogle Scholar
  176. Chuan Yang, Lihe Zhang, Huchuan Lu, Xiang Ruan, and Ming-Hsuan Yang. 2013. Saliency detection via graph-based manifold ranking. In Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’13). IEEE, Los Alamitos, CA, 3166--3173. Google ScholarGoogle ScholarDigital LibraryDigital Library
  177. Fisher Yu and Vladlen Koltun. 2015. Multi-scale context aggregation by dilated convolutions. arXiv:1511.07122.Google ScholarGoogle Scholar
  178. Fisher Yu, Wenqi Xian, Yingying Chen, Fangchen Liu, Mike Liao, Vashisht Madhavan, and Trevor Darrell. 2018. BDD100K: A diverse driving video database with scalable annotation tooling. arXiv:1805.04687.Google ScholarGoogle Scholar
  179. Jiangye Yuan, Shaun S. Gleason, and Anil M. Cheriyadat. 2013. Systematic benchmarking of aerial image segmentation. IEEE Geoscience and Remote Sensing Letters 10, 6 (2013), 1527--1531.Google ScholarGoogle ScholarCross RefCross Ref
  180. Matthew D. Zeiler and Rob Fergus. 2014. Visualizing and understanding convolutional networks. In Proceedings of the European Conference on Computer Vision. 818--833.Google ScholarGoogle Scholar
  181. Darko Zikic, Yani Ioannou, Matthew Brown, and Antonio Criminisi. 2014. Segmentation of brain tumor tissues with convolutional neural networks. In Proceedings of the MICCAI Workshop on Multimodal Brain Tumor Segmentation Challenge (BRATS'14). 36--39.Google ScholarGoogle Scholar
  182. Xiaohang Zhan, Xingang Pan, Ziwei Liu, Dahua Lin, and Chen Change Loy. 2019. Self-supervised learning via conditional motion propagation. arXiv:1903.11412.Google ScholarGoogle Scholar
  183. Qi Zhang, Sally A. Goldman, Wei Yu, and Jason E. Fritts. 2002. Content-based image retrieval using multiple-instance learning. In Proceedings of the 19th International Conference on Machine Learning (ICML’02), Vol. 2. 682--689. Google ScholarGoogle ScholarDigital LibraryDigital Library
  184. Richard Zhang, Phillip Isola, and Alexei A. Efros. 2016. Colorful image colorization. In Proceedings of the European Conference on Computer Vision. 649--666.Google ScholarGoogle Scholar
  185. Bo Zhao, Jiashi Feng, Xiao Wu, and Shuicheng Yan. 2017. A survey on deep learning-based fine-grained object classification and semantic segmentation. International Journal of Automation and Computing 14, 2 (2017), 119--135. Google ScholarGoogle ScholarDigital LibraryDigital Library
  186. Hengshuang Zhao, Jianping Shi, Xiaojuan Qi, Xiaogang Wang, and Jiaya Jia. 2017. Pyramid scene parsing network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17). 2881--2890.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Understanding Deep Learning Techniques for Image Segmentation

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Computing Surveys
        ACM Computing Surveys  Volume 52, Issue 4
        July 2020
        769 pages
        ISSN:0360-0300
        EISSN:1557-7341
        DOI:10.1145/3359984
        • Editor:
        • Sartaj Sahni
        Issue’s Table of Contents

        Copyright © 2019 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 30 August 2019
        • Accepted: 1 May 2019
        • Revised: 1 February 2019
        • Received: 1 July 2018
        Published in csur Volume 52, Issue 4

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • survey
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format