Skip to main content
Erschienen in: Neural Processing Letters 1/2020

18.06.2020

A New Supervised Clustering Framework Using Multi Discriminative Parts and Expectation–Maximization Approach for a Fine-Grained Animal Breed Classification (SC-MPEM)

verfasst von: Divya Meena Sundaram, Agilandeeswari Loganathan

Erschienen in: Neural Processing Letters | Ausgabe 1/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Fine-grained image classification is active research in the field of computer vision. Specifically, animal breed classification is an arduous task due to the challenges in camera traps images like occlusion, camouflage, poor illumination, pose variation, etc. In this paper, we propose a fine-grained animal breed classification model using supervised clustering based on Multi Part-Convolutional Neural Network (MP-CNN) and Expectation–Maximization (EM) clustering. The proposed model follows a straightforward pipeline that combines the deep feature extraction using the CNN pre-trained on ImageNet and classifies unsupervised data using EM clustering. Further, we also propose a multi discriminative part selection and detection for the precise classification of animal breeds without using bounding box and annotations on both training and testing phases. The model is tested on several benchmark datasets for animals, including the largest camera trap Snapshot Serengeti dataset and has achieved a cumulative accuracy of 98.4%. The results from the proposed model strengthen the belief that supervised training of deep CNN on a large and versatile dataset, extracts better features than most of the traditional approaches, even for the unsupervised tasks.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Swanson A, Kosmala M, Lintott C, Simpson R, Smith A, Packer C (2015) Snapshot Serengeti, high-frequency annotated camera trap images of 40 mammalian species in an African savanna. Sci Data 2:150026 Swanson A, Kosmala M, Lintott C, Simpson R, Smith A, Packer C (2015) Snapshot Serengeti, high-frequency annotated camera trap images of 40 mammalian species in an African savanna. Sci Data 2:150026
2.
Zurück zum Zitat Deng J, Dong W, Socher R, Li LJ, Li K, Fei LF (2009) ImageNet: a large-scale hierarchical image database. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 248–255 Deng J, Dong W, Socher R, Li LJ, Li K, Fei LF (2009) ImageNet: a large-scale hierarchical image database. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 248–255
6.
Zurück zum Zitat Norouzzadeh MS, Nguyen A, Kosmala M, Swanson A, Palmer MS, Packer C, Clune J (2018) Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning. Proc Natl Acad Sci 115(25):5716–5725 Norouzzadeh MS, Nguyen A, Kosmala M, Swanson A, Palmer MS, Packer C, Clune J (2018) Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning. Proc Natl Acad Sci 115(25):5716–5725
7.
Zurück zum Zitat Jaskó G, Giosan I, Nedevschi S (2017) Animal detection from traffic scenarios based on monocular color vision. In: 2017 13th IEEE international conference on intelligent computer communication and processing (ICCP), IEEE, pp 363–368 Jaskó G, Giosan I, Nedevschi S (2017) Animal detection from traffic scenarios based on monocular color vision. In: 2017 13th IEEE international conference on intelligent computer communication and processing (ICCP), IEEE, pp 363–368
8.
Zurück zum Zitat Sharma SU, Shah DJ (2016) A practical animal detection and collision avoidance system using computer vision technique. IEEE Access 5:347–358 Sharma SU, Shah DJ (2016) A practical animal detection and collision avoidance system using computer vision technique. IEEE Access 5:347–358
9.
Zurück zum Zitat Meena SD, Agilandeeswari L (2020) Stacked convolutional autoencoder for detecting animal images in cluttered scenes with a novel feature extraction framework. In: Soft computing for problem solving, Springer, Singapore, pp 513–522 Meena SD, Agilandeeswari L (2020) Stacked convolutional autoencoder for detecting animal images in cluttered scenes with a novel feature extraction framework. In: Soft computing for problem solving, Springer, Singapore, pp 513–522
11.
Zurück zum Zitat Gupta P, Verma GK (2017) Wild animal detection using discriminative feature-oriented dictionary learning. In: 2017 International conference on computing, communication and automation (ICCCA), IEEE, pp 104–109 Gupta P, Verma GK (2017) Wild animal detection using discriminative feature-oriented dictionary learning. In: 2017 International conference on computing, communication and automation (ICCCA), IEEE, pp 104–109
12.
Zurück zum Zitat Antônio WH, Da Silva M, Miani RS, Souza JR (2019) A proposal of an animal detection system using machine learning. Appl Artif Intell 33(13):1093–1106 Antônio WH, Da Silva M, Miani RS, Souza JR (2019) A proposal of an animal detection system using machine learning. Appl Artif Intell 33(13):1093–1106
13.
Zurück zum Zitat Xie L, Tian Q, Hong R, Yan S, Zhang B (2013) Hierarchical part matching for fine-grained visual categorization. In: International conference of computer vision (ICCV), pp 1641–1648 Xie L, Tian Q, Hong R, Yan S, Zhang B (2013) Hierarchical part matching for fine-grained visual categorization. In: International conference of computer vision (ICCV), pp 1641–1648
14.
Zurück zum Zitat Berg T, Belhumeur P (2013) Poof: part-based one-vs.-one features for fine-grained categorization, face verification, and attribute estimation. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 955–962 Berg T, Belhumeur P (2013) Poof: part-based one-vs.-one features for fine-grained categorization, face verification, and attribute estimation. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 955–962
15.
Zurück zum Zitat Branson S, VanHorn G, Belongie S, Perona P (2014) Bird species categorization using pose normalized deep convolutional nets. arxiv:1406.2952 Branson S, VanHorn G, Belongie S, Perona P (2014) Bird species categorization using pose normalized deep convolutional nets. arxiv:1406.2952
16.
Zurück zum Zitat Zhang N, Donahue J, Girshick R, Darrell T (2014) Part based R-CNNs for fine-grained category detection. In: European conference on computer vision (ECCV), pp 834–849 Zhang N, Donahue J, Girshick R, Darrell T (2014) Part based R-CNNs for fine-grained category detection. In: European conference on computer vision (ECCV), pp 834–849
17.
Zurück zum Zitat Lin D, Shen X, Lu C, Jia J (2015) Deep lac: deep localization, alignment and classification for fine-grained recognition. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 1666–1674 Lin D, Shen X, Lu C, Jia J (2015) Deep lac: deep localization, alignment and classification for fine-grained recognition. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 1666–1674
18.
Zurück zum Zitat Huang S, Xu Z, Tao D, Zhang Y (2016) Part-stacked CNN for fine-grained visual categorization. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 1173–1182 Huang S, Xu Z, Tao D, Zhang Y (2016) Part-stacked CNN for fine-grained visual categorization. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 1173–1182
19.
Zurück zum Zitat Yao H, Zhang S, Zhang Y, Li J, Tian Q (2016) Coarse-to-fine description for fine-grained visual categorization. IEEE Trans Image Process (TIP) 25(10):4858–4872MathSciNet Yao H, Zhang S, Zhang Y, Li J, Tian Q (2016) Coarse-to-fine description for fine-grained visual categorization. IEEE Trans Image Process (TIP) 25(10):4858–4872MathSciNet
20.
Zurück zum Zitat Xu Z, Huang S, Zhang Y, Tao D (2016) Webly-supervised fine-grained visual categorization via deep domain adaptation. In: IEEE transactions on pattern analysis and machine intelligence (TPAMI) Xu Z, Huang S, Zhang Y, Tao D (2016) Webly-supervised fine-grained visual categorization via deep domain adaptation. In: IEEE transactions on pattern analysis and machine intelligence (TPAMI)
21.
Zurück zum Zitat Xu Z, Tao D, Huang S, Zhang Y (2017) Friend or foe: fine-grained categorization with weak supervision. IEEE Trans Image Process (TIP) 26(1):135–146MathSciNetMATH Xu Z, Tao D, Huang S, Zhang Y (2017) Friend or foe: fine-grained categorization with weak supervision. IEEE Trans Image Process (TIP) 26(1):135–146MathSciNetMATH
22.
Zurück zum Zitat Xie L, Tian Q, Wang M, Zhang B (2014) Spatial pooling of heterogeneous features for image classification. IEEE Trans Image Process (TIP) 23(5):1994–2008MathSciNetMATH Xie L, Tian Q, Wang M, Zhang B (2014) Spatial pooling of heterogeneous features for image classification. IEEE Trans Image Process (TIP) 23(5):1994–2008MathSciNetMATH
23.
Zurück zum Zitat Krause J, Jin H, Yang J, Fei-Fei L (2015) Fine-grained recognition without part annotations. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 5546–5555 Krause J, Jin H, Yang J, Fei-Fei L (2015) Fine-grained recognition without part annotations. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 5546–5555
24.
Zurück zum Zitat Simon M, Rodner E (2015) Neural activation constellations: unsupervised part model discovery with convolutional networks. In: International conference of computer vision (ICCV), pp 1143–1151 Simon M, Rodner E (2015) Neural activation constellations: unsupervised part model discovery with convolutional networks. In: International conference of computer vision (ICCV), pp 1143–1151
25.
Zurück zum Zitat Lin TY, Chowdhury AR, Maji S (2015) Bilinear CNN models for fine-grained visual recognition. In: International conference of computer vision (ICCV), pp 1449–1457 Lin TY, Chowdhury AR, Maji S (2015) Bilinear CNN models for fine-grained visual recognition. In: International conference of computer vision (ICCV), pp 1449–1457
26.
Zurück zum Zitat Zhang X, Xiong H, Zhou W, Tian Q (2016) Fused one-vs-all features with semantic alignments for fine-grained visual categorization. IEEE Trans Image Process (TIP) 25(2):878–892MathSciNetMATH Zhang X, Xiong H, Zhou W, Tian Q (2016) Fused one-vs-all features with semantic alignments for fine-grained visual categorization. IEEE Trans Image Process (TIP) 25(2):878–892MathSciNetMATH
27.
Zurück zum Zitat Zhang L, Yang Y, Wang M, Hong R, Nie L, Li X (2016) Detecting densely distributed graph patterns for fine grained image categorization. IEEE Trans Image Process (TIP) 25(2):553–565MathSciNetMATH Zhang L, Yang Y, Wang M, Hong R, Nie L, Li X (2016) Detecting densely distributed graph patterns for fine grained image categorization. IEEE Trans Image Process (TIP) 25(2):553–565MathSciNetMATH
28.
Zurück zum Zitat Zheng H, Fu J, Mei T, Luo J (2017) Learning multi-attention convolutional neural network for fine-grained image recognition. In: International conference of computer (ICCV), pp 5209–5217 Zheng H, Fu J, Mei T, Luo J (2017) Learning multi-attention convolutional neural network for fine-grained image recognition. In: International conference of computer (ICCV), pp 5209–5217
29.
Zurück zum Zitat Liu J, Kanazawa A, Jacobs D, Belhumeur P (2012) Dog breed classification using part localization. In: European conference on computer vision, Springer, Berlin, pp 172–185 Liu J, Kanazawa A, Jacobs D, Belhumeur P (2012) Dog breed classification using part localization. In: European conference on computer vision, Springer, Berlin, pp 172–185
30.
Zurück zum Zitat Parkhi OM, Vedaldi A, Zisserman A, Jawahar CV (2012) Cats and dogs. In: 2012 IEEE conference on computer vision and pattern recognition. IEEE, pp 3498–3505 Parkhi OM, Vedaldi A, Zisserman A, Jawahar CV (2012) Cats and dogs. In: 2012 IEEE conference on computer vision and pattern recognition. IEEE, pp 3498–3505
31.
Zurück zum Zitat Khosla A, Jayadevaprakash N, Yao B, Li FF (2011) Novel dataset for fine-grained image categorization: Stanford dogs. In: Proc. CVPR workshop on fine-grained visual categorization (FGVC), vol 2, no 1 Khosla A, Jayadevaprakash N, Yao B, Li FF (2011) Novel dataset for fine-grained image categorization: Stanford dogs. In: Proc. CVPR workshop on fine-grained visual categorization (FGVC), vol 2, no 1
32.
Zurück zum Zitat Mulligan K, Rivas P (2019) Dog breed identification with a neural network over learned representations from the exception cnn architecture. In: 21st International conference on artificial intelligence (ICAI 2019) Mulligan K, Rivas P (2019) Dog breed identification with a neural network over learned representations from the exception cnn architecture. In: 21st International conference on artificial intelligence (ICAI 2019)
33.
Zurück zum Zitat Cubuk ED, Zoph B, Mane D, Vasudevan V, Le QV (2019) Autoaugment: learning augmentation strategies from data. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 113–123 Cubuk ED, Zoph B, Mane D, Vasudevan V, Le QV (2019) Autoaugment: learning augmentation strategies from data. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 113–123
34.
Zurück zum Zitat Touvron H, Vedaldi A, Douze M, Jégou H (2019) Fixing the train-test resolution discrepancy. In: Advances in neural information processing systems, pp 8250–8260 Touvron H, Vedaldi A, Douze M, Jégou H (2019) Fixing the train-test resolution discrepancy. In: Advances in neural information processing systems, pp 8250–8260
35.
Zurück zum Zitat Kolesnikov A, Beyer L, Zhai X, Puigcerver J, Yung J, Gelly S, Houlsby N (2019). Large scale learning of general visual representations for transfer. arXiv preprint arXiv:1912.11370 Kolesnikov A, Beyer L, Zhai X, Puigcerver J, Yung J, Gelly S, Houlsby N (2019). Large scale learning of general visual representations for transfer. arXiv preprint arXiv:​1912.​11370
36.
Zurück zum Zitat Lee J, Won T, Hong K (2020) Compounding the performance improvements of assembled techniques in a convolutional neural network. arXiv preprint arXiv:2001.06268 Lee J, Won T, Hong K (2020) Compounding the performance improvements of assembled techniques in a convolutional neural network. arXiv preprint arXiv:​2001.​06268
37.
Zurück zum Zitat Meena SD, Agilandeeswari L (2019) An efficient framework for animal breeds classification using semi-supervised learning and multi-part convolutional neural network (MP-CNN). IEEE Access 7:151783–151802 Meena SD, Agilandeeswari L (2019) An efficient framework for animal breeds classification using semi-supervised learning and multi-part convolutional neural network (MP-CNN). IEEE Access 7:151783–151802
38.
Zurück zum Zitat Liu X, Xia T, Wang J, Yang Y, Zhou F, Lin Y (2016) Fully convolutional attention networks for fine-grained recognition. arXiv preprint arXiv:1603.06765 Liu X, Xia T, Wang J, Yang Y, Zhou F, Lin Y (2016) Fully convolutional attention networks for fine-grained recognition. arXiv preprint arXiv:​1603.​06765
39.
Zurück zum Zitat Zheng H, Fu J, Mei T, Luo J (2017) Learning multi-attention convolutional neural network for fine-grained image recognition. In: Proceedings of the IEEE international conference on computer vision, pp 5209–5217 Zheng H, Fu J, Mei T, Luo J (2017) Learning multi-attention convolutional neural network for fine-grained image recognition. In: Proceedings of the IEEE international conference on computer vision, pp 5209–5217
40.
Zurück zum Zitat Sun M, Yuan Y, Zhou F, Ding E (2018) Multi-attention multi-class constraint for fine-grained image recognition. In; Proceedings of the European conference on computer vision (ECCV), pp 805–821 Sun M, Yuan Y, Zhou F, Ding E (2018) Multi-attention multi-class constraint for fine-grained image recognition. In; Proceedings of the European conference on computer vision (ECCV), pp 805–821
41.
Zurück zum Zitat Dubey A, Gupta O, Guo P, Raskar R, Farrell R, Naik N (2018) Pairwise confusion for fine-grained visual classification. In: Proceedings of the European conference on computer vision (ECCV). pp 70–86 Dubey A, Gupta O, Guo P, Raskar R, Farrell R, Naik N (2018) Pairwise confusion for fine-grained visual classification. In: Proceedings of the European conference on computer vision (ECCV). pp 70–86
42.
Zurück zum Zitat Sun G, Cholakkal H, Khan S, Khan FS, Shao L (2019) Fine-grained recognition: accounting for subtle differences between similar classes. arXiv preprint arXiv:1912.06842 Sun G, Cholakkal H, Khan S, Khan FS, Shao L (2019) Fine-grained recognition: accounting for subtle differences between similar classes. arXiv preprint arXiv:​1912.​06842
43.
Zurück zum Zitat Hu T, Qi H, Huang Q, Lu Y (2019) See better before looking closer: weakly supervised data augmentation network for fine-grained visual classification. arXiv preprint arXiv:1901.09891 Hu T, Qi H, Huang Q, Lu Y (2019) See better before looking closer: weakly supervised data augmentation network for fine-grained visual classification. arXiv preprint arXiv:​1901.​09891
44.
Zurück zum Zitat Zhuang P, Wang Y, Qiao . (2020) Learning attentive pairwise interaction for fine-grained classification. arXiv preprint arXiv:2002.10191 Zhuang P, Wang Y, Qiao . (2020) Learning attentive pairwise interaction for fine-grained classification. arXiv preprint arXiv:​2002.​10191
45.
Zurück zum Zitat Guo J, Ma S, Guo S (2019) MAANet: multi-view aware attention networks for image super-resolution. arXiv preprint arXiv:1904.06252 Guo J, Ma S, Guo S (2019) MAANet: multi-view aware attention networks for image super-resolution. arXiv preprint arXiv:​1904.​06252
46.
Zurück zum Zitat Fu J, Liu J, Tian H, Li Y, Bao Y, Fang Z, Lu H (2019) Dual attention network for scene segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3146–3154 Fu J, Liu J, Tian H, Li Y, Bao Y, Fang Z, Lu H (2019) Dual attention network for scene segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3146–3154
47.
Zurück zum Zitat Hu T, Yang P, Zhang C, Yu G, Mu Y, Snoek CG (2019) Attention-based multi-context guiding for few-shot semantic segmentation. In: Proceedings of the AAAI conference on artificial intelligence, vol 33, pp 8441–8448 Hu T, Yang P, Zhang C, Yu G, Mu Y, Snoek CG (2019) Attention-based multi-context guiding for few-shot semantic segmentation. In: Proceedings of the AAAI conference on artificial intelligence, vol 33, pp 8441–8448
48.
Zurück zum Zitat Zhang L, Nizampatnam S, Gangopadhyay A, Conde MV (2019) Multi-attention networks for temporal localization of video-level labels. arXiv preprint arXiv:1911.06866 Zhang L, Nizampatnam S, Gangopadhyay A, Conde MV (2019) Multi-attention networks for temporal localization of video-level labels. arXiv preprint arXiv:​1911.​06866
49.
Zurück zum Zitat Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2001) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357MATH Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2001) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357MATH
50.
Zurück zum Zitat Yan X, Ai T, Yang M, Yin H (2019) A graph convolutional neural network for classification of building patterns using spatial vector data. ISPRS J Photogram Remote Sens 150:259–273 Yan X, Ai T, Yang M, Yin H (2019) A graph convolutional neural network for classification of building patterns using spatial vector data. ISPRS J Photogram Remote Sens 150:259–273
51.
Zurück zum Zitat Liu JE, An FP (2020) Image classification algorithm based on deep learning-kernel function. In: Scientific programming Liu JE, An FP (2020) Image classification algorithm based on deep learning-kernel function. In: Scientific programming
52.
Zurück zum Zitat Huang C, Li H, Xie Y, Qingbo W, Luo B (2017) PBC: Polygon-based classifier for fine-grained categorization. IEEE Trans Multimed (TMM) 19(4):673–684 Huang C, Li H, Xie Y, Qingbo W, Luo B (2017) PBC: Polygon-based classifier for fine-grained categorization. IEEE Trans Multimed (TMM) 19(4):673–684
53.
Zurück zum Zitat Guérin J, Boots B (2018) Improving image clustering with multiple pretrained cnn feature extractors. arXiv preprint arXiv:1807.07760 Guérin J, Boots B (2018) Improving image clustering with multiple pretrained cnn feature extractors. arXiv preprint arXiv:​1807.​07760
54.
Zurück zum Zitat Long X, Gan C, De Melo G, Wu J, Liu X, Wen S (2018) Attention clusters: purely attention based local feature integration for video classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7834–7843 Long X, Gan C, De Melo G, Wu J, Liu X, Wen S (2018) Attention clusters: purely attention based local feature integration for video classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7834–7843
Metadaten
Titel
A New Supervised Clustering Framework Using Multi Discriminative Parts and Expectation–Maximization Approach for a Fine-Grained Animal Breed Classification (SC-MPEM)
verfasst von
Divya Meena Sundaram
Agilandeeswari Loganathan
Publikationsdatum
18.06.2020
Verlag
Springer US
Erschienen in
Neural Processing Letters / Ausgabe 1/2020
Print ISSN: 1370-4621
Elektronische ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-020-10246-3

Weitere Artikel der Ausgabe 1/2020

Neural Processing Letters 1/2020 Zur Ausgabe

Neuer Inhalt