Skip to main content

2018 | OriginalPaper | Buchkapitel

Universal Sketch Perceptual Grouping

verfasst von : Ke Li, Kaiyue Pang, Jifei Song, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales, Honggang Zhang

Erschienen in: Computer Vision – ECCV 2018

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this work we aim to develop a universal sketch grouper. That is, a grouper that can be applied to sketches of any category in any domain to group constituent strokes/segments into semantically meaningful object parts. The first obstacle to this goal is the lack of large-scale datasets with grouping annotation. To overcome this, we contribute the largest sketch perceptual grouping (SPG) dataset to date, consisting of 20, 000 unique sketches evenly distributed over 25 object categories. Furthermore, we propose a novel deep universal perceptual grouping model. The model is learned with both generative and discriminative losses. The generative losses improve the generalisation ability of the model to unseen object categories and datasets. The discriminative losses include a local grouping loss and a novel global grouping loss to enforce global grouping consistency. We show that the proposed model significantly outperforms the state-of-the-art groupers. Further, we show that our grouper is useful for a number of sketch analysis tasks including sketch synthesis and fine-grained sketch-based image retrieval (FG-SBIR).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Their relationship is analogous to that between unsupervised image segmentation [5, 6] and semantic segmentation [7].
 
Literatur
1.
Zurück zum Zitat Wagemans, J., et al.: A century of gestalt psychology in visual perception: I. Perceptual grouping and figure-ground organization. Psychol. Bull. 138(6), 1172–1217 (2012)CrossRef Wagemans, J., et al.: A century of gestalt psychology in visual perception: I. Perceptual grouping and figure-ground organization. Psychol. Bull. 138(6), 1172–1217 (2012)CrossRef
2.
Zurück zum Zitat Wagemans, J.: A century of gestalt psychology in visual perception: II. Conceptual and theoretical foundations. Psychol. Bull. 138(6), 1218–1252 (2012)CrossRef Wagemans, J.: A century of gestalt psychology in visual perception: II. Conceptual and theoretical foundations. Psychol. Bull. 138(6), 1218–1252 (2012)CrossRef
3.
Zurück zum Zitat Wertheimer, M.: Laws of Organization in Perceptual Forms. Kegan Paul, Trench, Trubner & Company, London, England (1938)CrossRef Wertheimer, M.: Laws of Organization in Perceptual Forms. Kegan Paul, Trench, Trubner & Company, London, England (1938)CrossRef
4.
Zurück zum Zitat Ren, X., Malik, J.: Learning a classification model for segmentation. In: ICCV (2003) Ren, X., Malik, J.: Learning a classification model for segmentation. In: ICCV (2003)
5.
Zurück zum Zitat Xia, X., Kulis, B.: W-net: A deep model for fully unsupervised image segmentation. ArXiv e-prints (2017) Xia, X., Kulis, B.: W-net: A deep model for fully unsupervised image segmentation. ArXiv e-prints (2017)
6.
Zurück zum Zitat Wang, C., Yang, B., Liao, Y.: Unsupervised image segmentation using convolutional autoencoder with total variation regularization as preprocessing. In: ICASSP (2017) Wang, C., Yang, B., Liao, Y.: Unsupervised image segmentation using convolutional autoencoder with total variation regularization as preprocessing. In: ICASSP (2017)
7.
Zurück zum Zitat Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. arXiv:1606.00915 (2016) Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. arXiv:​1606.​00915 (2016)
9.
Zurück zum Zitat Huang, Z., Fu, H., Lau, R.W.: Data-driven segmentation and labeling of freehand sketches. TOG (2014) Huang, Z., Fu, H., Lau, R.W.: Data-driven segmentation and labeling of freehand sketches. TOG (2014)
10.
Zurück zum Zitat Schneider, R.G., Tuytelaars, T.: Example-based sketch segmentation and labeling using CRFs. TOG (2016) Schneider, R.G., Tuytelaars, T.: Example-based sketch segmentation and labeling using CRFs. TOG (2016)
11.
Zurück zum Zitat Qi, Y., Guo, J., Li, Y., Zhang, H., Xiang, T., Song, Y.Z.: Sketching by perceptual grouping. In: ICIP (2013) Qi, Y., Guo, J., Li, Y., Zhang, H., Xiang, T., Song, Y.Z.: Sketching by perceptual grouping. In: ICIP (2013)
12.
Zurück zum Zitat Qi, Y., et al.: Making better use of edges via perceptual grouping. In: CVPR (2015) Qi, Y., et al.: Making better use of edges via perceptual grouping. In: CVPR (2015)
13.
Zurück zum Zitat Hinton, G., Salakhutdinov, R.: Reducing the dimensionality of data with neural networks. Science 313, 504–507 (2006)MathSciNetCrossRef Hinton, G., Salakhutdinov, R.: Reducing the dimensionality of data with neural networks. Science 313, 504–507 (2006)MathSciNetCrossRef
14.
Zurück zum Zitat Amir, A., Lindenbaum, M.: A generic grouping algorithm and its quantitative analysis. TPAMI 20(2), 168–185 (1998)CrossRef Amir, A., Lindenbaum, M.: A generic grouping algorithm and its quantitative analysis. TPAMI 20(2), 168–185 (1998)CrossRef
15.
Zurück zum Zitat Lun, Z., et al.: Learning to group discrete graphical patterns. TOG (2017) Lun, Z., et al.: Learning to group discrete graphical patterns. TOG (2017)
17.
Zurück zum Zitat Muhammad, U.R., Song, Y.Z., Xiang, T., Hospedales, T.: Learning deep sketch abstraction. In: CVPR (2018) Muhammad, U.R., Song, Y.Z., Xiang, T., Hospedales, T.: Learning deep sketch abstraction. In: CVPR (2018)
18.
Zurück zum Zitat Li, Y., Hospedales, T.M., Song, Y.Z., Gong, S.: Fine-grained sketch-based image retrieval by matching deformable part models. In: BMVC (2014) Li, Y., Hospedales, T.M., Song, Y.Z., Gong, S.: Fine-grained sketch-based image retrieval by matching deformable part models. In: BMVC (2014)
19.
Zurück zum Zitat Yu, Q., Liu, F., Song, Y.Z., Xiang, T., Hospedales, T.M., Loy, C.C.: Sketch me that shoe. In: CVPR (2016) Yu, Q., Liu, F., Song, Y.Z., Xiang, T., Hospedales, T.M., Loy, C.C.: Sketch me that shoe. In: CVPR (2016)
20.
Zurück zum Zitat Song, J., Song, Y.Z., Xiang, T., Hospedales, T., Ruan, X.: Deep multi-task attribute-driven ranking for fine-grained sketch-based image retrieval. In: BMVC (2016) Song, J., Song, Y.Z., Xiang, T., Hospedales, T., Ruan, X.: Deep multi-task attribute-driven ranking for fine-grained sketch-based image retrieval. In: BMVC (2016)
21.
Zurück zum Zitat Pang, K., Song, Y.Z., Xiang, T., Hospedales, T.M.: Cross-domain generative learning for fine-grained sketch-based image retrieval. In: BMVC (2017) Pang, K., Song, Y.Z., Xiang, T., Hospedales, T.M.: Cross-domain generative learning for fine-grained sketch-based image retrieval. In: BMVC (2017)
22.
Zurück zum Zitat Hu, C., Li, D., Song, Y.Z., Xiang, T., Hospedales, T.: Sketch-a-classifier: Sketch-based photo classifier generation. In: CVPR (2018) Hu, C., Li, D., Song, Y.Z., Xiang, T., Hospedales, T.: Sketch-a-classifier: Sketch-based photo classifier generation. In: CVPR (2018)
23.
Zurück zum Zitat Song, J., Qian, Y., Song, Y.Z., Xiang, T., Hospedales, T.: Deep spatial-semantic attention for fine-grained sketch-based image retrieval. In: ICCV (2017) Song, J., Qian, Y., Song, Y.Z., Xiang, T., Hospedales, T.: Deep spatial-semantic attention for fine-grained sketch-based image retrieval. In: ICCV (2017)
24.
Zurück zum Zitat Yu, Q., Yang, Y., Liu, F., Song, Y.Z., Xiang, T., Hospedales, T.M.: Sketch-a-net: A deep neural network that beats humans. IJCV (2017) Yu, Q., Yang, Y., Liu, F., Song, Y.Z., Xiang, T., Hospedales, T.M.: Sketch-a-net: A deep neural network that beats humans. IJCV (2017)
25.
Zurück zum Zitat Bowman, S.R., Vilnis, L., Vinyals, O., Dai, A.M., Jozefowicz, R., Bengio, S.: Generating sentences from a continuous space. Computer Science (2015) Bowman, S.R., Vilnis, L., Vinyals, O., Dai, A.M., Jozefowicz, R., Bengio, S.: Generating sentences from a continuous space. Computer Science (2015)
26.
Zurück zum Zitat Kingma, D.P., Welling, M.: Auto-encoding variational bayes. ArXiv e-prints (2013) Kingma, D.P., Welling, M.: Auto-encoding variational bayes. ArXiv e-prints (2013)
28.
Zurück zum Zitat Schuster, M., Paliwal, K.K.: Bidirectional recurrent neural networks. TSP 45, 2673–2681 (1997) Schuster, M., Paliwal, K.K.: Bidirectional recurrent neural networks. TSP 45, 2673–2681 (1997)
29.
Zurück zum Zitat Yang, J., Parikh, D., Batra, D.: Joint unsupervised learning of deep representations and image clusters. In: CVPR (2016) Yang, J., Parikh, D., Batra, D.: Joint unsupervised learning of deep representations and image clusters. In: CVPR (2016)
31.
Zurück zum Zitat Arbelez, P., Maire, M., Fowlkes, C., Malik, J.: Contour detection and hierarchical image segmentation. TPAMI 33, 898–916 (2011)CrossRef Arbelez, P., Maire, M., Fowlkes, C., Malik, J.: Contour detection and hierarchical image segmentation. TPAMI 33, 898–916 (2011)CrossRef
32.
Zurück zum Zitat Sangkloy, P., Lu, J., Fang, C., Yu, F., Hays, J.: Scribbler: Controlling deep image synthesis with sketch and color. In: CVPR (2017) Sangkloy, P., Lu, J., Fang, C., Yu, F., Hays, J.: Scribbler: Controlling deep image synthesis with sketch and color. In: CVPR (2017)
33.
Zurück zum Zitat Qi, Y., Guo, J., Song, Y.Z., Xiang, T., Zhang, H., Tan, Z.H.: Im2sketch: Sketch generation by unconflicted perceptual grouping. Neurocomputing 165, 338–349 (2015)CrossRef Qi, Y., Guo, J., Song, Y.Z., Xiang, T., Zhang, H., Tan, Z.H.: Im2sketch: Sketch generation by unconflicted perceptual grouping. Neurocomputing 165, 338–349 (2015)CrossRef
34.
Zurück zum Zitat Eitz, M., Hays, J., Alexa, M.: How do humans sketch objects? TOG (2012) Eitz, M., Hays, J., Alexa, M.: How do humans sketch objects? TOG (2012)
Metadaten
Titel
Universal Sketch Perceptual Grouping
verfasst von
Ke Li
Kaiyue Pang
Jifei Song
Yi-Zhe Song
Tao Xiang
Timothy M. Hospedales
Honggang Zhang
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-030-01237-3_36