Skip to main content
Top
Published in: World Wide Web 2/2023

24-09-2022

Learning refined features for open-world text classification with class description and commonsense knowledge

Authors: Haopeng Ren, Zeting Li, Yi Cai, Xingwei Tan, Xin Wu

Published in: World Wide Web | Issue 2/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Open-world classification requires a classifier not only to classify samples of the observed classes but also to detect samples which are not suitable to be classified as the known classes. State-of-the-art methods train a feature extractor to extract features for separating known classes with limited training data. Then some strategies, such as outlier detector, are used to reject samples from unknown classes based on the feature space. However, they are prone to extract the discriminative features among known classes and cannot model comprehensive features of known classes, which causes the classification errors when detecting the samples from the unknown classes in an open world scenario. Motivated by the theory of psychology and cognitive science, we utilize both class descriptions and commonsense knowledge summarized by human to refine the discriminant features and propose a regularization strategy. The regularization is incorporated into the feature extractor, which is enabled to further improve the performance of our model in an open-world environment. Extensive experiments and visualization analysis are conducted to evaluate the effectiveness of our proposed model.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Footnotes
1
http://qwone.com/ jason/20Newsgroups/
 
Literature
1.
go back to reference Feng, S., Wang, Y., Liu, L., Wang, D., Yu, G.: Attention based hierarchical lstm network for context-aware microblog sentiment classification. World Wide Web 22(1), 59–81 (2019)CrossRef Feng, S., Wang, Y., Liu, L., Wang, D., Yu, G.: Attention based hierarchical lstm network for context-aware microblog sentiment classification. World Wide Web 22(1), 59–81 (2019)CrossRef
2.
go back to reference Hu, R., Zhu, X., Zhu, Y., Gan, J.: Robust svm with adaptive graph learning. World Wide Web 23(3), 1945–1968 (2020)CrossRef Hu, R., Zhu, X., Zhu, Y., Gan, J.: Robust svm with adaptive graph learning. World Wide Web 23(3), 1945–1968 (2020)CrossRef
3.
go back to reference Wu, X., Cai, Y., Li, Q., Xu, J., Leung, H.-F.: Combining weighted category-aware contextual information in convolutional neural networks for text classification. World Wide Web 23(5), 2815–2834 (2020)CrossRef Wu, X., Cai, Y., Li, Q., Xu, J., Leung, H.-F.: Combining weighted category-aware contextual information in convolutional neural networks for text classification. World Wide Web 23(5), 2815–2834 (2020)CrossRef
4.
go back to reference Ren, H., Cai, Y., Chen, X., Wang, G., Li, Q.: A two-phase prototypical network model for incremental few-shot relation classification. In: Proceedings of the 28th international conference on computational linguistics, pp. 1618–1629 (2020) Ren, H., Cai, Y., Chen, X., Wang, G., Li, Q.: A two-phase prototypical network model for incremental few-shot relation classification. In: Proceedings of the 28th international conference on computational linguistics, pp. 1618–1629 (2020)
5.
go back to reference Li, Z., Cai, Y., Tan, X., Han, G., Ren, H., Wu, X., Li, W.: Learning refined features for open-world text classification. In: Asia-Pacific Web (APWeb) and Web-Age information management (WAIM) Joint international conference on Web and Big Data, pp. 367–381. Springer (2021) Li, Z., Cai, Y., Tan, X., Han, G., Ren, H., Wu, X., Li, W.: Learning refined features for open-world text classification. In: Asia-Pacific Web (APWeb) and Web-Age information management (WAIM) Joint international conference on Web and Big Data, pp. 367–381. Springer (2021)
6.
go back to reference Fei, G., Liu, B.: Breaking the closed world assumption in text classification. In: Proceedings of the 2016 conference of the north american chapter of the association for computational linguistics: human language technologies, pp. 506–514 (2016) Fei, G., Liu, B.: Breaking the closed world assumption in text classification. In: Proceedings of the 2016 conference of the north american chapter of the association for computational linguistics: human language technologies, pp. 506–514 (2016)
7.
go back to reference Yang, H.-M., Zhang, X.-Y., Yin, F., Liu, C.-L.: Robust classification with convolutional prototype learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3474–3482 (2018) Yang, H.-M., Zhang, X.-Y., Yin, F., Liu, C.-L.: Robust classification with convolutional prototype learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3474–3482 (2018)
8.
go back to reference Liang, B., Li, H., Su, M., Bian, P., Li, X., Shi, W.: Deep text classification can be fooled. In: IJCAI. Proceeding of the 27th international joint conference on artificial intelligence, pp. 4208–4215 (2018) Liang, B., Li, H., Su, M., Bian, P., Li, X., Shi, W.: Deep text classification can be fooled. In: IJCAI. Proceeding of the 27th international joint conference on artificial intelligence, pp. 4208–4215 (2018)
9.
go back to reference Shu, L., Xu, H., Liu, B.: Doc: Deep open classification of text documents. In: Proceedings of the 2017 conference on empirical methods in natural language processing, pp. 2911–2916 (2017) Shu, L., Xu, H., Liu, B.: Doc: Deep open classification of text documents. In: Proceedings of the 2017 conference on empirical methods in natural language processing, pp. 2911–2916 (2017)
10.
go back to reference Markus, H.: Self-schemata and processing information about the self. J. Pers. Soc. Psychol. 35(2), 63 (1977)CrossRef Markus, H.: Self-schemata and processing information about the self. J. Pers. Soc. Psychol. 35(2), 63 (1977)CrossRef
11.
go back to reference Banerjee, S.: Boosting inductive transfer for text classification using wikipedia. In: Sixth international conference on machine learning and applications (ICMLA 2007), pp. 148–153. IEEE (2007) Banerjee, S.: Boosting inductive transfer for text classification using wikipedia. In: Sixth international conference on machine learning and applications (ICMLA 2007), pp. 148–153. IEEE (2007)
12.
go back to reference Deng, Y., Shen, Y., Yang, M., Li, Y., Du, N., Fan, W., Lei, K.: Knowledge as a bridge: improving cross-domain answer selection with external knowledge. In: Proceedings of the 27th international conference on computational linguistics, pp. 3295–3305 (2018) Deng, Y., Shen, Y., Yang, M., Li, Y., Du, N., Fan, W., Lei, K.: Knowledge as a bridge: improving cross-domain answer selection with external knowledge. In: Proceedings of the 27th international conference on computational linguistics, pp. 3295–3305 (2018)
13.
go back to reference Ghosal, D., Hazarika, D., Roy, A., Majumder, N., Mihalcea, R., Poria, S.: Kingdom: knowledge-guided domain adaptation for sentiment analysis. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp. 3198–3210 (2020) Ghosal, D., Hazarika, D., Roy, A., Majumder, N., Mihalcea, R., Poria, S.: Kingdom: knowledge-guided domain adaptation for sentiment analysis. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp. 3198–3210 (2020)
14.
go back to reference Speer, R., Chin, J., Havasi, C.: Conceptnet 5.5: an Open multilingual graph of general knowledge. In: Proceedings of the Thirty-first AAAI conference on artificial intelligence, pp. 444–4451 (2017) Speer, R., Chin, J., Havasi, C.: Conceptnet 5.5: an Open multilingual graph of general knowledge. In: Proceedings of the Thirty-first AAAI conference on artificial intelligence, pp. 444–4451 (2017)
15.
go back to reference Kalchbrenner, N., Grefenstette, E., Blunsom, P., Kartsaklis, D., Kalchbrenner, N., Sadrzadeh, M., Kalchbrenner, N., Blunsom, P., Kalchbrenner, N., Blunsom, P.: A convolutional neural network for modelling sentences. In: Proceedings of the 52nd annual meeting of the association for computational linguistics, pp. 212–217. Association for Computational Linguistics (Volume 1: lon pares), pp. 655–665 (2014) Kalchbrenner, N., Grefenstette, E., Blunsom, P., Kartsaklis, D., Kalchbrenner, N., Sadrzadeh, M., Kalchbrenner, N., Blunsom, P., Kalchbrenner, N., Blunsom, P.: A convolutional neural network for modelling sentences. In: Proceedings of the 52nd annual meeting of the association for computational linguistics, pp. 212–217. Association for Computational Linguistics (Volume 1: lon pares), pp. 655–665 (2014)
16.
go back to reference Wang, G., Li, C., Wang, W., Zhang, Y., Shen, D., Zhang, X., Henao, R., Carin, L.: Joint embedding of words and labels for text classification. In: Proceedings of the 56th annual meeting of the association for computational linguistics (vol. 1: Long Papers), pp. 2321–2331 (2018) Wang, G., Li, C., Wang, W., Zhang, Y., Shen, D., Zhang, X., Henao, R., Carin, L.: Joint embedding of words and labels for text classification. In: Proceedings of the 56th annual meeting of the association for computational linguistics (vol. 1: Long Papers), pp. 2321–2331 (2018)
17.
go back to reference Ren, H., Zeng, Z., Cai, Y., Du, Q., Li, Q., Xie, H.: A weighted word embedding model for text classification. In: International conference on database systems for advanced applications, pp. 419–434. Springer (2019) Ren, H., Zeng, Z., Cai, Y., Du, Q., Li, Q., Xie, H.: A weighted word embedding model for text classification. In: International conference on database systems for advanced applications, pp. 419–434. Springer (2019)
18.
go back to reference Liu, M., Liu, L., Cao, J., Du, Q.: Co-attention network with label embedding for text classification. Neurocomputing 471, 61–69 (2022)CrossRef Liu, M., Liu, L., Cao, J., Du, Q.: Co-attention network with label embedding for text classification. Neurocomputing 471, 61–69 (2022)CrossRef
19.
go back to reference Zhou, D.-W., Ye, H.-J., Zhan, D.-C.: Learning placeholders for open-set recognition. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 4401–4410 (2021) Zhou, D.-W., Ye, H.-J., Zhan, D.-C.: Learning placeholders for open-set recognition. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 4401–4410 (2021)
20.
go back to reference Perera, P., Morariu, V.I., Jain, R., Manjunatha, V., Wigington, C., Ordonez, V., Patel, V.M.: Generative-discriminative feature representations for open-set recognition. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 11814–11823 (2020) Perera, P., Morariu, V.I., Jain, R., Manjunatha, V., Wigington, C., Ordonez, V., Patel, V.M.: Generative-discriminative feature representations for open-set recognition. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 11814–11823 (2020)
21.
22.
go back to reference Veenman, C.J., Reinders, M.J.: The nearest subclass classifier: a compromise between the nearest mean and nearest neighbor classifier. IEEE Trans. Pattern Anal. Mach. Intell. 27(9), 1417–1429 (2005)CrossRef Veenman, C.J., Reinders, M.J.: The nearest subclass classifier: a compromise between the nearest mean and nearest neighbor classifier. IEEE Trans. Pattern Anal. Mach. Intell. 27(9), 1417–1429 (2005)CrossRef
23.
go back to reference Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 210–227 (2008)CrossRef Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 210–227 (2008)CrossRef
24.
go back to reference Scheirer, W.J., de Rezende Rocha, A., Sapkota, A., Boult, T.E.: Toward open set recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35(7), 1757–1772 (2012)CrossRef Scheirer, W.J., de Rezende Rocha, A., Sapkota, A., Boult, T.E.: Toward open set recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35(7), 1757–1772 (2012)CrossRef
25.
go back to reference Scheirer, W.J., Jain, L.P., Boult, T.E.: Probability models for open set recognition. IEEE Trans. Pattern Anal. Mach. Intell. 36(11), 2317–2324 (2014)CrossRef Scheirer, W.J., Jain, L.P., Boult, T.E.: Probability models for open set recognition. IEEE Trans. Pattern Anal. Mach. Intell. 36(11), 2317–2324 (2014)CrossRef
26.
go back to reference Kotz, S., Nadarajah, S.: Kots 2000 extreme. Extreme value distributions: Theory and applications. world scientific (2000) Kotz, S., Nadarajah, S.: Kots 2000 extreme. Extreme value distributions: Theory and applications. world scientific (2000)
27.
go back to reference Zhang, H., Patel, V.M.: Sparse representation-based open set recognition. IEEE Trans. Pattern Anal. Mach. Intell. 39(8), 1690–1696 (2016)CrossRef Zhang, H., Patel, V.M.: Sparse representation-based open set recognition. IEEE Trans. Pattern Anal. Mach. Intell. 39(8), 1690–1696 (2016)CrossRef
28.
go back to reference Bendale, A., Boult, T.: Towards open world recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1893–1902 (2015) Bendale, A., Boult, T.: Towards open world recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1893–1902 (2015)
29.
go back to reference Bendale, A., Boult, T.E.: Towards open set deep networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1563–1572 (2016) Bendale, A., Boult, T.E.: Towards open set deep networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1563–1572 (2016)
30.
go back to reference Yoshihashi, R., Shao, W., Kawakami, R., You, S., Iida, M., Naemura, T.: Classification-reconstruction learning for open-set recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4016–4025 (2019) Yoshihashi, R., Shao, W., Kawakami, R., You, S., Iida, M., Naemura, T.: Classification-reconstruction learning for open-set recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4016–4025 (2019)
31.
go back to reference Xu, H., Liu, B., Shu, L., Yu, P.S.: Open-World Learning and Application to Product Classification. In: The World Wide Web conference, WWW 2019, pp. 3413–3419 (2019) Xu, H., Liu, B., Shu, L., Yu, P.S.: Open-World Learning and Application to Product Classification. In: The World Wide Web conference, WWW 2019, pp. 3413–3419 (2019)
32.
go back to reference Lin, T., Xu, H.: Deep unknown intent detection with margin loss. In: Proceedings of the 57th conference of the association for computational linguistics, ACL 2019, pp. 5491–5496 (2019) Lin, T., Xu, H.: Deep unknown intent detection with margin loss. In: Proceedings of the 57th conference of the association for computational linguistics, ACL 2019, pp. 5491–5496 (2019)
33.
go back to reference Oza, P., Patel, V.M.: C2ae: Class conditioned auto-encoder for open-set recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2307–2316 (2019) Oza, P., Patel, V.M.: C2ae: Class conditioned auto-encoder for open-set recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2307–2316 (2019)
34.
go back to reference Ren, H., Cai, Y., Zeng, Y.: Aspect-opinion sentiment alignment for cross-domain sentiment analysis (student abstract). Proceeding of the AAA1 conference on artificial intelligence, pp. 0–2 (2022) Ren, H., Cai, Y., Zeng, Y.: Aspect-opinion sentiment alignment for cross-domain sentiment analysis (student abstract). Proceeding of the AAA1 conference on artificial intelligence, pp. 0–2 (2022)
35.
go back to reference Manning, C.D., Surdeanu, M., Bauer, J., Finkel, J.R., Bethard, S., McClosky, D.: The stanford corenlp natural language processing toolkit. In: Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations, pp. 55–60 (2014) Manning, C.D., Surdeanu, M., Bauer, J., Finkel, J.R., Bethard, S., McClosky, D.: The stanford corenlp natural language processing toolkit. In: Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations, pp. 55–60 (2014)
36.
go back to reference Schlichtkrull, M., Kipf, T.N, Bloem, P., Berg, R.v.d., Titov, I., Welling, M.: Modeling relational data with graph convolutional networks. In: European semantic Web conference, pp. 593–607. Springer (2018) Schlichtkrull, M., Kipf, T.N, Bloem, P., Berg, R.v.d., Titov, I., Welling, M.: Modeling relational data with graph convolutional networks. In: European semantic Web conference, pp. 593–607. Springer (2018)
37.
go back to reference Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp. 1746–1751 (2014) Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp. 1746–1751 (2014)
38.
go back to reference Chung, J., Gülçehre, Ç., Cho, K., Bengio, Y.: Empirical evaluation of gated recurrent neural networks on sequence modeling. CoRR arXiv:1412.3555 (2014) Chung, J., Gülçehre, Ç., Cho, K., Bengio, Y.: Empirical evaluation of gated recurrent neural networks on sequence modeling. CoRR arXiv:1412.3555 (2014)
39.
go back to reference Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. In: Advances in neural information processing systems, pp. 5998–6008 (2017) Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. In: Advances in neural information processing systems, pp. 5998–6008 (2017)
40.
go back to reference Nair, V., Hinton, G.E.: Rectified linear units improve restricted Boltzmann machines. In: ICML (2010) Nair, V., Hinton, G.E.: Rectified linear units improve restricted Boltzmann machines. In: ICML (2010)
41.
go back to reference Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification. In: Advances in neural information processing systems, pp. 649–657 (2015) Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification. In: Advances in neural information processing systems, pp. 649–657 (2015)
42.
go back to reference Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. arXiv:1412.6980 (2014) Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. arXiv:1412.6980 (2014)
43.
go back to reference Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res. 15(1), 1929–1958 (2014)MathSciNetMATH Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res. 15(1), 1929–1958 (2014)MathSciNetMATH
Metadata
Title
Learning refined features for open-world text classification with class description and commonsense knowledge
Authors
Haopeng Ren
Zeting Li
Yi Cai
Xingwei Tan
Xin Wu
Publication date
24-09-2022
Publisher
Springer US
Published in
World Wide Web / Issue 2/2023
Print ISSN: 1386-145X
Electronic ISSN: 1573-1413
DOI
https://doi.org/10.1007/s11280-022-01102-6

Other articles of this Issue 2/2023

World Wide Web 2/2023 Go to the issue

Premium Partner