Top

World Wide Web

Published in:

24-09-2022

Learning refined features for open-world text classification with class description and commonsense knowledge

Authors: Haopeng Ren, Zeting Li, Yi Cai, Xingwei Tan, Xin Wu

Published in: World Wide Web | Issue 2/2023

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Open-world classification requires a classifier not only to classify samples of the observed classes but also to detect samples which are not suitable to be classified as the known classes. State-of-the-art methods train a feature extractor to extract features for separating known classes with limited training data. Then some strategies, such as outlier detector, are used to reject samples from unknown classes based on the feature space. However, they are prone to extract the discriminative features among known classes and cannot model comprehensive features of known classes, which causes the classification errors when detecting the samples from the unknown classes in an open world scenario. Motivated by the theory of psychology and cognitive science, we utilize both class descriptions and commonsense knowledge summarized by human to refine the discriminant features and propose a regularization strategy. The regularization is incorporated into the feature extractor, which is enabled to further improve the performance of our model in an open-world environment. Extensive experiments and visualization analysis are conducted to evaluate the effectiveness of our proposed model.

previous article Preface to the special issue on the Asia Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint International Conference on Web and Big Data (APWeb-WAIM) 2021

next article FPIRPQ: Accelerating regular path queries on knowledge graphs

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

http://qwone.com/ jason/20Newsgroups/

https://en.wikipedia.org/wiki/

Feng, S., Wang, Y., Liu, L., Wang, D., Yu, G.: Attention based hierarchical lstm network for context-aware microblog sentiment classification. World Wide Web 22(1), 59–81 (2019)CrossRef

Hu, R., Zhu, X., Zhu, Y., Gan, J.: Robust svm with adaptive graph learning. World Wide Web 23(3), 1945–1968 (2020)CrossRef

Wu, X., Cai, Y., Li, Q., Xu, J., Leung, H.-F.: Combining weighted category-aware contextual information in convolutional neural networks for text classification. World Wide Web 23(5), 2815–2834 (2020)CrossRef

Ren, H., Cai, Y., Chen, X., Wang, G., Li, Q.: A two-phase prototypical network model for incremental few-shot relation classification. In: Proceedings of the 28th international conference on computational linguistics, pp. 1618–1629 (2020)

Li, Z., Cai, Y., Tan, X., Han, G., Ren, H., Wu, X., Li, W.: Learning refined features for open-world text classification. In: Asia-Pacific Web (APWeb) and Web-Age information management (WAIM) Joint international conference on Web and Big Data, pp. 367–381. Springer (2021)

Fei, G., Liu, B.: Breaking the closed world assumption in text classification. In: Proceedings of the 2016 conference of the north american chapter of the association for computational linguistics: human language technologies, pp. 506–514 (2016)

Yang, H.-M., Zhang, X.-Y., Yin, F., Liu, C.-L.: Robust classification with convolutional prototype learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3474–3482 (2018)

Liang, B., Li, H., Su, M., Bian, P., Li, X., Shi, W.: Deep text classification can be fooled. In: IJCAI. Proceeding of the 27th international joint conference on artificial intelligence, pp. 4208–4215 (2018)

Shu, L., Xu, H., Liu, B.: Doc: Deep open classification of text documents. In: Proceedings of the 2017 conference on empirical methods in natural language processing, pp. 2911–2916 (2017)

10.

Markus, H.: Self-schemata and processing information about the self. J. Pers. Soc. Psychol. 35(2), 63 (1977)CrossRef

11.

Banerjee, S.: Boosting inductive transfer for text classification using wikipedia. In: Sixth international conference on machine learning and applications (ICMLA 2007), pp. 148–153. IEEE (2007)

12.

Deng, Y., Shen, Y., Yang, M., Li, Y., Du, N., Fan, W., Lei, K.: Knowledge as a bridge: improving cross-domain answer selection with external knowledge. In: Proceedings of the 27th international conference on computational linguistics, pp. 3295–3305 (2018)

13.

Ghosal, D., Hazarika, D., Roy, A., Majumder, N., Mihalcea, R., Poria, S.: Kingdom: knowledge-guided domain adaptation for sentiment analysis. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp. 3198–3210 (2020)

14.

Speer, R., Chin, J., Havasi, C.: Conceptnet 5.5: an Open multilingual graph of general knowledge. In: Proceedings of the Thirty-first AAAI conference on artificial intelligence, pp. 444–4451 (2017)

15.

Kalchbrenner, N., Grefenstette, E., Blunsom, P., Kartsaklis, D., Kalchbrenner, N., Sadrzadeh, M., Kalchbrenner, N., Blunsom, P., Kalchbrenner, N., Blunsom, P.: A convolutional neural network for modelling sentences. In: Proceedings of the 52nd annual meeting of the association for computational linguistics, pp. 212–217. Association for Computational Linguistics (Volume 1: lon pares), pp. 655–665 (2014)

16.

Wang, G., Li, C., Wang, W., Zhang, Y., Shen, D., Zhang, X., Henao, R., Carin, L.: Joint embedding of words and labels for text classification. In: Proceedings of the 56th annual meeting of the association for computational linguistics (vol. 1: Long Papers), pp. 2321–2331 (2018)

17.

Ren, H., Zeng, Z., Cai, Y., Du, Q., Li, Q., Xie, H.: A weighted word embedding model for text classification. In: International conference on database systems for advanced applications, pp. 419–434. Springer (2019)

18.

Liu, M., Liu, L., Cao, J., Du, Q.: Co-attention network with label embedding for text classification. Neurocomputing 471, 61–69 (2022)CrossRef

19.

Zhou, D.-W., Ye, H.-J., Zhan, D.-C.: Learning placeholders for open-set recognition. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 4401–4410 (2021)

20.

Perera, P., Morariu, V.I., Jain, R., Manjunatha, V., Wigington, C., Ordonez, V., Patel, V.M.: Generative-discriminative feature representations for open-set recognition. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 11814–11823 (2020)

21.

Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)CrossRefMATH

22.

Veenman, C.J., Reinders, M.J.: The nearest subclass classifier: a compromise between the nearest mean and nearest neighbor classifier. IEEE Trans. Pattern Anal. Mach. Intell. 27(9), 1417–1429 (2005)CrossRef

23.

Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 210–227 (2008)CrossRef

24.

Scheirer, W.J., de Rezende Rocha, A., Sapkota, A., Boult, T.E.: Toward open set recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35(7), 1757–1772 (2012)CrossRef

25.

Scheirer, W.J., Jain, L.P., Boult, T.E.: Probability models for open set recognition. IEEE Trans. Pattern Anal. Mach. Intell. 36(11), 2317–2324 (2014)CrossRef

26.

Kotz, S., Nadarajah, S.: Kots 2000 extreme. Extreme value distributions: Theory and applications. world scientific (2000)

27.

Zhang, H., Patel, V.M.: Sparse representation-based open set recognition. IEEE Trans. Pattern Anal. Mach. Intell. 39(8), 1690–1696 (2016)CrossRef

28.

Bendale, A., Boult, T.: Towards open world recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1893–1902 (2015)

29.

Bendale, A., Boult, T.E.: Towards open set deep networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1563–1572 (2016)

30.

Yoshihashi, R., Shao, W., Kawakami, R., You, S., Iida, M., Naemura, T.: Classification-reconstruction learning for open-set recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4016–4025 (2019)

31.

Xu, H., Liu, B., Shu, L., Yu, P.S.: Open-World Learning and Application to Product Classification. In: The World Wide Web conference, WWW 2019, pp. 3413–3419 (2019)

32.

Lin, T., Xu, H.: Deep unknown intent detection with margin loss. In: Proceedings of the 57th conference of the association for computational linguistics, ACL 2019, pp. 5491–5496 (2019)

33.

Oza, P., Patel, V.M.: C2ae: Class conditioned auto-encoder for open-set recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2307–2316 (2019)

34.

Ren, H., Cai, Y., Zeng, Y.: Aspect-opinion sentiment alignment for cross-domain sentiment analysis (student abstract). Proceeding of the AAA1 conference on artificial intelligence, pp. 0–2 (2022)

35.

Manning, C.D., Surdeanu, M., Bauer, J., Finkel, J.R., Bethard, S., McClosky, D.: The stanford corenlp natural language processing toolkit. In: Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations, pp. 55–60 (2014)

36.

Schlichtkrull, M., Kipf, T.N, Bloem, P., Berg, R.v.d., Titov, I., Welling, M.: Modeling relational data with graph convolutional networks. In: European semantic Web conference, pp. 593–607. Springer (2018)

37.

Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp. 1746–1751 (2014)

38.

Chung, J., Gülçehre, Ç., Cho, K., Bengio, Y.: Empirical evaluation of gated recurrent neural networks on sequence modeling. CoRR arXiv:1412.3555 (2014)

39.

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. In: Advances in neural information processing systems, pp. 5998–6008 (2017)

40.

Nair, V., Hinton, G.E.: Rectified linear units improve restricted Boltzmann machines. In: ICML (2010)

41.

Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification. In: Advances in neural information processing systems, pp. 649–657 (2015)

42.

Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. arXiv:1412.6980 (2014)

43.

Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res. 15(1), 1929–1958 (2014)MathSciNetMATH

Title: Learning refined features for open-world text classification with class description and commonsense knowledge
Authors: Haopeng Ren
Zeting Li
Yi Cai
Xingwei Tan
Xin Wu
Publication date: 24-09-2022
Publisher: Springer US
Published in: World Wide Web / Issue 2/2023
Print ISSN: 1386-145X
Electronic ISSN: 1573-1413
DOI: https://doi.org/10.1007/s11280-022-01102-6

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Other articles of this Issue 2/2023

Correction to: Task offloading for vehicular edge computing with edge‑cloud cooperation

Guest Editorial: WWWJ Special Issue of the 22th International Conference on Web Information Systems Engineering (WISE 2021)

Long short-term enhanced memory for sequential recommendation

Optimizing subgraph matching over distributed knowledge graphs using partial evaluation

Multi-task hourglass network for online automatic diagnosis of developmental dysplasia of the hip

Super-resolution-based part collaboration network for vehicle re-identification

Premium Partner