Skip to main content
Top

2021 | OriginalPaper | Chapter

ProPC: A Dataset for In-Domain and Cross-Domain Proposition Classification Tasks

Authors : Mengyang Hu, Pengyuan Liu, Lin Bo, Yuting Mao, Ke Xu, Wentao Su

Published in: Natural Language Processing and Chinese Computing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Correctly identifying the types of propositions helps to understand the logical relationship between sentences, and is of great significance to natural language understanding, reasoning and generation. However, in previous studies: 1) Only explicit propositions are concerned, while most propositions in texts are implicit; 2) Only detect whether it is a proposition, but it is more meaningful to identify which proposition type it belongs to; 3) Only in the encyclopedia domain, whereas propositions exist widely in various domains. We present ProPC, a dataset for in-domain and cross-domain propositions classification. It consists of 15,000 sentences, 4 different classifications, in 5 different domains. We define two new tasks: 1) In-domain proposition classification, which is to identify the proposition type of a given sentence (not limited to explicit proposition); 2) Cross-domain proposition classification, which takes encyclopedia as the source domain and the other 4 domains as the target domain. We use the Matching, Bert and RoBERTa as our baseline methods and run experiments on each task. The result shows that machine indeed can learn the characteristics of various types of propositions from explicit propositions and classify implicit propositions, but the ability of domain generalization still needs to be strengthened. Our dataset, ProPC, is publicly available at https://​github.​com/​NLUSoCo/​ProPC.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Logical keywords, like “all...are...”, “both...and...”, “if...,then...”, “either...or...”, etc.
 
2
for example, “if you don’t fight, you fail”, here the logical keywords should be “if...then”, it lose a “then”, so it is an implicit proposition.
 
Literature
1.
go back to reference Liu, L., et al.: Automatic recognition and analysis of explicit propositions in natural language. J. Chin. Inf. Process. 35(2), 41–51 (2021) Liu, L., et al.: Automatic recognition and analysis of explicit propositions in natural language. J. Chin. Inf. Process. 35(2), 41–51 (2021)
2.
go back to reference Tomasello, M.: Cognitive linguistics. In: A Companion to Cognitive Science, pp. 477–487(2017) Tomasello, M.: Cognitive linguistics. In: A Companion to Cognitive Science, pp. 477–487(2017)
3.
go back to reference Palmer, M., Gildea, D., Kingsbury, P.: The proposition bank: an annotated corpus of semantic roles. Comput. Linguistics 31(1), 71–106 (2005) Palmer, M., Gildea, D., Kingsbury, P.: The proposition bank: an annotated corpus of semantic roles. Comput. Linguistics 31(1), 71–106 (2005)
4.
go back to reference He, J., Fu, M., Tu, M.: Applying deep matching networks to Chinese medical question answering: a study and a dataset. BMC Med. Inf. Decis. Making 19(2), 91–100 (2019) He, J., Fu, M., Tu, M.: Applying deep matching networks to Chinese medical question answering: a study and a dataset. BMC Med. Inf. Decis. Making 19(2), 91–100 (2019)
5.
go back to reference Fleiss, J.L.: Measuring nominal scale agreement among many raters. Psychol. Bull. 76(5), 378 (1971) CrossRef Fleiss, J.L.: Measuring nominal scale agreement among many raters. Psychol. Bull. 76(5), 378 (1971) CrossRef
6.
go back to reference Devlin, J., et al.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018) Devlin, J., et al.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:​1810.​04805 (2018)
9.
go back to reference Huang, S.: On the hidden form of logical constant. J. Jianghan University (Soc. Sci. Ed.) 4 (1991) Huang, S.: On the hidden form of logical constant. J. Jianghan University (Soc. Sci. Ed.) 4 (1991)
10.
go back to reference Li, X., et al.: Language, logic and logic of language. Philos. Stud., 41–48 (1986) Li, X., et al.: Language, logic and logic of language. Philos. Stud., 41–48 (1986)
11.
go back to reference Zhou, L.: Formal logic and natural language. Philos. Stud., 29–35 (1993) Zhou, L.: Formal logic and natural language. Philos. Stud., 29–35 (1993)
12.
go back to reference Gao, F.: on the role of formal logic in language research. Mod. Chinese (Lang. Res. Ed.), 4–6 (2017) Gao, F.: on the role of formal logic in language research. Mod. Chinese (Lang. Res. Ed.), 4–6 (2017)
13.
14.
go back to reference Zhang, M., Song, Y., Qin, B., Liu, T.: Semantic relation recognition of Chinese text level sentences. Acta Sinica Sinica 27(06), 51–57 (2013) Zhang, M., Song, Y., Qin, B., Liu, T.: Semantic relation recognition of Chinese text level sentences. Acta Sinica Sinica 27(06), 51–57 (2013)
17.
go back to reference McGrath, M., Frank, D.: The Stanford Encyclopedia of Philosophy. 2nd edn. Metaphysics Research Lab, Stanford University (2020) McGrath, M., Frank, D.: The Stanford Encyclopedia of Philosophy. 2nd edn. Metaphysics Research Lab, Stanford University (2020)
18.
go back to reference Allwood, J., et al.: Logic in Linguistics. Cambridge University Press (1977) Allwood, J., et al.: Logic in Linguistics. Cambridge University Press (1977)
Metadata
Title
ProPC: A Dataset for In-Domain and Cross-Domain Proposition Classification Tasks
Authors
Mengyang Hu
Pengyuan Liu
Lin Bo
Yuting Mao
Ke Xu
Wentao Su
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-88480-2_5

Premium Partner