Skip to main content
Top

2018 | OriginalPaper | Chapter

TSABCNN: Two-Stage Attention-Based Convolutional Neural Network for Frame Identification

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

As an essential sub-task of frame-semantic parsing, Frame Identification (FI) is a fundamentally important research topic in shallow semantic parsing. However, most existing work is based on sophisticated, hand-crafted features which might not be compatible with FI procedure. Besides that, they usually heavily rely on available natural language processing (NLP) toolkits and various lexical resources. Thus existing methods with hand-crafted features may not achieve satisfactory performance. In this paper, we propose a two-stage attention-based convolutional neural network (TSABCNN) to alleviate this problem and capture the most important context features for FI task. In order to dynamically adjust the weight of each feature, we build two levels of attention over instances at input layer and pooling layer respectively. Furthermore, the proposed model is an end-to-end learning framework which does not need any complicated NLP toolkits and feature engineering, and can be applied to any language. Experiments results on FrameNet and Chinese FrameNet (CFN) show the effectiveness of the proposed approach for the FI task.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley FrameNet project. In: Meeting of the Association for Computational Linguistics and, International Conference on Computational Linguistics, vol. 47, pp. 86–90 (1998) Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley FrameNet project. In: Meeting of the Association for Computational Linguistics and, International Conference on Computational Linguistics, vol. 47, pp. 86–90 (1998)
go back to reference Fillmore, C.J.: Frame semantics. In: Linguistics in the Morning Calm, pp. 111–138 (1982) Fillmore, C.J.: Frame semantics. In: Linguistics in the Morning Calm, pp. 111–138 (1982)
go back to reference Hermann, K.M., et al.: Teaching machines to read and comprehend. In: NIPS, pp. 1693–1701 (2015) Hermann, K.M., et al.: Teaching machines to read and comprehend. In: NIPS, pp. 1693–1701 (2015)
go back to reference Das, D., Schneider, N., Chen, D., Smith, N.A.: Probabilistic frame-semantic parsing. In: Human Language Technologies: The 2010 Conference of the North American Chapter of the Association for Computational Linguistics, vol. 40, pp. 948–956 (2010) Das, D., Schneider, N., Chen, D., Smith, N.A.: Probabilistic frame-semantic parsing. In: Human Language Technologies: The 2010 Conference of the North American Chapter of the Association for Computational Linguistics, vol. 40, pp. 948–956 (2010)
go back to reference Das, D., Chen, D., Schneider, N., Smith, N.A.: Frame-semantic parsing. Comput. Linguist. 40(1), 9–56 (2014)CrossRef Das, D., Chen, D., Schneider, N., Smith, N.A.: Frame-semantic parsing. Comput. Linguist. 40(1), 9–56 (2014)CrossRef
go back to reference Carreras, X., Litkowski, K.C., Stevenson, S.: Semantic role labeling: an introduction to the special issue. Comput. Linguist. 34(2), 145–159 (2008)CrossRef Carreras, X., Litkowski, K.C., Stevenson, S.: Semantic role labeling: an introduction to the special issue. Comput. Linguist. 34(2), 145–159 (2008)CrossRef
go back to reference Kalchbrenner, N., Grefenstette, E., Blunsom, P.: A Convolutional Neural Network for Modelling Sentences. eprint arxiv (2014) Kalchbrenner, N., Grefenstette, E., Blunsom, P.: A Convolutional Neural Network for Modelling Sentences. eprint arxiv (2014)
go back to reference Kim, Y.: Convolutional Neural Networks for Sentence Classification. eprint arxiv (2014) Kim, Y.: Convolutional Neural Networks for Sentence Classification. eprint arxiv (2014)
go back to reference Zeng, D.J., Liu, K., Lai, S.W., Zhou, G.Y., Zhao, J.: Relation classification via convolutional deep neural network. In: Proceedings of COLING, Dublin, Ireland (2014) Zeng, D.J., Liu, K., Lai, S.W., Zhou, G.Y., Zhao, J.: Relation classification via convolutional deep neural network. In: Proceedings of COLING, Dublin, Ireland (2014)
go back to reference Chen, Y., Xu, L., Liu, K., Zeng, D., Zhao, J.: Event extraction via dynamic multi-pooling convolutional neural networks. In: The Meeting of the Association for Computational Linguistics, Beijing, China, pp. 167–176 (2015) Chen, Y., Xu, L., Liu, K., Zeng, D., Zhao, J.: Event extraction via dynamic multi-pooling convolutional neural networks. In: The Meeting of the Association for Computational Linguistics, Beijing, China, pp. 167–176 (2015)
go back to reference Dong, L., Wei, F., Zhou, M., Xu, K.: Question answering over freebase with multi-column convolutional neural networks. In: Meeting of the Association for Computational Linguistics and the International Joint Conference on Natural Language Processing, Beijing, China, pp. 260–269 (2015) Dong, L., Wei, F., Zhou, M., Xu, K.: Question answering over freebase with multi-column convolutional neural networks. In: Meeting of the Association for Computational Linguistics and the International Joint Conference on Natural Language Processing, Beijing, China, pp. 260–269 (2015)
go back to reference Severyn, A., Moschitti, A.: Learning to rank short text pairs with convolutional deep neural networks. In: The International ACM SIGIR Conference, pp. 373–382. ACM (2015) Severyn, A., Moschitti, A.: Learning to rank short text pairs with convolutional deep neural networks. In: The International ACM SIGIR Conference, pp. 373–382. ACM (2015)
go back to reference Yin, W., Schütze, H.: MultiGranCNN: an architecture for general matching of text chunks on multiple levels of granularity. In: Meeting of the Association for Computational Linguistics and the International Joint Conference on Natural Language Processing, Beijing, China, pp. 63–73 (2015) Yin, W., Schütze, H.: MultiGranCNN: an architecture for general matching of text chunks on multiple levels of granularity. In: Meeting of the Association for Computational Linguistics and the International Joint Conference on Natural Language Processing, Beijing, China, pp. 63–73 (2015)
go back to reference Gildea, D., Jurafsky, D.: Automatic labeling of semantic roles. Comput. Linguist. 28(28), 245–288 (2002)CrossRef Gildea, D., Jurafsky, D.: Automatic labeling of semantic roles. Comput. Linguist. 28(28), 245–288 (2002)CrossRef
go back to reference Carreras, X.: Introduction to the CoNLL-2004 shared task: semantic role labeling. In: Proceedings of 8th Conference on Natural Language Learning (CoNLL), Boston, MA, vol. 47, pp. 5–9 (2004) Carreras, X.: Introduction to the CoNLL-2004 shared task: semantic role labeling. In: Proceedings of 8th Conference on Natural Language Learning (CoNLL), Boston, MA, vol. 47, pp. 5–9 (2004)
go back to reference Carreras, X., Màrquez, L.: Introduction to the CoNLL-2005 shared task: semantic role labeling. In: Proceedings of the 9th Conference on Computational Natural Language Learning (CoNLL), ANN Arbor, MI, pp. 152–164 (2005) Carreras, X., Màrquez, L.: Introduction to the CoNLL-2005 shared task: semantic role labeling. In: Proceedings of the 9th Conference on Computational Natural Language Learning (CoNLL), ANN Arbor, MI, pp. 152–164 (2005)
go back to reference Fleischman, M., Kwon, N., Hovy, E.: Maximum entropy models for FrameNet classification. In: Conference on Empirical Methods in Natural Language Processing, Sapporo, pp. 49–56 (2003) Fleischman, M., Kwon, N., Hovy, E.: Maximum entropy models for FrameNet classification. In: Conference on Empirical Methods in Natural Language Processing, Sapporo, pp. 49–56 (2003)
go back to reference Erk, K.: Frame assignment as word sense disambiguation. In: proceedings of the 6th International Workshop on Computational Semantic (IWCS-6) (2005) Erk, K.: Frame assignment as word sense disambiguation. In: proceedings of the 6th International Workshop on Computational Semantic (IWCS-6) (2005)
go back to reference Johansson, R., Nugues, P.: LTH: semantic structure extraction using nonprojective dependency trees. In: Proceedings of SemEval, vol. 13, no. 4, pp. 227–230 (2007) Johansson, R., Nugues, P.: LTH: semantic structure extraction using nonprojective dependency trees. In: Proceedings of SemEval, vol. 13, no. 4, pp. 227–230 (2007)
go back to reference Bejan, C.A., Hathaway, C.: UTD-SRL: a pipeline architecture for extracting frame semantic structures. In: International Workshop on Semantic Evaluations, vol. 8764, pp. 460–463 (2007) Bejan, C.A., Hathaway, C.: UTD-SRL: a pipeline architecture for extracting frame semantic structures. In: International Workshop on Semantic Evaluations, vol. 8764, pp. 460–463 (2007)
go back to reference Hermann, K.M., Das, D., Weston, J., Ganchev, K.: Semantic frame identification with distributed word representations. In: Meeting of the Association for Computational Linguistics, vol. 1, pp. 1448–1458 (2014) Hermann, K.M., Das, D., Weston, J., Ganchev, K.: Semantic frame identification with distributed word representations. In: Meeting of the Association for Computational Linguistics, vol. 1, pp. 1448–1458 (2014)
go back to reference Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. arXiv: 1409.0473 (2014) Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. arXiv: 1409.0473 (2014)
go back to reference Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12(1), 2493–2537 (2011)MATH Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12(1), 2493–2537 (2011)MATH
go back to reference Yin, W.P., Schütze, H., Xiang, B., Zhou, B.W.: ABCNN: attention-based convolutional neural network for modeling sentence pairs. Action Editor: Brian Roark, vol. 4, pp. 259–272 (2016) Yin, W.P., Schütze, H., Xiang, B., Zhou, B.W.: ABCNN: attention-based convolutional neural network for modeling sentence pairs. Action Editor: Brian Roark, vol. 4, pp. 259–272 (2016)
go back to reference Collobert R., Weston J.: A unified architecture for natural language processing: deep neural networks with multitask learning. International Conference on Machine Learning, pp.160–167. ACM (2008) Collobert R., Weston J.: A unified architecture for natural language processing: deep neural networks with multitask learning. International Conference on Machine Learning, pp.160–167. ACM (2008)
go back to reference Hinton, G. E., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R. R.: Improving neural networks by preventing co-adaptation of feature detectors. Comput. Sci. vol. 3(4), 212–223 (2012) Hinton, G. E., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R. R.: Improving neural networks by preventing co-adaptation of feature detectors. Comput. Sci. vol. 3(4), 212–223 (2012)
go back to reference Zhang, Y., Wallace, B.: A sensitivity analysis of (and practitioners’ guide to) convolutional neural networks for sentence classification. In: Proceedings of the 8th International Joint Conference on Natural Language Processing, Taipei, Taiwan, pp. 253–263 (2017) Zhang, Y., Wallace, B.: A sensitivity analysis of (and practitioners’ guide to) convolutional neural networks for sentence classification. In: Proceedings of the 8th International Joint Conference on Natural Language Processing, Taipei, Taiwan, pp. 253–263 (2017)
go back to reference Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, vol. 26, pp. 3111–3119 (2013) Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, vol. 26, pp. 3111–3119 (2013)
go back to reference Zhou, P., et al.: Attention-based bidirectional long short-term memory networks for relation classification. In: Meeting of the Association for Computational Linguistics, pp. 207–212 (2016) Zhou, P., et al.: Attention-based bidirectional long short-term memory networks for relation classification. In: Meeting of the Association for Computational Linguistics, pp. 207–212 (2016)
go back to reference Li, R., Liu, H.J., Li, S.H.: Chinese frame identification using T-CRF model. In: International Conference on Computational Linguistics, COLING 2010, pp. 674–682 (2010) Li, R., Liu, H.J., Li, S.H.: Chinese frame identification using T-CRF model. In: International Conference on Computational Linguistics, COLING 2010, pp. 674–682 (2010)
go back to reference Zhao, H.Y., Li, R., Zhang, S., Zhang, L.W.: Chinese frame identification with deep neural network. J. Chin. Inf. Process. 30(6), 75–83 (2016) Zhao, H.Y., Li, R., Zhang, S., Zhang, L.W.: Chinese frame identification with deep neural network. J. Chin. Inf. Process. 30(6), 75–83 (2016)
Metadata
Title
TSABCNN: Two-Stage Attention-Based Convolutional Neural Network for Frame Identification
Authors
Hongyan Zhao
Ru Li
Fei Duan
Zepeng Wu
Shaoru Guo
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-030-01716-3_24

Premium Partner