Skip to main content

2019 | OriginalPaper | Buchkapitel

A Latent Variable CRF Model for Labeling Prediction

verfasst von : Jerry Chun-Wei Lin, Jimmy Ming-Tai Wu, Yinan Shao, Matin Pirouz, Binbin Zhang

Erschienen in: Multidisciplinary Social Networks Research

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

A latent variable conditional random fields (CRF) model is proposed to improve sequence labeling, which utilizes the BIO encoding schema as latent variable to capture the latent structure of hidden variables and observation data. The proposed model automatically selects the best encoding schema for each given input sequence. Through experimentation, it is demonstrated that the proposed model unveils the latent variable while performing robustly on sequence-labeling prediction tasks.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Baum, L.E.: An inequality and associated maximization technique in statistical estimation of probabilistic functions of a markov process. Inequalities 3, 1–8 (1972) Baum, L.E.: An inequality and associated maximization technique in statistical estimation of probabilistic functions of a markov process. Inequalities 3, 1–8 (1972)
2.
Zurück zum Zitat Baum, L.E., Eagon, J.A.: An inequality with applications to statistical estimation for probabilistic functions of markov processes and to a model for ecology. Bull. Am. Math. Soc. 37(3), 360–363 (1967)MathSciNetCrossRef Baum, L.E., Eagon, J.A.: An inequality with applications to statistical estimation for probabilistic functions of markov processes and to a model for ecology. Bull. Am. Math. Soc. 37(3), 360–363 (1967)MathSciNetCrossRef
3.
Zurück zum Zitat Baum, L.E., Petrie, T.: Statistical inference for probabilistic functions of finite state Markov chains. Ann. Math. Stat. 37(6), 1554–1563 (1966)MathSciNetCrossRef Baum, L.E., Petrie, T.: Statistical inference for probabilistic functions of finite state Markov chains. Ann. Math. Stat. 37(6), 1554–1563 (1966)MathSciNetCrossRef
4.
Zurück zum Zitat Berger, A.L., Pietra, S.A.D., Pietra, V.J.D.: A maximum entropy approach to natural language processing. Comput. Linguist. 22(1), 39–71 (1996) Berger, A.L., Pietra, S.A.D., Pietra, V.J.D.: A maximum entropy approach to natural language processing. Comput. Linguist. 22(1), 39–71 (1996)
5.
Zurück zum Zitat Cuong, N.V., Ye, N., Lee, W.S., Chieu, H.L.: Conditional random field with high-order dependencies for sequence labeling and segmentation. J. Mach. Learn. Res. 15(1), 981–1009 (2014)MathSciNetMATH Cuong, N.V., Ye, N., Lee, W.S., Chieu, H.L.: Conditional random field with high-order dependencies for sequence labeling and segmentation. J. Mach. Learn. Res. 15(1), 981–1009 (2014)MathSciNetMATH
6.
Zurück zum Zitat Dai, H., Lai, P., Chang, Y., Tsa, R.T.: Enhancing of chemical compound and drug name recognition using representative tag scheme and fine-grained tokenization. J. Cheminformatics 7(1), 1–10 (2015)CrossRef Dai, H., Lai, P., Chang, Y., Tsa, R.T.: Enhancing of chemical compound and drug name recognition using representative tag scheme and fine-grained tokenization. J. Cheminformatics 7(1), 1–10 (2015)CrossRef
7.
Zurück zum Zitat Fine, S., Singer, Y., Tishby, N.: The hierarchical hidden Markov model: analysis and applications. Mach. Learn. 32(1), 41–62 (1998)CrossRef Fine, S., Singer, Y., Tishby, N.: The hierarchical hidden Markov model: analysis and applications. Mach. Learn. 32(1), 41–62 (1998)CrossRef
8.
Zurück zum Zitat Guo, S., Chang, M.W., Kiciman, E.: To link or not to link? a study on end-to-end tweet entity linking. In: The Conference of the North American Chapter of the Association of Computational Linguistics, pp. 1020–1030 (2013) Guo, S., Chang, M.W., Kiciman, E.: To link or not to link? a study on end-to-end tweet entity linking. In: The Conference of the North American Chapter of the Association of Computational Linguistics, pp. 1020–1030 (2013)
9.
Zurück zum Zitat Gupta , P., Andrassy, B.: Table filling multi-task recurrent neural network for joint entity and relation extraction. In: The International Conference on Computational Linguistics, pp. 2537–2547 (2016) Gupta , P., Andrassy, B.: Table filling multi-task recurrent neural network for joint entity and relation extraction. In: The International Conference on Computational Linguistics, pp. 2537–2547 (2016)
11.
Zurück zum Zitat Lafferty, J.D., Mccallum, A., Pereira, F.C.N.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: The Eighteenth International Conference on Machine Learning, pp. 282–289 (2001) Lafferty, J.D., Mccallum, A., Pereira, F.C.N.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: The Eighteenth International Conference on Machine Learning, pp. 282–289 (2001)
12.
Zurück zum Zitat Liu, Y., Che, W., Guo, J., Bin, Q., Liu, T.: Exploring segment representations for neural segmentation models. In: The International Joint Conference on Artificial Intelligence, pp. 2880–288 (2016) Liu, Y., Che, W., Guo, J., Bin, Q., Liu, T.: Exploring segment representations for neural segmentation models. In: The International Joint Conference on Artificial Intelligence, pp. 2880–288 (2016)
13.
Zurück zum Zitat Lu, J., Venugopal, D., Gogate, V., Ng, V.: Joint inference for event coreference resolution. In: The International Conference on Computational Linguistics, pp. 3264–3275 (2016) Lu, J., Venugopal, D., Gogate, V., Ng, V.: Joint inference for event coreference resolution. In: The International Conference on Computational Linguistics, pp. 3264–3275 (2016)
14.
Zurück zum Zitat Ma, X., Hovy, E.: End-to-end sequence labeling via bi-directional LSTM-CNNS-CRF. In: The Annual Meeting of the Association for Computational Linguistics, pp. 1064–1074 (2016) Ma, X., Hovy, E.: End-to-end sequence labeling via bi-directional LSTM-CNNS-CRF. In: The Annual Meeting of the Association for Computational Linguistics, pp. 1064–1074 (2016)
15.
Zurück zum Zitat McCallum, A., Freitag, D., Pereira, F.C.N.: Maximum entropy Markov models for information extraction and segmentation. In: The International Conference on Machine Learning, pp. 591–598 (1999) McCallum, A., Freitag, D., Pereira, F.C.N.: Maximum entropy Markov models for information extraction and segmentation. In: The International Conference on Machine Learning, pp. 591–598 (1999)
16.
Zurück zum Zitat Mintz, M., Bills, R.S.S., Jurafsky, D.: Distant supervision for relation extraction without labeled data. In: The Annual Meeting of the Association for Computational Linguistics, pp. 1003–1011 (2009) Mintz, M., Bills, R.S.S., Jurafsky, D.: Distant supervision for relation extraction without labeled data. In: The Annual Meeting of the Association for Computational Linguistics, pp. 1003–1011 (2009)
17.
Zurück zum Zitat Muis, A.O., Lu, W.: Weak semi-Markov CRFS for noun phrase chunking in informal text. In: The North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 714–719 (2016) Muis, A.O., Lu, W.: Weak semi-Markov CRFS for noun phrase chunking in informal text. In: The North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 714–719 (2016)
18.
Zurück zum Zitat Nguyen, V.C., Lee, W.S., Ye, N., Hai, L.C.: Semi-Markov conditional random field with high-order feature, pp. 1–4 (2011) Nguyen, V.C., Lee, W.S., Ye, N., Hai, L.C.: Semi-Markov conditional random field with high-order feature, pp. 1–4 (2011)
19.
Zurück zum Zitat Okanohara, D., Miyao, Y., Tsuruoka, Y., Tisuji, J.: Improving the scalability of semi-Markov conditional random fields for named entity recognition. In: The Annual Meeting of the Association for Computational Linguistics, pp. 465–472 (2006) Okanohara, D., Miyao, Y., Tsuruoka, Y., Tisuji, J.: Improving the scalability of semi-Markov conditional random fields for named entity recognition. In: The Annual Meeting of the Association for Computational Linguistics, pp. 465–472 (2006)
20.
Zurück zum Zitat Petrov, S., Dan, K.: Sparse multi-scale grammars for discriminative latent variable parsing. In: The Conference on Empirical Methods in Natural Language Processing, pp. 867–876 (2008) Petrov, S., Dan, K.: Sparse multi-scale grammars for discriminative latent variable parsing. In: The Conference on Empirical Methods in Natural Language Processing, pp. 867–876 (2008)
21.
Zurück zum Zitat Ratinov, L., Roth, D.: Design challenges and misconceptions in named entity recognition. In: The Conference on Computational Natural Language Learning, pp. 147–155 (2009) Ratinov, L., Roth, D.: Design challenges and misconceptions in named entity recognition. In: The Conference on Computational Natural Language Learning, pp. 147–155 (2009)
22.
Zurück zum Zitat Ratnaparkhi, A.: A maximum entropy model for part-of-speech tagging. In: The Conference on Empirical Methods in Natural Language Processing, pp. 133–142 (1996) Ratnaparkhi, A.: A maximum entropy model for part-of-speech tagging. In: The Conference on Empirical Methods in Natural Language Processing, pp. 133–142 (1996)
25.
Zurück zum Zitat Sarawagi, S., Cohen, W.W.: Semi-Markov conditional random fields for information extraction. In: The Neural Information Processing Systems, pp. 1185–1192 (2004) Sarawagi, S., Cohen, W.W.: Semi-Markov conditional random fields for information extraction. In: The Neural Information Processing Systems, pp. 1185–1192 (2004)
26.
Zurück zum Zitat Sun, X., Huang, D., Ren, F.: Detecting new words from chinese text using latent semi-CRF models. IEICE Trans. Inform. Syst. 93(6), 1386–1393 (2010)CrossRef Sun, X., Huang, D., Ren, F.: Detecting new words from chinese text using latent semi-CRF models. IEICE Trans. Inform. Syst. 93(6), 1386–1393 (2010)CrossRef
27.
Zurück zum Zitat Sun, X., Nan, X.: Chinese base phrases chunking based on latent semi-CRF mode. In: The International Conference on Natural Language Processing and Knowledge Engineering, pp. 1–7 (2010) Sun, X., Nan, X.: Chinese base phrases chunking based on latent semi-CRF mode. In: The International Conference on Natural Language Processing and Knowledge Engineering, pp. 1–7 (2010)
28.
Zurück zum Zitat Tseng, H., Chang, P., Andrew, G., Jurafsky, D., Manning, C.: Sequential labeling with latent variables. In: The Workshop on Chinese Language Processing, pp. 168–171 (2015) Tseng, H., Chang, P., Andrew, G., Jurafsky, D., Manning, C.: Sequential labeling with latent variables. In: The Workshop on Chinese Language Processing, pp. 168–171 (2015)
29.
Zurück zum Zitat Zhang, H.P., Liu, Q., Cheng, X.Q., Zhang, H., Yu, H.K.: Chinese lexical analysis using hierarchical hidden Markov model. In: The Workshop on Chinese Language Processing, pp. 63–70 (2003) Zhang, H.P., Liu, Q., Cheng, X.Q., Zhang, H., Yu, H.K.: Chinese lexical analysis using hierarchical hidden Markov model. In: The Workshop on Chinese Language Processing, pp. 63–70 (2003)
30.
Zurück zum Zitat Zhao, H., Huang, C.N., Li, M., Kudo, T.: An improved Chinese word segmentation system with conditional random field. In: The Workshop on Chinese Language Processing, pp. 162–165 (2006) Zhao, H., Huang, C.N., Li, M., Kudo, T.: An improved Chinese word segmentation system with conditional random field. In: The Workshop on Chinese Language Processing, pp. 162–165 (2006)
Metadaten
Titel
A Latent Variable CRF Model for Labeling Prediction
verfasst von
Jerry Chun-Wei Lin
Jimmy Ming-Tai Wu
Yinan Shao
Matin Pirouz
Binbin Zhang
Copyright-Jahr
2019
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-15-1758-7_6