Skip to main content
Erschienen in: Soft Computing 21/2020

28.04.2020 | Methodologies and Application

LegalCap: a model for complex case discrimination based on capsule neural network

verfasst von: Dunlu Peng, Qiankun Wu

Erschienen in: Soft Computing | Ausgabe 21/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In recent years, artificial intelligence, especially deep learning techniques, has provided an excellent theoretical and technical basis for the development of intelligent court. Given that case description usually contains multiple charges, charge prediction can be regarded as a task of multi-label text classification. We utilize the gated recurrent unit (GRU), a variant of recurrent neural network (RNN), to extract the chronological features of sequence. Due to the lack of retaining the positional relationship of entities when using RNN to extract features from text, a fraction of information will be lost in the context of multi-label classification. To address the drawbacks aforementioned, capsule neural network is utilized to extract the positional relation. Considering the basic and positional feature of instance, a composite model LegalCap, which combines capsule neural network with GRU, is proposed to model the complicated cases. Extensive experiments prove that the proposed model outperforms the selected baselines in the task of charge prediction. To overcome the unsatisfactory performance of directly conducting deep learning in charge prediction on the minority cases, we relieve the impact of training case imbalance by means of re-sampling. Experimental results demonstrate that after modifying the data distribution, the LegalCap model has a significant improvement on models’ bias on labels, i.e., better predictions on minority cases.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Agarwal B, Ramampiaro H, Langseth H, Ruocco M (2018) A deep network model for paraphrase detection in short text messages. Inf Process Manag 54(6):922–937CrossRef Agarwal B, Ramampiaro H, Langseth H, Ruocco M (2018) A deep network model for paraphrase detection in short text messages. Inf Process Manag 54(6):922–937CrossRef
Zurück zum Zitat Bengio Y, Ducharme R, Vincent P, Janvin C (2003) A neural probabilistic language model. J Machi Learn Res 3:1137–1155MATH Bengio Y, Ducharme R, Vincent P, Janvin C (2003) A neural probabilistic language model. J Machi Learn Res 3:1137–1155MATH
Zurück zum Zitat Chung J, Gülçehre Ç, Cho K, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. CoRR arXiv:1412.3555 Chung J, Gülçehre Ç, Cho K, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. CoRR arXiv:​1412.​3555
Zurück zum Zitat Grave E, Mikolov T, Joulin A, Bojanowski P (2017) Bag of tricks for efficient text classification. In: Proceedings of the 15th conference of the European chapter of the association for computational linguistics, EACL 2017, Valencia, Spain, April 3–7, 2017, volume 2: short papers, pp 427–431 Grave E, Mikolov T, Joulin A, Bojanowski P (2017) Bag of tricks for efficient text classification. In: Proceedings of the 15th conference of the European chapter of the association for computational linguistics, EACL 2017, Valencia, Spain, April 3–7, 2017, volume 2: short papers, pp 427–431
Zurück zum Zitat Hu Z, Li X, Tu C, Liu Z, Sun M (2018) Few-shot charge prediction with discriminative legal attributes. In: Proceedings of the 27th international conference on computational linguistics, COLING 2018, Santa Fe, New Mexico, USA, August 20–26, 2018, pp 487–498 Hu Z, Li X, Tu C, Liu Z, Sun M (2018) Few-shot charge prediction with discriminative legal attributes. In: Proceedings of the 27th international conference on computational linguistics, COLING 2018, Santa Fe, New Mexico, USA, August 20–26, 2018, pp 487–498
Zurück zum Zitat Kalchbrenner N, Grefenstette E, Blunsom P (2014) A convolutional neural network for modelling sentences. In: Proceedings of the 52nd annual meeting of the association for computational linguistics, ACL 2014, June 22–27, 2014, Baltimore, MD, USA, volume 1: long papers, pp 655–665 Kalchbrenner N, Grefenstette E, Blunsom P (2014) A convolutional neural network for modelling sentences. In: Proceedings of the 52nd annual meeting of the association for computational linguistics, ACL 2014, June 22–27, 2014, Baltimore, MD, USA, volume 1: long papers, pp 655–665
Zurück zum Zitat Kastrati Z, Imran AS, Yayilgan SY (2019) The impact of deep learning on document classification using semantically rich representations. Inf Process Manag 56(5):1618–1632CrossRef Kastrati Z, Imran AS, Yayilgan SY (2019) The impact of deep learning on document classification using semantically rich representations. Inf Process Manag 56(5):1618–1632CrossRef
Zurück zum Zitat Katz DM, Bommarito MJ II, Blackman J (2017) A general approach for predicting the behavior of the Supreme Court of the United States. PloS ONE 12(4):e0174698CrossRef Katz DM, Bommarito MJ II, Blackman J (2017) A general approach for predicting the behavior of the Supreme Court of the United States. PloS ONE 12(4):e0174698CrossRef
Zurück zum Zitat Kim Y (2014) Convolutional neural networks for sentence classification. In: Proceedings of the 2014 conference on empirical methods in natural language processing, EMNLP 2014, October 25–29, 2014, Doha, Qatar. A meeting of SIGDAT, a special interest group of the ACL, pp 1746–1751 Kim Y (2014) Convolutional neural networks for sentence classification. In: Proceedings of the 2014 conference on empirical methods in natural language processing, EMNLP 2014, October 25–29, 2014, Doha, Qatar. A meeting of SIGDAT, a special interest group of the ACL, pp 1746–1751
Zurück zum Zitat Kingma DP, Ba J (2015) Adam: a method for stochastic optimization. In: 3rd International conference on learning representations, ICLR 2015, San Diego, CA, USA, May 7–9, 2015, conference track proceedings Kingma DP, Ba J (2015) Adam: a method for stochastic optimization. In: 3rd International conference on learning representations, ICLR 2015, San Diego, CA, USA, May 7–9, 2015, conference track proceedings
Zurück zum Zitat Ko Y (2017) How to use negative class information for naive Bayes classification. Inf Process Manag 53(6):1255–1268CrossRef Ko Y (2017) How to use negative class information for naive Bayes classification. Inf Process Manag 53(6):1255–1268CrossRef
Zurück zum Zitat Lee JY, Dernoncourt F (2016) Sequential short-text classification with recurrent and convolutional neural networks. In: NAACL HLT 2016, the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies, San Diego California, USA, June 12–17, 2016, pp 515–520 Lee JY, Dernoncourt F (2016) Sequential short-text classification with recurrent and convolutional neural networks. In: NAACL HLT 2016, the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies, San Diego California, USA, June 12–17, 2016, pp 515–520
Zurück zum Zitat Liu C, Liao T (2005) Classifying criminal charges in chinese for web-based legal services. In: Web technologies research and development—APWeb 2005, 7th Asia-Pacific web conference, Shanghai, China, March 29—April 1, 2005, Proceedings, pp 64–75 Liu C, Liao T (2005) Classifying criminal charges in chinese for web-based legal services. In: Web technologies research and development—APWeb 2005, 7th Asia-Pacific web conference, Shanghai, China, March 29—April 1, 2005, Proceedings, pp 64–75
Zurück zum Zitat Liu C, Chang C, Ho J (2004) Case instance generation and refinement for case-based criminal summary judgments in Chinese. J Inf Sci Eng 20(4):783–800 Liu C, Chang C, Ho J (2004) Case instance generation and refinement for case-based criminal summary judgments in Chinese. J Inf Sci Eng 20(4):783–800
Zurück zum Zitat Liu P, Qiu X, Huang X (2016) Recurrent neural network for text classification with multi-task learning. In: Proceedings of the twenty-fifth international joint conference on artificial intelligence, IJCAI 2016, New York, NY, USA, 9–15 July 2016, pp 2873–2879 Liu P, Qiu X, Huang X (2016) Recurrent neural network for text classification with multi-task learning. In: Proceedings of the twenty-fifth international joint conference on artificial intelligence, IJCAI 2016, New York, NY, USA, 9–15 July 2016, pp 2873–2879
Zurück zum Zitat Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space, In: 1st International conference on learning representations, ICLR 2013, Scottsdale, Arizona, USA, May 2–4, 2013, workshop track proceedings Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space, In: 1st International conference on learning representations, ICLR 2013, Scottsdale, Arizona, USA, May 2–4, 2013, workshop track proceedings
Zurück zum Zitat Pavlinek M, Podgorelec V (2017) Text classification method based on self-training and LDA topic models. Expert Syst Appl 80:83–93CrossRef Pavlinek M, Podgorelec V (2017) Text classification method based on self-training and LDA topic models. Expert Syst Appl 80:83–93CrossRef
Zurück zum Zitat Pineda FJ (1987) Generalization of back propagation to recurrent and higher order neural networks. In: Neural information processing systems, Denver, Colorado, USA, pp 602–611 Pineda FJ (1987) Generalization of back propagation to recurrent and higher order neural networks. In: Neural information processing systems, Denver, Colorado, USA, pp 602–611
Zurück zum Zitat Qian Q, Huang M, Lei J, Zhu X (2017) Linguistically regularized LSTM for sentiment classification. In: Proceedings of the 55th annual meeting of the association for computational linguistics, ACL 2017, Vancouver, Canada, July 30–August 4, volume 1: long papers, pp 1679–1689 Qian Q, Huang M, Lei J, Zhu X (2017) Linguistically regularized LSTM for sentiment classification. In: Proceedings of the 55th annual meeting of the association for computational linguistics, ACL 2017, Vancouver, Canada, July 30–August 4, volume 1: long papers, pp 1679–1689
Zurück zum Zitat Rodríguez AS, Murillo EC (2017) Automatic parametrization of support vector machines for short texts polarity detection. CLEI Electron J 20(1):6:1–6:19 Rodríguez AS, Murillo EC (2017) Automatic parametrization of support vector machines for short texts polarity detection. CLEI Electron J 20(1):6:1–6:19
Zurück zum Zitat Sabour S, Frosst N, Hinton GE (2017) Dynamic routing between capsules. In: Advances in neural information processing systems 30: annual conference on neural information processing systems 2017, 4–9 December 2017, Long Beach, CA, USA, pp 3859–3869 Sabour S, Frosst N, Hinton GE (2017) Dynamic routing between capsules. In: Advances in neural information processing systems 30: annual conference on neural information processing systems 2017, 4–9 December 2017, Long Beach, CA, USA, pp 3859–3869
Zurück zum Zitat Srivastava N, Hinton GE, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958MathSciNetMATH Srivastava N, Hinton GE, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958MathSciNetMATH
Zurück zum Zitat Sun A, Lim E, Liu Y (2009) On strategies for imbalanced text classification using SVM: a comparative study. Decis Support Syst 48(1):191–201CrossRef Sun A, Lim E, Liu Y (2009) On strategies for imbalanced text classification using SVM: a comparative study. Decis Support Syst 48(1):191–201CrossRef
Zurück zum Zitat Waltl B, Bonczek G, Scepankova E, Matthes F (2019) Semantic types of legal norms in German laws: classification and analysis using local linear explanations. Artif Intell Law 27(1):43–71CrossRef Waltl B, Bonczek G, Scepankova E, Matthes F (2019) Semantic types of legal norms in German laws: classification and analysis using local linear explanations. Artif Intell Law 27(1):43–71CrossRef
Zurück zum Zitat Xiao C, Zhong H, Guo Z, Tu C, Liu Z, Sun M, Feng Y, Han X, Hu Z, Wang H, Xu J (2018) CAIL2018: a large-scale legal dataset for judgment prediction. CoRR arXiv:1807.02478 Xiao C, Zhong H, Guo Z, Tu C, Liu Z, Sun M, Feng Y, Han X, Hu Z, Wang H, Xu J (2018) CAIL2018: a large-scale legal dataset for judgment prediction. CoRR arXiv:​1807.​02478
Zurück zum Zitat Yang M, Zhao W, Ye J, Lei Z, Zhao Z, Zhang S (2018) Investigating capsule networks with dynamic routing for text classification. In: Proceedings of the 2018 conference on empirical methods in natural language processing, Brussels, Belgium, October 31–November 4, 2018, pp 3110–3119 Yang M, Zhao W, Ye J, Lei Z, Zhao Z, Zhang S (2018) Investigating capsule networks with dynamic routing for text classification. In: Proceedings of the 2018 conference on empirical methods in natural language processing, Brussels, Belgium, October 31–November 4, 2018, pp 3110–3119
Zurück zum Zitat Ye H, Jiang X, Luo Z, Chao W (2018) Interpretable charge predictions for criminal cases: learning to generate court views from fact descriptions. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, NAACL-HLT 2018, New Orleans, Louisiana, USA, June 1–6, 2018, volume 1 (long papers), pp 1854–1864 Ye H, Jiang X, Luo Z, Chao W (2018) Interpretable charge predictions for criminal cases: learning to generate court views from fact descriptions. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, NAACL-HLT 2018, New Orleans, Louisiana, USA, June 1–6, 2018, volume 1 (long papers), pp 1854–1864
Zurück zum Zitat Zhang X, Zhao JJ, LeCun Y (2015) Character-level convolutional networks for text classification. In: Advances in neural information processing systems 28: annual conference on neural information processing systems 2015, December 7–12, 2015, Montreal, QC, Canada, pp 649–657 Zhang X, Zhao JJ, LeCun Y (2015) Character-level convolutional networks for text classification. In: Advances in neural information processing systems 28: annual conference on neural information processing systems 2015, December 7–12, 2015, Montreal, QC, Canada, pp 649–657
Metadaten
Titel
LegalCap: a model for complex case discrimination based on capsule neural network
verfasst von
Dunlu Peng
Qiankun Wu
Publikationsdatum
28.04.2020
Verlag
Springer Berlin Heidelberg
Erschienen in
Soft Computing / Ausgabe 21/2020
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-020-04922-8

Weitere Artikel der Ausgabe 21/2020

Soft Computing 21/2020 Zur Ausgabe

Premium Partner