Skip to main content
Top
Published in: Artificial Intelligence and Law 4/2020

25-01-2020 | Original Research

Encoded summarization: summarizing documents into continuous vector space for legal case retrieval

Authors: Vu Tran, Minh Le Nguyen, Satoshi Tojo, Ken Satoh

Published in: Artificial Intelligence and Law | Issue 4/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We present our method for tackling a legal case retrieval task by introducing our method of encoding documents by summarizing them into continuous vector space via our phrase scoring framework utilizing deep neural networks. On the other hand, we explore the benefits from combining lexical features and latent features generated with neural networks. Our experiments show that lexical features and latent features generated with neural networks complement each other to improve the retrieval system performance. Furthermore, our experimental results suggest the importance of case summarization in different aspects: using provided summaries and performing encoded summarization. Our approach achieved F1 of 65.6% and 57.6% on the experimental datasets of legal case retrieval tasks.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Bench-Capon T, Araszkiewicz M, Ashley K, Atkinson K, Bex F, Borges F, Bourcier D, Bourgine P, Conrad JG, Francesconi E et al (2012) A history of ai and law in 50 papers: 25 years of the international conference on ai and law. Artif Intell Law 20(3):215–319CrossRef Bench-Capon T, Araszkiewicz M, Ashley K, Atkinson K, Bex F, Borges F, Bourcier D, Bourgine P, Conrad JG, Francesconi E et al (2012) A history of ai and law in 50 papers: 25 years of the international conference on ai and law. Artif Intell Law 20(3):215–319CrossRef
go back to reference Chen Q, Zhu X, Ling ZH, Wei S, Jiang H, Inkpen D (2017) Enhanced LSTM for natural language inference. In: Proceedings of the 55th annual meeting of the association for computational linguistics (volume 1: long papers), association for computational linguistics, Vancouver, Canada, pp 1657–1668 Chen Q, Zhu X, Ling ZH, Wei S, Jiang H, Inkpen D (2017) Enhanced LSTM for natural language inference. In: Proceedings of the 55th annual meeting of the association for computational linguistics (volume 1: long papers), association for computational linguistics, Vancouver, Canada, pp 1657–1668
go back to reference Duchi J, Hazan E, Singer Y (2011) Adaptive subgradient methods for online learning and stochastic optimization. J Mach Learn Res 12(Jul):2121–2159MathSciNetMATH Duchi J, Hazan E, Singer Y (2011) Adaptive subgradient methods for online learning and stochastic optimization. J Mach Learn Res 12(Jul):2121–2159MathSciNetMATH
go back to reference Galgani F, Compton P, Hoffmann A (2012a) Citation based summarisation of legal texts. In: Pacific rim international conference on artificial intelligence, Springer, New York, pp 40–52 Galgani F, Compton P, Hoffmann A (2012a) Citation based summarisation of legal texts. In: Pacific rim international conference on artificial intelligence, Springer, New York, pp 40–52
go back to reference Galgani F, Compton P, Hoffmann A (2012b) Towards automatic generation of catchphrases for legal case reports. In: Proceedings of the 13th international conference on computational linguistics and intelligent text processing—volume part II, CICLing’12, Springer, Berlin, pp 414–425, https://doi.org/10.1007/978-3-642-28601-8_35 Galgani F, Compton P, Hoffmann A (2012b) Towards automatic generation of catchphrases for legal case reports. In: Proceedings of the 13th international conference on computational linguistics and intelligent text processing—volume part II, CICLing’12, Springer, Berlin, pp 414–425, https://​doi.​org/​10.​1007/​978-3-642-28601-8_​35
go back to reference Getman AP, Karasiuk VV (2014) A crowdsourcing approach to building a legal ontology from text. Artif Intell Law 22(3):313–335CrossRef Getman AP, Karasiuk VV (2014) A crowdsourcing approach to building a legal ontology from text. Artif Intell Law 22(3):313–335CrossRef
go back to reference Hachey B, Grover C (2004) A rhetorical status classifier for legal text summarisation. Text Summarization Branches Out Hachey B, Grover C (2004) A rhetorical status classifier for legal text summarisation. Text Summarization Branches Out
go back to reference Jackson P, Al-Kofahi K, Tyrrell A, Vachher A (2003) Information extraction from case law and retrieval of prior cases. Artif Intell 150(1–2):239–290CrossRef Jackson P, Al-Kofahi K, Tyrrell A, Vachher A (2003) Information extraction from case law and retrieval of prior cases. Artif Intell 150(1–2):239–290CrossRef
go back to reference Johnson R, Zhang T (2015) Effective use of word order for text categorization with convolutional neural networks. In: Proceedings of the 2015 conference of the North American chapter of the association for computational linguistics: human language technologies, association for computational linguistics, Denver, Colorado, pp 103–112 Johnson R, Zhang T (2015) Effective use of word order for text categorization with convolutional neural networks. In: Proceedings of the 2015 conference of the North American chapter of the association for computational linguistics: human language technologies, association for computational linguistics, Denver, Colorado, pp 103–112
go back to reference Kalchbrenner N, Grefenstette E, Blunsom P (2014) A convolutional neural network for modelling sentences. In: Proceedings of the 52nd annual meeting of the association for computational linguistics (volume 1: long papers), association for computational linguistics, Baltimore, Maryland, pp 655–665 Kalchbrenner N, Grefenstette E, Blunsom P (2014) A convolutional neural network for modelling sentences. In: Proceedings of the 52nd annual meeting of the association for computational linguistics (volume 1: long papers), association for computational linguistics, Baltimore, Maryland, pp 655–665
go back to reference Kano Y, Kim MY, Yoshioka M, Lu Y, Rabelo J, Kiyota N, Goebel R, Satoh K (2018) Coliee-2018: evaluation of the competition on legal information extraction and entailment. In: Twelfth international workshop on Juris-informatics (JURISIN), COLIEE Kano Y, Kim MY, Yoshioka M, Lu Y, Rabelo J, Kiyota N, Goebel R, Satoh K (2018) Coliee-2018: evaluation of the competition on legal information extraction and entailment. In: Twelfth international workshop on Juris-informatics (JURISIN), COLIEE
go back to reference Kim MY, Xu Y, Goebel R (2013) Summarization of legal texts with high cohesion and automatic compression rate. In: Motomura Y, Butler A, Bekki D (eds) New frontiers in artificial intelligence. Springer, Berlin, pp 190–204CrossRef Kim MY, Xu Y, Goebel R (2013) Summarization of legal texts with high cohesion and automatic compression rate. In: Motomura Y, Butler A, Bekki D (eds) New frontiers in artificial intelligence. Springer, Berlin, pp 190–204CrossRef
go back to reference Kim Y (2014) Convolutional neural networks for sentence classification. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), association for computational linguistics, Doha, Qatar, pp 1746–1751 Kim Y (2014) Convolutional neural networks for sentence classification. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), association for computational linguistics, Doha, Qatar, pp 1746–1751
go back to reference Le Q, Mikolov T (2014) Distributed representations of sentences and documents. In: Proceedings of the 31st international conference on machine learning (ICML-14), pp 1188–1196 Le Q, Mikolov T (2014) Distributed representations of sentences and documents. In: Proceedings of the 31st international conference on machine learning (ICML-14), pp 1188–1196
go back to reference Levy O, Goldberg Y (2014) Dependency-based word embeddings. In: ACL (2), pp 302–308 Levy O, Goldberg Y (2014) Dependency-based word embeddings. In: ACL (2), pp 302–308
go back to reference Mandal A, Chaki R, Saha S, Ghosh K, Pal A, Ghosh S (2017a) Measuring similarity among legal court case documents. In: Proceedings of the 10th annual ACM India compute conference on ZZZ, ACM, pp 1–9 Mandal A, Chaki R, Saha S, Ghosh K, Pal A, Ghosh S (2017a) Measuring similarity among legal court case documents. In: Proceedings of the 10th annual ACM India compute conference on ZZZ, ACM, pp 1–9
go back to reference Mandal A, Ghosh K, Pal A, Ghosh S (2017b) Automatic catchphrase identification from legal court case documents. In: Proceedings of the 2017 ACM on conference on information and knowledge management, CIKM ’17, ACM, New York, pp 2187–2190, https://doi.org/10.1145/3132847.3133102 Mandal A, Ghosh K, Pal A, Ghosh S (2017b) Automatic catchphrase identification from legal court case documents. In: Proceedings of the 2017 ACM on conference on information and knowledge management, CIKM ’17, ACM, New York, pp 2187–2190, https://​doi.​org/​10.​1145/​3132847.​3133102
go back to reference Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119 Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119
go back to reference Olsson LT (1999) Guide to uniform production of judgments. Australian Institute of Judicial Administration Olsson LT (1999) Guide to uniform production of judgments. Australian Institute of Judicial Administration
go back to reference Pennington J, Socher R, Manning C (2014) Glove: Global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543 Pennington J, Socher R, Manning C (2014) Glove: Global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543
go back to reference Severyn A, Moschitti A (2015) Learning to rank short text pairs with convolutional deep neural networks. In: Proceedings of the 38th International ACM SIGIR conference on research and development in information retrieval, ACM, pp 373–382 Severyn A, Moschitti A (2015) Learning to rank short text pairs with convolutional deep neural networks. In: Proceedings of the 38th International ACM SIGIR conference on research and development in information retrieval, ACM, pp 373–382
go back to reference Tran VD, Nguyen ML, Satoh K (2018) Automatic catchphrase extraction from legal case documents via scoring using deep neural networks. In: Workshop on MIning and REasoning with legal texts Tran VD, Nguyen ML, Satoh K (2018) Automatic catchphrase extraction from legal case documents via scoring using deep neural networks. In: Workshop on MIning and REasoning with legal texts
go back to reference Wyner A (2008) An ontology in owl for legal case-based reasoning. Artif Intell Law 16(4):361CrossRef Wyner A (2008) An ontology in owl for legal case-based reasoning. Artif Intell Law 16(4):361CrossRef
go back to reference Wyner A, Hoekstra R (2012) A legal case owl ontology with an instantiation of popov v. hayashi. Artif Intell Law 20(1):83–107CrossRef Wyner A, Hoekstra R (2012) A legal case owl ontology with an instantiation of popov v. hayashi. Artif Intell Law 20(1):83–107CrossRef
go back to reference Zeng Y, Wang R, Zeleznikow J, Kemp E (2005) Knowledge representation for the intelligent legal case retrieval. In: Khosla R, Howlett RJ, Jain LC (eds) Knowledge-based intelligent information and engineering systems. Springer, Berlin, pp 339–345CrossRef Zeng Y, Wang R, Zeleznikow J, Kemp E (2005) Knowledge representation for the intelligent legal case retrieval. In: Khosla R, Howlett RJ, Jain LC (eds) Knowledge-based intelligent information and engineering systems. Springer, Berlin, pp 339–345CrossRef
Metadata
Title
Encoded summarization: summarizing documents into continuous vector space for legal case retrieval
Authors
Vu Tran
Minh Le Nguyen
Satoshi Tojo
Ken Satoh
Publication date
25-01-2020
Publisher
Springer Netherlands
Published in
Artificial Intelligence and Law / Issue 4/2020
Print ISSN: 0924-8463
Electronic ISSN: 1572-8382
DOI
https://doi.org/10.1007/s10506-020-09262-4

Premium Partner