nach oben

European Actuarial Journal

Erschienen in:

29.08.2021 | Original Research Paper

Loss amount prediction from textual data using a double GLM with shrinkage and selection

verfasst von: Scott Manski, Kaixu Yang, Gee Y. Lee, Tapabrata Maiti

Erschienen in: European Actuarial Journal | Ausgabe 2/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

The Gamma model has been widely utilized in a variety of fields, including actuarial science, where it has important applications in insurance loss predictions. Meanwhile, high dimensional models and their applications have become more common in the statistics literature in recent years. The availability of such high dimensional models have allowed the analysis of non-traditional data, including those containing textual descriptions of the response. In the models used in such applications, the dispersion may be designed to be related to a set of covariates, as opposed to being a single fixed value for the entire population. Following this approach, we incorporate a group Lasso type penalty in both the dispersion and the mean parameterization for a Gamma model, and illustrate its use in a predictive analytics application in actuarial science. In particular, we apply the method to an insurance claim prediction problem involving textual data analysis methods. Simulations are conducted to illustrate the variable selection and model fitting performance of our method.

Vorheriger Artikel A nonparametric sequential learning procedure for estimating the pure premium

Nächster Artikel The effect of risk constraints on the optimal insurance policy

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Tibshirani R (1996) Regression shrinkage and selection via the lasso. Stat Comput 58:267–288MathSciNetMATH

Yuan M, Lin Y (2006) Model selection and estimation in regression with grouped variables. J R Stat Soc Ser B (Methodol) 68:49–67MathSciNetCrossRefMATH

Chan F, Chan L, Mead E (1982) Properties and modifications of Whittaker-Henderson graduation. Scand Actuar J 1982:56–61MathSciNetCrossRefMATH

Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw 33:1–22CrossRef

Yang Y, Zou H (2015) A fast unified algorithm for solving group-lasso penalize learning problems. Stat Comput 25:1129–1141MathSciNetCrossRefMATH

Qian W, Yang Y, Zou H (2016) Tweedie’s compound Poisson model with grouped elastic net. J Comput Graph Stat 25:606–625MathSciNetCrossRef

Frees EW, Lee G (2015) Rating endorsements using generalized linear models. Variance 10:51–74

Yin C, Lin X (2016) Efficient estimation of Erlang mixtures using iSCAD penalty with insurance application. ASTIN Bull J IAA 46(3):779–799MathSciNetCrossRefMATH

Jeong H, Chang H, Valdez EA (2021) A non-convex regularization approach for stable estimation of loss development factors. Scand Actuar J. https://doi.org/10.1080/03461238.2021.1882550MathSciNetCrossRefMATH

10.

Tzougas G, Karlis D (2020) An EM algorithm for fitting a new class of mixed exponential regression models with varying dispersion. ASTIN Bull J IAA 50(2):555–583MathSciNetCrossRefMATH

11.

Tzougas G, Jeong H (2021) EM estimation for the exponential generalized inverse Gaussian regression model with varying dispersion and shape for modelling the aggregate claim amount. Risks 9(1):19. https://doi.org/10.3390/risks9010019CrossRef

12.

Devriendt S, Antonio K, Reynkens T, Verbelen R (2020) Sparse regression with multi-type regularized feature modeling. Insur Math Econ 96:248–261MathSciNetCrossRefMATH

13.

Lee GY, Manski S, Maiti T (2020) Actuarial applications of word embedding models. ASTIN Bull. 50(1):1–24. https://doi.org/10.1017/asb.2019.28MathSciNetCrossRefMATH

14.

Smyth GK (1989) Generalized linear models with varying dispersion. J R Stat Soc Ser B (Methodol) 51:47–60MathSciNet

15.

Smyth GK, Jørgensen B (2002) Fitting Tweedie’s compound Poisson model to insurance claims data: dispersion modelling. ASTIN Bull 32:143–157MathSciNetCrossRefMATH

16.

Shi P (2016) Insurance ratemaking using a copula-based multivariate Tweedie model. Scand Actuar J 2016(3):198–215MathSciNetCrossRefMATH

17.

Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. Adv Neural Inf Process Syst 26:3111–3119

18.

Pennington J, Socher R, Manning CD (2014) GloVe: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), October 25–29, 2014, Doha, Qatar, pp 1532–1543

19.

Wood SN (2017) Generalized additive models: an introduction with R, 2nd edn. CRC Press, Boca RatonCrossRefMATH

Titel: Loss amount prediction from textual data using a double GLM with shrinkage and selection
verfasst von: Scott Manski
Kaixu Yang
Gee Y. Lee
Tapabrata Maiti
Publikationsdatum: 29.08.2021
Verlag: Springer Berlin Heidelberg
Erschienen in: European Actuarial Journal / Ausgabe 2/2022
Print ISSN: 2190-9733
Elektronische ISSN: 2190-9741
DOI: https://doi.org/10.1007/s13385-021-00294-x

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 2/2022

The slowdown in mortality improvement rates 2011–2017: a multi-country analysis

The only constant is change: opportunities and challenges for actuaries in a changing world

The effect of risk constraints on the optimal insurance policy

Discussion on ‘A long-term care multi-state Markov model revisited: a Markov chain Monte Carlo approach’ (Fleichmann et al.)

Optimal dynamic reinsurance with worst-case default of the reinsurer

Model transparency and interpretability: survey and application to the insurance industry