Skip to main content
Top

Hint

Swipe to navigate through the articles of this issue

Published in: Neural Processing Letters 5/2021

24-06-2021

Semi-Supervised Clustering for Financial Risk Analysis

Authors: Yihan Han, Tao Wang

Published in: Neural Processing Letters | Issue 5/2021

Login to get access
share
SHARE

Abstract

Many methods have been developed for financial risk analysis. In general, the conventional unsupervised approaches lack sufficient accuracy and semantics for the clustering, and the supervised approaches rely on large amount of training data for the classification. This paper explores the semi-supervised scheme for the financial data prediction, in which accurate predictions are expected with a small amount of labeled data. Due to lack of sufficient distinguishability in financial data, it is hard for the existing semi-supervised approaches to obtain satisfactory results. In order to improve the performance, we first convert the input labeled clues to the global prior probability, and propagate the’soft’ prior probability to learn the posterior probability instead of directly propagating the’hard’ labeled data. A label diffusion model is then constructed to adaptively fuse the information at feature space and label space, which makes the structures of data affinity and labeling more consistent. Experiments on two public real financial datasets validate the effectiveness of the proposed method.
Appendix
Available only for authorised users
Literature
1.
go back to reference Tay FEH, Cao LJ (2002) ε-Descending support vector machines for financial time series forecasting. Neural Process Lett 15(2):179–195 CrossRef Tay FEH, Cao LJ (2002) ε-Descending support vector machines for financial time series forecasting. Neural Process Lett 15(2):179–195 CrossRef
2.
go back to reference Arratia A, Belanche LA, Fábregues L (2019) An evaluation of equity premium prediction using multiple kernel learning with financial features. Neural Process Lett 52:117–134 CrossRef Arratia A, Belanche LA, Fábregues L (2019) An evaluation of equity premium prediction using multiple kernel learning with financial features. Neural Process Lett 52:117–134 CrossRef
3.
go back to reference MacQueen JB (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability, University of California Press, pp. 281–297 MacQueen JB (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability, University of California Press, pp. 281–297
4.
go back to reference Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J Roy Stat Soc Ser B Meth 39(1):1–38 MathSciNetMATH Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J Roy Stat Soc Ser B Meth 39(1):1–38 MathSciNetMATH
5.
go back to reference Abou-Rjeili A, Karypis G (2006) Multilevel algorithms for partitioning power-law graphs. In: IEEE International Parallel and Distributed Processing Symposium (IPDPS) Abou-Rjeili A, Karypis G (2006) Multilevel algorithms for partitioning power-law graphs. In: IEEE International Parallel and Distributed Processing Symposium (IPDPS)
6.
go back to reference Brockett P, Xia X, Derrig R (1998) Using Kohonen’s self organizing feature map to uncover automobile bodily injury claims fraud. J Risk Insur 65(2):245–274 CrossRef Brockett P, Xia X, Derrig R (1998) Using Kohonen’s self organizing feature map to uncover automobile bodily injury claims fraud. J Risk Insur 65(2):245–274 CrossRef
7.
go back to reference Cox E (1995) A fuzzy system for detecting anomalous behaviors in healthcare provider claims. In: Goonatilake S, Treleaven P (eds) Intelligent systems for finance and business. Wiley, New York, pp 111–134 Cox E (1995) A fuzzy system for detecting anomalous behaviors in healthcare provider claims. In: Goonatilake S, Treleaven P (eds) Intelligent systems for finance and business. Wiley, New York, pp 111–134
8.
go back to reference Moreau Y, Lerouge E, Verrelst H, Vandewalle J, Stormann C, Burge P (1999) BRUTUS: a hybrid system for fraud detection in mobile communications. In: Proceedings of the European Symposium Artificial Neural Networks, pp. 447–454 Moreau Y, Lerouge E, Verrelst H, Vandewalle J, Stormann C, Burge P (1999) BRUTUS: a hybrid system for fraud detection in mobile communications. In: Proceedings of the European Symposium Artificial Neural Networks, pp. 447–454
9.
go back to reference Yeo AC, Smith KA, Willis RJ, Brooks M (2001) Clustering technique for risk classification and prediction of claim costs in the automobile insurance industry. Intell Syst Acc Finance Manage 10(1):39–50 CrossRef Yeo AC, Smith KA, Willis RJ, Brooks M (2001) Clustering technique for risk classification and prediction of claim costs in the automobile insurance industry. Intell Syst Acc Finance Manage 10(1):39–50 CrossRef
10.
go back to reference Kou G, Peng Y, Wang G (2014) Evaluation of clustering algorithms for financial risk analysis using MCDM methods. Inf Sci 275:1–12 CrossRef Kou G, Peng Y, Wang G (2014) Evaluation of clustering algorithms for financial risk analysis using MCDM methods. Inf Sci 275:1–12 CrossRef
11.
go back to reference Nachev A, Hill S, Barry C, Stoyanov B (2010) Fuzzy, distributed, instance counting, and default artmap neural networks for financial diagnosis. Int J Inform Technol Decis Making 9(6):959–978 CrossRef Nachev A, Hill S, Barry C, Stoyanov B (2010) Fuzzy, distributed, instance counting, and default artmap neural networks for financial diagnosis. Int J Inform Technol Decis Making 9(6):959–978 CrossRef
12.
go back to reference Ngoc MT, Park DC (2018) Centroid neural network with pairwise constraints for semi-supervised learning. Neural Process Lett 48(3):1721–1747 CrossRef Ngoc MT, Park DC (2018) Centroid neural network with pairwise constraints for semi-supervised learning. Neural Process Lett 48(3):1721–1747 CrossRef
13.
go back to reference Zhou D, Bousquet O, Lal TN, Weston J, Scholkopf B (2004) Learning with local and global consistency. Adv Neural Inf Process Syst 16(4):321–328 Zhou D, Bousquet O, Lal TN, Weston J, Scholkopf B (2004) Learning with local and global consistency. Adv Neural Inf Process Syst 16(4):321–328
14.
go back to reference Wang Y, Meng Y, Fu Z et al (2017) Towards safe semi-supervised classification: adjusted cluster assumption via clustering. Neural Process Lett 46(3):1031–1042 CrossRef Wang Y, Meng Y, Fu Z et al (2017) Towards safe semi-supervised classification: adjusted cluster assumption via clustering. Neural Process Lett 46(3):1031–1042 CrossRef
15.
go back to reference Ma X, Gao L, Yong X, Lidong Fu (2010) Semi-supervised clustering algorithm for community structure detection in complex networks. Physica A 389:187–197 CrossRef Ma X, Gao L, Yong X, Lidong Fu (2010) Semi-supervised clustering algorithm for community structure detection in complex networks. Physica A 389:187–197 CrossRef
16.
go back to reference Bai S, Zhou Z, Wang J, Bai X, Latecki LJ, Tian Q (2017) Ensemble diffusion for retrieval. In: Proceedings of the IEEE International conference on computer vision pp. 774–783 Bai S, Zhou Z, Wang J, Bai X, Latecki LJ, Tian Q (2017) Ensemble diffusion for retrieval. In: Proceedings of the IEEE International conference on computer vision pp. 774–783
18.
go back to reference Quinlan JR (1993) C45: Programs for machine learning. Morgan Kaufmann, San Franciso Quinlan JR (1993) C45: Programs for machine learning. Morgan Kaufmann, San Franciso
19.
go back to reference Zhao Y, Karypis G (2001) Criterion functions for document clustering: experiments and analysis, Technical Report TR 01-40, Department of Computer Science, University of Minnesota Zhao Y, Karypis G (2001) Criterion functions for document clustering: experiments and analysis, Technical Report TR 01-40, Department of Computer Science, University of Minnesota
20.
go back to reference Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco MATH Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco MATH
21.
go back to reference Bai S, Bai X, Tian Q, Latecki LJ (2017) Regularized diffusion process for visual retrieval. Proc AAAI Conf Artif Intell 31:3967–3973 Bai S, Bai X, Tian Q, Latecki LJ (2017) Regularized diffusion process for visual retrieval. Proc AAAI Conf Artif Intell 31:3967–3973
Metadata
Title
Semi-Supervised Clustering for Financial Risk Analysis
Authors
Yihan Han
Tao Wang
Publication date
24-06-2021
Publisher
Springer US
Published in
Neural Processing Letters / Issue 5/2021
Print ISSN: 1370-4621
Electronic ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-021-10564-0

Other articles of this Issue 5/2021

Neural Processing Letters 5/2021 Go to the issue