Elsevier

Applied Soft Computing

Volume 44, July 2016, Pages 144-152
Applied Soft Computing

Financial distress prediction using the hybrid associative memory with translation

https://doi.org/10.1016/j.asoc.2016.04.005Get rights and content

Highlights

  • We explore the hybrid associative memory with translation for default prediction.

  • We analyze the behavior of this neural network under the presence of class imbalance.

  • We study how the class overlapping affects the performance of the associative memory.

  • We compare its performance with that of other prediction models.

  • The associative memory is the best model, especially to predict the default cases.

Abstract

This paper presents an alternative technique for financial distress prediction systems. The method is based on a type of neural network, which is called hybrid associative memory with translation. While many different neural network architectures have successfully been used to predict credit risk and corporate failure, the power of associative memories for financial decision-making has not been explored in any depth as yet. The performance of the hybrid associative memory with translation is compared to four traditional neural networks, a support vector machine and a logistic regression model in terms of their prediction capabilities. The experimental results over nine real-life data sets show that the associative memory here proposed constitutes an appropriate solution for bankruptcy and credit risk prediction, performing significantly better than the rest of models under class imbalance and data overlapping conditions in terms of the true positive rate and the geometric mean of true positive and true negative rates.

Introduction

A large number of techniques have been developed to help decision-makers and analysts in predicting financial distress. Traditionally, decisions on credit risk of a corporate borrower were exclusively based upon subjective judgments made by human experts, using past experiences and some guiding principles [69]. However, two major problems with this approach are the difficulty to make consistent estimates and the fact that it tends to be reactive rather than predictive. The world financial crisis has led to increasing attention of banks and financial institutions on this question because of its significant impact on the decisions made [14], resulting in the development of numerous techniques to face the important challenge of credit risk and bankruptcy prediction from financial ratios using mathematical models. From the pioneer work by Altman [7], based on multivariate discriminant analysis, a variety of statistical and operations research methods have subsequently been applied to credit risk and bankruptcy prediction, including linear and logistic regression, multivariate adaptive regression splines, survival analysis, linear and quadratic programming, and multiple criteria programming. Most of these techniques typically rely on the assumptions of linear separability, multivariate normality and independence of the predictive variables, but they are very often violated in real-life problems [25], [34], [55].

Popular computational intelligence tools such as decision trees, neural networks, support vector machines, fuzzy systems, rough sets, artificial immune systems, and evolutionary algorithms are techniques that can deal with non-linearity. Besides, these methods are highly capable of extracting meaningful information from imprecise data and detecting trends that are too complex to be discovered by either humans or conventional systems. Despite various studies have concluded that no technique is clearly superior to other competing algorithms because it depends on the characteristics of the problem analyzed [13], [15], [16], different neural network architectures have shown good performance in comparison to other methods for a range of financial applications [10], [19], [48], [53], [78]. However, when the number of examples is relatively small, several works have demonstrated that the accuracy and generalization performance of a support vector machine (SVM) is usually better than that of statistical and other soft computing techniques [23], [24], [65], [67]. While typical neural networks used in this context are the multi-layer perceptron (MLP), the radial basis function (RBF) and the probabilistic or Bayesian network (BN), other neural models such as the associative memories have not been explored as yet.

The ability of human brain to make associations from partial information has historically attracted great interest among researchers, leading to a variety of theoretical neural networks that act as associative memories. An associative memory [39] is an early type of artificial neural network that relates an input vector x with an output vector y. The functionality of associative memories is reached in two phases: learning and recall. The learning process consists of building a connection matrix W with a value for each association (xk, yk). In the recall phase, an output vector y, which corresponds to the most similar to the input vector x, is obtained from the associative memory. These models are powerful computational tools due to their conceptual and implementational simplicity, their strong mathematical foundation, and their capability of storing huge amounts of data that allow to properly recover the most similar patterns to an input vector with low computational efforts [77].

Representative examples of associative memories are lernmatrix [66], the linear associator [8], [38], the Moore–Penrose generalized inverse associative memory [40], the Hopfield network [28], the bidirectional associative memory [41], the fuzzy associative memory [42], the morphological associative memory [58], and the alpha–beta associative memory [2]. Some of these models have been used to solve very different problems. Sabourin and Mitiche [59] developed a Kohonen associative memory with selective multiresolution for OCR. A fuzzy associative memory was introduced to determine rock types from well-log signatures [17]. The bidirectional associative memory networks were used to find the relations between various cancers and elemental contents in serum samples with the aim of diagnosing cancer [81]. A hybrid classifier based on self-organizing maps and associative memories was designed for speaker recognition [31]. Zhang et al. [79] proposed a modular face recognition scheme by combining the wavelet subband representations and kernel associative memories. An associative memory based on the restricted Coulomb energy was also applied to human face recognition [49]. Namba and Zhang [50] devised an associative memory to recognize Braille images. A novel system for medical diagnosis based on associative memories was proposed by Aldape-Pérez et al. [5]. Itkar and Kulkarni [32] developed an efficient algorithm for mining frequent patterns using an auto-associative memory.

Apart from the associative memories just mentioned, Santiago-Montero [63] introduced the hybrid associative classifier and its extension, the hybrid associative classifier with translation (HACT). Both these associative memories are based on the learning phase of the linear associator and the recall phase of the Steinbuch's lernmatrix. This paper applies the HACT neural network to decision making problems for financial distress prediction and presents an empirical comparison with other popular prediction methods. To the best of our knowledge, this model has not been used for classification purposes, and even less in the context of finance and management. The aim of this paper therefore is four-fold:

  • 1.

    To explore the capability of the HACT model in the prediction of bankruptcy and credit risk.

  • 2.

    To analyze the behavior of this neural network under the presence of imbalance in class distribution, which constitutes a data complexity often neglected in financial applications.

  • 3.

    To investigate how the class overlapping affects the performance of the associative memory.

  • 4.

    To compare the performance of HACT with that of other prediction techniques.

From now on, the paper is organized as follows. Section 2 provides a review of works related to neural networks used for corporate bankruptcy and credit risk prediction. Section 3 introduces the fundamental concepts of the associative memories and describes the bases of the HACT model. The experimental set-up and databases are given in Section 4, while the results are discussed in Section 5. Finally, Section 6 presents the concluding remarks and outlines some directions for future research.

Section snippets

A review of neural networks applied to financial distress prediction

From the beginning of the 1990s, the development of artificial neural network technologies for bankruptcy and credit risk prediction problems has been the subject of considerable attention and research efforts. The first reference to using neural networks can be found in the paper by Odom and Sharda [51], showing that a three-layer feed-forward perceptron is more accurate and robust than multi-variate discriminant analysis. After this seminal work, many other studies have proposed the use of

Hybrid associative classifier with translation

In its most general form, an associative memory is a content-addressable neural network based on matrix algebra [39], [57] that maps input patterns (examples) to output patterns by using the p different associated pattern pairs (xk, yk) stored during the learning phase. The associative memory takes the form of a connection weight matrix W=[wi,j]m×n generated from a finite set of p encoded associations, called fundamental set of associations, {(xμ, yμ)|μ = 1, 2, …, p}, where xμn are the

Experimental set-up

Nine data sets related to bankruptcy/creditworthiness have been employed in order to make a comprehensive comparison of the HACT model with four well-known neural networks (MLP, RBF, BN and the voted perceptron, VP), whose architectures and parameter settings are reported in Table 1. In addition, an SVM with a linear kernel (widely acknowledged as one of the best soft computing techniques) and the logit model (a classical econometric method) have also been included in this study. Note that,

Experimental results and discussion

Table 4 reports the true positive rate averaged across the 10 runs for each database, the average values across all the databases and the Friedman's average rank for each neural network approach (the one with the lowest average rank has to be deemed as the best solution). The values for the best performing method in each database are underlined. Based on the Friedman's average ranks, the results reveal that the HACT model corresponds to the algorithm with the best performance, followed by MLP

Conclusions and future work

From the first works in the beginning of the 1990s, the artificial neural networks emerged as an effective method for bankruptcy and credit risk prediction. They differ from classical financial prediction systems, such as the models based on statistical techniques, mainly in their black-box nature and in the assumption of a non-linear relation among variables. In this paper, the hybrid associative memory with translation has been explored and compared to other well-known neural models (MLP,

Acknowledgments

This work has partially been supported by the Mexican CONACYT through the Postdoctoral Fellowship Program [232167], the Spanish Ministry of Economy [TIN2013-46522-P], the Generalitat Valenciana [PROMETEOII/2014/062] and the Mexican PRODEP [DSA/103.5/15/7004]. We would like to thank the Reviewers for their valuable comments and suggestions, which have helped to improve the quality of this paper substantially.

References (81)

  • Y. Ding et al.

    Forecasting financial condition of Chinese listed companies based on support vector machine

    Expert Syst. Appl.

    (2008)
  • N.C. Hsieh

    Hybrid mining approach in the design of credit scoring models

    Expert Syst. Appl.

    (2005)
  • C. Hung et al.

    A selective ensemble based on expected probabilities for bankruptcy prediction

    Expert Syst. Appl.

    (2009)
  • A. Khashman

    Neural networks for credit risk evaluation: investigation of different neural models and learning schemes

    Expert Syst. Appl.

    (2010)
  • A. Khashman

    Credit risk evaluation using neural networks: emotional versus conventional models

    Appl. Soft Comput.

    (2011)
  • R.C. Lacher et al.

    A neural network for classifying the financial health of a firm

    Eur. J. Oper. Res.

    (1995)
  • T.S. Lee et al.

    A two-stage hybrid credit scoring model using artificial neural networks and multivariate adaptive regression splines

    Expert Syst. Appl.

    (2005)
  • T.S. Lee et al.

    Credit scoring using the hybrid neural discriminant technique

    Expert Syst. Appl.

    (2002)
  • S. Lessmann et al.

    Benchmarking state-of-the-art classification algorithms for credit scoring: an update of research

    Eur. J. Oper. Res.

    (2015)
  • T.H. Lin

    A cross model study of corporate financial distress prediction in Taiwan: multiple discriminant analysis, logit, probit and neural networks models

    Neurocomputing

    (2009)
  • P.C. Pendharkar

    A threshold-varying artificial neural network approach for classification and its application to bankruptcy prediction problem

    Comput. Oper. Res.

    (2005)
  • V. Ravi et al.

    Soft computing system for bank performance prediction

    Appl. Soft Comput.

    (2008)
  • V. Ravi et al.

    Threshold accepting trained principal component neural network and feature subset selection: application to bankruptcy prediction in banks

    Appl. Soft Comput.

    (2008)
  • M. Sabourin et al.

    Modeling and classification of shape using a Kohonen associative memory with selective multiresolution

    Neural Netw.

    (1993)
  • C. Serrano-Cinca et al.

    Partial least square discriminant analysis for bankruptcy prediction

    Decis. Support Syst.

    (2013)
  • K.S. Shin et al.

    An application of support vector machines in bankruptcy prediction model

    Expert Syst. Appl.

    (2005)
  • J. Sun et al.

    Financial distress prediction using support vector machines: ensemble vs. individual

    Appl. Soft Comput.

    (2012)
  • C.F. Tsai et al.

    A comparative study of classifier ensembles for bankruptcy prediction

    Appl. Soft Comput.

    (2014)
  • C.F. Tsai et al.

    Using neural network ensembles for bankruptcy prediction and credit scoring

    Expert Syst. Appl.

    (2008)
  • D. West

    Neural network credit scoring models

    Comput. Oper. Res.

    (2000)
  • D. West et al.

    Neural network ensemble strategies for financial decision applications

    Comput. Oper. Res.

    (2005)
  • L. Yu et al.

    Credit risk assessment with a multistage neural network ensemble learning approach

    Expert Syst. Appl.

    (2008)
  • G. Zhang et al.

    Artificial neural networks in bankruptcy prediction: general framework and cross-validation analysis

    Eur. J. Oper. Res.

    (1999)
  • Z. Zhang et al.

    Classification of cancer patients based on elemental contents of serums using bidirectional associative memory networks

    Anal. Chim. Acta

    (2001)
  • M.E. Acevedo-Mosqueda et al.

    Alpha–beta bidirectional associative memories: theory and applications

    Neural Process. Lett.

    (2007)
  • J. Alcalá-Fdez et al.

    KEEL data-mining software tool: data set repository, integration of algorithms and experimental analysis framework

    J. Mult. Valued Log. Soft Comput.

    (2011)
  • J. Alcalá-Fdez et al.

    KEEL: a software tool to assess evolutionary algorithms to data mining problems

    Soft Comput.

    (2009)
  • E. Altman

    Financial ratios, discriminant analysis, and the prediction of corporate bankruptcy

    J. Financ.

    (1968)
  • A.F. Atiya

    Bankruptcy prediction for credit risk using neural networks: a survey and new results

    IEEE Trans. Neural Netw.

    (2001)
  • J. Baek et al.

    Bankruptcy prediction for credit risk using an auto-associative neural network in Korean firms

  • Cited by (70)

    • Long-horizon predictions of credit default with inconsistent customers

      2024, Technological Forecasting and Social Change
    • A novel ensemble feature selection method by integrating multiple ranking information combined with an SVM ensemble model for enterprise credit risk prediction in the supply chain

      2022, Expert Systems with Applications
      Citation Excerpt :

      The cost of incorrectly classifying minority class instances into the majority class is very high (Sun et al., 2020). At present, there are three main kinds of solutions to solve the class imbalance problem: the data-level method (Cordón et al., 2018; Gao et al., 2014; Li et al., 2018; Zhang & Li, 2014), the algorithm-level method (Cleofas-Sánchez et al., 2016; Raghuwanshi & Shukla, 2018) and the ensemble learning method (Shen et al., 2020; Wang & Sun, 2021). Scholars are devoting increasing attention to developing new models by combining the data-level method and the ensemble learning method.

    View all citing articles on Scopus
    View full text