ARTICLE IN PRESS Available online at www.sciencedirect.com
Expert Systems with Applications Expert Systems with Applications xxx (2008) xxx–xxx www.elsevier.com/locate/eswa
A genetic programming model for bankruptcy prediction: Empirical evidence from Iran Hossein Etemadi, Ali Asghar Anvary Rostamy *, Hassan Farajzadeh Dehkordi Tarbiat Modares University (TMU), No. 12, Shahid Rahnama Alley, Saidi Street, Lavasani Avenue, P.O. Box 19546, Tehran, Iran
Abstract Prediction of corporate bankruptcy is a phenomenon of increasing interest to investors/creditors, borrowing firms, and governments alike. Timely identification of firms’ impending failure is indeed desirable. By this time, several methods have been used for predicting bankruptcy but some of them suffer from underlying shortcomings. In recent years, Genetic Programming (GP) has reached great attention in academic and empirical fields for efficient solving high complex problems. GP is a technique for programming computers by means of natural selection. It is a variant of the genetic algorithm, which is based on the concept of adaptive survival in natural organisms. In this study, we investigated application of GP for bankruptcy prediction modeling. GP was applied to classify 144 bankrupt and non-bankrupt Iranian firms listed in Tehran stock exchange (TSE). Then a multiple discriminant analysis (MDA) was used to benchmarking GP model. Genetic model achieved 94% and 90% accuracy rates in training and holdout samples, respectively; while MDA model achieved only 77% and 73% accuracy rates in training and holdout samples, respectively. McNemar test showed that GP approach outperforms MDA to the problem of corporate bankruptcy prediction. Ó 2008 Elsevier Ltd. All rights reserved. Keywords: Bankruptcy prediction; Financial ratios; Genetic Programming; Multiple discriminant analysis; Iranian companies
1. Introduction Corporate bankruptcy is a very important economic phenomenon. The health and success of the firms are of widespread concern to policy makers, industry participants, investors, and managers (O’Leary, 1998). It also is a problem that affects the economy of every country. The number of failing firms is important for the economy of a country and it can be considered as an index of the development and robustness of the economy (Zopounidis & Dimitras, 1998). The high individual, economic, and social costs encountered in corporate failures or bankruptcies have spurred searches for better understanding and prediction capability (McKee & Lensberg, 2002).
*
Corresponding author. Tel.: +98 21 22291235; fax: +98 21 22291279. E-mail addresses:
[email protected] (H. Etemadi), Anvary@ modares.ac.ir,
[email protected] (A.A. Anvary Rostamy),
[email protected] (H.F. Dehkordi).
Prediction of corporate bankruptcy is a phenomenon of increasing interest to investors/creditors, borrowing firms, and governments alike. Timely identification of firms’ impending failure is indeed desirable (Jones, 1987). By this time, several methods have been used for predicting bankruptcy. Early research focused primarily on univariate models such as individual financial ratios. Among these studies Beaver (1966) is more noticeable than the others. He introduced a univariate technique for the classification of firms in two groups using some financial ratios. The ratios were used individually and a cut-off score was calculated for each ratio on the basis of minimizing misclassification. The univariate methods were later criticized, in spite of its considerable results, because of the correlation among ratios and providing different signals for a firm by ratios (Dimitras, Zanakis, & Zopounidis, 1996). Later research turned to multivariate models. Researchers found that corporate bankruptcy can be affected by many different factors at the same time. Altman (1968) introduced a multivariable technique, multiple discriminant
0957-4174/$ - see front matter Ó 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.eswa.2008.01.012
Please cite this article in press as: Etemadi, H. et al., A genetic programming model for bankruptcy ..., Expert Systems with Applications (2008), doi:10.1016/j.eswa.2008.01.012
ARTICLE IN PRESS 2
H. Etemadi et al. / Expert Systems with Applications xxx (2008) xxx–xxx
analysis (MDA), for failure prediction. Because this study made use of more than one variable for bankruptcy prediction and applied an advanced statistical technique for determining the relationship among predictor variables, it was of much interest. MDA provides good predictions but suffers from some limitations. Hence, variety methods introduced to overcome MDA shortcomings and improving accuracy. These methods can be grouped in two categories: statistical and artificial intelligence models. First group consists of Logit (Foreman, 2002; Ohlson, 1980; Zavgren, 1985), Probit (Casey, Mc Gee, & Stinkey, 1986; Theodossiou, 1991), Linear Probability (Stone & Rasp, 1991; Vranas, 1992) Cumulative Sums (Kahya & Theodossiou, 1999), and etc. Neural Networks (Altman, Marco, & Varetto, 1994; Coats & Fant, 1993; Jo, Han, & Lee 1997), Genetic Algorithms (Shin & Lee, 2002; Varetto, 1998), Case Based Reasoning (Park & Han, 2002), Rough Sets (Dimitras, Slowinski, Susmaga, & Zopounidis 1999; McKee & Lensberg, 2002), Support Vector Machine (Min & Lee, 2005), and etc, constitute second group. Some of these models have high predictive accuracy levels but because of absence bankruptcy theory, attempts to establish a generally accepted model for bankruptcy prediction are not successful. Some studies have provided comprehensive surveys on bankruptcy prediction methods such as Dimitras et al. (1996), Jones (1987), and Kumar and Ravi 2007. A common approach to bankruptcy prediction is to review the literature to identify a large set of potential predictive financial and/or non-financial variables and then develop a reduced set of variables, through some combination of judgmental and mathematical analysis that will predict bankruptcy (Lensberg, Eilifsen, & McKee 2006). In this study we implemented such approach for variables selection stage. After this, a relatively new technique for bankruptcy prediction, Genetic programming, constructed an accurate classification model for bankruptcy prediction. This model was benchmarked with the MDA, the most common used classification model for this subject. In the rest of the paper, first, we discuss about GP and MDA, two techniques were used for bankruptcy prediction modeling. In the section 4 we explain variable selection process and after that models development, empirical results and conclusion will be discussed. 2. Genetic programming Genetic programming (GP) is a search methodology belonging to the family of evolutionary computation (EC). GP can be considered an extension of Genetic algorithms (GAs) (Koza, 1992). GAs is stochastic search techniques that can search large and complicated spaces on the ideas from natural genetics and evolutionary principle (Goldberg, 1989; Holland, 1975). They have been demonstrated to be effective and robust in searching very large spaces in a wide range of applications (Colin, 1994; Shin & Han, 1999). GAs has been applied in wide range finan-
+ –
* X
Y
/
6 Z
8
Fig. 1. Tree representation of the program (expression): (X*Y) + 6 (Z/ 8).
cial fields such as trading system (Deboeck, 1994), stock selection (Mahfoud & Mani, 1995), bankruptcy prediction (Shin & Lee, 2002), and etc. GP is basically a GA applied to a population of computer programs (CP). While a GA usually operates on (coded) strings of numbers, a GP has to operate on CP. GP allows, in comparison with GA, the optimization of much more complicated structures and can therefore be applied to a greater diversity of problems (Sette & Boullart, 2001). Koza (1992) has extensively described GP. Because bankruptcy prediction can be considered as a classification problem, in continues, we offer a necessary description of GP with emphasis on its application in classification role. In general, genetic programming models were inspired by the Darwinian theory of evolution. According to the most common implementations, a population of candidate solutions is maintained, and after a generation is accomplished, the population is expected fitted better for a given problem. Genetic programming uses tree-like individuals that can represent mathematical expressions, making valuable the application of GP in symbolic regression problems. Such a GP individual is shown in Fig. 1. Three genetic operators are mostly used in these algorithms: reproduction, crossover, and mutation. 2.1. Reproduction The reproduction operator simply chooses an individual in the current population and copies it without changes into the new population. 2.2. Crossover Two parent individuals are selected and a subtree is picked on each one. Then crossover swaps the nodes and their relative sub-trees from one parent to the other. This operator must ensure the respect of the depth limits. If a condition is violated the too-large offspring is simply replaced by one of the parents. There are other parameters that specify the frequency with which internal or external points are selected as crossover points. Figs. 2 and 3 show an example of crossover operator. 2.3. Mutation The mutation operator can be applied to either a function node or a terminal node. A node in the tree is
Please cite this article in press as: Etemadi, H. et al., A genetic programming model for bankruptcy ..., Expert Systems with Applications (2008), doi:10.1016/j.eswa.2008.01.012
ARTICLE IN PRESS H. Etemadi et al. / Expert Systems with Applications xxx (2008) xxx–xxx –
W
–
*
/
T
6. Repeat steps (3)–(6) until an acceptable classification rule is found or the specified maximum number of generations has been reached. 7. Repeat steps (2)–(7) until one rule is determined for each class in the database. 8. Assign each example in the training and in the test sets to one and only one class.
+
+
*
X Z
X
Y
/
6
Z
Z
3
8
Fig. 2. Representation of crossover (parents).
2.4. Fitness function +
–
–
+ T
W
* /
6 Z
X
/ Y
*
6
8
Z
Z
Fig. 3. Representation of crossover (children).
randomly selected. If the chosen node is a terminal it is simply replaced by another terminal. If it is a function and point mutation is to be performed, it is replaced by a new function with the same arity. If, instead, tree mutation is to be carried out, a new function node (not necessarily with the same arity) is chosen, and the original node together with its relative subtree is substituted by a new randomly generated subtree. A depth ramp is used to set bounds on size when generating the replacement subtree. Naturally it is to check that this replacement does not violate the depth limit. If this happens mutation just reproduces the original tree into the new generation. Further parameters specify the probability with which internal or external points are selected as mutation points. An example of mutation operator is shown in Fig. 4. The major steps of this genetic programming can be formalized as follows: 1. Generate at random an initial population of rules representing potential solutions to the prediction of the bankrupt and non-bankrupt firms classes. 2. Evaluate each rule on the training set by means of a fitness function. 3. Select the rules to undergo the mechanism of reproduction. 4. Apply the genetic operators crossover, reproduction and mutation to produce new rules. 5. Reinsert these offspring to create the new population.
+
+
–
* X
Y
–
* /
6 Z
X
Y
8
Fig. 4. Representation of mutation.
The population is constituted by the possible class descriptions, that is to say sets of conditions on attributes of the objects to classify. For all classification problems, in order to be able to apply a particular fitness function, the learning algorithms must convert the value returned by the evolved model into ‘‘1” or ‘‘0” using the 0/1 Rounding Threshold. If the value returned by the evolved model is equal to or greater than the rounding threshold, then the record is classified as ‘‘1”, ‘‘0” otherwise. There are many varieties fitness function such as number of hits, sensitivity/specificity, relative squared error (RSE), mean squared error (MSE) and etc. that can be applied for evaluating performance of generated classification rules. We used ‘‘number of hits” as fitness function because of its simplicity in efficiency, and is based on the number of samples correctly classified. More formally, the fitness fi of an individual program corresponds to the number of hits and is evaluated by the formula: fi ¼ h
ð1Þ
where h is the number of fitness cases correctly evaluated (number of hits). So, for this fitness function, maximum fitness fmax is given by: fmax ¼ n
ð2Þ
where n is the number of fitness cases. Its counterpart with ‘‘parsimony pressure” uses this fitness measure fi as ‘‘raw fitness” rfi and complements it with a parsimony term. Parsimony pressure puts a little pressure on the size of the evolving solutions, allowing the discovery of more compact models. Thus, in this case, raw maximum fitness rfmax = n. And the overall fitness fppi (that is, fitness with parsimony pressure) is evaluated by the formula: 1 S max S i fppi ¼ rfi 1 þ ð3Þ 5000 S max S min where Si is the size of the program, Smax and Smin represent, respectively, maximum and minimum program sizes and are evaluated by the formulas: S max ¼ Gðh þ tÞ
ð4Þ
S min ¼ G
ð5Þ
^
6 S
W
where G is the number of genes, and h and t are the head and tail sizes (note that, for simplicity, the linking function was not taken into account). Thus, when rfi = rfmax and Si = Smin (highly improbable, though, as this can only
Please cite this article in press as: Etemadi, H. et al., A genetic programming model for bankruptcy ..., Expert Systems with Applications (2008), doi:10.1016/j.eswa.2008.01.012
ARTICLE IN PRESS 4
H. Etemadi et al. / Expert Systems with Applications xxx (2008) xxx–xxx
happen for very simple functions as this means that all the sub-ETs are composed of just one node), fppi = fppmax , with fppmax evaluated by the formula: fppmax ¼ 1:0002 rfmax
ð6Þ
Usual termination criterions appear to be the accomplishment of a number of generations, the achievement of a desired classification error, etc. The described procedure is depicted in the flowchart of Fig. 5. (Tsakonas, 2006). 2.5. Genetic programming for solving the bankruptcy prediction problem The aim is the implementation of a genetic model able to automatically extract an intelligible classification rule for prediction classes of bankrupt and non-bankrupt firms in a sample, given the values of some financial ratios, called predicting variables. Each rule is constituted by a logical combination of these ratios. This combination determines a class description which is used to construct the classification rule. Once defined a fitness function, bankruptcy prediction problem becomes a search problem of the best solution in the search space of all the possible solutions, that is to say an optimization of the fitness function for which optimization techniques can be used. Given a number of variables describing each firm and their related domains, it is easily understandable that for
bankruptcy prediction problems, the number of possible solutions is enormous. An exhaustive search by enumerating all the possible solutions is computationally impracticable. Hence, we appeal to GP which is a powerful search method inspired by natural selection. It does not guarantee to find the global optimum; nonetheless it usually allows retrieving a suboptimal solution in a reasonable computation time. The evolving population is constituted by individuals or ‘programs’ representing the classification rules in the form of trees whose sizes are intrinsically variable in length. Each rule is constituted by a number of conditional clauses, in which conditions on or between certain variables are set, and by a predictive clause representing the firms class. A class together with its description forms a classification rule of type ‘‘If {solution} then {bankrupt/ non-bankrupt}”. The conditional part of the rule is formed by all the active conditional clauses. A population composed by a set of these candidate rules is maintained and gradually improved by constructing new fitter ones until rules of sufficient quality are found or other stopping criteria are satisfied. The model allows attaining rules covering the different classes by running the evolutionary algorithm as many times as the number of the classes to predict. In practice the model will analyze one class at a time. To construct the bankruptcy prediction model, data is partitioned into two sets: the training and the test sets. The training set contains the known firms used during the evolution process to find an explicit classification rule able to separate an instance of a class of bankrupt firms from instances of non-bankrupt class, while the test set is used to evaluate the generalization ability of the rule found (De Falco, Della Cioppa, & Tarantino, 2002). 3. Multiple discriminant analysis (MDA) MDA is a statistical technique used to classify an observation into one of several a priori groupings dependent upon the observation’s individual characteristics. Therefore, MDA allows the researcher to study the difference between two or more groups of objects with respect to several variables simultaneously (Klecka, 1985). It is used primarily to classify and/or make predictions in problems where the dependent variable appears in qualitative. In the case of two groups consisting of bankrupt firms on one hand and non-bankrupt firms on the other, MDA attempts to establish a linear combination of characteristics (financial ratios) which ‘‘best” discriminates between the groups (Altman & Narayanan, 1997). The discriminant function is specified as Z i ¼ a1 X 1i þ a2 X 2i þ ::: þ an X ni
Fig. 5. Overview of genetic programming process.
ð7Þ
where a1, a2, ..., an = discriminant coefficients, X1i, X2i ..., Xni = classifying variables (financial ratios) of firm i. The value of the dividing point of the two groups is calculated as
Please cite this article in press as: Etemadi, H. et al., A genetic programming model for bankruptcy ..., Expert Systems with Applications (2008), doi:10.1016/j.eswa.2008.01.012
ARTICLE IN PRESS H. Etemadi et al. / Expert Systems with Applications xxx (2008) xxx–xxx
D ¼ ðZ b þ Z nb Þ=2
ð8Þ
where Zb is the mean value of the Zs in the bankrupt group and Znb is the mean of the Zs in the non-bankrupt group. After coefficients a1 through an are estimated and the discriminant function is established, the Z value of a particular firm can be calculated and compared to the dividing point D. The classification depends on whether the Z value is greater or smaller than the dividing point D. The quality of the model is measured by the degree of success in reclassifying the two groups of firms correctly (Fernandez, 2003). Altman (1993) has provided complete description of MDA and its financial applications. As a model for dichotomous events, MDA has its shortcomings. A major limitation of MDA is its restrictive assumption of normal distribution of each independent variable (Hamer, 1983). Other assumptions are: The predictors are not highly correlated with each other. The mean and variance of a given predictor are not correlated. The correlation between two predictors is constant across groups. MDA is prevalent technique in bankruptcy prediction (Aziz & Dar, 2006). In terms of classification or prediction ability among traditional models, while some studies, have found Logit model superior to MDA (Press & Wilson, 1978), other studies (Gu, 2002; Aziz & Dar, 2006) have shown that the two models are equally efficient. 4. Development of models 4.1. Data collection The data set used for this research consists of 144 Iranian companies. All of them were or still are listed on the Tehran Stock Exchange (TSE). 72 companies went bankrupt under paragraph 141 of Iran Trade Law1 from 1998 through 2005. The other 72 companies are ‘‘matched” companies, from the same period of listing on the TSE. Because of small population, we could not match two groups in each of industries. Also size of the firms as a potential explanatory variable has been considered in variable selection phase. 4.2. Variables selection We apply a three stages predictive variable selection process. At the first stage, bankruptcy prediction literature was reviewed and 65 variables among more than 250 financial ratios were selected as predictive variables. These financial ratios were chosen based on popularity in literature. In the second stage, 43 variables were selected based 1
Under paragraph 141 of Iran Trade Law, a firm is bankrupt when its total value of retained earnings is equal or greater than 50% of its listed capital.
5
on availability of the necessary data. Table 1 shows the selected variables. In the third stage, using stepwise discriminant analysis (SDA) was used to select final variables. SDA is a prevalent procedure for reducing dimensions of a problem and selecting the most significant variables that can among widespread set of variables. SDA select variable based on a cut-off F value for pre-specified statistical significance level until no variable is qualified for selecting. With significance level set at the 0.05 level, the discriminant stepwise procedure selected 5 variables from the 43 candidates for the model which could best discriminate the bankrupt firms from the non-bankrupt firms. These selected financial ratios are: (1) Operational income to sales ratio (X36), (2) Total liability to total assets (X9), (3) Sales to current assets ratio (X43), (4) Interest expense to gross profit (X25), and (5) Quick assets to total assets (X20). Operational income to sales (margin of profit) ratio reflects profitability of firm that shows the ability of a firm in covering all production costs and providing some reasonable margin of profit. A higher ratio can be translated as better coverage production costs and lower default risk. Therefore, margin of profit has a negative relation to bankruptcy. This ratio had the highest influence in segregation of two groups. Total liability to total assets (Debt ratio) measure the degree of indebtedness and the ability of paying off debt interests and principal and is categorized as solvency ratios. Solvency may reveal fundamental causes of bankruptcy rooted in a firm’s financing policy. This ratio has a negative relation with bankruptcy occurrence. Sales to current assets (current assets turnover) ratio, measuring revenue created per dollar of current assets is used as efficiency ratios. A higher efficiency means higher profitability and better liquidity and finally to lower default risk. However, as Gu and Gao (2000) found, a high efficiency ratio accompanied by poor profitability may indicate fast sales growth without proper costs control, thus signaling a high default risk. Analyzing balance sheet statements of firms in our sample specify that, in average, current assets constitute over than 60% total assets, hence, inclusion of current assets in the model can improve it’s the performance. Interest expense to gross profit measures firms’ ability to pay interest expense. When a firm has good profitability then creditors are sure that their interests in this firm can be achievable. This ratio has positive relation with bankruptcy occurrence; the lower interest expense to gross profit ratio, the lower bankruptcy risk. Quick assets to total assets ratio is one of the liquidity ratios and shows firms’ ability of paying short term debts. It can be interpreted as closeness of firms’ assets to cash. Creditors prefer that firms hold assets with high liquidity because these assets have lower risk than the others and firms can quickly convert them to money in extraordinary circumstances. Thus liquidity ratios have a momentous role
Please cite this article in press as: Etemadi, H. et al., A genetic programming model for bankruptcy ..., Expert Systems with Applications (2008), doi:10.1016/j.eswa.2008.01.012
ARTICLE IN PRESS 6
H. Etemadi et al. / Expert Systems with Applications xxx (2008) xxx–xxx
Table 1 variables used in the study and comparison of means in two groups #
Definition of variables
Means of non-bankrupt companies (72)
1 EBIT/TA 0.21767 3 RE/SC 0.47895 5 MVE/SE 3.68678 7 Ca/TA 0.05437 *TL/TA 9 0.68384 11 CL/TL 0.86624 13 (R+Inv)/TA 0.53664 15 R/Inv 14.5389 17 SE/TA 0.34665 19 QA/CL 0.66072 21 FA/(SE+LTD) 0.64560 23 CA/TA 0.66938 25 *IE/GP 0.41801 27 S/TA 0.88878 29 PIC/SE 0.44833 31 RE/TA 0.05836 33 NI/S 0.20475 35 D/NI 0.79203 37 OI/TA 0.19046 39 EBIT/S 0.21757 41 S/SE 3.06774 43 *S/CA 1.40478 CA: Current assets Ca: Cash CL: Current liabilities PIC: Paid in capital EBIT: Earnings before interest & taxes FA: Fixed assets GP: Gross profit IE: Interest expenses Inv: Inventory LA : Liquid assets LTD: Long term debt MVE: Marked value of equity *Final variables selected by SDA.
Means of bankrupt companies (72)
Tests of equality of group means (Sig level)
#
Definition of variables
Means of non-bankrupt companies (72)
Means of bankrupt companies (72)
Tests of equality of group means
0.05754 0.02016 1.73033 0.03103 0.78209 0.83518 0.62095 0.92670 0.24232 0.50096 0.78329 0.71408 1.68147 0.78572 0.52932 0.02026 0.00552 1.0847 0.02571 0.0035 3.09217 1.13062
0.000 0.000 0.032 0.009 0.000 0.234 0.041 0.292 0.000 0.002 0.410 0.201 0.211 0.107 0.199 0.000 0.000 0.311 0.000 0.000 0.987 0.006
2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 42
LTD/SE MVE/TL MVE/TA Size(logTA) CL/SE (Ca+STI)/CL R/S SE/TL CA/CL *QA/TA FA/TA Ca/CL S/Ca WC/TA S/WC NI/SE NI/TA *OI/S EBIT/IE GP/S S/FA
2.54796 2.09833
2.85370 0.59374
0.851 0.008
5.22278 2.28985 0.15794 0.44839 0.56374 1.17647 0.56170 0.23400 0.10662 45.2066 0.07149 17.8260 0.53856 0.17323 0.22485 33.00551 0.29107 6.03377
4.87388 2.51011 0.05352 0.35287 0.33791 1.13654 0.44675 0.23308 0.04838 122.611 0.05773 5.19366 0.23618 0.01155 0.01508 8.02330 0.10676 6.17452
0.001 0.874 0.000 0.445 0.000 0.567 0.001 0.976 0.002 0.245 0.696 0.213 0.072 0.000 0.000 0.078 0.000 0.893
in debt contracts and cost of capital. In general, firms with higher liquidity have a lower bankruptcy risk. 4.3. Genetic programming model The formerly mentioned procedure applied for GP model. For this study, the Number of Hit, as described in the pervious section, was used as the fitness function. For testing predictive power of model, data set was randomly separated into two groups. Training sample consists of 104 (51 bankrupt and 53 non-bankrupt) firms and Holdout sample contains 40 (21 bankrupt and 19 non-bankrupt) firms. For implementing GP process and developing bankruptcy model, GeneXproTools software version 4.0 was used. Crossover and mutation operators were set as 0.6 and 0.06, respectively. The representation of a solution for the problem provided by the GP algorithm is a decision tree. Each node of this tree is a function node taking one of the values from the set {+, , *, ^, NOT, and LT}: ‘‘+” adds up two variables, ‘‘” subtracts them, ‘‘*” multiply them, ‘‘^” is sym-
NI: Net income OI: Operational income QA: Quick assets R: Receivables RE: Retained earnings S: Sales SC: Stock capital SE: Shareholders’ equity STI: Short term investments TA: Total assets TL: Total liabilities WC: Working capital
bol of the power, ‘‘NOT” means one minus a variables, and ‘‘LT” returns 1 when the first variable is lesser than the second, else 0. When return of the GP model (decision tree) for a training or test firm is greater or equal 0.5 (Rounding Threshold), this firm is marked as ‘‘bankrupt firm”. Alternatively, when return of the GP model for a training or test firm is lesser than 0.5, this firm is classified as ‘‘non-bankrupt firm”. Comparing real class of firms with predicted class by the GP model, in the training and test sample, determines accuracy of the model. Fig. 6 Shows the best GP model obtained. This model has been divided in three sub-trees (each tree represent a Gene that’s means the model is a Chromosome consisted of three genes). Sum of the returns of sub-trees for a firm should be compared with ‘‘Rounding Threshold” for determining class of the firm. 4.4. Multiple discriminant analysis model To determine the relative prediction effectiveness of GP model, we benchmarked it against a MDA model,
Please cite this article in press as: Etemadi, H. et al., A genetic programming model for bankruptcy ..., Expert Systems with Applications (2008), doi:10.1016/j.eswa.2008.01.012
ARTICLE IN PRESS H. Etemadi et al. / Expert Systems with Applications xxx (2008) xxx–xxx ×
+ –
6.2
–
X9
7
^3
X20
×
X20
–
X36 Sub-Tree 1
×
X43
X36
X52
-9.66
Sub-Tree 2
+ –
^2
LT
NOT
+
X52
×
-5.5
-2.5
X20
X36
X43 Sub-Tree 3
Fig. 6. The best GP model obtained.
as a more prevalent methodology. Accordingly, for comparison purposes, we developed a MDA model using same five variables (ratios) and same training and test data set. A basic assumption of MDA is the multivariate normality of the classifying variables (Kleinbaum, Kupper, & Muller, 1988) but it is applicable just when the sample size is less than 50 observations (Gu, 2002). Since in this study the sample size was 140 (firms) then the normality test was not necessary. We used SPSS software version 13 for constructing MDA model. Significance levels were determined at 0.1 values. The estimated MDA model is as: Z ¼ 3:764X 20 3:609X 9 þ :719X 43 þ 0:102X 36 0:034X 52 0:251
ð9Þ
To estimate the model, SPSS software adjusted the dividing point to a cut-off value of zero using constant 0.251. Z score is calculated by above equation for each firm. Companies with Z scores above zero were put into non-bankrupt group. To assess the classification accuracy of the discriminant model, we used a cross validation procedure. In this procedure, (n 1) firms out of n in training sample are used. MDA estimates discriminant functions based on (n 1) firms and applies them to classify a firm left out. This cross validation is performed for each of n training firms. The classification accuracy rate for each group is estimated from the proportion of sample firms that are classified correctly.
5. Experimental results 5.1. GP model Table 2 shows obtained result from GP model. This model could classify firms in the training sample with 94% overall accuracy rate. In detailed view, the GP model has achieved 96% accuracy rate for correct classifying bankrupt firms and 93% accuracy rate for correct classifying non-bankrupt firms in the training sample. In addition, the GP model was applied to the holdout sample for testing. This model could correct classify 90% firms in the holdout sample. That this rates for correct classifying bankrupt and non-bankrupt firms are 95% and 84%, respectively. Table 2 Results of genetic programming model Samples* Real group membership
Learning sample
Holdout sample
1 0 1 0 1 0 1 0
(count) (count) (%) (%) (count) (count) (%) (%)
Predicted group ‘‘GP model” 1
0
Total
49 4 96 7 20 3 95 16
5 49 4 93 1 16 5 84
51 53 100 100 21 19 100 100
*Group 1: bankrupt firms; Group 0: non-bankrupt firms.
Please cite this article in press as: Etemadi, H. et al., A genetic programming model for bankruptcy ..., Expert Systems with Applications (2008), doi:10.1016/j.eswa.2008.01.012
ARTICLE IN PRESS 8
H. Etemadi et al. / Expert Systems with Applications xxx (2008) xxx–xxx
Table 3 Results of multiple discriminant analysis model Samples*
Predicted group membership by MDA model Learning sample
Real group membership Real group membership
1 (count) 0 (count) 1 (%) 0 (%) (%)
Cross-validated
Holdout sample
1
0
Total
1
0
Total
1
0
Total
40 13 78 25 78
11 40 22 75 75
51 53 100 100 77
38 14 75 25 75
13 40 25 75 75
51 53 100 100 75
15 5 71 32 71
6 14 24 74 74
21 19 100 100 73
Overall accuracy *Group 1: bankrupt firms; Group 0: non-bankrupt firms.
5.2. MDA model Table 3 summarizes results of MDA model on training and test samples. Among 104 firms in training sample, 11 bankrupt and 13 non-bankrupt firms were misclassified. The overall classification accuracy rate of MDA model was approximately 77% in training sample. The overall classification accuracy rate of the model was approximately 75% in cross validation procedure. Accuracy rate of classification, obtained from validation data, is nearly similar to classification accuracy rate of training data; we can conclude that the derived classification function has a good potential and validation. For testing, we applied MDA model to classify firms in the holdout sample. Among 19 non-bankrupt firms in the holdout sample, 6 firms were misclassified and only one of the bankrupt firms was misclassified. 5.3. Comparing GP & MDA models We used the McNemar test to examine whether or not the classification performance of the GP model is significantly higher than that of MDA model. The McNemar test is a non-parametric test of the hypothesis that two related dichotomous variables have the same mean. As we are interested in the correct classification of cases, the measure for testing is classification accuracy (the number of correct classifications to the total number of training and holdout samples). There are 25 out of 144 observations were differ classified by two models. McNemar test showed that the models were significantly different at .000 significant level. We confirmed this result also by performing a Wilcoxon test. 6. Summary and conclusions Bankruptcy is a highly significant worldwide problem that affects the economic well being of all countries. The high social costs incurred by various stakeholders associated with bankrupt firms have spurred searches for better theoretical understanding and prediction capability. In this paper genetic programming (GP) and multiple discriminant analysis (MDA) techniques have been used to find out whether it is possible to predict the survival or failure of Iranian corporations based on financial ratios.
An important contribution of this paper is identification of the effective predictive financial ratios. The financial ratios of OI/TA, S/CA, TL/TA, QA/TA, and IE/GP are identified as highly predictive factors by SDA method. These five ratios cover most significant financial aspects of a firm. In addition According to Dimitras et al. (1996), these financial ratios are members of most popular predictive variables in bankruptcy prediction literature. Using these variables, a GP model was developed. The GP model was 94% and 90% accurate on training and holdout samples, respectively. Also, a MDA model was developed using the same five variables and same data set. The model achieved only 77% and 73% accuracy in training and holdout samples, respectively. In summary, GP produced what can be deemed as a highly accurate bankruptcy prediction model considering both the nature of sample companies and prediction period. As a final conclusion, it can be claimed that GP model was more accurate than traditional MDA model. The model provides insight into the complex interaction of bankruptcy related factors and suggests avenues for future research. References Altman, E. I. (1968). Financial ratios, discriminant analysis and the prediction of corporate bankruptcy. Journal of Finance, 23(4), 589–609. Altman, E. I. (1993). Corporate financial distress and bankruptcy (2nd ed.). NY: John Wiley & Sons. Altman, E. I., & Narayanan, P. (1997). An international survey of business failure classification models. Financial Markets, Institutions & Instruments, 6(2), 57. Altman, E. I., Marco, G., & Varetto, F. (1994). Corporate distress diagnosis: Comparisons using linear discriminant analysis and neural networks (the Italian experience). Journal of Banking and Finance, 18, 505–529. Aziz, M. A., & Dar, H. A. (2006). Predicting corporate bankruptcy: Where we stand? Corporate Governance, 6(1), 18–33. Beaver, W. (1966). Financial ratios as predictors of failures, empirical research in accounting, selected studies. Supplement to the Journal of Accounting Research, 5(4), 71–127. Casey, M., Mc Gee, V., & Stinkey, C. (1986). Discriminating between reorganized and liquidated firms in bankruptcy. Accounting Review, 249–262. Coats, P. K., & Fant, L. F. (1993). Recognizing financial distress patterns using a neural network tool. Financial Management, 22, 142–155. Colin, A. M. (1994). Genetic algorithms for financial modeling. In G. J. Deboeck (Ed.), Trading on the edge (pp. 148–173). NY: Wiley.
Please cite this article in press as: Etemadi, H. et al., A genetic programming model for bankruptcy ..., Expert Systems with Applications (2008), doi:10.1016/j.eswa.2008.01.012
ARTICLE IN PRESS H. Etemadi et al. / Expert Systems with Applications xxx (2008) xxx–xxx Deboeck, G. J. (1994). Using GAs to optimize a trading system. In G. J. Deboeck (Ed.), Trading on the edge (pp. 174–188). NY: Wiley. De Falco, I., Della Cioppa, A., & Tarantino, E. (2002). Discovering interesting classification rules, with genetic programming. Applied Soft Computing, 1, 257–269. Dimitras, A. I., Slowinski, R., Susmaga, R., & Zopounidis, C. (1999). Business failure prediction using rough sets. European Journal of Operational Research, 114, 263–280. Dimitras, A. I., Zanakis, S. H., & Zopounidis, C. (1996). A survey of business failure with an emphasis on prediction methods and industrial application. European Journal of Operational Research, 90, 487–513. Fernandez, G. (2003). Data mining using SAS applications. Chapman & Hall/CRC. Foreman, R. D. (2002). A logistic analysis of bankruptcy within the US local telecommunications industry. Journal of Economics and Business, 1–32. Goldberg, D. (1989). Genetic algorithms in search, optimization and machine learning. Addison-Wesley. Gu, Z. (2002). Analyzing bankruptcy in the restaurant industry: A multiple discriminant model. Hospitality Management, 21, 25–42. Gu, Z., & Gao, L. (2000). A multivariate model for predicting business failures of hospitality firms. Tourism and Hospitality Research: The Surrey Quarterly Review, 2, 37–49. Hamer, M. M. (1983). Failure prediction: sensitivity of classification accuracy to alternative statistical methods and variable sets. Journal of Accounting and Public Policy, 2, 289–307. Holland, J. H. (1975). Adaptation in natural and artificial systems. Ann Arbor, MI: University of Michigan Press. Jo, H., Han, I., & Lee, H. (1997). Bankruptcy prediction using case-based reasoning, neural networks, and discriminant analysis. Expert Systems with Applications, 13(2), 97–108. Jones, F. L. (1987). Current techniques in bankruptcy prediction. Journal of Accounting Literature, 6, 131–164. Kahya, E., & Theodossiou, P. (1999). Predicting corporate financial distress: A time-series CUSUM methodology. Review of Quantitative Finance and Accounting, 13, 323–345. Klecka, W. R. (1985). Discriminant analysis. London, Beverly Hills: Sage Publications. Kleinbaum, D. G., Kupper, L. L., & Muller, K. E. (1988). Applied regression analysis and other multivariable methods. Boston: PWSKENT Publishing Company. Koza, J. R. (1992). Genetic programming: On the programming of computers by means of natural selection. Cambridge, MA: MIT Press. Kumar, P. R., & Ravi, V. (2007). Bankruptcy prediction in banks and firms via statistical and intelligent techniques – A review. European Journal of Operational Research, 180(1), 1–28. Lensberg, T., Eilifsen, A., & McKee, T. E. (2006). Bankruptcy theory development and classification via genetic programming. European Journal of Operational Research, 169, 677–697.
9
Mahfoud, S., & Mani, G. (1995). Genetic algorithms for predicting individual stock performance. In Proceedings of the third international conference on artificial intelligence applications on wall street (pp. 174–181). McKee, T. E., & Lensberg, T. (2002). Genetic programming and rough sets: A hybrid approach to bankruptcy classification. European Journal of Operational Research, 138, 436–451. Min, J. H., & Lee, Y. C. (2005). Bankruptcy prediction using support vector machine with optimal choice of kernel function parameters. Expert Systems with Applications, 28, 603–614. O’Leary, D. E. (1998). Using neural networks to predict corporate failure. International Journal of Intelligent Systems in Accounting Finance and Management, 7(3), 187–197. Ohlson, J. A. (1980). Financial ratios and the probabilistic prediction of bankruptcy. Journal of Accounting Research, 109–131. Park, C., & Han, I. (2002). A case-based reasoning with the feature weights derived by analytic hierarchy process for bankruptcy prediction. Expert Systems with Applications, 23(3), 225–264. Press, S. J., & Wilson, S. (1978). Choosing between logistic regression and discriminant analysis. Journal of American Statistics Association, 73, 699–705. Sette, S., & Boullart, L. (2001). Genetic programming: principles and applications. Engineering Applications of Artificial Intelligence, 14, 727–736. Shin, K. S., & Han, I. (1999). Case-based reasoning supported by genetic algorithms for corporate bond rating. Expert Systems with Applications, 16(2), 85–95. Shin, K., & Lee, Y. (2002). A genetic algorithm application in bankruptcy prediction modeling. Expert Systems with Applications, 23(3), 321–328. Stone, M., & Rasp, J. (1991). ‘Tradeoffs in the choice between Logit and OLS for accounting choice studies. The Accounting Review(1), 170–178. Theodossiou, P. T. (1991). Alternative models for assessing the financial condition of business in Greece. Journal of Business Finance and Accounting, 18(5), 697–720. Tsakonas, A. (2006). A comparison of classification accuracy of four genetic programming-evolved intelligent structures. Information Sciences, 176, 691–724. Varetto, F. (1998). Genetic algorithms applications in the analysis of insolvency risk. Journal of Banking and Finance, 22, 1421–1439. Vranas, A. S. (1992). The significance of financial characteristics in predicting business failure: An analysis in the Greek context. Foundations of Computing and Decision Sciences, 17(4), 257–275. Zavgren, C. V. (1985). Assessing the vulnerability to failure of American industrial firms: A logistic analysis. Journal of Banking and Finance, 12(1), 19–45. Zopounidis, C., & Dimitras, A. (1998). Multicriteria decision aid methods for the prediction of business failure. Dordrecht: Kluwer Academic Publishers.
Please cite this article in press as: Etemadi, H. et al., A genetic programming model for bankruptcy ..., Expert Systems with Applications (2008), doi:10.1016/j.eswa.2008.01.012