Acta Univ. Agric. Silvic. Mendelianae Brun. 2013, 61(7), 2709-2716 | DOI: 10.11118/actaun201361072709

Indeterminate values of target variable in development of credit scoring models

Martin Řezáč1, Lukáš Toma2
1 Department of Mathematics and Statistics, Faculty of Science, Masaryk University, Kotlářská 2, 611 37 Brno, Czech Republic
2 Polytech Nantes, Université de Nantes 1, quai de Tourville BP 13522, 44035 Nantes Cedex 1, France

In the beginning of every modelling procedure, the first question to ask is what we are trying to predict by the model. In credit scoring the most frequent case is modelling of probability of default; however other situations, such as fraud, revolving of the credit or success of collections could be predicted as well. Nevertheless, the first step is always to define the target variable.
The target variable is generally an 'output' of the model. It contains the information on the available data that we want to predict in future data. In credit scoring it is commonly called good/bad definition. In this paper we study the effect of use of indeterminate value of target variable in development of credit scoring models. We explain the basic principles of logistic regression modelling and selection of target variable. Next, the focus is given to introduction of some of the widely used statistics for model assessment. The main part of the paper is devoted to development and assessment of 27 credit scoring models on real credit data, which are built up and assessed according various definitions of target variable. We show that there is a valid reason for some target definitions to include the indeterminate value into the modelling process, as it provided us with convincing results.

Keywords: credit scoring, indeterminate value, target variable, Gini, KS, lift

Received: June 19, 2013; Published: December 24, 2013  Show citation

ACS AIP APA ASA Harvard Chicago IEEE ISO690 MLA NLM Turabian Vancouver
Řezáč, M., & Toma, L. (2013). Indeterminate values of target variable in development of credit scoring models. Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis61(7), 2709-2716. doi: 10.11118/actaun201361072709
Download citation

References

  1. ANDERSON, R., 2007: The credit scoring toolkit: theory and practice for retail credit risk management and decision automation. Oxford: Oxford University Press. 731, ISBN 978-0-19-922640-5.
  2. BEARDSELL, M., 2004: Impact of Indeterminate Performance Exclusions on Credit Score Model Development. In: Mays, E. (ed.), Credit Scoring for risk managers: The handbook for lenders. Mason: South-Western Publishing, 147-156. ISBN 978-1450578967.
  3. BOLTON, CH., 2009: Logistic regression and its application in credit scoring, http://upetd.up.ac.za/thesis/available/etd-08172010-202405/unrestricted/dissertation.pdf. Accessed on January 12, 2013.
  4. COPPOCK, D. S., 2002: Why Lift? DM Review Online. http://www.dmreview.com/news/5329-1.html. Accessed on December 1, 2009.
  5. GIUDICI, P., 2003: Applied Data Mining: statistical methods for business and industry. Chichester: Wiley. 364, ISBN 978-0-470-84679-7.
  6. HOSMER, D. W. and Lemeshow, S., 2000: Applied Logistic Regression. 2nd ed., New York: John Wiley & Sons Inc. 375, ISBN 978-0-471-35632-5. Go to original source...
  7. KOČENDA, E. and VOJTEK, M., 2009: Default Predictors and Credit Scoring Models for Retail Banking. CESifo Working Paper, No. 2862. Go to original source...
  8. MÜLLER, M. and RÖNZ, B., 2000: Credit Scoring using Semiparametric Methods. In: FRANKE, J., HÄRDLE, W., STAHL, G. (Eds.): Measuring Risk in Complex Stochastic Systems. New York: Springer-Verlag, 85-102. ISBN 978-0-387-98996-9. Go to original source...
  9. SARLIJA, N., BENSIC, M., ZEKIC-SUSAC, M., 2006: A neural network classification of credit applicants in consumer credit scoring. In: AIA'06 Proceedings of the 24th IASTED international conference on Artificial intelligence and applications. Innsbruck: ACTA Press, Anaheim, 205-210. ISBN 0-88986-556-6.
  10. ŘEZÁČ, M. and KOLÁČEK, J., 2012: Lift-Based Quality Indexes for Credit Scoring Models as an Alternative to Gini and KS. Journal of Statistics: Advances in Theory and Applications 7, 1: 1-23. ISSN 0975-1262.
  11. ŘEZÁČ, M. and ŘEZÁČ, F., 2011: How to Measure the Quality of Credit Scoring Models. Finance a úvěr - Czech Journal of Economics and Finance, 61, 5: 486-507. ISSN 0015-1920.
  12. SIDDIQI, N., 2006: Credit Risk Scorecards: developing and implementing intelligent credit scoring, New Jersey: Wiley, 196. ISBN 978-0-471-75451-0.
  13. THOMAS, L. C., EDELMAN, D. B., CROOK, J. N., 2002: Credit Scoring and Its Applications. Philadelphia: SIAM Monographs on Mathematical Modeling and Computation, 248. ISBN 0-89871-483-4. Go to original source...
  14. WILKIE, A. D., 2004: Measures for comparing scoring systems. In: THOMAS, L. C., EDELMAN, D. B., CROOK, J. N. (Eds.): Readings in Credit Scoring. Oxford: Oxford University Press, pp. 51-62. ISBN 978-0-19-852797-8.
  15. XU, K., 2003: How has the literature on Gini's index evolved in past 80 years? http://economics.dal.ca/RePEc/dal/wparch/howgini.pdf. Accessed on December 1, 2009. Go to original source...

This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (CC BY NC ND 4.0), which permits non-comercial use, distribution, and reproduction in any medium, provided the original publication is properly cited. No use, distribution or reproduction is permitted which does not comply with these terms.