
Credit Evaluation System Based on FICO
- 1 School of Computer and Information Engineering, Henan University of Economics and Law, Zhengzhou, Henan Province, China
- 2 School of AI and Advanced Computing Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu Province, China
* Author to whom correspondence should be addressed.
Abstract
As credit transactions become more prevalent, financial institutions require effective methods to assess credit risk and reduce the likelihood of borrower default. In the U.S., the Fair Isaac Credit Organization (FICO) score is widely used by banks and insurers to evaluate personal creditworthiness. This paper aims to develop an automated credit scoring tool based on the FICO system to help financial institutions improve risk assessment. The paper leverages the random forest algorithm for data preprocessing and feature engineering to extract key variables, such as the borrower's financial status and credit history. To ensure data stability and interpretability, Information Value and Weight of Evidence techniques are applied to process these variables. Additionally, the Sigmoid function is used to map the model output to a range between 0 and 1, making it suitable for generating credit scores. This random forest algorithm helps handle non-linear relationships and missing data, while cross-validation enhances the model’s generalization ability. After training, the paper achieved an automated credit scoring system, closely aligned with the FICO scoring system. The model’s Area Under the Curve (AUC) value reached 0.84, indicating strong predictive accuracy and reliability. This tool enables financial institutions to more accurately assess credit risk, offering a robust, data-driven approach to improve decision-making and risk management.
Keywords
Credit Evaluation, FICO, Machine Learning, Python.
[1]. Mester, L. J. (1997). What’s the point of credit scoring?. Business Review, 3, 3-16.
[2]. Thomas, L. C., Crook, J. N., & Edelman, D. B. (2017). Credit scoring and its applications. SIAM.
[3]. Baesens, B., Van Gestel, T., Stepanova, M., Suykens, J., & Vanthienen, J. (2003). Benchmarking logistic regression and support vector machine classifiers for credit scoring. Journal of the Operational Research Society, 54(6), 627-635.
[4]. Lessmann, S., Baesens, B., Seow, H. V., & Thomas, L. C. (2015). Benchmarking state-of-the-art classification algorithms for credit scoring: An update of the research literature. European Journal of Operational Research, 247(1), 124-136.
[5]. Bellotti, T., & Crook, J. (2009). Support vector machines for credit scoring and discovery of significant features. Expert Systems with Applications, 36(2), 3302-3308.
[6]. Stekhoven, D. J., & Bühlmann, P. (2012). MissForest—non-parametric missing value imputation for mixed-type data. Bioinformatics, 28(1), 112-118.
[7]. Siddiqi, N. (2012). Credit risk scorecards: Developing and implementing intelligent credit scoring. John Wiley & Sons.
[8]. Fawcett, T. (2006). An introduction to ROC analysis. Pattern Recognition Letters, 27(8), 861-874.
[9]. Hand, D. J. (2009). Measuring classifier performance: a coherent alternative to the area under the ROC curve. Machine Learning, 77(1), 103-123.
[10]. Kshetri, N. (2016). Big data’s role in expanding access to financial services in China. International Journal of Information Management, 36(3), 297-308.
[11]. Baesens, B., Van Gestel, T., Stepanova, M., Suykens, J., & Vanthienen, J. (2003). Benchmarking logistic regression and support vector machine classifiers for credit scoring. Journal of the Operational Research Society, 54(6), 627-635.
[12]. Lessmann, S., Baesens, B., Seow, H. V., & Thomas, L. C. (2015). Benchmarking state-of-the-art classification algorithms for credit scoring: An update of the research literature. European Journal of Operational Research, 247(1), 124-136.
[13]. Stekhoven, D. J., & Bühlmann, P. (2012). MissForest—non-parametric missing value imputation for mixed-type data. Bioinformatics, 28(1), 112-118.
[14]. Siddiqi, N. (2012). Credit risk scorecards: Developing and implementing intelligent credit scoring. John Wiley & Sons.
[15]. Fawcett, T. (2006). An introduction to ROC analysis. Pattern Recognition Letters, 27(8), 861-874.
Cite this article
Li,Y.;Shi,Y. (2024). Credit Evaluation System Based on FICO. Applied and Computational Engineering,96,48-55.
Data availability
The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.
Disclaimer/Publisher's Note
The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
About volume
Volume title: Proceedings of the 2nd International Conference on Machine Learning and Automation
© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and
conditions of the Creative Commons Attribution (CC BY) license. Authors who
publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons
Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this
series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published
version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial
publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and
during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See
Open access policy for details).