Research Article
Open access
Published on 8 November 2024
Download pdf
Yu,K.;Xia,S.;Zhang,Y.;Wang,S. (2024). Loan Approval Prediction Improved by XGBoost Model Based on Four-Vector Optimization Algorithm. Applied and Computational Engineering,82,35-44.
Export citation

Loan Approval Prediction Improved by XGBoost Model Based on Four-Vector Optimization Algorithm

Keke Yu *,1, Siwei Xia 2, Yitian Zhang 3, Shikai Wang 4
  • 1 University of California, Santa Barbara, CA, US
  • 2 Electrical and Computer Engineering, New York University, NY, USA
  • 3 Accounting, UW-Madison, WI, USA
  • 4 Electrical and Computer Engineering, New York University, NY, USA

* Author to whom correspondence should be addressed.

https://doi.org/10.54254/2755-2721/82/20241317

Abstract

This paper discusses an improved XGBoost model based on four-vector optimization algorithm to improve the accuracy of loan approval prediction. Through the analysis of correlation heat maps, we found that there were significant positive and negative correlations among some variables, which laid the foundation for subsequent machine learning analysis. Based on this, we compare the traditional machine learning algorithm with the improved model to evaluate its performance in loan approval forecasting. In the confusion matrix analysis of the training set, the improved XGBoost model demonstrated excellent performance, with all loan approval predictions being correct with 100% accuracy. However, the performance in the test set was slightly different, with 1,182 projects receiving correct loan approval predictions and 99 projects forecasting errors. Of these, 26 projects that should have been predicted to be "unapproved" were incorrectly labeled as "approved," while 73 projects that should have been predicted to be "approved" were incorrectly labeled as "unapproved." These results suggest that we still need to pay attention to the misjudgment of the model in practical application. By synthesizing all model evaluation indicators, we found that the improved XGBoost model based on the four-vector optimization algorithm has a higher accuracy in loan approval prediction than the traditional XGBoost model, with an increase of 1.5%. In addition, the other evaluation indicators also show a trend of significantly better than the traditional model. This study shows that the four-vector optimization algorithm can effectively improve the application effect of XGBoost model in the field of loan approval, and provide more accurate data support and decision-making basis for the financial industry. In the future, we will continue to explore the potential and application prospects of this algorithm in other fields.

Keywords

Four-vector optimization algorithm, XGBoost, Loan approval forecast.

[1]. Viswanatha, V., et al. "Prediction of loan approval in banks using machine learning approach." International Journal of Engineering and Management Research 13.4 (2023): 7-19.

[2]. Uddin, Nazim, et al. "An ensemble machine learning based bank loan approval predictions system with a smart application." International Journal of Cognitive Computing in Engineering 4 (2023): 327-339.

[3]. Krishnaraj, P., S. Rita, and Jitendra Jaiswal. "Comparing Machine Learning Techniques for Loan Approval Prediction." Proceedings of the 1st International Conference on Artificial Intelligence, Communication, IoT, Data Engineering and Security, IACIDS 2023, 23-25 November 2023, Lavasa, Pune, India. 2024.

[4]. Sandhu, Harjyot Singh, Varun Sharma, and Vishali Jassi. "Loan Approval Prediction Using Machine Learning."

[5]. Liu, Yaru, and Huifang Feng. "Hybrid 1DCNN-Attention with Enhanced Data Preprocessing for Loan Approval Prediction." Journal of Computer and Communications 12.8 (2024): 224-241.

[6]. Kandula, Ashok Reddy, et al. "Comparative Analysis for Loan Approval Prediction System Using Machine Learning Algorithms." International Conference on Computer & Communication Technologies. Singapore: Springer Nature Singapore, 2023.

[7]. Phan, Chung, Stefano Filomeni, and Kok Seng Kiong. "The impact of technology on access to credit: A review of loan approval and terms in rural Vietnam and Thailand." Research in International Business and Finance (2024): 102504.

[8]. Ogundunmade, Tayo P., Adedayo A. Adepoju, and Abdelaziz Allam. "Stock price forecasting: Machine learning models with K-fold and repeated cross validation approaches." Mod Econ Manag 1 (2022).

[9]. Budiharto, Widodo. "Data science approach to stock prices forecasting in Indonesia during Covid-19 using Long Short-Term Memory (LSTM)." Journal of big data 8 (2021): 1-9.

Cite this article

Yu,K.;Xia,S.;Zhang,Y.;Wang,S. (2024). Loan Approval Prediction Improved by XGBoost Model Based on Four-Vector Optimization Algorithm. Applied and Computational Engineering,82,35-44.

Data availability

The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.

Disclaimer/Publisher's Note

The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

About volume

Volume title: Proceedings of the 2nd International Conference on Machine Learning and Automation

Conference website: https://2024.confmla.org/
ISBN:978-1-83558-565-8(Print) / 978-1-83558-566-5(Online)
Conference date: 21 November 2024
Editor:Mustafa ISTANBULLU, Anil Fernando
Series: Applied and Computational Engineering
Volume number: Vol.82
ISSN:2755-2721(Print) / 2755-273X(Online)

© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. Authors who publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See Open access policy for details).