
Bank Customer Churn Prediction with Machine Learning Methods
- 1 University College London
* Author to whom correspondence should be addressed.
Abstract
This paper examines and analyses customer churn prediction in the banking sector using the data from ABC Bank. The analysis conducted will document the determinants of bank customer churn and provide insights to the most important factors which influence the customers decision to quit utilizing the services of a bank. The investigation is based on the results of two machine learning algorithms with k-fold-cross-validation and same boosting methods. The result of the analysis reveals that out of logistic regression and random forests algorithms, the random forest methods show a higher accuracy score which corresponds with the literature review studied. Furthermore, the statistic of the research indicates that customer’s age has the highest association with the likelihood of customer churning, while whether the customer has a credit card at the bank has the lowest interconnection. The results of this research may provide valid explanations to customer churn in the banking sector and bring further intuitions of the advantages which machine learning methods may provide to future financial analysis.
Keywords
Bank Customer, Prediction, Machine Learning
[1]. Briker, V., Farrow, R., Trevino, W., & Allen, B. (2019). SMU Data Science Review, 2(3).
[2]. Buckinx, W., & Van den Poel, D. (2005). c. European Journal of Operational Research, 164(1), 252–268. doi:10.1016/j.ejor.2003.12.010
[3]. Cole, A. (2020). Retrieved from https://towardsdatascience.com/predicting-customer-churn-using-logistic-regression-c6076f37eaca
[4]. Czímer, B., Dietz, M., László, V., & Sengupta, J. (2022). Retrieved from https://www.mckinsey.com/industries/financial-services/our-insights/the-future-of-banks-a-20-trillion-dollar-breakup-opportunity
[5]. de Lima Lemos, R. A., Silva, T. C., & Tabak, B. M. (2022). Propension to customer churn in a financial institution: A machine learning approach. Neural Computing and Applications, 34(14), 11751–11768. doi:10.1007/s00521-022-07067-x
[6]. Guliyev, H., & Yerdelen Tatoğlu, F. (2021). Customer churn analysis in banking sector: Evidence from explainable machine learning models. Journal of Applied Microeconometrics, 1(2), 85–99. doi:10.53753/jame.1.2.03
[7]. J, S., Gangadhar, Ch., Arora, R. K., Renjith, P. N., Bamini, J., & Chincholkar, Y. devidas. (2023). E-commerce customer churn prevention using machine learning-based business intelligence strategy. Measurement: Sensors, 27, 100728. doi:10.1016/j.measen.2023.100728
[8]. Jain, H., Khunteta, A., & Srivastava, S. (2020). Churn prediction in telecommunication using logistic regression and logit boost. Procedia Computer Science, 167, 101–112. doi:10.1016/j.procs.2020.03.187
[9]. Jamal, Z., & Bucklin, R. E. (2006). Improving the diagnosis and prediction of customer churn: A heterogeneous hazard modeling approach. Journal of Interactive Marketing, 20(3–4), 16–29. doi:10.1002/dir.20064
[10]. Neslin, S. A., Gupta, S., Kamakura, W., Lu, J., & Mason, C. H. (2006). Defection detection: Measuring and understanding the predictive accuracy of customer churn models. Journal of Marketing Research, 43(2), 204–211. doi:10.1509/jmkr.43.2.204
[11]. Stoltzfus, J. C. (2011). Logistic regression: A brief primer. Academic Emergency Medicine, 18(10), 1099–1104. doi:10.1111/j.1553-2712.2011.01185.x
[12]. Suh, Y. (2023). Machine learning based customer churn prediction in home appliance rental business. Journal of Big Data, 10(1). doi:10.1186/s40537-023-00721-8
[13]. Ullah, I., Raza, B., Malik, A. K., Imran, M., Islam, S. U., & Kim, S. W. (2019). A churn prediction model using Random Forest: Analysis of machine learning techniques for churn prediction and factor identification in telecom sector. IEEE Access, 7, 60134–60149. doi:10.1109/access.2019.2914999
[14]. Vafeiadis, T., Diamantaras, K. I., Sarigiannidis, G., & Chatzisavvas, K. Ch. (2015). A comparison of machine learning techniques for customer churn prediction. Simulation Modelling Practice and Theory, 55, 1–9. doi:10.1016/j.simpat.2015.03.003
Cite this article
Zhu,H. (2024). Bank Customer Churn Prediction with Machine Learning Methods. Advances in Economics, Management and Political Sciences,69,23-29.
Data availability
The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.
Disclaimer/Publisher's Note
The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
About volume
Volume title: Proceedings of the 2nd International Conference on Financial Technology and Business Analysis
© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and
conditions of the Creative Commons Attribution (CC BY) license. Authors who
publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons
Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this
series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published
version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial
publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and
during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See
Open access policy for details).