Prediction of heart disease based on logistic regression
- 1 School of Data Science, Capital University of Economics and Business, Beijing, 100000, China
* Author to whom correspondence should be addressed.
Abstract
Heart disease is a major threat to human health, with a variety of contributing factors, and is not easily cured. This paper will present a dataset from a cardiovascular study of residents of Framingham, Massachusetts. First, the validity of the three models, logistic regression, random forest, and decision tree, is estimated by comparing information such as accuracy, precision, recall, and F1 values. The optimal model, i.e., the logistic regression model, was selected by plotting ROC curves and using AUC as a reference criterion for assessing the predictive effectiveness of the models. Then the raw data and data were preprocessed, including dealing with missing values. Finally, a logistic regression model was developed to analyze the influencing factors of heart disease. The purpose of this study was to use the results of the logistic model to help doctors and patients in heart disease treatment. The results show that the model has a good predictive effect.
Keywords
Logistic regression, heart disease, ROC curve
[1]. Jiang L, Ding S, Zhang L P, et al. 2017 Changes in plasma neutrophil gelatinase-associated lipid transport protein (NGAL) in relation to acute postoperative lung injury in infants and children with congenital heart disease. Advances in Modern Biomedicine, 17(8), 1570-1573.
[2]. Kondo T, Nakano Y, Adachi S, et al. 2019 Efects of tobacco smoking on cardiovascular disease. Circ J, 83(10), 1980-1985.
[3]. Wu S, Xu W, Guan C, et al. 2023 Global burden of cardiovascular disease attributable to metabolic risk factors,1990 -2019: an analysis of observational data from a 2019 Global Burden of Disease study. BMJ Open, 13(5).
[4]. Chia C W, Egan J M, Ferrucci L 2018 Age-related changes in glucose metabolism, hyperglycemia, and cardiovascular risk. Circ Res, 123(7), 886-904.
[5]. Jin Y, So H, Cerin E, et al. 2023 The temporal trend of disease burden at-tributable to metabolic risk factors in China, 1990-2019: an analysis of the Global Burden of Disease study. Front Nutr.
[6]. Huang Y, Li Y L, Yan W T, Wang G, Wang B W, Xie P 2024 Trend Analysis and Future Trend Forecast of Ischemic Heart Disease Burden Attributable to Fasting Hyperglycemia in China, 1990-2019. Chronic Disease Prevention and Control in China, 32(3), 176-182.
[7]. Janssen F, Bardoutsos A, Ei Gewily S, et al. 2021 Future life expectancy in europe taking into account the impact of smoking, obesity and alcohol. ELife.
[8]. Li FW, Wen SJ, Tang QX, et al. 2020 Impact of injury-related deaths on life expectancy in China. Cadernos de saude publica, 36(11).
[9]. Larsson S C, Wolk A, Beck M 2017 Alcohol consumption, cigarete smoking and incidence of aortic valve stenosis. J Intern Med, 282(4).
[10]. Markus M R, Lieb W, Stritzke J, et al. 2015 Light to moderat alcohol consumption is asociated with lower risk of aortic valve sclerosis: the study of health in pomerania (SHIP). Arterioscler Thromb Vasc Biol, 35(5).
[11]. Wang R F, Luo Y, Chen Z S, et al. 2021 Relationship between cardiometabolic co-morbidities and disability in Chinese middle-aged and elderly people. Journal of Jilin University (Medical Edition). 47(3), 761-769.
[12]. Ji H T, Zhao Y X, Yu X Q, Zhang C C, Liu Z D and Chai Q 2023 Effect of smoking and low-density lipoprotein cholesterol interaction on valvular heart disease. Preventive Medicine Forum, 29(1), 46-49.
[13]. Ni Z H 2023 Construction of a predictive model for the risk of limited ability to perform activities of daily living in elderly cardiac patients. Geriatrics research, 4(6), 33-38.
[14]. Zhang X H 2023 Factor Analysis of Heart Disease Diagnosis Based on Logistic Regression and Decision Tree. Modern information technology, 7(7), 117-123.
[15]. Zhang Y Y, Ge R G and Sun G 2020 A study of patients' perceptions of excessive medical examinations and influencing factors based on binary logistics regression. China Health Care Management, 37(12), 893-895+899.
[16]. Yan J J, Wu H, and Han B D 2020 Multifactorial Logistics Regression Analysis of Risk Factors for Residual Cavity Formation after Tuberculous Septic Thorax Surgery. Medical Innovation in China, 17(18), 128-131.
Cite this article
Zhang,Z. (2024).Prediction of heart disease based on logistic regression.Theoretical and Natural Science,51,1-7.
Data availability
The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.
Disclaimer/Publisher's Note
The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content
About volume
Volume title: Proceedings of the Quantum Machine Learning: Bridging Quantum Physics and Computational Simulations - CONFMPCS 2024
© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and
conditions of the Creative Commons Attribution (CC BY) license. Authors who
publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons
Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this
series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published
version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial
publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and
during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See
Open access policy for details).