Research Article
Open access
Published on 27 September 2024
Download pdf
Zhu,W. (2024). High school student GPA prediction by various linear regression models. Theoretical and Natural Science,52,153-162.
Export citation

High school student GPA prediction by various linear regression models

Weijia Zhu *,1,
  • 1 Mathematics Science and Statistics Program, University of Toronto, ON, L5L 1C6, Canada

* Author to whom correspondence should be addressed.

https://doi.org/10.54254/2753-8818/52/2024CH0136

Abstract

Academic performance (GPA) is a significant index for high school students in North America. The research aims to develop and validate the best predictive regression model to evaluate the GPA of high school students. In addition, the prediction numeric data of GPA can correspond to specific classification data of GPA (Grade Level) to calculate and compare the models’ accuracy for predicting Grade Level. The research subjects are high school students from different backgrounds in North America, including their background, study habits, participation in extracurricular activities, etc. The experiment explores the impact of different factors on student GPA and finds that the number of absences from lectures is a key factor in predicting student GPA. Multiple linear regression analysis is used as the main model in the experiment, which may be improved by the stepwise regression methods. The generalization ability of the model is evaluated through cross-validation (CV) methods. Also, the boosting or random forest model is used to be the comparing model for predicting GPA. The experimental result shows that the multiple linear regression model has high accuracy (84%) and reliability (R^2 value is 0.95) in predicting student GPA. The conclusion of the research emphasizes the importance of predicting student GPA in high school education and the potential for guiding educational practice through data analysis. Future work will consider introducing more subjects and variables, such as different subject learning abilities, mental health, and social support, to further improve the predictive accuracy of the model.

Keywords

GPA, linear regression models, stepwise regression, prediction

[1]. Sawyer R 2013 Beyond Correlations: Usefulness of High School GPA and Test Scores in Making College Admissions Decisions. Applied Measurement in Education, 26(2), 89-112.

[2]. Kobrin J L, et al. 2008 Validity of the SAT® for Predicting First-Year College Grade Point Average. Research Report, College Board.

[3]. Miami U O 2014 New Research Shows How Students' High School GPA Could Affect Their Income. Lab Manager.

[4]. Philippe F L, et al. 2023 Organized civic and non-civic activities as predictors of academic GPA in high school students. Applied Developmental Science, 27(2), 189-204.

[5]. Rahafar A, Maghsudloo M, Farhangnia S, Vollmer C and Randler C 2015 The role of chronotype, gender, test anxiety, and conscientiousness in academic achievement of high school students. Chronobiology International, 33(1), 1-9.

[6]. Warren J E 2016 Small Learning Communities and High School Academic Success. ProQuest Dissertations & Theses.

[7]. Paolo A M, Bonaminio G A, Durham D and Stites S W 2004 Comparison and Cross-validation of Simple and Multiple Logistic Regression Models to Predict USMLE Step 1 Performance. Teaching and Learning in Medicine, 16(1), 69-73.

[8]. Hassan Y M I, Elkorany A and Wassif K 202. Utilizing Social Clustering-Based Regression Model for Predicting Student’s GPA. IEEE Access, 10, 48948-48963.

[9]. Nasiri M, Minaei B and Vafaei F 2012 Predicting GPA and academic dismissal in LMS using educational data mining: A case mining. 6th National and 3rd International Conference of E-Learning and E-Teaching, 53-58.

[10]. Cai J, Xu K, Zhu Y, Hu F and Li L 2020 Prediction and analysis of net ecosystem carbon exchange based on gradient boosting regression and random forest. Applied Energy, 262, 114566.

Cite this article

Zhu,W. (2024). High school student GPA prediction by various linear regression models. Theoretical and Natural Science,52,153-162.

Data availability

The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.

Disclaimer/Publisher's Note

The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

About volume

Volume title: Proceedings of CONF-MPCS 2024 Workshop: Quantum Machine Learning: Bridging Quantum Physics and Computational Simulations

Conference website: https://2024.confmpcs.org/
ISBN:978-1-83558-621-1(Print) / 978-1-83558-622-8(Online)
Conference date: 9 August 2024
Editor:Anil Fernando, Marwan Omar
Series: Theoretical and Natural Science
Volume number: Vol.52
ISSN:2753-8818(Print) / 2753-8826(Online)

© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. Authors who publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See Open access policy for details).