
Hybrid Deep Learning Framework for Student Grade Prediction
- 1 Beijing-Dublin International College, Beijing University of Technology, Beijing, 100124, China
* Author to whom correspondence should be addressed.
Abstract
This paper constructs a hybrid prediction framework combining Multi-Layer Perceptron, Long Short-Term Memory, and Transformer models to predict students' academic performance. The framework takes the Grades dataset, combines advanced feature engineering technologies such as time attenuation weighting and topic correlation matrix, and uses mean square error (MSE) and R² index to evaluate the prediction performance. The results show that the performance of the Long Short-Term Memory (LSTM) model is better than other models, and the minimum mean square error of its test set is 4.4821, proving the LSTM model's effectiveness in capturing students' learning time series patterns. The transformer model also performs well, but the mean square error is slightly higher. In addition, the interpretability analysis of SHapley Additive exPlanations (SHAP) reveals the significant contribution of G2 characteristics to the prediction, which provides a basis for targeted education intervention. This study emphasizes the potential of deep learning in educational data mining and emphasizes the importance of interpretability and feature diversity. Future work may explore the integration of more complex feature engineering technologies and further improve the prediction accuracy by combining the advantages of LSTM and transformer models.
Keywords
Learning behavior analysis, academic performance prediction, Long Short-Term Memory, attention mechanism
[1]. Luo, Z.; Mai, J.; Feng, C.; Kong, D.; Liu, J.; Ding, Y.; Qi, B.; Zhu, Z. (2024). A Method for Prediction and Analysis of Student Performance That Combines Multi-Dimensional Features of Time and Space. Mathematics, 12, 3597.
[2]. Li, L. Z. (2021). A Study on the Grade Prediction Model Based on Online Learning Behavior Analysis. Master’s thesis, Computer Application Technology. Guiyang, Guangdong.
[3]. Zhang, Y., An, R., Liu, S., Cui, J., & Shang, X. (2023). Predicting and Understanding Student Learning Performance Using Multi-Source Sparse Attention Convolutional Neural Networks. IEEE Transactions on Big Data, 9(1), 118–133.
[4]. Wang, J., & Yu, Y. (2025). Machine learning approach to student performance prediction of online learning. PLoS ONE, 20(1), e0299018.
[5]. Chui, K. T., Liu, R. W., Zhao, M., & Ordoñez de Pablos, P. (2020). Predicting students’ performance with school and family tutoring using generative adversarial network-based deep support vector machine. IEEE Access, 8(10), 103355–103365.
[6]. Kaggle. (2025). Grades dataset. 2025/4/1. https://www.kaggle.com/datasets/arushikhokharr/grades-dataset
[7]. Li, M., Wang, X., Ruan, S., Zhang, K., & Liu, Q. (2025). Student performance prediction model based on dual-channel attention mechanism. Journal of Computer Research and Development,57(08),1729-1740.
[8]. Liu, K., et al. (2024). A multi-dimensional student performance prediction model (MSPP). Expert Systems with Applications.
[9]. An, M., Han, M., Ren, R., Ke, C., & Chang, Y. (2024). Online learning behavior analysis and grade prediction method. Computer Application Information Technology and Informatization, (9), 17–21.
[10]. Alatawi, H. Q., & Hechmi, S. (2022). A survey of data mining methods for early prediction of students’ performance. In 2022 2nd International Conference on Computing and Information Technology (ICCIT) (pp. 171–174). Tabuk, Saudi Arabia: IEEE.
[11]. Gupta, S., & Agarwal, J. (2022, October). Approaches for student performance prediction. In 2022 10th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO) (pp. 1–6). IEEE.
[12]. Zhao, J Y. (2025). Teaching for Justice and Care -- Discussion and practical Appeal on the Relationship between educational justice ethics and educational care ethics. Education Guide, (03), 20-28.
Cite this article
Xiao,F. (2025). Hybrid Deep Learning Framework for Student Grade Prediction. Applied and Computational Engineering,154,198-203.
Data availability
The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.
Disclaimer/Publisher's Note
The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
About volume
Volume title: Proceedings of CONF-SEML 2025 Symposium: Machine Learning Theory and Applications
© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and
conditions of the Creative Commons Attribution (CC BY) license. Authors who
publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons
Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this
series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published
version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial
publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and
during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See
Open access policy for details).