Application of Lightweight Characteristic Residual Frame in Small Sample Score Prediction

Authors

  • Minghui Zhang Faculty of Information Science and Technology, Zhengzhou Normal University, Zhengzhou, China
  • Siti Khatijah Nor Abdul Rahim Faculty of Computer & Mathematical Sciences, Universiti Teknologi MARA, 40450 Shah Alam, Selangor, Malaysia
  • Raseeda Hamzah Faculty of Computer & Mathematical Sciences, Universiti Teknologi MARA, 77008 Jasin, Melaka,Malaysia

DOI:

https://doi.org/10.33736/jaspe.10465.2026

Keywords:

Multi-source and Heterogeneous Data, Feature Engineering, Equal Frequency Division Box, Cascade Residual.

Abstract

The analysis and modeling of educational data are of great significance to the evaluation of teaching quality and personalized learning guidance. However, the acquisition of academic data is often limited by the costs of data collection and the actual teaching scenarios that occur. Challenges like limited data access, small samples, and data sparsity make small-sample analysis both unavoidable and a persistent challenge in educational research. This study integrates the multi-feature data of 296 students majoring in computer science from a university in Zhengzhou, China. It proposed a feature residual cascade prediction framework that integrates binning technology. Firstly, the unified feature space of multimodal feature fusion is constructed through feature filtering and feature generation. Secondly, a high-precision and high-efficiency prediction model is established by combining the random forest strategy with box division residual error correction, named ReBin (Residual-Binned Model). The experimental results show that the method achieves excellent predictive performance with R ²=0.99 under limited sample conditions, and the improved ReBin model does not generate additional computational burden in terms of execution efficiency. By constructing a comprehensive comparative study of the system, significant breakthroughs have been made in both prediction accuracy and computational efficiency. This further confirms that this study not only provides an effective solution for the analysis of small sample data in education, but also provides an innovative modeling framework for the prediction research of small sample data in other fields, which has important theoretical reference and application value.

References

Li, X., Ke, S., Li, Y., Jin, W., Fu, X., Fu, G., & Bi, W. (2024). Temperature compensation based on BP neural network with small sample data for chloride ions optical fiber probe. Optics & Laser Technology, 176, 110973. https://doi.org/10.1016/j.optlastec.2024.110973

Yu, C., Li, W., Guo, Y., Sun, X., Hong, F., Sun, N., & Zhang, Q. (2024). Research on wear rate of train brake pads driven by small sample data. Wear, 536, 205169. https://doi.org/10.1016/j.wear.2023.205169

Wu, Z., Ma, C., Zhang, L., Gui, H., Liu, J., & Liu, Z. (2024). Predicting and compensating for small-sample thermal information data in precision machine tools: A spatial-temporal interactive integration network and digital twin system approach. Applied Soft Computing, 161, 111760. https://doi.org/10.1016/j.asoc.2024.111760

Li, C., Wang, L., Li, J., & Chen, Y. (2024). Application of multi-algorithm ensemble methods in high-dimensional and small-sample data of geotechnical engineering: A case study of swelling pressure of expansive soils. Journal of Rock Mechanics and Geotechnical Engineering, 16(5), 1896-1917. https://doi.org/10.1016/j.jrmge.2023.10.015

Cheng, Z., Chen, F., Zuo, E., Gong, Z., Xiao, M., Li, C., Liu, Y., Liu, P., Chen, C., Lv, X. & Chen, C. (2025). The MLSE-SCAM architecture combines with the improved DRSN-TIC model for Raman spectroscopy small-sample data learning. Expert Systems with Applications, 279, 127462. https://doi.org/10.1016/j.eswa.2025.127462

Marbouti, F., Diefes-Dux, H. A., & Madhavan, K. (2016). Models for early prediction of at-risk students in a course using standards-based grading. Computers & Education, 103, 1-15. https://doi.org/10.1016/j.compedu.2016.09.005

Lakkaraju, H., Aguiar, E., Shan, C., Miller, D., Bhanpuri, N., Ghani, R., & Addison, K. L. (2015, August). A machine learning framework to identify students at risk of adverse academic outcomes. In Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, Sydney, NSW, Australia (pp. 1909-1918). https://doi.org/10.1145/2783258.2788620

Zeineddine, H., Braendle, U., & Farah, A. (2021). Enhancing prediction of student success: Automated machine learning approach. Computers & Electrical Engineering, 89, 106903. https://doi.org/10.1016/j.compeleceng.2020.106903

Aggarwal, D., Mittal, S., & Bali, V. (2021). Significance of non-academic parameters for predicting student performance using ensemble learning techniques. International Journal of System Dynamics Applications (IJSDA), 10(3), 38-49. DOI: 10.4018/IJSDA.2021070103

Bache K. & Silva A, M. G. (2013).The development of a student performance prediction model for improved student retention. In Proceedings of the 5th Annual Future Business Technology Conference.

Kovacic Z. (2010).Early prediction of student success: Mining students’ enrolment data. In Proceedings of the InSITE 2010: Informing Science , IT Education Conference, Cassino, Italy.

Hu, Y. H., Lo, C. L., & Shih, S. P. (2014). Developing early warning systems to predict students’ online learning performance. Computers in Human Behavior, 36, 469-478. https://doi.org/10.1016/j.chb.2014.04.002

Schlegl, T., Seeböck, P., Waldstein, S. M., Langs, G., Schmidt-Erfurth, U. & Langs, G. (2017). Unsupervised anomaly detection with generative adversarial networks to guide marker discovery[C]. International Conference on Information Processing in Medical Imaging.Cham:Springer, 145-157.https://doi.org/10.48550/arXiv.1703.05921

Kuzilek, J., Hlosta, M., Herrmannova, D., Zdrahal, Z., Vaclavek, J., & Wolff, A. (2015). OU Analyse: analysing at-risk students at The Open University. Learning analytics review, 1-16. http://www.laceproject.eu/learning-analyticsreview/analysing-at-risk-students-at-open-university/

Xu, J., Moon, K. H., & Van Der Schaar, M. (2017). A machine learning approach for tracking and predicting student performance in degree programs. IEEE Journal of Selected Topics in Signal Processing, 11(5), 742-753. https://doi.org/10.1109/JSTSP.2017.2692560

Watson, C., Li, F. W., & Godwin, J. L. (2013, July). Predicting performance in an introductory programming course by logging and analyzing student programming behavior. In 2013 IEEE 13th international conference on advanced learning technologies (pp. 319-323). IEEE. https://doi.org/10.1109/ICALT.2013.99

Al-Shehri, H., Al-Qarni, A., Al-Saati, L., Batoaq, A., Badukhen, H., Alrashed, S., Alhiyafi, J. & Olatunji, S. O. (2017, April). Student performance prediction using support vector machine and k-nearest neighbor. In 2017 IEEE 30th canadian conference on electrical and computer engineering (CCECE) (pp. 1-4). IEEE. doi: 10.1109/CCECE.2017.7946847.

Buenaño-Fernández, D., Gil, D., & Luján-Mora, S. (2019). Application of machine learning in predicting performance for computer engineering students: A case study. Sustainability, 11(10), 2833. https://doi.org/10.3390/su11102833

Niyogisubizo, J., Liao, L., Nziyumva, E., Murwanashyaka, E., & Nshimyumukiza, P. C. (2022). Predicting student's dropout in university classes using two-layer ensemble machine learning approach: A novel stacked generalization. Computers and Education: Artificial Intelligence, 3, 100066. https://doi.org/10.1016/j.caeai.2022.100066

Thaker K.,Huang Y., Brusilovsky P., Daqing H. (2018).Dynamic knowledge modeling with heterogeneous activities for adaptive textbooks. In Proceedings of the 11th International Conference on Educational Data Mining, Buffalo, NY, USA, 592–595.

He, J., Bailey, J., Rubinstein, B., & Zhang, R. (2015). Identifying At-Risk Students in Massive Open Online Courses. Proceedings of the AAAI Conference on Artificial Intelligence, 29(1), 25-30. https://doi.org/10.1609/aaai.v29i1.9471

Iqbal, Z., Qadir, J., Mian, A. N., & Kamiran, F. (2017). Machine learning based student grade prediction: A case study. arXiv preprint arXiv:1708.08744. https://doi.org/10.48550/arXiv.1708.08744

Chowdhury, S., Mayilvahanan, P., & Govindaraj, R. (2022). Optimal feature extraction and classification-oriented medical insurance prediction model: machine learning integrated with the internet of things. International Journal of Computers and Applications, 44(3), 278-290. https://doi.org/10.1080/1206212X.2020.1733307

Daud, A., Aljohani, N. R., Abbasi, R. A., Lytras, M. D., Abbas, F., & Alowibdi, J. S. (2017, April). Predicting student performance using advanced learning analytics. In Proceedings of the 26th international conference on world wide web companion (pp. 415-421). https://doi.org/10.1145/3041021.3054164

Kotsiantis, S., Patriarcheas, K., & Xenos, M. (2010). A combinational incremental ensemble of classifiers as a technique for predicting students’ performance in distance education. Knowledge-Based Systems, 23(6), 529-535. https://doi.org/10.1016/j.knosys.2010.03.010

Sghir, N., Adadi, A., & Lahmer, M. (2023). Recent advances in predictive learning analytics: A decade systematic review (2012–2022). Education and information technologies, 28(7), 8299-8333. https://doi.org/10.1007/s10639-022-11536-0

Downloads

Published

2026-04-30

How to Cite

Zhang, M., Abdul Rahim, S. K. N., & Hamzah, R. (2026). Application of Lightweight Characteristic Residual Frame in Small Sample Score Prediction. Journal of Applied Science &Amp; Process Engineering, 13(1), 1–27. https://doi.org/10.33736/jaspe.10465.2026