A hybrid model for predicting academic performance of engineering undergraduates
Abstract
Predicting engineering students’ academic performance is crucial for teachers to implement teaching interventions. How to use students’ behavior records to predict grades during school has attracted extensive attention from researchers. Various machine learning methods have been proposed by previous researchers to predict engineering students’ academic performance through their learning records. However, one of the major challenges now is that many researchers ignore the problem of reduced accuracy of prediction models due to imbalanced data. Simultaneously, a single classifier is susceptible to data changes and has an unstable classification performance. To this end, this paper proposes the RBF-Naive Bayes (RBFNB) prediction model to predict students’ academic performance, which integrates Radial Basis Function (RBF) Network algorithm and Naive Bayes algorithm. The method of RBF Network is applied to predict the academic performance of students and generate prediction results. Then, the method of Naive Bayes is adopted to correct the previous prediction results for improving the accuracy of the prediction. The proposed RBFNB prediction model was applied to the data set about the engineering students’ academic performance from a university and the Kalboard 360 public data set, respectively.
References
- 1. , School choice in a market environment: Individual versus social expectations, Complexity 2018 :1–11, 2018. Crossref, Web of Science, Google Scholar
- 2. , Zero shot learning for code education: Rubric sampling with deep learning inference, Proc. AAAI Conf. Artificial Intelligence, pp. 782–790, 2019. Crossref, Google Scholar
- 3. , Transitioning from face-to-face to blended and full online learning engineering master’s program, IEEE Trans. Educ. 63(1) :2–9, 2019. Crossref, Web of Science, Google Scholar
- 4. , Web-based platform for K-12 AI education in China, Proc. AAAI Conf. Artificial Intelligence, Vol. 35, No. 17, pp. 15687–15694, 2021. Crossref, Google Scholar
- 5. , Incidence of digital competences in the completion rates of MOOCs: Case study on energy sustainability courses, IEEE Trans. Educ. 63(3) :183–189, 2020. Crossref, Web of Science, Google Scholar
- 6. , Respecting data privacy in educational data mining: An approach to the transparent handling of student data and dealing with the resulting missing value problem, 2018 IEEE 27th Int. Conf. Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE),
Bayonne, France , pp. 160–164, 2018. Crossref, Google Scholar - 7. , Metacognitive engagement during problem solving while in naturalistic homework study groups, Int. J. Eng. Educ. 37(1) :96–114, 2021. Web of Science, Google Scholar
- 8. , Strategies for continuously improving the professional development and practice of engineering educators, Int. J. Eng. Educ. 37(1) :287–297, 2021. Web of Science, Google Scholar
- 9. , Influence of context on item-specific self-efficacy and competence of engineering students, Int. J. Eng. Educ. 33(4) :1297–1306, 2017. Web of Science, Google Scholar
- 10. , Longitudinal academic performance analysis using a two-step clustering methodology, Int. J. Eng. Educ. 33(1) :203–215, 2017. Web of Science, Google Scholar
- 11. , Prediction for student academic performance using SMNaive bayes model, Int. Conf. Advanced Data Mining and Applications,
Dalian, China , pp. 712–725, 2019. Crossref, Google Scholar - 12. , Evaluating the effect of program visualization on student motivation, IEEE Trans. Educ. 60(3) :238–245, 2017. Crossref, Web of Science, Google Scholar
- 13. , Tracking student performance in introductory programming by means of machine learning, 2019 4th MEC International Conference on Big Data and Smart City (ICBDSC),
Muscat, Oman , pp. 1–6, 2019. Crossref, Google Scholar - 14. , A review on predictive modeling technique for student academic performance monitoring, MATEC Web of Conf.,
Malaysia , Vol. 255, p. 03004, 2019. Crossref, Google Scholar - 15. , Student performance prediction model for early-identification of at-risk students in traditional classroom settings, Proc. 10th Int. Conf. Management of Digital EcoSystems, pp. 239–245, 2018. Crossref, Google Scholar
- 16. , BLDC motor-driven fluid pumping system design: An extrapolated active learning case study for electrical machines classes, IEEE Trans. Educ. 63(3) :173–182, 2020. Crossref, Web of Science, Google Scholar
- 17. Mondal A. K., Dolz J., Desrosiers C., Few-shot 3d multi-modal medical image segmentation using generative adversarial learning, arXiv:1810.12241, 2018. Google Scholar
- 18. , Spatiotemporal data fusion in graph convolutional networks for traffic prediction, IEEE Access 8 :76632–76641, 2020. Crossref, Web of Science, Google Scholar
- 19. , Improving student engagement in teaching electric machines through blended learning, IEEE Trans. Educ. 62(4) :297–304, 2019. Crossref, Web of Science, Google Scholar
- 20. , Using topic modeling to extract pre-service teachers’ understandings of computational thinking from their coding reflections, IEEE Trans. Educ. 62(4) :325–332, 2019. Crossref, Web of Science, Google Scholar
- 21. , Towards emotionally aware AI smart classroom: Current issues and directions for engineering and education, IEEE Access 6 :5308–5331, 2018. Crossref, Web of Science, Google Scholar
- 22. , A machine learning approach for tracking and predicting student performance in degree programs, IEEE J. Sel. Topics Signal Process. 11(5) :742–753, 2017. Crossref, Web of Science, Google Scholar
- 23. Sebastian E., Kerstin S., Early detection of students at risk–predicting student dropouts using administrative student data and machine learning methods, Arqus Discussion Papers in Quantitative Tax Research, 2018. Google Scholar
- 24. , Machine learning and prediction in medicinebeyond the peak of inflated expectations, N. Engl. J. Med. 376(26) :2507, 2017. Crossref, Web of Science, Google Scholar
- 25. , Student academic performance prediction by using decision tree algorithm, 2018 4th Int. Conf. Computer and Information Sciences (ICCOINS),
Kuching, Malaysia , pp. 1–5, 2018. Crossref, Google Scholar - 26. , A machine learning framework to identify students at risk of adverse academic outcomes, Proc. 21th ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining,
New York, NY , pp. 1909–1918, 2015. Crossref, Google Scholar - 27. , Student academic performance prediction using supervised learning techniques, Int. J. Emerg. Technol. Learn. 14(14) :92, 2019. Crossref, Web of Science, Google Scholar
- 28. , A hybrid approach using oversampling technique and cost-sensitive learning for bankruptcy prediction, Complexity 2019 :1–12, 2019. Crossref, Web of Science, Google Scholar
- 29. , A predictive analytics framework as a countermeasure for attrition of students, Interac. Learn. Environ. 30(6) :1028–1043, 2022. Crossref, Web of Science, Google Scholar
- 30. , Prediction of defective software modules using class imbalance learning, Appl. Comput. Intell. Soft Comput. 2016 :Article ID 7658207, 2016. Google Scholar
- 31. , Selection of best decision tree algorithm for prediction and classification of students’ action, Am. Int. J. Res. Sci., Technol., Eng. Math. 16(1) :26–32, 2016. Google Scholar
- 32. , Psychological correlates of university students’ academic performance: A systematic review and meta-analysis, Psychol. Bull. 138(2) :353, 2012. Crossref, Web of Science, Google Scholar
- 33. , SMOTE for learning from imbalanced data: Progress and challenges, marking the 15-year anniversary, J. Artif. Intell. Res. 61 :863–905, 2018. Crossref, Web of Science, Google Scholar
- 34. , Classification of wine quality with imbalanced data, 2016 IEEE Int. Conf. Industrial Technology (ICIT),
Valencia, Spain , pp. 1712–1217, 2016. Crossref, Google Scholar - 35. , Training deep neural networks on imbalanced data sets, 2016 Int. Joint Conf. Neural Networks (IJCNN),
Shenzhen, China , pp. 4368–4374, 2016. Crossref, Google Scholar - 36. , Modelling student performance using data mining techniques: Inputs for academic program development, Proc. 2019 5th Int. Conf. Computing and Data Engineering,
New York, NY , pp. 36–40, 2019. Crossref, Google Scholar - 37. , Predicting academic performance: A bootstrapping approach for learning dynamic Bayesian networks, Int. Conf. Artificial Intelligence in Education,
Chicago , pp. 26–36, 2019. Crossref, Google Scholar - 38. , Iterative metric learning for imbalance data classification, IJCAI,
Stockholm , pp. 2805–2811, 2018. Crossref, Google Scholar - 39. , A novel ensemble method for classifying imbalanced data, Pattern Recogn. 48(5) :1623–1637, 2015. Crossref, Web of Science, Google Scholar
- 40. , Label propagation based semi-supervised learning for software defect prediction, Autom. Softw. Eng. 24(1) :47–69, 2017. Crossref, Web of Science, Google Scholar
- 41. , A model to predict low academic performance at a specific enrollment using data mining, IEEE Rev. Iberoam. Tecnol. Aprendiz. 10(3) :119–125, 2015. Google Scholar
- 42. , Symbolic one-class learning from imbalanced datasets: Application in medical diagnosis, Int. J. Artif. Intell. Tools 18(02) :273–309, 2009. Link, Web of Science, Google Scholar
- 43. , Predicting students’ academic performance and main behavioral features using data mining techniques, Int. Conf. Computing,
Riyadh, Saudi Arabia , pp. 245–259, 2019. Google Scholar - 44. , Fast-match: Fast affine template matching, Proc. IEEE Conf. Computer Vision and Pattern Recognition,
Portland, USA , pp. 2331–2338, 2013. Crossref, Google Scholar - 45. , Predicting honors student performance using RBFNN and PCA method, Int. Conf. Database Systems for Advanced Applications,
Suzhou, China , pp. 364–375, 2017. Crossref, Google Scholar - 46. , Data mining: Practical machine learning tools and techniques with Java implementations, ACM SIGMOD Record 31(1) :76–77, 2002. Crossref, Google Scholar
- 47. , Preprocessing and analyzing educational data set using X-API for improving student’s performance, 2015 IEEE Jordan Conf. Applied Electrical Engineering and Computing Technologies (AEECT),
Aqaba, Jordan , pp. 1–5, 2015. Crossref, Google Scholar - 48. , Analyzing students’ academic performance through educational data mining, 3c Tecnología: Glosas de innovación aplicadas a la pyme 8(1) :402–421, 2019. Google Scholar
- 49. , Mining educational data to predict student’s academic performance using ensemble methods, Int. J. Database Theory Appl. 9(8) :119–136, 2016. Crossref, Google Scholar
- 50. , Hybrid resampling to handle imbalanced class on classification of student performance in classroom, 2017 1st Int. Conf. Informatics and Computational Sciences (ICICoS),
Semarang, Indonesia , pp. 207–212, 2017. Crossref, Google Scholar - 51. , Analogy-based reasoning in classifier construction, in Transactions on Rough Sets IV,
Springer, Berlin , pp. 277–374, 2015. Google Scholar - 52. , Rseslib 3: Library of rough set and machine learning methods with extensible architecture, in Transactions on Rough Sets XXI,
Springer, Berlin , pp. 301–323, 2019. Crossref, Google Scholar - 53. , Comparison of predicting student’s performance using machine learning algorithms, Int. J. Intell. Syst. Appl. 11(12) :34, 2019. Google Scholar
- 54. , Attributes of low performing students in e-learning system using clustering technique, 2019 Int. Conf. Computational Science and Computational Intelligence (CSCI),
Las Vegas, USA , pp. 1324–1328, 2019. Crossref, Google Scholar - 55. , Learning analytics in Universiti Teknologi Brunei: predicting graduates performance, 2018 Fourth Int. Conf. Advances in Computing, Communication & Automation (ICACCA),
Subang Jaya, Malaysia , pp. 1–5, 2018. Crossref, Google Scholar