ISSN :2582-9793

A Hybrid Shannon Entropy-Driven Ensemble Framework Integrating Random Forest, XGBoost, and CatBoost for Robust Mental Stress Prediction Among School Students

Original Research (Published On: 19-May-2026 )
DOI : https://doi.org/10.54364/AAIML.2026.63305

Dr.A.Leo, meena sarumathi, mahila vasanthi thangam, Clement Sudhahar J, Kevin Joseph J and Lourdu Stepy P

Adv. Artif. Intell. Mach. Learn., XX (XX):-

1. Dr.A.Leo: Karunya Institute of Technology and Sciences

2. meena sarumathi: Karunya Institute of Technology and Science.

3. mahila vasanthi thangam: Karunya Institute of Technology and Sciences.

4. Clement Sudhahar J: ICFAI Business School, IFHE Deemed University.

5. Kevin Joseph J: Division of Electronics and Communication Engineering, Karunya Institute of Technology and Sciences, Tamil Nadu, India

6. Lourdu Stepy P: School of management studies, Karunya Institute of Technology and Science, Tamil Nadu, India.

Download PDF Here

DOI: 10.54364/AAIML.2026.63305

Article History: Received on: 14-Feb-26, Accepted on: 12-May-26, Published on: 19-May-26

Corresponding Author: Dr.A.Leo

Email: leoa@karunya.edu

Citation: Meena Sarumathi S, et al. A Hybrid Shannon Entropy-Driven Ensemble Framework Integrating Random Forest, XGBoost, and CatBoost for Robust Mental Stress Prediction Among School Students. Advances in Artificial Intelligence and Machine Learning.2026. (Ahead of Print) https://dx.doi.org/10.54364/AAIML.2026.63305


Abstract

    

The academic stress, anxiety and the uncertainty as regards the career matters have enormous psychological effects on the students who are at this important juncture of education. The prior support system and targeted guidance with the proper mechanisms of early detection are required. The machine Learning algorithms help facilitate in practical deployment and targeted guidance for students at their age group. The most underappreciated mental health challenges in modern education are mental stress among high school students. In this research Shannon Entropy based feature selection has been used. Through that the three ensemble classifiers namely Random Forest, XG Boost and Cat Boost were used to predict mental stress levels from structured questionnaire. The data was collected in five psychological domains from 500 respondents answering 15 questions through five-point Likert scale. The XG Boost delivered commendable 91% accuracy and AUC = 0.996, while random forest followed by 95% of accuracy and AUC of 0.998. CatBoost achieved the highest test accuracy (97%) with AUC of 1.000, though five-fold cross-validation yielded a more conservative estimate of 92.8% (±3.31%). The high test-set performance reflects the cluster-derived target variable methodology; external clinical validation remains essential. In this research the most discriminating features are of motivation related questions, Q14 and Q 15. The framework is small enough to be deployed in practice and comprehensible enough to be used in the context of target student support programs.

Statistics

   Article View: 24
   PDF Downloaded: 2