Ahmed, Usman and Jiangbin, Zheng and Almogren, Ahmad and Sadiq, Muhammad and Rehman, Ateeq Ur and Sadiq, MT and Choi, Jaeyoung (2024) Hybrid bagging and boosting with SHAP based feature selection for enhanced predictive modeling in intrusion detection systems. Scientific Reports, 14 (1). 30532-. DOI https://doi.org/10.1038/s41598-024-81151-1
Ahmed, Usman and Jiangbin, Zheng and Almogren, Ahmad and Sadiq, Muhammad and Rehman, Ateeq Ur and Sadiq, MT and Choi, Jaeyoung (2024) Hybrid bagging and boosting with SHAP based feature selection for enhanced predictive modeling in intrusion detection systems. Scientific Reports, 14 (1). 30532-. DOI https://doi.org/10.1038/s41598-024-81151-1
Ahmed, Usman and Jiangbin, Zheng and Almogren, Ahmad and Sadiq, Muhammad and Rehman, Ateeq Ur and Sadiq, MT and Choi, Jaeyoung (2024) Hybrid bagging and boosting with SHAP based feature selection for enhanced predictive modeling in intrusion detection systems. Scientific Reports, 14 (1). 30532-. DOI https://doi.org/10.1038/s41598-024-81151-1
Abstract
The novelty and growing sophistication of cyber threats mean that high accuracy and interpretable machine learning models are needed more than ever before for Intrusion Detection and Prevention Systems. This study aims to solve this challenge by applying Explainable AI techniques, including Shapley Additive explanations feature selection, to improve model performance, robustness, and transparency. The method systematically employs different classifiers and proposes a new hybrid method called Hybrid Bagging-Boosting and Boosting on Residuals. Then, performance is taken in four steps: the multistep evaluation of hybrid ensemble learning methods for binary classification and fine-tuning of performance; feature selection using Shapley Additive explanations values retraining the hybrid model for better performance and reducing overfitting; the generalization of the proposed model for multiclass classification; and the evaluation using standard information metrics such as accuracy, precision, recall, and F1-score. Key results indicate that the proposed methods outperform state-of-the-art algorithms, achieving a peak accuracy of 98.47% and an F1 score of 96.19%. These improvements stem from advanced feature selection and resampling techniques, enhancing model accuracy and balancing precision and recall. Integrating Shapley Additive explanations-based feature selection with hybrid ensemble methods significantly boosts the predictive and explanatory power of Intrusion Detection and Prevention Systems, addressing common pitfalls in traditional cybersecurity models. This study paves the way for further research on statistical innovations to enhance Intrusion Detection and Prevention Systems performance.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Bagging and Boosting; Network Security; Intrusion Detection and Prevention Systems; Explainable AI; SHAP Feature Selection |
Subjects: | Z Bibliography. Library Science. Information Resources > ZZ OA Fund (articles) |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 13 Jan 2025 12:45 |
Last Modified: | 13 Jan 2025 13:19 |
URI: | http://repository.essex.ac.uk/id/eprint/39949 |
Available files
Filename: s41598-024-81151-1.pdf
Licence: Creative Commons: Attribution 4.0