Zhang, Runkai and Rong, Rong and Gan, John Q and Xu, Yun and Wang, Haixian and Wang, Xiaoyun (2023) Reliable and fast automatic artifact rejection of Long-Term EEG recordings based on Isolation Forest. Medical and Biological Engineering and Computing, 62 (2). pp. 521-535. DOI https://doi.org/10.1007/s11517-023-02961-5
Zhang, Runkai and Rong, Rong and Gan, John Q and Xu, Yun and Wang, Haixian and Wang, Xiaoyun (2023) Reliable and fast automatic artifact rejection of Long-Term EEG recordings based on Isolation Forest. Medical and Biological Engineering and Computing, 62 (2). pp. 521-535. DOI https://doi.org/10.1007/s11517-023-02961-5
Zhang, Runkai and Rong, Rong and Gan, John Q and Xu, Yun and Wang, Haixian and Wang, Xiaoyun (2023) Reliable and fast automatic artifact rejection of Long-Term EEG recordings based on Isolation Forest. Medical and Biological Engineering and Computing, 62 (2). pp. 521-535. DOI https://doi.org/10.1007/s11517-023-02961-5
Abstract
Long-term electroencephalogram (Long-Term EEG) has the capacity to monitor over a long period, making it a valuable tool in medical institutions. However, due to the large volume of patient data, selecting clean data segments from raw Long-Term EEG for further analysis is an extremely time-consuming and labor-intensive task. Furthermore, the various actions of patients during recording make it difficult to use algorithms to denoise part of the EEG data, and thus lead to the rejection of these data. Therefore, tools for the quick rejection of heavily corrupted epochs in Long-Term EEG records are highly beneficial. In this paper, a new reliable and fast automatic artifact rejection method for Long-Term EEG based on Isolation Forest (IF) is proposed. Specifically, the IF algorithm is repetitively applied to detect outliers in the EEG data, and the boundary of inliers is promptly adjusted by using a statistical indicator to make the algorithm proceed in an iterative manner. The iteration is terminated when the distance metric between clean epochs and artifact-corrupted epochs remains unchanged. Six statistical indicators (i.e., min, max, median, mean, kurtosis, and skewness) are evaluated by setting them as centroid to adjust the boundary during iteration, and the proposed method is compared with several state-of-the-art methods on a retrospectively collected dataset. The experimental results indicate that utilizing the min value of data as the centroid yields the most optimal performance, and the proposed method is highly efficacious and reliable in the automatic artifact rejection of Long-Term EEG, as it significantly improves the overall data quality. Furthermore, the proposed method surpasses compared methods on most data segments with poor data quality, demonstrating its superior capacity to enhance the data quality of the heavily corrupted data. Besides, owing to the linear time complexity of IF, the proposed method is much faster than other methods, thus providing an advantage when dealing with extensive datasets.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Long-Term EEG; Automatic rejection; Isolation Forest; Outlier detection; Machine learning |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 21 Nov 2023 12:58 |
Last Modified: | 09 Nov 2024 02:00 |
URI: | http://repository.essex.ac.uk/id/eprint/36885 |
Available files
Filename: MBEC2023AcceptedVersion.pdf