Chen, Weijie and Daly, Ian and Chen, Yixin and Wu, Xiao and He, Xinjie and Wang, Xingyu and Cichocki, Andrzej and Jin, Jing (2026) Multiscale Pooling Spatial–Temporal Attention Network: Elevating Cross Session and Small Sample Decoding in Motor Imagery Brain–Computer Interfaces. IEEE Transactions on Systems, Man, and Cybernetics: Systems. DOI https://doi.org/10.1109/tsmc.2025.3650196
Chen, Weijie and Daly, Ian and Chen, Yixin and Wu, Xiao and He, Xinjie and Wang, Xingyu and Cichocki, Andrzej and Jin, Jing (2026) Multiscale Pooling Spatial–Temporal Attention Network: Elevating Cross Session and Small Sample Decoding in Motor Imagery Brain–Computer Interfaces. IEEE Transactions on Systems, Man, and Cybernetics: Systems. DOI https://doi.org/10.1109/tsmc.2025.3650196
Chen, Weijie and Daly, Ian and Chen, Yixin and Wu, Xiao and He, Xinjie and Wang, Xingyu and Cichocki, Andrzej and Jin, Jing (2026) Multiscale Pooling Spatial–Temporal Attention Network: Elevating Cross Session and Small Sample Decoding in Motor Imagery Brain–Computer Interfaces. IEEE Transactions on Systems, Man, and Cybernetics: Systems. DOI https://doi.org/10.1109/tsmc.2025.3650196
Abstract
Motor imagery (MI) is one of the most widely used paradigms in brain–computer interfaces (BCIs), known for its ability to trigger changes in brain activity without the need for an external “cue” stimulus. This unique characteristic has attracted significant attention from neuroscientists and researchers in fundamental science. However, compared to P300 and steady-state visual evoked potential (SSVEP), neural activity related to MI tends to be less stable and exhibits substantial variability between individuals. Consequently, accurately decoding MI, using both traditional machine learning and deep learning, has proven to be a considerable challenge. Moreover, given the difficulty of acquiring electroencephalography (EEG) data and the high data demands of deep learning, enhancing the accuracy of MI decoding with limited sample sizes remains a pressing issue that urgently needs to be addressed. This article addresses the challenges mentioned above by introducing a novel deep neural network designed for accurate MI decoding, which is designed to be effective with both small-sample sizes and larger datasets. This network, named the multiscale pooling spatial–temporal attention network (MPSTANet), integrates mix pooling techniques with spatial–temporal attention mechanisms. MPSTANet first employs local and global spatial attention, along with multiscale temporal attention, to thoroughly extract spatial–temporal information from EEG signals. Next, MPSTANet utilizes feature fusion and the proposed mix pooling technique to preserve as much of the extracted spatial–temporal information as possible. Finally, channel interaction attention (CIA) and 3-D weight attention (3-DWA) are employed to recalibrate the weights of the fused channels and spatial–temporal features, respectively. To validate the performance of our proposed MPSTANet model, we conducted experiments on four public datasets, including both small-sample sizes and subject-independent scenarios. MPSTANet achieved cross-session decoding accuracies of 84.82%, 72.92%, 88.20%, and 46.54% on the BCI Competition IV 2a dataset, the Open BMI dataset, the BCI Competition IV 2b dataset, and the PhysioNet dataset, respectively. Furthermore, MPSTANet demonstrated a significant lead compared to other deep learning models in both small-sample and subject-independent experiments. These results demonstrate the robustness of MPSTANet in MI decoding and its promising potential for BCI applications.
| Item Type: | Article |
|---|---|
| Uncontrolled Keywords: | Brain–computer interfaces (BCIs), cross session, motor imagery (MI), small sample |
| Subjects: | Z Bibliography. Library Science. Information Resources > ZR Rights Retention |
| Divisions: | Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
| SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
| Depositing User: | Unnamed user with email elements@essex.ac.uk |
| Date Deposited: | 20 Jan 2026 15:29 |
| Last Modified: | 20 Jan 2026 15:52 |
| URI: | http://repository.essex.ac.uk/id/eprint/42584 |
Available files
Filename: Multiscale_Pooling_SpatialTemporal_Attention_Network_Elevating_Cross_Session_and_Small_Sample_Decoding_in_Motor_Imagery_BrainComputer_Interfaces.pdf
Licence: Creative Commons: Attribution 4.0