Causality-Driven Convolutional Manifold Attention Network for Electroencephalogram Signal Decoding

Deep learning-based methods have achieved remarkable success in brain-computer interfaces (BCIs). However, its inherent assumption of independent and identically distributed (i.i.d.) data renders it vulnerable to out-of distribution (OOD) scenarios. To address this limitation, the present study prop...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:IEEE transactions on pattern analysis and machine intelligence Ročník PP; s. 1 - 12
Hlavní autori: Lu, Bin, Chen, Junxiang, Wang, Fuwang, Wen, Guilin, Fu, Rongrong, Hua, Changchun
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: United States IEEE 27.10.2025
Predmet:
ISSN:0162-8828, 1939-3539, 2160-9292, 1939-3539
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:Deep learning-based methods have achieved remarkable success in brain-computer interfaces (BCIs). However, its inherent assumption of independent and identically distributed (i.i.d.) data renders it vulnerable to out-of distribution (OOD) scenarios. To address this limitation, the present study proposed a causality-driven convolutional manifold attention network (CD-CMAN) that learned invariant representations from electroencephalogram (EEG) signals to enhance OOD generalization. The framework began with a spatiotemporal convolution module to extract rich temporal and spatial features. Guided by the defined structural causal model and leveraging the strengths of Riemannian geometry and deep learning, dual latent encoders with manifold attention units were crafted to explicitly separate spatiotemporal feature maps into semantic and variation latent factors. A reconstruction module with a dedicated loss was implemented to ensure these factors retaining informative, while the Hilbert-Schmidt independence criterion (HSIC) was introduced to enforce their statistical independence. Further, a variational information bottleneck and gradient reversal layer were incorporated to compress and disentangle the semantic and variation factors. Evaluations on two public datasets under both subject-dependent and subject independent settings demonstrated that CD-CMAN consistently outperforms comparative baselines. These findings suggest that the proposed model could provide a new solution for the practical application of BCI technology.
Bibliografia:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0162-8828
1939-3539
2160-9292
1939-3539
DOI:10.1109/TPAMI.2025.3625631