Topology-agnostic EEG foundation model that uses learned queries and cross-attention to unify arbitrary electrode montages into a fixed latent space, scaling linearly in channels.
Electroencephalography (EEG) is recorded with a bewildering variety of electrode montages: clinical setups may use 19 channels in the standard 10-20 layout, research rigs may use 64 or 128, and consumer devices fewer still. This heterogeneity has been a persistent obstacle for EEG foundation models, most of which assume a fixed channel configuration or flatten every electrode-patch into a dense sequence whose cost grows with the number of channels. LUNA (Latent Unified Network Architecture) addresses this by making the model agnostic to electrode topology while keeping compute and memory linear in the channel count.
Introduced in October 2025 by Berkay Döner, Thorir Mar Ingolfsson, Luca Benini, and Yawei Li at ETH Zurich's PULP Platform, LUNA is a self-supervised foundation model that reconciles disparate electrode geometries by projecting any montage into a fixed-size latent representation. A bank of learned queries attends to the incoming channels through cross-attention, so downstream temporal modeling operates on a compact latent sequence rather than on a matrix whose width depends on how many electrodes were used. The result is a single backbone that can be pretrained once and fine-tuned across datasets with different sensor layouts.
LUNA sits alongside EEG foundation models such as LaBraM and BIOT, but its distinguishing contribution is decoupling representational cost from electrode geometry, which yields large efficiency gains without sacrificing accuracy.
LUNA combines three components: a tokenizer that splits EEG into patches carrying temporal and frequency features, a Channel-Unification Module that uses Q learned queries and cross-attention (O(Q*C) complexity in C channels) to produce a fixed latent representation, and a RoPE-based Transformer temporal encoder. It is pretrained on the TUEG and Siena corpora (over 21,000 hours of raw EEG), with all downstream subjects excluded to prevent leakage. On Temple University Hospital benchmarks it reaches 81.57% balanced accuracy and 0.8957 AUROC on TUAB (abnormality), 0.921 AUROC on TUAR (artifacts, a state-of-the-art result), and 0.802 AUROC on TUSL (slowing), and it also transfers to emotion recognition. The code is released under Apache-2.0 in the BioFoundation repository, while the pretrained weights carry a CC BY-ND 4.0 license (unmodified redistribution with attribution only).
LUNA targets clinical and research EEG analysis tasks including abnormality detection, artifact rejection, slowing classification, and emotion recognition. Its topology-agnostic design is most valuable when montages vary across sites or when channel counts are high, letting practitioners fine-tune one pretrained backbone across heterogeneous datasets instead of training a separate model per layout. The dramatic reduction in FLOPs and memory also makes it attractive for resource-constrained or high-throughput settings. The authors emphasize that LUNA is a research tool for representation learning and explicitly not a medical device.
By decoupling representational cost and structure from electrode geometry, LUNA advances EEG foundation models toward the kind of montage-flexible, efficient backbones the field needs to pool data across clinical and research sources. State-of-the-art artifact-detection results combined with order-of-magnitude efficiency gains make the topology-unification idea a practical contribution rather than a purely conceptual one, and the public Apache-2.0 code, documented Hugging Face model card, and three released model sizes lower the barrier to adoption. As a 2025 preprint, its broader benchmarks and clinical relevance await peer review and independent replication, and the CC BY-ND weight license restricts redistribution of fine-tuned variants.
Döner, B., et al. (2025) LUNA: Efficient and Topology-Agnostic Foundation Model for EEG Signal Analysis. arXiv.org.
DOI: 10.48550/arXiv.2510.22257Papers that recently cited this model.
The most-cited papers that cite this model.
Not enough data