LUNA

EEG foundation model whose learned queries map any electrode montage into a fixed latent space, scaling linearly in the number of channels.

Released: October 2025

Parameters: 311 Million

Electroencephalography (EEG) is recorded with a bewildering variety of electrode montages: clinical setups may use 19 channels in the standard 10-20 layout, research rigs may use 64 or 128, and consumer devices fewer still. This heterogeneity has been a persistent obstacle for EEG foundation models, most of which assume a fixed channel configuration or flatten every electrode-patch into a dense sequence whose cost grows with the number of channels. LUNA (Latent Unified Network Architecture) addresses this by making the model agnostic to electrode topology while keeping compute and memory linear in the channel count.

Introduced in October 2025 by Berkay Döner, Thorir Mar Ingolfsson, Luca Benini, and Yawei Li at ETH Zurich's PULP Platform, LUNA is a self-supervised foundation model that reconciles disparate electrode geometries by projecting any montage into a fixed-size latent representation. A bank of learned queries attends to the incoming channels through cross-attention, so downstream temporal modeling operates on a compact latent sequence rather than on a matrix whose width depends on how many electrodes were used. The result is a single backbone that can be pretrained once and fine-tuned across datasets with different sensor layouts.

LUNA sits alongside EEG foundation models such as LaBraM and BIOT, but its distinguishing contribution is decoupling representational cost from electrode geometry, which yields large efficiency gains without sacrificing accuracy.

Key Features

Topology-agnostic by design: A Channel-Unification Module uses learned queries and cross-attention to map an arbitrary set of electrodes into a fixed latent space, so the same model handles 19-, 64-, or 128-channel recordings without retraining.
Linear-in-channels scaling: Because temporal modeling runs on the compact latent sequence rather than dense channel-patch matrices, LUNA reports up to 300x fewer FLOPs and roughly 10x lower GPU memory than dense-attention baselines.
Self-supervised pretraining: The model is pretrained on more than 21,000 hours of raw EEG with no task labels, learning transferable time-frequency representations.
RoPE temporal encoder: A Transformer encoder with rotary position embeddings processes the latent sequence, after a tokenizer segments EEG into patches with temporal and frequency features.
Three released sizes: LUNA ships as Base (7M), Large (43M), and Huge (311M) parameter variants, with pretrained weights and fine-tuning configs published on Hugging Face.

Technical Details

LUNA combines three components: a tokenizer that splits EEG into patches carrying temporal and frequency features, a Channel-Unification Module that uses Q learned queries and cross-attention (O(Q*C) complexity in C channels) to produce a fixed latent representation, and a RoPE-based Transformer temporal encoder. It is pretrained on the TUEG and Siena corpora (over 21,000 hours of raw EEG), with all downstream subjects excluded to prevent leakage. On Temple University Hospital benchmarks it reaches 81.57% balanced accuracy and 0.8957 AUROC on TUAB (abnormality), 0.921 AUROC on TUAR (artifacts, a state-of-the-art result), and 0.802 AUROC on TUSL (slowing), and it also transfers to emotion recognition. The code is released under Apache-2.0 in the BioFoundation repository, while the pretrained weights carry a CC BY-ND 4.0 license (unmodified redistribution with attribution only).

Applications

LUNA targets clinical and research EEG analysis tasks including abnormality detection, artifact rejection, slowing classification, and emotion recognition. Its topology-agnostic design is most valuable when montages vary across sites or when channel counts are high, letting practitioners fine-tune one pretrained backbone across heterogeneous datasets instead of training a separate model per layout. The dramatic reduction in FLOPs and memory also makes it attractive for resource-constrained or high-throughput settings. The authors emphasize that LUNA is a research tool for representation learning and explicitly not a medical device.

Impact

By decoupling representational cost and structure from electrode geometry, LUNA advances EEG foundation models toward the kind of montage-flexible, efficient backbones the field needs to pool data across clinical and research sources. State-of-the-art artifact-detection results combined with order-of-magnitude efficiency gains make the topology-unification idea a practical contribution rather than a purely conceptual one, and the public Apache-2.0 code, documented Hugging Face model card, and three released model sizes lower the barrier to adoption. As a 2025 preprint, its broader benchmarks and clinical relevance await peer review and independent replication, and the CC BY-ND weight license restricts redistribution of fine-tuned variants.

Citation

LUNA: Efficient and Topology-Agnostic Foundation Model for EEG Signal Analysis

Preprint

Döner, B., et al. (2025) LUNA: Efficient and Topology-Agnostic Foundation Model for EEG Signal Analysis. arXiv.org.

DOI: 10.48550/arXiv.2510.22257

Recent citations

Papers that recently cited this model.

CoCoT-EEG: Contrastive-Pretrained Multiscale Convolutional Transformer for EEG Decoding
Gabriel Mahuas, Victoria Shevchenko, Ugo Tanielian, et al.
Jul 2026
0
Stacked LoRA for Subject-Adaptive EEG Foundation Models in Motor Imagery Decoding
Aymen Sarhane, Fouad Lbakali, Mouad Souissi, et al.
Jul 2026
0
Evaluation of EEG Foundation Models for Event-Based Burst-Suppression Detection in ICU
Elisa Vasta, T. Ingolfsson, A. Cossettini, et al.
Jun 2026
0

Top citations

The most-cited papers that cite this model.

DeeperBrain: A Neuro-Grounded EEG Foundation Model Towards Universal BCI
Jiquan Wang, Sha Zhao, Yangxuan Zhou, et al.
arXiv.org · Jan 2026
7
TinyMyo: a Tiny Foundation Model for Flexible EMG Signal Processing at the Edge
Matteo Fasulo, Giusy Spacone, T. Ingolfsson, et al.
arXiv.org · Dec 2025
5
Neural Signals Generate Clinical Notes in the Wild
Jathurshan Pradeepkumar, Zheng Chen, Jimeng Sun
arXiv.org · Jan 2026
2Influential
Neural Brain Fields: A NeRF-Inspired Approach for Generating Nonexistent EEG Electrodes
Shahar Kedem, Itamar Zimerman, Eliya Nachmani
arXiv.org · Dec 2025
1
CoCoT-EEG: Contrastive-Pretrained Multiscale Convolutional Transformer for EEG Decoding
Gabriel Mahuas, Victoria Shevchenko, Ugo Tanielian, et al.
Jul 2026
0

Citations

Total Citations17

Influential2

References39

GitHub

Stars131

Forks15

Open Issues6

Contributors4

Last Push15d ago

LanguagePython

LicenseApache-2.0

HuggingFace

Downloads7.3K

Likes6

Last Modified3mo ago

Fields of citing research

Computer Science100%
Medicine56%
Engineering56%
Biology19%

Share of papers citing this model.

Openness

bio.rodeo opennessFully open · usable and reproducible

73Open

Usability — can I run it?71

Reproducibility — can I retrain it?81

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

GitHub Repository Research Paper HuggingFace Model

Key Features

Topology-agnostic by design: A Channel-Unification Module uses learned queries and cross-attention to map an arbitrary set of electrodes into a fixed latent space, so the same model handles 19-, 64-, or 128-channel recordings without retraining.

Linear-in-channels scaling: Because temporal modeling runs on the compact latent sequence rather than dense channel-patch matrices, LUNA reports up to 300x fewer FLOPs and roughly 10x lower GPU memory than dense-attention baselines.

Self-supervised pretraining: The model is pretrained on more than 21,000 hours of raw EEG with no task labels, learning transferable time-frequency representations.

RoPE temporal encoder: A Transformer encoder with rotary position embeddings processes the latent sequence, after a tokenizer segments EEG into patches with temporal and frequency features.

Three released sizes: LUNA ships as Base (7M), Large (43M), and Huge (311M) parameter variants, with pretrained weights and fine-tuning configs published on Hugging Face.

Technical Details

Applications

Impact

Recent citations

Papers that recently cited this model.

CoCoT-EEG: Contrastive-Pretrained Multiscale Convolutional Transformer for EEG Decoding

Gabriel Mahuas, Victoria Shevchenko, Ugo Tanielian, et al.

Jul 2026

Stacked LoRA for Subject-Adaptive EEG Foundation Models in Motor Imagery Decoding

Aymen Sarhane, Fouad Lbakali, Mouad Souissi, et al.

Jul 2026

Evaluation of EEG Foundation Models for Event-Based Burst-Suppression Detection in ICU

Elisa Vasta, T. Ingolfsson, A. Cossettini, et al.

Jun 2026

Top citations

The most-cited papers that cite this model.

2Influential

Neural Brain Fields: A NeRF-Inspired Approach for Generating Nonexistent EEG Electrodes

Shahar Kedem, Itamar Zimerman, Eliya Nachmani

arXiv.org · Dec 2025

CoCoT-EEG: Contrastive-Pretrained Multiscale Convolutional Transformer for EEG Decoding

Gabriel Mahuas, Victoria Shevchenko, Ugo Tanielian, et al.

Jul 2026

LUNA

#Key Features

#Technical Details

#Applications

#Impact

Citation

LUNA: Efficient and Topology-Agnostic Foundation Model for EEG Signal Analysis

Recent citations

CoCoT-EEG: Contrastive-Pretrained Multiscale Convolutional Transformer for EEG Decoding

Stacked LoRA for Subject-Adaptive EEG Foundation Models in Motor Imagery Decoding

Evaluation of EEG Foundation Models for Event-Based Burst-Suppression Detection in ICU

Top citations

CoCoT-EEG: Contrastive-Pretrained Multiscale Convolutional Transformer for EEG Decoding

Related models

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

LUNA

#Key Features

#Technical Details

#Applications

#Impact

Citation

LUNA: Efficient and Topology-Agnostic Foundation Model for EEG Signal Analysis

Recent citations

CoCoT-EEG: Contrastive-Pretrained Multiscale Convolutional Transformer for EEG Decoding

Stacked LoRA for Subject-Adaptive EEG Foundation Models in Motor Imagery Decoding

Evaluation of EEG Foundation Models for Event-Based Burst-Suppression Detection in ICU

Top citations

CoCoT-EEG: Contrastive-Pretrained Multiscale Convolutional Transformer for EEG Decoding

Related models

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact