NeuroLM

Shanghai Jiao Tong University / Microsoft

Multi-task EEG foundation model that treats brain signals as a foreign language, pairing a text-aligned neural tokenizer with a GPT-2 backbone.

Released: August 2024

Parameters: 1.7 Billion

NeuroLM is a foundation model for electroencephalography (EEG) that reframes brain-signal analysis as a language-modeling problem: raw EEG is converted into discrete tokens and processed by a large language model, allowing a single network to handle many decoding tasks without a separate classifier head per dataset. It was introduced by Wei-Bang Jiang and Bao-Liang Lu at Shanghai Jiao Tong University together with Yansen Wang and Dongsheng Li at Microsoft Research Asia, first posted to arXiv in August 2024 and accepted at ICLR 2025.

Most prior EEG deep-learning models are trained and evaluated on one task at a time, so a model built for sleep staging cannot also flag abnormal recordings or classify emotion without retraining. NeuroLM instead treats EEG as a "foreign language" that is aligned to text, then uses multi-task instruction tuning to teach one model to follow natural-language prompts across heterogeneous EEG benchmarks. To the authors' knowledge it is the first multi-task EEG foundation model, and it builds directly on the team's earlier neural tokenizer work, LaBraM.

The result is a unified system that ingests multi-channel EEG and emits predictions for tasks as different as abnormality detection, event classification, and emotion recognition, all driven by the same instruction-tuned backbone rather than task-specific fine-tuning pipelines.

Key Features

EEG as a foreign language: A text-aligned vector-quantized (VQ) neural tokenizer discretizes EEG, so brain signals share a token vocabulary with text and can be modeled by a standard autoregressive LLM.
Single multi-task model: Multi-task instruction tuning lets one set of weights serve six diverse EEG datasets simultaneously, replacing the usual one-model-per-task workflow.
Text-EEG alignment via adversarial training: The tokenizer is trained with vector-quantized temporal-frequency prediction and domain-classifier adversarial losses that pull EEG embeddings toward the text embedding space.
Multi-channel autoregressive learning: A GPT-2 backbone consumes the EEG tokens and learns channel-aware autoregressive structure, supporting sequences up to 1024 patches.
Three model scales: Released as NeuroLM-B (254M), NeuroLM-L (500M), and NeuroLM-XL (1.7B) parameters to trade off compute against capacity.

Technical Details

NeuroLM couples a VQ neural tokenizer with a GPT-2 language-model backbone. The tokenizer is trained to reconstruct both the temporal and frequency domains of EEG while adversarial domain classifiers align its codebook with a text embedding space; the LLM is then trained with multi-channel autoregressive objectives and adapted via instruction tuning. Pretraining used roughly 25,000 hours of EEG, dominated by the Temple University EEG corpus (~24,000 hours) and supplemented by datasets such as SEED and BCI Competition IV. The largest variant, NeuroLM-XL, has about 1.696 billion parameters. Evaluation spans six benchmarks: TUAB (abnormal detection), TUEV (event classification), SEED (emotion recognition), HMC (sleep staging), Workload (cognitive load), and TUSL (slowing events). A single instruction-tuned NeuroLM matches the broad capabilities of separately trained baselines, though on individual tasks dedicated models can still edge it out: on TUAB, NeuroLM-XL reaches a balanced accuracy of 0.797 versus 0.814 for the single-task LaBraM-Base.

Applications

NeuroLM targets clinical and neuroscience workflows where a lab must run many different EEG analyses—screening recordings for abnormality, classifying epileptiform or slowing events, staging sleep, gauging cognitive workload, or decoding affective state—from a shared backbone rather than maintaining a fleet of task-specific models. By exposing tasks as natural-language instructions, it lowers the engineering burden of adding a new EEG task and makes it feasible to deploy one model across the diagnostic and brain-computer-interface settings that previously each required bespoke training.

Impact

NeuroLM is notable as the first EEG foundation model to unify multiple decoding tasks under a single instruction-following LLM, demonstrating that the tokenize-then-language-model recipe that reshaped protein and genomic modeling can extend to neural time series. Its ICLR 2025 acceptance, open MIT-licensed code, and released checkpoints have made it a reference point for "EEG-as-language" research and multi-task biosignal foundation models. The main limitation is that per-task accuracy does not yet consistently surpass strong single-task specialists such as LaBraM, leaving headroom for better tokenization and alignment in future work.

Citation

NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals

Preprint

Jiang, W., et al. (2024) NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals. International Conference on Learning Representations.

DOI: 10.48550/arXiv.2409.00101

Recent citations

Papers that recently cited this model.

PGAP: Purity-Guided Active Prompting for EEG Decoding With LLMs [Research Frontier]
Jingwei Luo, Ziwei Wang, Dingkun Liu, et al.
IEEE Computational Intelligence Magazine · Aug 2026
0
MindAU: EEG-Conditioned Facial Action Unit Editing via Dual-Stream Manifold Alignment
Zhenhang Li, Xin Zhou, Hao Deng, et al.
Jul 2026
0
BrainFIBRE: A Foundation Model via Information Decomposition for Brain Microstructure
Zijian Dong, Yi Lin, Jixiang Fang, et al.
Jul 2026
0

Top citations

The most-cited papers that cite this model.

SensorLM: Learning the Language of Wearable Sensors
Yuwei Zhang, Kumar Ayush, Siyuan Qiao, et al.
arXiv.org · Jun 2025
47
CSBrain: A Cross-scale Spatiotemporal Brain Foundation Model for EEG Decoding
Yuchen Zhou, Jiamin Wu, Zichen Ren, et al.
arXiv.org · Jun 2025
35
Brain Foundation Models: A survey on advancements in neural signal processing and brain discovery
Xin-qiu Zhou, Chenyu Liu, Zhisheng Chen, et al.
IEEE Signal Processing Magazine · Mar 2025
34
EEG foundation models: a critical review of current progress and future directions
Gayal Kuruppu, Neeraj Wagh, V. Kremen, et al.
Journal of Neural Engineering · Jul 2025
20
EEG Emotion Copilot: Optimizing lightweight LLMs for emotional EEG interpretation with assisted medical record generation
Hongyu Chen, Weiming Zeng, Chengcheng Chen, et al.
Neural Networks · Sep 2024
20

Citations

Total Citations101

Influential15

References55

GitHub

Stars163

Forks22

Open Issues10

Contributors1

Last Push10mo ago

LanguagePython

LicenseMIT

HuggingFace

Downloads0

Likes5

Last Modified1y ago

Fields of citing research

Computer Science99%
Medicine44%
Engineering40%
Biology17%
Psychology3%
Education2%
Physics2%
Linguistics2%

Share of papers citing this model.

Openness

bio.rodeo opennessFully open · usable and reproducible

66Partial

Usability — can I run it?87

Reproducibility — can I retrain it?57

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

GitHub Repository Research Paper HuggingFace Model

Key Features

EEG as a foreign language: A text-aligned vector-quantized (VQ) neural tokenizer discretizes EEG, so brain signals share a token vocabulary with text and can be modeled by a standard autoregressive LLM.

Single multi-task model: Multi-task instruction tuning lets one set of weights serve six diverse EEG datasets simultaneously, replacing the usual one-model-per-task workflow.

Text-EEG alignment via adversarial training: The tokenizer is trained with vector-quantized temporal-frequency prediction and domain-classifier adversarial losses that pull EEG embeddings toward the text embedding space.

Multi-channel autoregressive learning: A GPT-2 backbone consumes the EEG tokens and learns channel-aware autoregressive structure, supporting sequences up to 1024 patches.

Three model scales: Released as NeuroLM-B (254M), NeuroLM-L (500M), and NeuroLM-XL (1.7B) parameters to trade off compute against capacity.

Technical Details

Applications

Impact

Citation

NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals

Preprint

Jiang, W., et al. (2024) NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals. International Conference on Learning Representations.

DOI: 10.48550/arXiv.2409.00101

Recent citations

Papers that recently cited this model.

PGAP: Purity-Guided Active Prompting for EEG Decoding With LLMs [Research Frontier]

Jingwei Luo, Ziwei Wang, Dingkun Liu, et al.

IEEE Computational Intelligence Magazine · Aug 2026

MindAU: EEG-Conditioned Facial Action Unit Editing via Dual-Stream Manifold Alignment

Zhenhang Li, Xin Zhou, Hao Deng, et al.

Jul 2026

BrainFIBRE: A Foundation Model via Information Decomposition for Brain Microstructure

Zijian Dong, Yi Lin, Jixiang Fang, et al.

Jul 2026

NeuroLM

#Key Features

#Technical Details

#Applications

#Impact

Citation

NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals

Recent citations

MindAU: EEG-Conditioned Facial Action Unit Editing via Dual-Stream Manifold Alignment

BrainFIBRE: A Foundation Model via Information Decomposition for Brain Microstructure

Top citations

Related models

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

NeuroLM

#Key Features

#Technical Details

#Applications

#Impact

Citation

NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals

Recent citations

MindAU: EEG-Conditioned Facial Action Unit Editing via Dual-Stream Manifold Alignment

BrainFIBRE: A Foundation Model via Information Decomposition for Brain Microstructure

Top citations

Related models

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact