MMAI Liquid Foundation Model (LFM2-2.6B-MMAI)

Small-molecule drug discovery foundation model covering ADMET, retrosynthesis, drug-target activity, and molecular optimization in a 2.6B checkpoint.

Released: March 2026

Parameters: 2.6 Billion

The MMAI Liquid Foundation Model (released as LFM2-2.6B-MMAI) is a compact, multi-task foundation model for small-molecule drug discovery, developed through a strategic partnership between Liquid AI and Insilico Medicine and announced in March 2026. Rather than relying on the in-context learning of very large general-purpose language models — which the authors show does not reliably deliver the scientific understanding required for chemistry — the model is purpose-trained to learn what its creators call "the language of molecules." A single 2.6-billion-parameter checkpoint covers tasks that are usually served by a patchwork of specialist models.

The model is built on LFM2, Liquid AI's hybrid Liquid Foundation Model architecture descended from liquid neural networks, and is specialized using Insilico's MMAI Gym for Science (MMAI = Multi-Modal AI). MMAI Gym is a structured training-and-evaluation environment that supplies curated molecular data formats, task-specific reasoning traces, and benchmarking recipes spanning over 1,000 pharmaceutical benchmarks. The central claim of the accompanying preprint is that a smaller, domain-trained foundation model can match or surpass models an order of magnitude larger across the drug discovery pipeline.

Because the checkpoint is small enough to run on private infrastructure, it is positioned for pharmaceutical and biotech teams that need state-of-the-art molecular reasoning without sending proprietary chemical data to external cloud APIs — a recurring barrier to AI adoption in regulated drug discovery settings.

Key Features

Single multi-task checkpoint: One 2.6B-parameter model handles molecular optimization, ADMET property prediction, retrosynthesis, drug-target activity prediction, and functional-group reasoning, replacing several task-specific systems.
Liquid Foundation Model backbone: Uses the LFM2 hybrid architecture (gated short convolutions plus grouped-query attention), giving strong efficiency and on-device/on-premise deployability relative to comparable transformer LLMs.
MMAI Gym specialization: Trained with curated scientific reasoning traces and benchmarking recipes across 200+ drug discovery tasks, teaching the model molecular data formats and modalities rather than relying on prompt-time reasoning.
Punches above its weight: Matches or outperforms systems roughly ten times its size; for example, it beats TxGemma-27B on 13 of 22 property-prediction tasks and sets state-of-the-art results on several.
Data sovereignty: Compact enough to deploy entirely on private pharmaceutical infrastructure, addressing confidentiality constraints around proprietary compound data.

Technical Details

LFM2-2.6B-MMAI is a fine-tune of the openly released LFM2-2.6B base model, a hybrid architecture combining gated short convolutions with grouped-query attention. Specialization was performed in the MMAI Gym environment using approximately 120 billion tokens of pharmaceutical data spanning 200+ tasks, with supervised fine-tuning operating over an aggregate context on the order of millions of tokens per optimization step. On benchmarks the model is competitive with or superior to much larger systems: it improves on TxGemma-27B for the majority of ADME/PK property-prediction tasks (for instance, a Caco-2 permeability MAE of 0.347 versus 0.401) and achieves near specialist-level performance on retrosynthesis, drug-target activity, and molecular optimization. The preprint was released on arXiv on 3 March 2026 (arXiv:2603.03517) under a CC-BY-NC-ND 4.0 license.

Applications

The model targets practical medicinal-chemistry and lead-optimization workflows: predicting ADMET/pharmacokinetic and toxicity properties of candidate compounds, proposing retrosynthetic routes, estimating drug-target binding/activity, and performing multi-parameter molecular optimization. Its small footprint makes it suitable for deployment inside a pharma company's own secure environment, where it can serve as a unified reasoning engine across the discovery pipeline rather than requiring teams to stitch together multiple specialist tools or expose confidential chemical libraries to third-party cloud services.

Impact

LFM2-2.6B-MMAI is a notable data point in the broader shift toward small, domain-specialized foundation models that rival or exceed much larger general-purpose LLMs on scientific tasks — here demonstrated against TxGemma-27B and comparably sized systems. It also operationalizes Liquid Foundation Models, a non-transformer-centric architecture, for a demanding scientific domain. Availability is the main caveat: the underlying LFM2-2.6B base weights are openly published on Hugging Face, but the MMAI-specialized checkpoint and the MMAI Gym training environment are offered through Insilico's membership/access program rather than as open weights, and no public training code repository accompanies the preprint. Independent replication of the reported benchmark gains is therefore limited at release.

Citation

MMAI Gym for Science: Training Liquid Foundation Models for Drug Discovery

Preprint

Kuznetsov, M., et al. (2026) MMAI Gym for Science: Training Liquid Foundation Models for Drug Discovery.

DOI: 10.48550/arXiv.2603.03517

Recent citations

Papers that recently cited this model.

The End of Aging Clocks: Training Foundation Models to Reason in Aging and Longevity
A. Zhavoronkov, V. Aladinskiy, A. Aliper, et al.
bioRxiv · Mar 2026
0

Top citations

The most-cited papers that cite this model.

The End of Aging Clocks: Training Foundation Models to Reason in Aging and Longevity
A. Zhavoronkov, V. Aladinskiy, A. Aliper, et al.
bioRxiv · Mar 2026
0

Citations

Total Citations1

Influential0

References43

HuggingFace

Downloads7.3K

Likes188

Last Modified3mo ago

Pipelinetext-generation

Fields of citing research

Biology100%
Computer Science100%
Medicine100%

Share of papers citing this model.

Openness

bio.rodeo opennessClosed · low usability and reproducibility

7Closed

Usability — can I run it?10

Reproducibility — can I retrain it?2

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

Research Paper Official Website HuggingFace Model

Key Features

Single multi-task checkpoint: One 2.6B-parameter model handles molecular optimization, ADMET property prediction, retrosynthesis, drug-target activity prediction, and functional-group reasoning, replacing several task-specific systems.

Liquid Foundation Model backbone: Uses the LFM2 hybrid architecture (gated short convolutions plus grouped-query attention), giving strong efficiency and on-device/on-premise deployability relative to comparable transformer LLMs.

MMAI Gym specialization: Trained with curated scientific reasoning traces and benchmarking recipes across 200+ drug discovery tasks, teaching the model molecular data formats and modalities rather than relying on prompt-time reasoning.

Punches above its weight: Matches or outperforms systems roughly ten times its size; for example, it beats TxGemma-27B on 13 of 22 property-prediction tasks and sets state-of-the-art results on several.

Data sovereignty: Compact enough to deploy entirely on private pharmaceutical infrastructure, addressing confidentiality constraints around proprietary compound data.

Technical Details

Applications

Impact

MMAI Liquid Foundation Model (LFM2-2.6B-MMAI)

Key Features

Technical Details

Applications

Impact

Citation

MMAI Gym for Science: Training Liquid Foundation Models for Drug Discovery

Recent citations

The End of Aging Clocks: Training Foundation Models to Reason in Aging and Longevity

Top citations

The End of Aging Clocks: Training Foundation Models to Reason in Aging and Longevity

Citations

HuggingFace

Fields of citing research

Openness

Tags

Resources

MMAI Liquid Foundation Model (LFM2-2.6B-MMAI)

Key Features

Technical Details

Applications

Impact

Citation

MMAI Gym for Science: Training Liquid Foundation Models for Drug Discovery

Recent citations

The End of Aging Clocks: Training Foundation Models to Reason in Aging and Longevity

Top citations

The End of Aging Clocks: Training Foundation Models to Reason in Aging and Longevity

Citations

HuggingFace

Fields of citing research

Openness

Tags

Resources

MMAI Liquid Foundation Model (LFM2-2.6B-MMAI)

#Key Features

#Technical Details

#Applications

#Impact

Citation

MMAI Gym for Science: Training Liquid Foundation Models for Drug Discovery

Recent citations

The End of Aging Clocks: Training Foundation Models to Reason in Aging and Longevity

Top citations

The End of Aging Clocks: Training Foundation Models to Reason in Aging and Longevity

Related models

Citations

HuggingFace

Fields of citing research

Openness

Tags

Resources

MMAI Liquid Foundation Model (LFM2-2.6B-MMAI)

#Key Features

#Technical Details

#Applications

#Impact

Citation

MMAI Gym for Science: Training Liquid Foundation Models for Drug Discovery

Recent citations

The End of Aging Clocks: Training Foundation Models to Reason in Aging and Longevity

Top citations

The End of Aging Clocks: Training Foundation Models to Reason in Aging and Longevity

Related models

Citations

HuggingFace

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact