bio.rodeo
ModelsOrganizationsLeaderboardAbout
bio.rodeo

The authoritative source for evaluating biological foundation models. No hype, just honest analysis.

AboutFAQSubmit a modelContact
© 2026 Pulsatance. All rights reserved. ~
Built by Pulsatance
Small molecule foundation models
Small moleculeLanguage model

MMAI Liquid Foundation Model (LFM2-2.6B-MMAI)

Liquid AI / Insilico Medicine

A 2.6B-parameter Liquid Foundation Model fine-tuned via Insilico's MMAI Gym to handle ADMET, retrosynthesis, drug-target activity, and molecular optimization in one checkpoint.

Released: March 2026
Parameters: 2.6 Billion

The MMAI Liquid Foundation Model (released as LFM2-2.6B-MMAI) is a compact, multi-task foundation model for small-molecule drug discovery, developed through a strategic partnership between Liquid AI and Insilico Medicine and announced in March 2026. Rather than relying on the in-context learning of very large general-purpose language models — which the authors show does not reliably deliver the scientific understanding required for chemistry — the model is purpose-trained to learn what its creators call "the language of molecules." A single 2.6-billion-parameter checkpoint covers tasks that are usually served by a patchwork of specialist models.

The model is built on LFM2, Liquid AI's hybrid Liquid Foundation Model architecture descended from liquid neural networks, and is specialized using Insilico's MMAI Gym for Science (MMAI = Multi-Modal AI). MMAI Gym is a structured training-and-evaluation environment that supplies curated molecular data formats, task-specific reasoning traces, and benchmarking recipes spanning over 1,000 pharmaceutical benchmarks. The central claim of the accompanying preprint is that a smaller, domain-trained foundation model can match or surpass models an order of magnitude larger across the drug discovery pipeline.

Because the checkpoint is small enough to run on private infrastructure, it is positioned for pharmaceutical and biotech teams that need state-of-the-art molecular reasoning without sending proprietary chemical data to external cloud APIs — a recurring barrier to AI adoption in regulated drug discovery settings.

#Key Features

  • Single multi-task checkpoint: One 2.6B-parameter model handles molecular optimization, ADMET property prediction, retrosynthesis, drug-target activity prediction, and functional-group reasoning, replacing several task-specific systems.
  • Liquid Foundation Model backbone: Uses the LFM2 hybrid architecture (gated short convolutions plus grouped-query attention), giving strong efficiency and on-device/on-premise deployability relative to comparable transformer LLMs.
  • MMAI Gym specialization: Trained with curated scientific reasoning traces and benchmarking recipes across 200+ drug discovery tasks, teaching the model molecular data formats and modalities rather than relying on prompt-time reasoning.
  • Punches above its weight: Matches or outperforms systems roughly ten times its size; for example, it beats TxGemma-27B on 13 of 22 property-prediction tasks and sets state-of-the-art results on several.
  • Data sovereignty: Compact enough to deploy entirely on private pharmaceutical infrastructure, addressing confidentiality constraints around proprietary compound data.

#Technical Details

LFM2-2.6B-MMAI is a fine-tune of the openly released LFM2-2.6B base model, a hybrid architecture combining gated short convolutions with grouped-query attention. Specialization was performed in the MMAI Gym environment using approximately 120 billion tokens of pharmaceutical data spanning 200+ tasks, with supervised fine-tuning operating over an aggregate context on the order of millions of tokens per optimization step. On benchmarks the model is competitive with or superior to much larger systems: it improves on TxGemma-27B for the majority of ADME/PK property-prediction tasks (for instance, a Caco-2 permeability MAE of 0.347 versus 0.401) and achieves near specialist-level performance on retrosynthesis, drug-target activity, and molecular optimization. The preprint was released on arXiv on 3 March 2026 (arXiv:2603.03517) under a CC-BY-NC-ND 4.0 license.

#Applications

The model targets practical medicinal-chemistry and lead-optimization workflows: predicting ADMET/pharmacokinetic and toxicity properties of candidate compounds, proposing retrosynthetic routes, estimating drug-target binding/activity, and performing multi-parameter molecular optimization. Its small footprint makes it suitable for deployment inside a pharma company's own secure environment, where it can serve as a unified reasoning engine across the discovery pipeline rather than requiring teams to stitch together multiple specialist tools or expose confidential chemical libraries to third-party cloud services.

#Impact

LFM2-2.6B-MMAI is a notable data point in the broader shift toward small, domain-specialized foundation models that rival or exceed much larger general-purpose LLMs on scientific tasks — here demonstrated against TxGemma-27B and comparably sized systems. It also operationalizes Liquid Foundation Models, a non-transformer-centric architecture, for a demanding scientific domain. Availability is the main caveat: the underlying LFM2-2.6B base weights are openly published on Hugging Face, but the MMAI-specialized checkpoint and the MMAI Gym training environment are offered through Insilico's membership/access program rather than as open weights, and no public training code repository accompanies the preprint. Independent replication of the reported benchmark gains is therefore limited at release.

Tags

admet_predictionretrosynthesisdrug_target_activity_predictionmolecular_optimizationstate_space_modeltransformerfoundation_modelmulti_taskdrug_discovery