A 2.6B-parameter Liquid Foundation Model fine-tuned via Insilico's MMAI Gym to handle ADMET, retrosynthesis, drug-target activity, and molecular optimization in one checkpoint.
The MMAI Liquid Foundation Model (released as LFM2-2.6B-MMAI) is a compact, multi-task foundation model for small-molecule drug discovery, developed through a strategic partnership between Liquid AI and Insilico Medicine and announced in March 2026. Rather than relying on the in-context learning of very large general-purpose language models — which the authors show does not reliably deliver the scientific understanding required for chemistry — the model is purpose-trained to learn what its creators call "the language of molecules." A single 2.6-billion-parameter checkpoint covers tasks that are usually served by a patchwork of specialist models.
The model is built on LFM2, Liquid AI's hybrid Liquid Foundation Model architecture descended from liquid neural networks, and is specialized using Insilico's MMAI Gym for Science (MMAI = Multi-Modal AI). MMAI Gym is a structured training-and-evaluation environment that supplies curated molecular data formats, task-specific reasoning traces, and benchmarking recipes spanning over 1,000 pharmaceutical benchmarks. The central claim of the accompanying preprint is that a smaller, domain-trained foundation model can match or surpass models an order of magnitude larger across the drug discovery pipeline.
Because the checkpoint is small enough to run on private infrastructure, it is positioned for pharmaceutical and biotech teams that need state-of-the-art molecular reasoning without sending proprietary chemical data to external cloud APIs — a recurring barrier to AI adoption in regulated drug discovery settings.
LFM2-2.6B-MMAI is a fine-tune of the openly released LFM2-2.6B base model, a hybrid architecture combining gated short convolutions with grouped-query attention. Specialization was performed in the MMAI Gym environment using approximately 120 billion tokens of pharmaceutical data spanning 200+ tasks, with supervised fine-tuning operating over an aggregate context on the order of millions of tokens per optimization step. On benchmarks the model is competitive with or superior to much larger systems: it improves on TxGemma-27B for the majority of ADME/PK property-prediction tasks (for instance, a Caco-2 permeability MAE of 0.347 versus 0.401) and achieves near specialist-level performance on retrosynthesis, drug-target activity, and molecular optimization. The preprint was released on arXiv on 3 March 2026 (arXiv:2603.03517) under a CC-BY-NC-ND 4.0 license.
The model targets practical medicinal-chemistry and lead-optimization workflows: predicting ADMET/pharmacokinetic and toxicity properties of candidate compounds, proposing retrosynthetic routes, estimating drug-target binding/activity, and performing multi-parameter molecular optimization. Its small footprint makes it suitable for deployment inside a pharma company's own secure environment, where it can serve as a unified reasoning engine across the discovery pipeline rather than requiring teams to stitch together multiple specialist tools or expose confidential chemical libraries to third-party cloud services.
LFM2-2.6B-MMAI is a notable data point in the broader shift toward small, domain-specialized foundation models that rival or exceed much larger general-purpose LLMs on scientific tasks — here demonstrated against TxGemma-27B and comparably sized systems. It also operationalizes Liquid Foundation Models, a non-transformer-centric architecture, for a demanding scientific domain. Availability is the main caveat: the underlying LFM2-2.6B base weights are openly published on Hugging Face, but the MMAI-specialized checkpoint and the MMAI Gym training environment are offered through Insilico's membership/access program rather than as open weights, and no public training code repository accompanies the preprint. Independent replication of the reported benchmark gains is therefore limited at release.