A multimodal ChatGPT-style LLM that fuses a molecular-graph GNN, a molecular-image CNN, and a Vicuna-13B backbone for interactive, free-form prediction of metabolite mechanisms and properties.
MetaboliteChat is a multimodal large language model that brings a ChatGPT-style conversational interface to metabolite analysis. Rather than predicting a single fixed endpoint, it accepts a metabolite's molecular structure together with a free-form natural-language question and returns free-text answers about the molecule's biological mechanisms, functions, and physicochemical properties. The model was developed by Zhenhao Guo and colleagues in Pengtao Xie's lab at New York University and released as a bioRxiv preprint in November 2025.
Metabolomics has historically relied on narrow, task-specific predictors that each address one property and must be retrained for every new question. This fragments the analysis workflow and makes it hard for non-specialists to interrogate a metabolite holistically. MetaboliteChat reframes the problem as instruction-following: by aligning molecular representations with a language model, it lets a researcher pose arbitrary questions in plain English and receive explanatory, multi-turn responses grounded in the molecule's structure.
The model sits within a growing family of biomedical multimodal LLMs (such as LLaVA-Med and BioGPT) but is distinctive in targeting small-molecule metabolites specifically and in fusing two complementary structural views — a molecular graph and a rendered molecular image — into a single conversational system.
MetaboliteChat follows the now-common vision-language alignment recipe (its code is adapted from MiniGPT-4 and LAVIS) but substitutes molecule-aware encoders for a generic image encoder. A GNN encodes the molecular graph derived from the SMILES string while a CNN encodes a rendered image of the molecule; their features are projected into the embedding space of a Vicuna-13B-v1.5 backbone via a trainable projection layer, with the backbone providing language understanding and generation. The instruction-tuning corpus is built from the Human Metabolome Database (HMDB), formatted as per-molecule question–answer pairs, and spans 152,222 metabolites. The authors report that MetaboliteChat outperforms both generic LLMs and task-specific baselines on metabolite analysis tasks. Inference runs from a fixed checkpoint, enabling zero-shot use on new metabolites; training and evaluation require an NVIDIA GPU with roughly 70–80 GB of memory.
MetaboliteChat is aimed at metabolomics researchers, computational chemists, and drug-discovery scientists who need to rapidly characterize metabolites without building a bespoke model for each property of interest. Because it answers free-form questions, it can serve as an exploratory assistant — summarizing a molecule's likely mechanisms, suggesting functional roles, or comparing properties across candidates — and lowers the barrier for wet-lab biologists who lack machine-learning expertise. Its zero-shot generalization is particularly useful for newly identified or poorly annotated metabolites where labeled training data is scarce.
MetaboliteChat extends the conversational, instruction-following paradigm that has reshaped protein and pathology AI into the metabolomics domain, where multimodal LLMs remain comparatively rare. By unifying structural and textual reasoning in one interactive system, it points toward more accessible, explanation-oriented tools for small-molecule analysis. As a preprint with code released under a BSD-3-Clause license and weights distributed via Google Drive, its results await peer review and independent benchmarking; the authors explicitly position it as a prototype requiring expert validation before any applied or pharmaceutical use.
Guo, Z., et al. (2025) MetaboliteChat: A Unified Multimodal Large Language Model for Interactive Metabolite Analysis and Functional Insights. bioRxiv.
DOI: 10.1101/2025.11.07.687008Papers that recently cited this model.
M. D. Mashabela, Nompumelelo R. Sibanyoni, Kamogelo Mmotla, et al.
Current Plant Biology · Jun 2026
Shuai Xiao, Suhai Liu, Wei Zhou, et al.
May 2026
Zhenhao Guo, Rachit Saluja, Tianyuan Yao, et al.
arXiv.org · Nov 2025
The most-cited papers that cite this model.
Shuai Xiao, Suhai Liu, Wei Zhou, et al.
May 2026
Zhenhao Guo, Rachit Saluja, Tianyuan Yao, et al.
arXiv.org · Nov 2025
M. D. Mashabela, Nompumelelo R. Sibanyoni, Kamogelo Mmotla, et al.
Current Plant Biology · Jun 2026
Share of papers citing this model.