Pengcheng Laboratory / Sun Yat-sen University / Tsinghua University
A DeepSeek-7B-based multi-task large reasoning model that applies chain-of-thought reasoning and reinforcement learning across ~10 molecular science task families.
Mol-Reasoning is a multi-task large reasoning model for molecular science developed by researchers at Pengcheng Laboratory, Sun Yat-sen University, and Tsinghua University, released as an arXiv preprint in March 2026. Rather than treating molecular property prediction, generation, and reaction tasks as purely data-driven pattern matching, the model is designed to emulate how a molecular scientist thinks—reasoning step by step and reflecting on intermediate conclusions before committing to an answer. It builds on the pre-trained DeepSeek-7B language model and augments it with multi-specialist modules and a chain-of-thought framework refined through reinforcement learning.
The central thesis is that embedding explicit reasoning mechanisms enables high-efficiency learning: a comparatively small 7B-parameter model can match or exceed much larger foundation models while using substantially less training data and compute. The authors report a roughly 50.3% average improvement over the DeepSeek-7B base model across their evaluation suite, and competitive or superior results against more than 20 state-of-the-art multi-task molecular LLMs, including ultra-large-parameter systems.
Mol-Reasoning fits into a growing class of "reasoning-first" chemistry LLMs that pair SMILES-aware language modeling with structured deliberation, positioning interpretability and sample efficiency as alternatives to brute-force scale. The released artifact is a fixed checkpoint intended to generalize to new inputs across its supported task families without per-task retraining.
Mol-Reasoning is built on DeepSeek-7B, a 7-billion-parameter transformer language model, extended with multi-specialist modules and a reinforcement-learning-trained chain-of-thought mechanism. Evaluation spans 10 molecular tasks and 47 metrics covering captioning, generation, name translation, property classification and regression, and reaction prediction. The authors report an average improvement of approximately 50.3% over the base DeepSeek-7B model and report outperforming more than 20 state-of-the-art multi-task LLM baselines, several of which have substantially larger parameter counts. A case study applies the model to central-nervous-system drug-candidate design, illustrating the practical reach of its reasoning-driven predictions.
Mol-Reasoning targets computational chemists and drug-discovery researchers who need a single model to handle multiple molecular tasks—predicting ADMET-relevant properties such as blood-brain-barrier penetration and toxicity, generating and describing molecules, translating between SMILES and IUPAC nomenclature, and proposing forward or retrosynthetic reactions. Its interpretable chain-of-thought outputs are particularly useful where practitioners want to inspect the reasoning behind a prediction, as demonstrated in the paper's CNS drug-candidate design case study.
Mol-Reasoning contributes to the argument that explicit reasoning can substitute for raw parameter scale in molecular science, offering a more interpretable and resource-efficient path for chemistry LLMs. As a recent (March 2026) preprint, its long-term adoption is still unproven, and openness is currently limited: the GitHub repository states the data and software are for non-commercial use only, and model weights are reported as forthcoming following publication rather than released at preprint time. These availability and licensing constraints—alongside the absence of peer review to date—should be weighed when considering the model for downstream or commercial work.