Mol-Reasoning

Pengcheng Laboratory / Sun Yat-sen University / Tsinghua University

Molecular reasoning model built on DeepSeek-7B, using chain-of-thought and reinforcement learning for property prediction, generation, and reactions.

Released: March 2026

Parameters: 7 Billion

Mol-Reasoning is a multi-task large reasoning model for molecular science developed by researchers at Pengcheng Laboratory, Sun Yat-sen University, and Tsinghua University, released as an arXiv preprint in March 2026. Rather than treating molecular property prediction, generation, and reaction tasks as purely data-driven pattern matching, the model is designed to emulate how a molecular scientist thinks—reasoning step by step and reflecting on intermediate conclusions before committing to an answer. It builds on the pre-trained DeepSeek-7B language model and augments it with multi-specialist modules and a chain-of-thought framework refined through reinforcement learning.

The central thesis is that embedding explicit reasoning mechanisms enables high-efficiency learning: a comparatively small 7B-parameter model can match or exceed much larger foundation models while using substantially less training data and compute. The authors report a roughly 50.3% average improvement over the DeepSeek-7B base model across their evaluation suite, and competitive or superior results against more than 20 state-of-the-art multi-task molecular LLMs, including ultra-large-parameter systems.

Mol-Reasoning fits into a growing class of "reasoning-first" chemistry LLMs that pair SMILES-aware language modeling with structured deliberation, positioning interpretability and sample efficiency as alternatives to brute-force scale. The released artifact is a fixed checkpoint intended to generalize to new inputs across its supported task families without per-task retraining.

Key Features

Multi-specialist modules: The architecture routes molecular problems through specialist components that supply diverse chemical expertise, rather than relying on a single monolithic head for every task.
Reasoning-enhanced reinforcement learning: A chain-of-thought framework is refined with reinforcement learning infused with molecular knowledge, encouraging structured, reflective derivations instead of single-shot answers.
Broad multi-task coverage: A single model spans roughly ten task families—including molecule captioning, SMILES generation, SMILES↔IUPAC translation, BBBP and ClinTox classification, ESOL and lipophilicity regression, and forward/retrosynthesis reaction prediction—evaluated across 47 metrics.
Sample- and compute-efficient: Reported gains are achieved with significantly fewer training examples and lower computational cost than competing large models.
Fixed, reusable checkpoint: The model applies to new inputs without task-specific fine-tuning, simplifying deployment across heterogeneous molecular workflows.

Technical Details

Mol-Reasoning is built on DeepSeek-7B, a 7-billion-parameter transformer language model, extended with multi-specialist modules and a reinforcement-learning-trained chain-of-thought mechanism. Evaluation spans 10 molecular tasks and 47 metrics covering captioning, generation, name translation, property classification and regression, and reaction prediction. The authors report an average improvement of approximately 50.3% over the base DeepSeek-7B model and report outperforming more than 20 state-of-the-art multi-task LLM baselines, several of which have substantially larger parameter counts. A case study applies the model to central-nervous-system drug-candidate design, illustrating the practical reach of its reasoning-driven predictions.

Applications

Mol-Reasoning targets computational chemists and drug-discovery researchers who need a single model to handle multiple molecular tasks—predicting ADMET-relevant properties such as blood-brain-barrier penetration and toxicity, generating and describing molecules, translating between SMILES and IUPAC nomenclature, and proposing forward or retrosynthetic reactions. Its interpretable chain-of-thought outputs are particularly useful where practitioners want to inspect the reasoning behind a prediction, as demonstrated in the paper's CNS drug-candidate design case study.

Impact

Mol-Reasoning contributes to the argument that explicit reasoning can substitute for raw parameter scale in molecular science, offering a more interpretable and resource-efficient path for chemistry LLMs. As a recent (March 2026) preprint, its long-term adoption is still unproven, and openness is currently limited: the GitHub repository states the data and software are for non-commercial use only, and model weights are reported as forthcoming following publication rather than released at preprint time. These availability and licensing constraints—alongside the absence of peer review to date—should be weighed when considering the model for downstream or commercial work.

Citation

A Multi-task Large Reasoning Model for Molecular Science

Preprint

Liu, P., et al. (2026) A Multi-task Large Reasoning Model for Molecular Science.

DOI: 10.48550/arXiv.2603.12808

Recent citations

Papers that recently cited this model.

Not enough citation data yet.

Top citations

The most-cited papers that cite this model.

Not enough citation data yet.

Citations

Total Citations2

Influential0

References55

GitHub

Stars1

Forks0

Open Issues0

Contributors1

Last Push9mo ago

LanguagePython

Fields of citing research

Not enough data

Openness

bio.rodeo opennessClosed · low usability and reproducibility

21Closed

Usability — can I run it?19

Reproducibility — can I retrain it?9

Model Openness Framework

Unclassified

Missing required components

Resources

GitHub Repository Research Paper

Key Features

Multi-specialist modules: The architecture routes molecular problems through specialist components that supply diverse chemical expertise, rather than relying on a single monolithic head for every task.

Reasoning-enhanced reinforcement learning: A chain-of-thought framework is refined with reinforcement learning infused with molecular knowledge, encouraging structured, reflective derivations instead of single-shot answers.

Broad multi-task coverage: A single model spans roughly ten task families—including molecule captioning, SMILES generation, SMILES↔IUPAC translation, BBBP and ClinTox classification, ESOL and lipophilicity regression, and forward/retrosynthesis reaction prediction—evaluated across 47 metrics.

Sample- and compute-efficient: Reported gains are achieved with significantly fewer training examples and lower computational cost than competing large models.

Fixed, reusable checkpoint: The model applies to new inputs without task-specific fine-tuning, simplifying deployment across heterogeneous molecular workflows.

Technical Details

Applications

Impact

Mol-Reasoning

Key Features

Technical Details

Applications

Impact

Citation

A Multi-task Large Reasoning Model for Molecular Science

Recent citations

Top citations

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Mol-Reasoning

Key Features

Technical Details

Applications

Impact

Citation

A Multi-task Large Reasoning Model for Molecular Science

Recent citations

Top citations

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Mol-Reasoning

#Key Features

#Technical Details

#Applications

#Impact

Citation

A Multi-task Large Reasoning Model for Molecular Science

Recent citations

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Mol-Reasoning

#Key Features

#Technical Details

#Applications

#Impact

Citation

A Multi-task Large Reasoning Model for Molecular Science

Recent citations

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact