Arc Institute / UCSF / University of Toronto / Vector Institute
A multimodal reasoning LLM that fuses protein-language-model embeddings with biological context to generate interpretable reasoning traces for protein function and GO-term annotation.
BioReason-Pro is a multimodal reasoning large language model for protein function prediction, developed by the Arc Institute together with UCSF and the Bo Wang Lab (University of Toronto / Vector Institute / University Health Network) and released as a bioRxiv preprint in March 2026. It reframes function annotation — typically a multi-label classification problem over Gene Ontology (GO) terms — as a reasoning task, generating step-by-step explanations that connect a protein's domain architecture, interaction partners, and organism context to its molecular function, biological processes, and cellular localization.
Architecturally, BioReason-Pro fuses protein embeddings from the ESM3 protein language model with the Qwen3 language model, allowing it to ground natural-language reasoning in learned structural and evolutionary representations. A companion model, GO-GPT — an autoregressive transformer over GO terms that captures the ontology's hierarchical and cross-aspect dependencies — supplies structured label priors to the reasoning pipeline.
The model addresses a long-standing weakness of black-box function predictors: their outputs are labels without rationale. By emitting explicit reasoning traces, BioReason-Pro produces annotations that human experts can scrutinize, and in a blinded comparison experts preferred its annotations over ground-truth UniProt entries in 79% of cases.
BioReason-Pro is built by coupling ESM3 protein embeddings with the Qwen3 language model. It is trained via supervised fine-tuning on synthetic reasoning traces generated by GPT-5 for over 130,000 proteins, then further optimized through reinforcement learning. On GO term prediction it reaches 73.6% Fmax, and an LLM judge assigns its functional summaries a score of 8/10. The inference pipeline runs on a single GPU and covers 200+ organisms. Weights for GO-GPT and both the SFT and RL variants of BioReason-Pro are released on HuggingFace, with precomputed predictions for 223,000+ proteins available as a dataset and through the web demo.
BioReason-Pro is aimed at researchers who need not just a predicted function but a defensible rationale — for example, annotating proteins of unknown function from new genomes and metagenomes, prioritizing targets in therapeutic discovery, and curating or auditing existing database entries. The interpretable reasoning traces make it suitable as an assistant for expert biocurators, and its single-GPU inference and 200+ organism coverage lower the barrier for routine use.
By treating protein annotation as an interpretable reasoning problem and pairing it with an open release of code, weights, and large-scale precomputed predictions, BioReason-Pro pushes function prediction toward transparency and auditability rather than opaque scoring. The finding that experts preferred its annotations over UniProt ground truth in 79% of blinded cases is a notable signal of practical quality, though such expert-preference results depend on evaluation design and merit independent replication. The work also extends the Bo Wang Lab's earlier BioReason line, which coupled DNA foundation models with LLMs, to the protein domain.