SciCore-Omics

Nanjing University / OpenBMB / Tsinghua University

Tri-modal foundation model unifying histology images, spatial transcriptomics, and language for zero-shot pathology and spatial biology reasoning.

Released: May 2026

Parameters: 8 Billion

SciCore-Omics is a tri-modal foundation model that unifies histology images, spatial transcriptomics, and biological language within a single architecture for spatial biology and pathology reasoning. Spatial biology sits at the intersection of two data types that have historically been modeled in isolation: hematoxylin-and-eosin (H&E) histology, which captures tissue morphology at high resolution, and spatial transcriptomics, which measures gene expression while preserving spatial context. SciCore-Omics treats these as complementary views of the same tissue and aligns both with natural language, so that a single model can reason across morphology, expression, and biomedical text.

Developed by researchers at Nanjing University together with the OpenBMB community and Zhiyuan Liu's THUNLP group at Tsinghua University, the model was released as a bioRxiv preprint on May 30, 2026. Its central design goal is zero-shot generalization: from one fixed, openly licensed checkpoint, SciCore-Omics performs histopathology classification, gene-expression prediction, and spatial-domain recognition without task-specific fine-tuning. This positions it alongside pathology foundation models and spatial-omics models, but distinguished by its joint treatment of all three modalities rather than image-text or expression-only pairing.

The model and training code are released under Apache-2.0, with weights on Hugging Face and an interactive demo Space, making it one of the more openly available tri-modal spatial biology models to date.

Key Features

Tri-modal unification: Histology images, spatial transcriptomics (H5AD profiles), and biological language are aligned in a shared representation space, supporting image-only, gene-only, and joint image-gene reasoning as well as free-text biomedical interpretation.
Zero-shot generalization: A single fixed checkpoint handles histopathology classification, gene-expression prediction, and spatial-domain recognition without per-task fine-tuning.
Gene-aware language bridge: A NicheFormer gene encoder, a Gene Q-Former for embedding compression, and a Gene Projector translate transcriptomic profiles into the language-model token space.
Three-stage progressive training: Training proceeds from gene-bridge distillation through continued pretraining and supervised fine-tuning to reinforcement-learning refinement.
Open release: Apache-2.0 weights, training pipeline, and a Hugging Face demo Space lower the barrier for reuse and reproduction in research settings.

Technical Details

SciCore-Omics is an 8-billion-parameter multimodal model built on a MiniCPM-V-style vision-language backbone. The gene-aware branch couples a NicheFormer encoder with a Gene Q-Former and a Gene Projector to map spatial transcriptomic profiles into the language model's token space, letting morphology and expression be reasoned over jointly with text. Pretraining uses 151,182 spatially paired spots — locations where histology and transcriptomic measurements are co-registered — as the supervision signal for cross-modal alignment. The three-stage progressive pipeline comprises gene-bridge distillation, Swift-based continued pretraining and supervised fine-tuning, and a GSPO/PPO-style reinforcement-learning refinement stage. The released checkpoint, training code, and inference utilities are Apache-2.0 licensed.

Applications

SciCore-Omics targets researchers in computational pathology and spatial biology who need to interpret tissue across modalities. Because it operates zero-shot from a fixed checkpoint, it can classify histopathology images, predict gene expression from morphology, and recognize spatial domains without assembling labeled training sets for each task. Its natural-language interface supports interactive biomedical reasoning over H&E slides and spatial transcriptomics, useful for exploratory tissue analysis and hypothesis generation. The authors emphasize that it is released for research use only and should not serve as a standalone clinical diagnostic or treatment-recommendation system.

Impact

SciCore-Omics contributes an openly licensed, fully released tri-modal model to the rapidly growing space of pathology and spatial-omics foundation models, where most prior work has unified at most two modalities. By coupling histology, spatial transcriptomics, and language under Apache-2.0 weights with public training code and a live demo, it offers a reusable base for spatial biology research and a template for bridging gene expression into vision-language models. As a 2026 preprint its benchmark standing is still emerging, but its open release and zero-shot, multi-task design make it a notable reference point for cross-modal spatial biology.

Citation

SciCore-Omics: a tri-modal foundation model unifying histology, spatial transcriptomics and language for spatial biology

Xiao, X., et al. (2026) SciCore-Omics: a tri-modal foundation model unifying histology, spatial transcriptomics and language for spatial biology. bioRxiv.

DOI: 10.64898/2026.05.30.728937

Recent citations

Papers that recently cited this model.

Not enough citation data yet.

Top citations

The most-cited papers that cite this model.

Not enough citation data yet.

Citations

Total Citations0

Influential0

References59

GitHub

Stars10

Forks2

Open Issues0

Contributors2

Last Push1mo ago

LanguagePython

LicenseApache-2.0

HuggingFace

Downloads69

Likes10

Last Modified1mo ago

Pipelinefeature-extraction

Fields of citing research

Not enough data

Openness

bio.rodeo opennessOpen weights · open weights, closed recipe

65Partial

Usability — can I run it?100

Reproducibility — can I retrain it?29

open weights, closed recipe

Model Openness Framework

Class III

Open Model

Resources

GitHub Repository Research Paper HuggingFace Model Demo

Key Features

Tri-modal unification: Histology images, spatial transcriptomics (H5AD profiles), and biological language are aligned in a shared representation space, supporting image-only, gene-only, and joint image-gene reasoning as well as free-text biomedical interpretation.

Zero-shot generalization: A single fixed checkpoint handles histopathology classification, gene-expression prediction, and spatial-domain recognition without per-task fine-tuning.

Gene-aware language bridge: A NicheFormer gene encoder, a Gene Q-Former for embedding compression, and a Gene Projector translate transcriptomic profiles into the language-model token space.

Three-stage progressive training: Training proceeds from gene-bridge distillation through continued pretraining and supervised fine-tuning to reinforcement-learning refinement.

Open release: Apache-2.0 weights, training pipeline, and a Hugging Face demo Space lower the barrier for reuse and reproduction in research settings.

Technical Details

Applications

Impact

Citation

SciCore-Omics: a tri-modal foundation model unifying histology, spatial transcriptomics and language for spatial biology

Xiao, X., et al. (2026) SciCore-Omics: a tri-modal foundation model unifying histology, spatial transcriptomics and language for spatial biology. bioRxiv.

DOI: 10.64898/2026.05.30.728937

SciCore-Omics

Key Features

Technical Details

Applications

Impact

Citation

SciCore-Omics: a tri-modal foundation model unifying histology, spatial transcriptomics and language for spatial biology

Recent citations

Top citations

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

SciCore-Omics

Key Features

Technical Details

Applications

Impact

Citation

SciCore-Omics: a tri-modal foundation model unifying histology, spatial transcriptomics and language for spatial biology

Recent citations

Top citations

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

SciCore-Omics

#Key Features

#Technical Details

#Applications

#Impact

Citation

SciCore-Omics: a tri-modal foundation model unifying histology, spatial transcriptomics and language for spatial biology

Recent citations

Top citations

Related models

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

SciCore-Omics

#Key Features

#Technical Details

#Applications

#Impact

Citation

SciCore-Omics: a tri-modal foundation model unifying histology, spatial transcriptomics and language for spatial biology

Recent citations

Top citations

Related models

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact