Government

Chinese Academy of Sciences

National academy and research organization of China

Location: Beijing, China
14 models(4 Single-cell, 3 Language model, 3 DNA & Gene, 2 Protein, 2 RNA, 2 Imaging, 1 Pathology)

Labs & Groups (3)

Models (14)

CellPulse

Wuhan Institute of Virology

Released April 24, 2026

A direction-aware foundation model trained on ~23M bulk RNA-seq differential-expression profiles that simulates coordinated gene dynamics in viral infection.

Single-cellLanguage model

OmniNA

Beijing Institute of Genomics / Chinese Academy of Sciences

Released April 13, 2026

217

Self-supervised generative foundation model jointly trained on 91.7M nucleotide sequences and structured annotations spanning 1.076 trillion bases, achieving SOTA on 23 nucleotide-language benchmarks.

DNA & Gene

IDiom

Chinese Academy of Sciences

Released April 11, 2026

Autoregressive language model trained on 37 million intrinsically disordered region sequences from the AlphaFold Database, generating IDR sequences conditioned on surrounding structured context.

Protein

mRNA-GPT

Chinese Academy of Sciences

Released April 2, 2026

23

Autoregressive generative model pretrained on 30 million full-length natural mRNA sequences that jointly optimizes 5' UTR, CDS, and 3' UTR for therapeutic mRNA stability and translation efficiency.

RNA

Digepath

Chinese Academy of Sciences

Released April 1, 2026

Subspecialty-specific computational pathology foundation model pretrained on 353 million multi-scale patches from 210,000 H&E slides for gastrointestinal pathology, achieving SOTA on 32 of 33 systematic downstream tasks.

Pathology

scLong

Chinese Academy of Sciences

Released April 1, 2026

21

Billion-parameter single-cell foundation model performing full self-attention across all 28,000 human genes, integrating Gene Ontology priors via GCN for long-range gene context capture in transcriptomics.

Single-cell

RegFormer

Chinese Academy of Sciences

Released April 1, 2026

8

GRN-informed single-cell foundation model combining gene regulatory hierarchy priors with long-sequence Mamba modeling for clustering, batch integration, perturbation modeling, and drug response prediction.

Single-cell

IDPForge

Chinese Academy of Sciences

Released March 25, 2026

114

Transformer-based protein language diffusion model generating all-atom intrinsically disordered protein conformational ensembles, validated against experimental NMR and SAXS data.

Protein

EVA

GENTEL Lab

Released March 24, 2026

1186

Long-context generative RNA foundation model trained on 114 million full-length RNA sequences, supporting de novo design of tRNAs, aptamers, CRISPR guide RNAs, mRNAs, and circular RNAs.

RNA

AntigenLM

Chinese Academy of Sciences / Beijing Institute of Genomics

Released February 9, 2026

A structure-aware generative DNA language model pretrained on influenza genomes that forecasts future antigenic variants across regions and subtypes.

DNA & Gene

Melody

Shandong University / University of Electronic Science and Technology of China / Chinese Academy of Sciences

Released November 23, 2025

A deep learning framework that predicts locus-specific DNA methylation across 39 human tissues from genomic sequence, with a scRNA-seq-augmented variant for unseen cell types.

DNA & Gene

EndoChat

Chinese University of Hong Kong / Huawei / Technical University of Munich / University of Strasbourg / Shandong University / Chinese Academy of Sciences

Released January 20, 2025

422050

Grounded multimodal large language model for endoscopic surgery, supporting visual dialogue, region-based question answering, and bounding-box grounding across surgical scene understanding tasks.

ImagingLanguage model

MedPLIB

Baidu / China Agricultural University / Chinese Academy of Sciences / Peking University

Released December 12, 2024

337130

Biomedical multimodal LLM with pixel-level insight, combining visual question answering, pixel-grounded prompts, and segmentation via a mixture-of-experts design.

ImagingLanguage model

GeneCompass

Chinese Academy of Sciences

Released October 8, 2024

124118

Knowledge-informed cross-species foundation model pre-trained on 101M human and mouse single-cell transcriptomes to decipher universal gene regulatory mechanisms.

Single-cell