Arc Institute
Independent research institute focused on fundamental biology
Models (8)
Generative pipeline for epitope-targeted de novo antibody (nanobody) CDR design that yields nanomolar binders from only dozens of designs per antigen.
A multimodal reasoning LLM that fuses protein-language-model embeddings with biological context to generate interpretable reasoning traces for protein function and GO-term annotation.
A tokenizer-free, hierarchical autoregressive genomic foundation model that adaptively chunks raw nucleotides, enabling efficient long-context learning and zero-shot variant and gene predictions.
Single-cell foundation model using tabular attention over context cells to enable zero-shot representation and in-context prediction of arbitrary perturbations.
Transformer model for predicting cellular responses to perturbations across diverse cell contexts, trained on over 267 million human single-cell profiles.
A family of codon-resolution language models trained on 130 million coding sequences from 20,000 species, revealing context-dependent codon grammar governing translation and mRNA stability.
Genomic foundation model trained on 9.3 trillion DNA base pairs spanning all domains of life, with 40B parameters and a 1-million-token context window.
A 7B parameter genomic foundation model using StripedHyena architecture to model prokaryotic DNA, RNA, and proteins at single-nucleotide resolution with 131k token context.