A compact 167M-parameter sequence-only protein language model using a multiplicative LSTM that achieves competitive ProteinGym fitness prediction without structure or attention.
AINN-P1 is a compact protein language model developed by Ainnocence, an AI drug discovery company, and released as a bioRxiv preprint in March 2026. It targets a practical question raised by the recent generation of very large protein language models: how much of their predictive power actually depends on billions of parameters, attention mechanisms, or multiple sequence alignments (MSAs). AINN-P1 answers this by showing that a 167M-parameter, attention-free, sequence-only model can match much larger transformers on standard protein-fitness benchmarks.
Rather than the now-dominant transformer architecture, AINN-P1 is built on a multiplicative LSTM (mLSTM), a recurrent design that scales linearly with sequence length and avoids the growing key-value caches that make transformer inference expensive on long sequences. The model is trained autoregressively on UniRef with straightforward next-token prediction, and protein fitness is read out zero-shot from the model's likelihoods, with optional frozen-encoder few-shot regression on top of its embeddings.
The work positions itself as evidence for a "sequence-only paradigm" — strong variant-effect prediction achievable without structural inputs, MSAs, or hundred-billion-parameter scale — which is attractive for cost-sensitive, high-throughput screening in drug discovery pipelines.
AINN-P1 is a 167M-parameter mLSTM trained by autoregressive next-token prediction on UniRef protein sequences. On the ProteinGym deep mutational scanning benchmark, it reports an average Spearman rho of 0.441 across four task categories and 0.625 on the stability category. The authors report that this average edges past the ProSST baseline (0.438) while using far fewer parameters than billion- and hundred-billion-scale transformer protein language models. Evaluation uses both zero-shot scoring from model likelihoods and few-shot regression on top of frozen encoder representations.
AINN-P1 is aimed at protein engineering and therapeutic discovery workflows where variant-effect and fitness prediction must be run at scale and at low cost. Because it requires only a single sequence — no structure, no MSA — and runs on a modest parameter budget, it is well suited to high-throughput in silico screening of mutant libraries, stability optimization, and prioritization of candidate variants before wet-lab validation. As a commercial model from Ainnocence, it is positioned to support the company's drug discovery platform.
AINN-P1 contributes to an ongoing debate about scaling in protein language models, offering concrete evidence that competitive fitness prediction does not strictly require massive transformers or alignment-derived signals. Its mLSTM design also revisits recurrent architectures as an efficient alternative for biological sequence modeling. The main limitation for the broader community is access: the model is a commercial release under a CC-BY-NC license, and no public weights or code accompany the preprint, so independent reproduction and downstream use are currently constrained.