Shanghai Jiao Tong University / Fudan University / East China Normal University
A reinforcement-learning-driven generative framework that unifies sequence generation with multi-objective RNA codon optimization, generalizing across six species and five RNA types without per-dataset retraining.
Designing the coding sequence of an RNA molecule is a deceptively hard optimization problem. For any given protein, a vast number of synonymous codon sequences encode the same amino acids, but these alternatives differ dramatically in properties that matter for RNA therapeutics — translational efficiency, secondary-structure stability, and species-specific codon usage among them. Existing computational methods typically follow a decoupled "generate-or-optimize" paradigm, separating candidate generation from objective optimization, and they tend to struggle on long sequences and to generalize poorly beyond the narrow dataset on which they were tuned.
RNARL, introduced in a June 2026 bioRxiv preprint by Shenggeng Lin, Yi Xiong, and colleagues at Shanghai Jiao Tong University (with collaborators at Fudan University and East China Normal University), reframes the problem as a single reinforcement-learning task. Rather than first proposing sequences and then optimizing them, RNARL directly learns a policy that generates high-performance codon sequences while simultaneously balancing multiple design objectives. The result is a unified framework that produces optimized sequences in one pass and, critically, transfers across organisms and RNA classes without retraining a new model for each setting.
The work targets the practical needs of RNA therapeutic design, where mRNA vaccines and protein-replacement therapies depend on codon sequences tuned to a particular host species and expression context. By coupling generation and optimization, RNARL aims to make this tuning faster and more broadly applicable than the per-dataset pipelines that preceded it.
RNARL is a reinforcement-learning-driven generative framework in which sequence generation and multi-objective optimization are unified within a single policy, rather than separated into distinct generation and optimization stages. The model learns to emit codon sequences directly, using its multi-objective reward to steer generation toward candidates that satisfy several design criteria simultaneously. The authors report that RNARL effectively handles coding sequences over 3,900 nucleotides and evaluate it across six species and five RNA types, reporting superior performance and universality relative to existing approaches in these settings. Detailed architecture specifications, training-data composition, parameter count, and benchmark tables are described in the preprint; as of this writing no public code repository or downloadable model weights have been located, and the web platform URL is not stated in the abstract. The preprint is released under a CC BY-NC-ND 4.0 license.
RNARL is aimed at RNA therapeutic design, where the codon sequence of an mRNA must be optimized for efficient and stable expression in a specific host. Its generalization across multiple species and RNA types makes it relevant to mRNA vaccine and protein-replacement programs that need host-tuned sequences, and to research groups exploring synonymous-sequence design for non-coding and coding RNAs alike. The freely available web platform lowers the barrier for experimental biologists, allowing them to generate optimized candidate sequences without building or training models themselves.
By unifying generation and optimization into one reinforcement-learning policy that transfers across species and RNA types, RNARL contributes to a growing line of RL-based codon-design tools — alongside contemporaries such as CodonRL and codonGPT — that move beyond single-objective, single-dataset optimization. Its emphasis on long-sequence handling and cross-setting generalization addresses two recurring limitations of earlier methods, and the accompanying public web platform broadens access for therapeutic developers. As a recent preprint, its real-world impact and benchmark standing remain to be established through peer review and independent evaluation, and the absence of released code or weights currently limits independent reproduction.
Lin, S., et al. (2026) Reinforcement learning-driven unified generative framework for multi-objective RNA codon design. bioRxiv.
DOI: 10.64898/2026.06.12.732012Papers that recently cited this model.
The most-cited papers that cite this model.
Not enough data