Labs & Groups (1)
Models (3)
DNA & Gene
DNABERT-S
MAGICS Lab
Species-aware DNA embedding model built on DNABERT-2, using contrastive learning to cluster and differentiate genomic sequences by species without labeled data.
12634353K
See the scorecard
DNA & Gene
DNABERT-2
MAGICS Lab
Multi-species genomic foundation model replacing k-mer tokenization with BPE, achieving state-of-the-art performance with 21x fewer parameters than prior leading models.
47837496.9K
See the scorecard
DNA & Gene
DNABERT
Northwestern University
BERT-based pre-trained model for DNA sequences using k-mer tokenization. Achieves state-of-the-art performance on promoter, splice site, and transcription factor binding prediction.
74939.4K
See the scorecard