Foundation model pre-trained on 65M cryo-EM particle images via contrastive learning, enabling zero-shot classification, pose clustering, and quality assessment.
Cryo-IEF (Cryo-EM Image Evaluation Foundation) is a discriminative foundation model for cryo-electron microscopy (cryo-EM) image processing, developed at Westlake University. Pre-trained on approximately 65 million cryo-EM single-particle images using unsupervised contrastive learning, it learns general-purpose visual representations that transfer to a wide range of downstream processing tasks without requiring labeled data. The model was first posted as a bioRxiv preprint in November 2024 and subsequently published in Nature Methods in 2025.
Cryo-EM is a transformative structural biology technique capable of resolving macromolecular structures at near-atomic resolution, but processing the raw data remains expert-intensive and computationally demanding. Key bottlenecks include distinguishing real particles from background contamination, clustering particles by viewing angle to overcome preferred orientation artifacts, and assessing individual image quality at scale. Cryo-IEF addresses all three challenges with a single pre-trained visual encoder, reducing the need for task-specific labeled datasets and domain expertise.
The model also underpins CryoWizard, a fully automated single-particle cryo-EM pipeline that takes raw micrographs as input and produces a 3D reconstruction end-to-end. By combining Cryo-IEF's feature representations with K-Means++ pose clustering, CryoWizard achieves angular subsampling that mitigates the preferred orientation problem — a longstanding source of map distortion that affects a large fraction of published cryo-EM datasets.
Cryo-IEF uses a Vision Transformer (ViT) as its backbone encoder. Pre-training follows a contrastive learning framework analogous to MoCo v3: each input particle image is independently augmented twice to produce two views, which are encoded by parallel online and momentum encoders. The training objective maximizes similarity between representations of different augmented views of the same particle while minimizing similarity between distinct particles. Augmentations include random cropping, color jittering, Gaussian blurring, solarization, and rotation — choices specifically motivated by cryo-EM physics, where particle orientations in the micrograph are arbitrary and true signal is rotation-invariant.
The pre-training corpus comprises approximately 65 million single-particle images drawn from publicly deposited datasets, primarily EMPIAR (the Electron Microscopy Public Image Archive), spanning a wide diversity of protein complexes and imaging conditions. Benchmarks for CryoWizard — the downstream automated pipeline built on a fine-tuned Cryo-IEF — demonstrate substantial improvement over conventional processing on preferred-orientation datasets. On EMPIAR-10217, manual processing yielded a conical FSC anisotropy ratio (cFAR) of 0.01 (severely anisotropic); CryoWizard achieved a cFAR of 0.74 at 2.37 A resolution. On EMPIAR-10096, the cFAR improved from 0.03 (manual) to 0.34 at 2.78 A, demonstrating that Cryo-IEF-based pose clustering can recover angular diversity that conventional picking and classification workflows routinely miss.
Cryo-IEF is designed for structural biologists and cryo-EM facility staff who need to process large datasets efficiently and reproducibly. Its zero-shot classification capability is particularly valuable early in a project, where rapid screening of whether a dataset contains multiple protein species or significant contamination informs downstream decisions without requiring new labeling. For samples exhibiting preferred orientation — a pervasive problem for proteins that adsorb asymmetrically to the air-water interface — the pose clustering workflow in CryoWizard provides a practical mitigation strategy that previously required extensive manual intervention or specialized grid chemistry. Quality-based particle curation scales naturally to datasets of millions of picks, making Cryo-IEF relevant to high-throughput cryo-EM efforts at synchrotron facilities and pharmaceutical research labs.
Cryo-IEF represents a meaningful step toward fully automated cryo-EM data processing by demonstrating that a single foundation model can serve multiple quality control and data organization tasks that have historically required separate, hand-tuned algorithms. Its publication in Nature Methods reflects the significance of the approach to the structural biology community. Notable limitations temper its immediate scope: the model and CryoWizard pipeline are validated exclusively on single-particle cryo-EM, leaving cryo-electron tomography and subtomogram averaging workflows unaddressed. Automated pipelines can also underperform on highly challenging samples — very small proteins, highly flexible complexes, or cases of extreme preferred orientation — where expert judgment remains necessary. The pre-training dataset, while large, is skewed toward complexes that are well-represented in public archives, which may limit generalization to underrepresented sample classes. Model weights are freely available on HuggingFace at westlake-repl/Cryo-IEF, and the full codebase including the CryoWizard pipeline is open-source on GitHub.
Yan, Y., et al. (2025) A comprehensive foundation model for cryo-EM image processing. Nature Methods.
DOI: 10.1038/s41592-025-02916-8