cryoSENSE: Compressive Sensing Enables High-throughput Microscopy with Sparse and Generative Priors on the Protein Cryo-EM Image Manifold

Zain Shabeeb^1,† Daniel Saeedi^2,† Darin Tsui^2,† Vida Jamali^1,* Amirali Aghazadeh^2,*

¹School of Chemical and Biomolecular Engineering, Georgia Institute of Technology ²School of Electrical and Computer Engineering, Georgia Institute of Technology ^†Equal contributions ^*Correspondence to: vida@gatech.edu, amiralia@gatech.edu

3D Volume Reconstruction (compression factor C: 1.33)

Ground Truth

cryoSENSE-DCT Recovery

cryoSENSE-DDPM Recovery

Reconstructed Atomic Model (compression factor C: 1.33)

Ground Truth

cryoSENSE-DCT Recovery

cryoSENSE-DDPM Recovery

‹

›

Figure 1: cryoSENSE increases data throughput by preserving 3D structural detail under high compression factors. a, Original 3D cryo-EM volume from uncompressed particle images. b, cryoSENSE 3D reconstructions from compressively acquired images, using generative (top row) and sparse priors (bottom row). Volumes are color-coded by their FSC resolution (lower = better).

Figure 2: Overview of cryoSENSE. a, cryoSENSE employs pixel-space and Fourier-space masking strategies to obtain compressed measurements. b, sparsity priors enable image recovery via proximal gradient descent. c, generative priors learn a low-dimensional manifold of cryo-EM images and guide a diffusion process to generate images consistent with the measurements. d, cryoSENSE enables high-throughput acquisition of cryo-EM images, which are then validated through downstream biological tasks such as e, 3D volume reconstruction, atomic model building, and conformational heterogeneity analysis.

Figure 3: a, Example EMPIAR-10076 particle images used for CryoDRGN heterogeneity analysis. b, UMAP projection of CryoDRGN latent space trained on original 128×128 images, showing four distinct conformational clusters colored by GMM labels. c, Example cryoSENSE - DDPM reconstructions obtained from K = 16 downsampled images with C = 1.25. d, UMAP projection of CryoDRGN latent space trained on DDPM reconstructions, colored by original GMM cluster labels from b. e, Example cryoSENSE - DCT reconstructions obtained from K = 16 downsampled images with C = 1.25. f, UMAP projection of CryoDRGN latent space trained on DCT reconstructions, colored by original GMM cluster labels from b.

Figure 4: a, Atomic models from ModelAngelo fit into reconstructed cryo-EM 3D volumes for the Original dataset (EMPIAR-10648) and cryoSENSE reconstructions (K=2, C=1.33) using DCT and DDPM priors. Chains are colored by ModelAngelo prediction confidence scores. b, FSC curves for the three corresponding cryo-EM 3D volumes. c, Chain-level sequence alignment score vs. sequence identity for matched chains between Original vs. DDPM and Original vs. DCT; marker size reflects backbone RMSD. d, cryoSENSE - DDPM atomic model with four representative chain regions highlighted, corresponding to the same colored regions shown in g. e, First example structural comparison of the highlighted region in the Original model with its corresponding DDPM- and DCT-reconstructed regions. f, Second example structural comparison of the highlighted region in the Original model with its corresponding DDPM- and DCT-reconstructed regions. g, Original atomic model with four chain regions highlighted corresponding to the same regions as in d and h. h, cryoSENSE - DCT atomic model with four representative chain regions highlighted, corresponding to the same colored regions shown in g.

Abstract

Cryo-electron microscopy (cryo-EM) enables the atomic-resolution visualization of biomolecules; however, modern direct detectors generate data volumes that far exceed the available storage and transfer bandwidth, thereby constraining practical throughput. We introduce cryoSENSE, the computational realization of a hardware-software co-designed framework for compressive cryo-EM sensing and acquisition. We show that cryo-EM images of proteins lie on low-dimensional manifolds that can be independently represented using sparse priors in predefined bases and generative priors captured by a denoising diffusion model. cryoSENSE leverages these low-dimensional manifolds to enable faithful image reconstruction from spatial and Fourier-domain undersampled measurements while preserving downstream structural resolution. In experiments, cryoSENSE increases acquisition throughput by up to 2.5× while retaining the original 3D resolution, offering controllable trade-offs between the number of masked measurements and the level of downsampling.

High Throughput

Achieves up to 2.5× compression factor while preserving 3D structural resolution at near-atomic detail.

Dual Prior Framework

Combines handcrafted sparsity priors (DCT, wavelets, TV) with learned generative diffusion priors for robust reconstruction.

Flexible Acquisition

Supports both pixel-space masking (coded apertures) and Fourier-space masking (phase plates) strategies.

Biological Fidelity

Preserves conformational heterogeneity (80-88% cluster agreement) and enables accurate atomic model building.

Fourier Space Masking

C is the compression ratio. The higher the C, the more aggressive the compression.

C = 2

cryoSENSE: Compressive Sensing Enables High-throughput Microscopy with Sparse and Generative Priors on the Protein Cryo-EM Image Manifold

3D Volume Reconstruction (compression factor C: 1.33)

Reconstructed Atomic Model (compression factor C: 1.33)

Abstract

High Throughput

Dual Prior Framework

Flexible Acquisition

Biological Fidelity

Interactive Masking Strategies

Original Image

Current Mask (Mask 1/1)

Compression Info

Masked Image with Block Grid

Block Summation Process

Compressed Output

Matrix Representation (Compressed Output)

Fourier Space Masking

C = 2

Uniform Random

Annular Ring

Radial Spoke

C = 2.5

Uniform Random

Annular Ring

Radial Spoke

C = 3.33

Uniform Random

Annular Ring

Radial Spoke

C = 5

Uniform Random

Annular Ring

Radial Spoke

C = 10

Uniform Random

Annular Ring

Radial Spoke

From Low-Res Measurements to High-Res Reconstruction using cryoSENSE!

Block Size = 2

Diffusion Reconstruction Process

Block Size = 32

Diffusion Reconstruction Process