3D Volume Reconstruction (compression factor C: 1.33)

Ground Truth
cryoSENSE-DCT Recovery
cryoSENSE-DDPM Recovery

Reconstructed Atomic Model (compression factor C: 1.33)

Ground Truth
cryoSENSE-DCT Recovery
cryoSENSE-DDPM Recovery
cryoSENSE Figure 1

Figure 1: cryoSENSE increases data throughput by preserving 3D structural detail under high compression factors. a, Original 3D cryo-EM volume from uncompressed particle images. b, cryoSENSE 3D reconstructions from compressively acquired images, using generative (top row) and sparse priors (bottom row). Volumes are color-coded by their FSC resolution (lower = better).

cryoSENSE Figure 2

Figure 2: Overview of cryoSENSE. a, cryoSENSE employs pixel-space and Fourier-space masking strategies to obtain compressed measurements. b, sparsity priors enable image recovery via proximal gradient descent. c, generative priors learn a low-dimensional manifold of cryo-EM images and guide a diffusion process to generate images consistent with the measurements. d, cryoSENSE enables high-throughput acquisition of cryo-EM images, which are then validated through downstream biological tasks such as e, 3D volume reconstruction, atomic model building, and conformational heterogeneity analysis.

cryoSENSE Figure 3

Figure 3: a, Example EMPIAR-10076 particle images used for CryoDRGN heterogeneity analysis. b, UMAP projection of CryoDRGN latent space trained on original 128×128 images, showing four distinct conformational clusters colored by GMM labels. c, Example cryoSENSE - DDPM reconstructions obtained from K = 16 downsampled images with C = 1.25. d, UMAP projection of CryoDRGN latent space trained on DDPM reconstructions, colored by original GMM cluster labels from b. e, Example cryoSENSE - DCT reconstructions obtained from K = 16 downsampled images with C = 1.25. f, UMAP projection of CryoDRGN latent space trained on DCT reconstructions, colored by original GMM cluster labels from b.

cryoSENSE Figure 4

Figure 4: a, Atomic models from ModelAngelo fit into reconstructed cryo-EM 3D volumes for the Original dataset (EMPIAR-10648) and cryoSENSE reconstructions (K=2, C=1.33) using DCT and DDPM priors. Chains are colored by ModelAngelo prediction confidence scores. b, FSC curves for the three corresponding cryo-EM 3D volumes. c, Chain-level sequence alignment score vs. sequence identity for matched chains between Original vs. DDPM and Original vs. DCT; marker size reflects backbone RMSD. d, cryoSENSE - DDPM atomic model with four representative chain regions highlighted, corresponding to the same colored regions shown in g. e, First example structural comparison of the highlighted region in the Original model with its corresponding DDPM- and DCT-reconstructed regions. f, Second example structural comparison of the highlighted region in the Original model with its corresponding DDPM- and DCT-reconstructed regions. g, Original atomic model with four chain regions highlighted corresponding to the same regions as in d and h. h, cryoSENSE - DCT atomic model with four representative chain regions highlighted, corresponding to the same colored regions shown in g.

Abstract

Cryo-electron microscopy (cryo-EM) enables the atomic-resolution visualization of biomolecules; however, modern direct detectors generate data volumes that far exceed the available storage and transfer bandwidth, thereby constraining practical throughput. We introduce cryoSENSE, the computational realization of a hardware-software co-designed framework for compressive cryo-EM sensing and acquisition. We show that cryo-EM images of proteins lie on low-dimensional manifolds that can be independently represented using sparse priors in predefined bases and generative priors captured by a denoising diffusion model. cryoSENSE leverages these low-dimensional manifolds to enable faithful image reconstruction from spatial and Fourier-domain undersampled measurements while preserving downstream structural resolution. In experiments, cryoSENSE increases acquisition throughput by up to 2.5× while retaining the original 3D resolution, offering controllable trade-offs between the number of masked measurements and the level of downsampling.

High Throughput

Achieves up to 2.5× compression factor while preserving 3D structural resolution at near-atomic detail.

Dual Prior Framework

Combines handcrafted sparsity priors (DCT, wavelets, TV) with learned generative diffusion priors for robust reconstruction.

Flexible Acquisition

Supports both pixel-space masking (coded apertures) and Fourier-space masking (phase plates) strategies.

Biological Fidelity

Preserves conformational heterogeneity (80-88% cluster agreement) and enables accurate atomic model building.