CryoSPARC Guide
  • About CryoSPARC
  • Current Version
  • Licensing
    • Non-commercial license agreement
  • Setup, Configuration and Management
    • CryoSPARC Architecture and System Requirements
    • CryoSPARC Installation Prerequisites
    • How to Download, Install and Configure
      • Obtaining A License ID
      • Downloading and Installing CryoSPARC
      • CryoSPARC Cluster Integration Script Examples
      • Accessing the CryoSPARC User Interface
    • Deploying CryoSPARC on AWS
      • Performance Benchmarks
    • Using CryoSPARC with Cluster Management Software
    • Software Updates and Patches
    • Management and Monitoring
      • Environment variables
      • (Optional) Hosting CryoSPARC Through a Reverse Proxy
      • cryosparcm reference
      • cryosparcm cli reference
      • cryosparcw reference
    • Software System Guides
      • Guide: Updating to CryoSPARC v4
      • Guide: Installation Testing with cryosparcm test
      • Guide: Verify CryoSPARC Installation with the Extensive Validation Job (v4.3+)
      • Guide: Verify CryoSPARC Installation with the Extensive Workflow (≤v4.2)
      • Guide: Performance Benchmarking (v4.3+)
      • Guide: Download Error Reports
      • Guide: Maintenance Mode and Configurable User Facing Messages
      • Guide: User Management
      • Guide: Multi-user Unix Permissions and Data Access Control
      • Guide: Lane Assignments and Restrictions
      • Guide: Queuing Directly to a GPU
      • Guide: Priority Job Queuing
      • Guide: Configuring Custom Variables for Cluster Job Submission Scripts
      • Guide: SSD Particle Caching in CryoSPARC
      • Guide: Data Management in CryoSPARC (v4.0+)
      • Guide: Data Cleanup (v4.3+)
      • Guide: Reduce Database Size (v4.3+)
      • Guide: Data Management in CryoSPARC (≤v3.3)
      • Guide: CryoSPARC Live Session Data Management
      • Guide: Manipulating .cs Files Created By CryoSPARC
      • Guide: Migrating your CryoSPARC Instance
      • Guide: EMDB-friendly XML file for FSC plots
    • Troubleshooting
  • Application Guide (v4.0+)
    • A Tour of the CryoSPARC Interface
    • Browsing the CryoSPARC Instance
    • Projects, Workspaces and Live Sessions
    • Jobs
    • Job Views: Cards, Tree, and Table
    • Creating and Running Jobs
    • Low Level Results Interface
    • Filters and Sorting
    • View Options
    • Tags
    • Flat vs Hierarchical Navigation
    • File Browser
    • Blueprints
    • Workflows
    • Inspecting Data
    • Managing Jobs
    • Interactive Jobs
    • Upload Local Files
    • Managing Data
    • Downloading and Exporting Data
    • Instance Management
    • Admin Panel
  • Cryo-EM Foundations
    • Image Formation
      • Contrast in Cryo-EM
      • Waves as Vectors
      • Aliasing
  • Expectation Maximization in Cryo-EM
  • Processing Data in cryoSPARC
    • Get Started with CryoSPARC: Introductory Tutorial (v4.0+)
    • Tutorial Videos
    • All Job Types in CryoSPARC
      • Import
        • Job: Import Movies
        • Job: Import Micrographs
        • Job: Import Particle Stack
        • Job: Import 3D Volumes
        • Job: Import Templates
        • Job: Import Result Group
        • Job: Import Beam Shift
      • Motion Correction
        • Job: Patch Motion Correction
        • Job: Full-Frame Motion Correction
        • Job: Local Motion Correction
        • Job: MotionCor2 (Wrapper) (BETA)
        • Job: Reference Based Motion Correction (BETA)
      • CTF Estimation
        • Job: Patch CTF Estimation
        • Job: Patch CTF Extraction
        • Job: CTFFIND4 (Wrapper)
        • Job: Gctf (Wrapper) (Legacy)
      • Exposure Curation
        • Job: Micrograph Denoiser (BETA)
        • Job: Micrograph Junk Detector (BETA)
        • Interactive Job: Manually Curate Exposures
      • Particle Picking
        • Interactive Job: Manual Picker
        • Job: Blob Picker
        • Job: Template Picker
        • Job: Filament Tracer
        • Job: Blob Picker Tuner
        • Interactive Job: Inspect Particle Picks
        • Job: Create Templates
      • Extraction
        • Job: Extract from Micrographs
        • Job: Downsample Particles
        • Job: Restack Particles
      • Deep Picking
        • Guideline for Supervised Particle Picking using Deep Learning Models
        • Deep Network Particle Picker
          • T20S Proteasome: Deep Particle Picking Tutorial
          • Job: Deep Picker Train and Job: Deep Picker Inference
        • Topaz (Bepler, et al)
          • T20S Proteasome: Topaz Particle Picking Tutorial
          • T20S Proteasome: Topaz Micrograph Denoising Tutorial
          • Job: Topaz Train and Job: Topaz Cross Validation
          • Job: Topaz Extract
          • Job: Topaz Denoise
      • Particle Curation
        • Job: 2D Classification
        • Interactive Job: Select 2D Classes
        • Job: Reference Based Auto Select 2D (BETA)
        • Job: Reconstruct 2D Classes
        • Job: Rebalance 2D Classes
        • Job: Class Probability Filter (Legacy)
        • Job: Rebalance Orientations
        • Job: Subset Particles by Statistic
      • 3D Reconstruction
        • Job: Ab-Initio Reconstruction
      • 3D Refinement
        • Job: Homogeneous Refinement
        • Job: Heterogeneous Refinement
        • Job: Non-Uniform Refinement
        • Job: Homogeneous Reconstruction Only
        • Job: Heterogeneous Reconstruction Only
        • Job: Homogeneous Refinement (Legacy)
        • Job: Non-uniform Refinement (Legacy)
      • CTF Refinement
        • Job: Global CTF Refinement
        • Job: Local CTF Refinement
        • Job: Exposure Group Utilities
      • Conformational Variability
        • Job: 3D Variability
        • Job: 3D Variability Display
        • Job: 3D Classification
        • Job: Regroup 3D Classes
        • Job: Reference Based Auto Select 3D (BETA)
        • Job: 3D Flexible Refinement (3DFlex) (BETA)
      • Postprocessing
        • Job: Sharpening Tools
        • Job: DeepEMhancer (Wrapper)
        • Job: Validation (FSC)
        • Job: Local Resolution Estimation
        • Job: Local Filtering
        • Job: ResLog Analysis
        • Job: ThreeDFSC (Wrapper) (Legacy)
      • Local Refinement
        • Job: Local Refinement
        • Job: Particle Subtraction
        • Job: Local Refinement (Legacy)
      • Helical Reconstruction
        • Helical symmetry in CryoSPARC
        • Job: Helical Refinement
        • Job: Symmetry search utility
        • Job: Average Power Spectra
      • Utilities
        • Job: Exposure Sets Tool
        • Job: Exposure Tools
        • Job: Generate Micrograph Thumbnails
        • Job: Cache Particles on SSD
        • Job: Check for Corrupt Particles
        • Job: Particle Sets Tool
        • Job: Reassign Particles to Micrographs
        • Job: Remove Duplicate Particles
        • Job: Symmetry Expansion
        • Job: Volume Tools
        • Job: Volume Alignment Tools
        • Job: Align 3D maps
        • Job: Split Volumes Group
        • Job: Orientation Diagnostics
      • Simulations
        • Job: Simulate Data (GPU)
        • Job: Simulate Data (Legacy)
    • CryoSPARC Tools
    • Data Processing Tutorials
      • Case study: End-to-end processing of a ligand-bound GPCR (EMPIAR-10853)
      • Case Study: DkTx-bound TRPV1 (EMPIAR-10059)
      • Case Study: Pseudosymmetry in TRPV5 and Calmodulin (EMPIAR-10256)
      • Case Study: End-to-end processing of an inactive GPCR (EMPIAR-10668)
      • Case Study: End-to-end processing of encapsulated ferritin (EMPIAR-10716)
      • Case Study: Exploratory data processing by Oliver Clarke
      • Tutorial: Tips for Membrane Protein Structures
      • Tutorial: Common CryoSPARC Plots
      • Tutorial: Negative Stain Data
      • Tutorial: Phase Plate Data
      • Tutorial: EER File Support
      • Tutorial: EPU AFIS Beam Shift Import
      • Tutorial: Patch Motion and Patch CTF
      • Tutorial: Float16 Support
      • Tutorial: Particle Picking Calibration
      • Tutorial: Blob Picker Tuner
      • Tutorial: Helical Processing using EMPIAR-10031 (MAVS)
      • Tutorial: Maximum Box Sizes for Refinement
      • Tutorial: CTF Refinement
      • Tutorial: Ewald Sphere Correction
      • Tutorial: Symmetry Relaxation
      • Tutorial: Orientation Diagnostics
      • Tutorial: BILD files in CryoSPARC v4.4+
      • Tutorial: Mask Creation
      • Case Study: Yeast U4/U6.U5 tri-snRNP
      • Tutorial: 3D Classification
      • Tutorial: 3D Variability Analysis (Part One)
      • Tutorial: 3D Variability Analysis (Part Two)
      • Tutorial: 3D Flexible Refinement
        • Installing 3DFlex Dependencies (v4.1–v4.3)
      • Tutorial: 3D Flex Mesh Preparation
    • Webinar Recordings
  • Real-time processing in cryoSPARC Live
    • About CryoSPARC Live
    • Prerequisites and Compute Resources Setup
    • How to Access cryoSPARC Live
    • UI Overview
    • New Live Session: Start to Finish Guide
    • CryoSPARC Live Tutorial Videos
    • Live Jobs and Session-Level Functions
    • Performance Metrics
    • Managing a CryoSPARC Live Session from the CLI
    • FAQs and Troubleshooting
  • Guides for v3
    • v3 User Interface Guide
      • Dashboard
      • Project and Workspace Management
      • Create and Build Jobs
      • Queue Job, Inspect Job and Other Job Actions
      • View and Download Results
      • Job Relationships
      • Resource Manager
      • User Management
    • Tutorial: Job Builder
    • Get Started with CryoSPARC: Introductory Tutorial (v3)
    • Tutorial: Manually Curate Exposures (v3)
  • Resources
    • Questions and Support
Powered by GitBook
On this page
  • Pre-Processing
  • Particle Picking
  • Particle Curation
  • Reconstruction & Refinement
  • Heterogeneity Analysis
  • Common Failure Modes
  • Useful Discussion Threads
  • Citations
  1. Processing Data in cryoSPARC
  2. Data Processing Tutorials

Tutorial: Tips for Membrane Protein Structures

Helpful hints for processing cryo-EM data of membrane proteins.

PreviousCase Study: Exploratory data processing by Oliver ClarkeNextTutorial: Common CryoSPARC Plots

Last updated 2 years ago

Membrane proteins are an increasingly important class of targets for cryo-EM in academia and industry. These targets are often small (<100kDA in molecular weight), flexible, and have a large micelle in the transmembrane region. Here, we list a few suggested tips for working with these targets in CryoSPARC, sorted by the different stages of processing.

Pre-Processing

We find that per-particle CTF refinements (post-3D refinement) rarely improve final structures due to the low amount of signal present per-particle in the micrograph for small membrane targets. Nevertheless, per-particle CTF refinement may be useful to try once a sufficiently detailed structure is refined.

Particle Picking

Particle picking can be one of the most challenging parts of working with membrane proteins.

For crowded micrographs, the following two parameters can substantially affect picking performance:

  • Particle diameter

  • Min. separation dist

    • Reducing this for crowded datasets may help pick out more true particles.

Other potentially useful picking jobs:

We suggest using an extraction box size that is approximately 2-3 times larger than the particle diameter.

To account for signal displacement caused by the CTF, Rosenthal and Henderson (2003) suggest a box size of:

D+2R=D+2(λΔF/d),D + 2 R = D + 2 (\lambda \Delta F / d),D+2R=D+2(λΔF/d),

where DDD is the diameter of the particle, λ\lambdaλ is the electron wavelength, ΔF\Delta FΔF is the defocus value and ddd is the resolution. Note that the radius of displacement, RRR, is not a function of the particle diameter, and therefore this formula may result in a box size that is 4-5 times larger than the particle diameter when DDD is relatively small (as is typically the case for membrane proteins).

Many of the CryoSPARC algorithms (e.g., 2D classification, ab-initio), however, are tuned for particle images with a box size that is 2-3 times larger than the extent of the particle. Furthermore, substantial computational savings can be achieved by using a smaller box size at early stages of processing where there are potentially many particles (millions). With small membrane proteins at reasonable concentrations, it is common to have many particles (several hundred) per micrograph and therefore very large particle sets in initial classification.

To address this, a suggested pipeline is:

  1. Extract particles with smaller box size (1.5X - 2X particle extent),

  2. Perform multiple rounds of 2D classification, ab-initio, 3D classification, and initial (heterogeneous) refinements, selecting the best particles to carry forward

  3. Re-extract surviving particles with a larger box size (2X - 3X particle extent) to reasonably account for all the information spread due to the CTF, and finally

  4. Perform high resolution refinement(s).

Particle Curation

During 2D classification, a number of parameter changes can help improve performance for membrane targets:

  • Force Max over poses/shifts

    • By turning this off, 2D classification will automatically marginalize over the poses and shifts of each particle. For small particles, the uncertainty over poses and shifts can be substantial, and account for this through marginalization over these unknowns can be beneficial. Marginalization will add computational cost, but can help improve classification results in general when SNR is low. When this option is used, 2D classes will appear more “radially blurred” with less streaky or noisy artefacts towards the periphery.

  • Number of iterations

    • Increasing the default value of 20 may help improve classes.

  • Batch size

    • Empirically, users have found that doubling the initial value to 400 is sometimes beneficial.

  • Circular mask diameter

    • This can help account for crowding by masking out any information outside of a circular region in each particle image. For small particles with a lot of crowding, this can be necessary to ensure classification is based on view/conformation rather than arrangement of neighbours.

Reconstruction & Refinement

A note about masks

In general, we find that particle subtraction can only help in very specific situations. Namely, if your structure contains two very rigid subunits, one large and one small. In this case, subtracting the larger subunit can improve the resolution of the smaller unit, if particle alignments are sufficiently well resolved for this subtraction to accurately remove the larger subunit signal.

  • Initial / maximum resolution

    • For smaller membrane proteins, it is often useful to set the initial and maximum resolutions to smaller numerical values (e.g., 9Å and 7Å). This is because smaller particles appear as featureless blobs at lower resolutions and there will not be enough information to align particles and recover the structure.

Non-uniform refinement can significantly improve refinements for targets that contain micelles and for smaller proteins. Consider the following two modifications if refinement results are poor:

  • Initial lowpass

    • Empirically, increasing this resolution (e.g., to a lower numerical value such as 15Å) may improve results for smaller targets.

  • Static masking

    • For small, low-SNR particles, dynamic masking may perform poorly. Instead, supplying a soft, static mask may improve the final refinement.

Local refinement can also be quite useful for membrane targets. Note that local refinement masks must be softly padded, especially when cutting into density (even a micelle). A few salient parameters to consider:

  • Rotation/Shift search extent

    • When using smaller masks, tighter orientation search extents generally produce better results.

  • Marginalization

    • (Default on) Marginalization over poses and shifts can greatly improve alignments for smaller targets.

  • Non-uniform refine enable

    • (Default on) Non-uniform refinement can help account for disordered regions (such as micelles and flexible/floppy appendages).

  • Rotation/Shift gaussian prior widths

    • In cases of small molecules, small masks, or poor SNR, local refinement may benefit from the introduction of gaussian priors around each particle's initial orientation parameters. The utility of these priors is commensurate with the quality of the initial alignments.

Heterogeneity Analysis

CryoSPARC includes a wide assortment of tools for assessing and separating heterogeneous datasets. For high-resolution refinement of any protein, it is critical to ensure that the dataset is as homogeneous as possible; this often entails both particle curation (junk removal) and pruning of heterogeneity. In addition to 2D Classification and Ab-Initio Reconstruction, several other job types for heterogeneity analysis are highlighted below, along with important parameters to consider.

  • Force hard classification

    • Hard classification can improve results for low-SNR particles, especially when the target contains a static (well-resolved) domain connected to a flexible/heterogeneous domain (such as a micelle).

  • Force hard classification

    • Similar to Heterogeneous Refinement, force hard classification can help isolate regions of heterogeneity.

  • RMS convergence criterion

    • For low-SNR particles, the standard class switching criterion may lead to more F-EM iterations than necessary and cause processing to take longer. Consider turning this secondary criterion on to save computational cost.

3D Variability (3DVA) analysis can be an especially important tool for heterogeneity analysis of small membrane targets. The 3DVA publication includes results on the Cannabinoid Receptor 1-G GPCR, which show that 3DVA can resolve two different bending motions of the 53kDa transmembrane region of the protein.

When running 3DVA, be sure to supply a soft solvent mask to ensure that the job does not resolve variation due to the micelle. It is often advantageous to use a mask that excludes the micelle, nanodisc, or other disordered regions, in order to force the algorithm to focus only on variability within the ordered region.

Common Failure Modes

  • "Spiky" densities like the one shown below are often a sign that there are many junk particles in the dataset — this can be especially prevalent in membrane protein datasets where particle picking is difficult. In these cases, it is often helpful to further “purify” the dataset, by either:

    • performing additional 2D classification rounds, or

    • running ab-initio reconstruction with multiple classes, then using the resulting volumes (including junk classes) to initialize heterogeneous refinement or 3D classification jobs and processing all the particles. Particles that fall into intact classes where the protein density is strong can be used for further refinements and particles falling into other classes can be discarded. This “junk-sorting” in 3D can often separate junk particles more effectively than 2D classification.

Useful Discussion Threads

Citations

Kumar, Kaavya Krishna, et al. "Structure of a signaling cannabinoid receptor 1-G protein complex." Cell 176.3 (2019): 448-458

Maloney, Finn P., et al. "Structure, substrate recognition and initiation of hyaluronan synthase." Nature 604.7904 (2022): 195-201.

Rosenthal, Peter B., and Richard Henderson. "Optimal determination of particle orientation, absolute hand, and contrast loss in single-particle electron cryomicroscopy." Journal of Molecular Biology 333.4 (2003): 721-745.

Generally, pre-processing steps remain unchanged from other nominal cryoEM pipelines—namely, we recommend the use of the and jobs with no salient modifications to the parameters.

/

Neural-network-based particle picking techniques such as or can be useful when a large portion of particles are difficult to identify visually

can also be quite useful for crowded micrographs. Be sure to choose approximately 100 manual picks, focusing on picks that are ‘clumped together’ and originating from micrographs that span a wide range of defocus values.

First and foremost, all masks applied during 2D-to-3D processing should be smooth (i.e., contain no sudden 'cliffs' where the mask drops from a value near 1 to a value near 0) to avoid ringing effects. This is because sharp masks, when applied to half-maps during refinement jobs, can increase the likelihood of overfitting by introducing artifactual signal that is common to both half-maps. If you are generating masks using Chimera(X) (e.g., by following our ), be sure to use the job to add a sufficient soft padding width. As noted by the , a useful rule of thumb is to keep the mask padding width proportional to the achieved resolution in Angstroms. As long as the soft padding width is sufficient, and the mask covers the desired region of structure (while "cutting" through minimal density), the threshold value and dilation radius may be set as needed in order to generate a mask of the desired size.

Furthermore, it is especially important for membrane target masks to not be overly 'tight' to the structure. For such small proteins, a tight mask can more easily lead to a situation where a refinement 'overfits' to junk/noise (cf. ). In general, avoid creating a mask that is similar in shape to the secondary structure of the protein, and err on the side of loose (but nevertheless smooth) masks for all processing.

We strongly recommend avoiding the subtraction of micelles -- these structures are generally disordered, and it is very difficult to subtract them from particle images without removing other useful signal. Instead, consider the use of with non-uniform refinement and marginalization turned on.

Patch Motion Correction
Patch CTF
Blob Picker
Template Picker
Topaz
Deep Particle Picker
Blob Picker Tuner
Extract from Micrographs
2D classification
Particle subtraction
local refinement
Ab initio
Non-uniform refinement
Local refinement
Heterogeneous Refinement
3D Classification
3D Variability
https://discuss.cryosparc.com/t/cannot-align-small-protein-complex-particles/4674/14
https://discuss.cryosparc.com/t/ab-initio-yield-better-map-than-nu-refinment-on-a-small-membrane-protein-complex/6778/2
https://discuss.cryosparc.com/t/strange-reconstruction-failures-in-heterogeneous-refinement/8802/6
Common Failure Modes
tutorial
Volume Tools
mask generation tutorial
One type of membrane protein: the Cannabinoid Receptor 1-G GPCR complex, (Kumar et al., 2019). Data from EMPIAR-10288. Density shown at two different thresholds to illustrate the micelle regions.
A ’spiky’ hyaluronan synthase (the same density shown at two different thresholds for clarity) resolved from one class of a 3D classification job. Data from EMPIAR-11030 (Maloney et al., 2022).