CryoSPARC Guide
  • About CryoSPARC
  • Current Version
  • Licensing
    • Non-commercial license agreement
  • Setup, Configuration and Management
    • CryoSPARC Architecture and System Requirements
    • CryoSPARC Installation Prerequisites
    • How to Download, Install and Configure
      • Obtaining A License ID
      • Downloading and Installing CryoSPARC
      • CryoSPARC Cluster Integration Script Examples
      • Accessing the CryoSPARC User Interface
    • Deploying CryoSPARC on AWS
      • Performance Benchmarks
    • Using CryoSPARC with Cluster Management Software
    • Software Updates and Patches
    • Management and Monitoring
      • Environment variables
      • (Optional) Hosting CryoSPARC Through a Reverse Proxy
      • cryosparcm reference
      • cryosparcm cli reference
      • cryosparcw reference
    • Software System Guides
      • Guide: Updating to CryoSPARC v4
      • Guide: Installation Testing with cryosparcm test
      • Guide: Verify CryoSPARC Installation with the Extensive Validation Job (v4.3+)
      • Guide: Verify CryoSPARC Installation with the Extensive Workflow (≤v4.2)
      • Guide: Performance Benchmarking (v4.3+)
      • Guide: Download Error Reports
      • Guide: Maintenance Mode and Configurable User Facing Messages
      • Guide: User Management
      • Guide: Multi-user Unix Permissions and Data Access Control
      • Guide: Lane Assignments and Restrictions
      • Guide: Queuing Directly to a GPU
      • Guide: Priority Job Queuing
      • Guide: Configuring Custom Variables for Cluster Job Submission Scripts
      • Guide: SSD Particle Caching in CryoSPARC
      • Guide: Data Management in CryoSPARC (v4.0+)
      • Guide: Data Cleanup (v4.3+)
      • Guide: Reduce Database Size (v4.3+)
      • Guide: Data Management in CryoSPARC (≤v3.3)
      • Guide: CryoSPARC Live Session Data Management
      • Guide: Manipulating .cs Files Created By CryoSPARC
      • Guide: Migrating your CryoSPARC Instance
      • Guide: EMDB-friendly XML file for FSC plots
    • Troubleshooting
  • Application Guide (v4.0+)
    • A Tour of the CryoSPARC Interface
    • Browsing the CryoSPARC Instance
    • Projects, Workspaces and Live Sessions
    • Jobs
    • Job Views: Cards, Tree, and Table
    • Creating and Running Jobs
    • Low Level Results Interface
    • Filters and Sorting
    • View Options
    • Tags
    • Flat vs Hierarchical Navigation
    • File Browser
    • Blueprints
    • Workflows
    • Inspecting Data
    • Managing Jobs
    • Interactive Jobs
    • Upload Local Files
    • Managing Data
    • Downloading and Exporting Data
    • Instance Management
    • Admin Panel
  • Cryo-EM Foundations
    • Image Formation
      • Contrast in Cryo-EM
      • Waves as Vectors
      • Aliasing
  • Expectation Maximization in Cryo-EM
  • Processing Data in cryoSPARC
    • Get Started with CryoSPARC: Introductory Tutorial (v4.0+)
    • Tutorial Videos
    • All Job Types in CryoSPARC
      • Import
        • Job: Import Movies
        • Job: Import Micrographs
        • Job: Import Particle Stack
        • Job: Import 3D Volumes
        • Job: Import Templates
        • Job: Import Result Group
        • Job: Import Beam Shift
      • Motion Correction
        • Job: Patch Motion Correction
        • Job: Full-Frame Motion Correction
        • Job: Local Motion Correction
        • Job: MotionCor2 (Wrapper) (BETA)
        • Job: Reference Based Motion Correction (BETA)
      • CTF Estimation
        • Job: Patch CTF Estimation
        • Job: Patch CTF Extraction
        • Job: CTFFIND4 (Wrapper)
        • Job: Gctf (Wrapper) (Legacy)
      • Exposure Curation
        • Job: Micrograph Denoiser (BETA)
        • Job: Micrograph Junk Detector (BETA)
        • Interactive Job: Manually Curate Exposures
      • Particle Picking
        • Interactive Job: Manual Picker
        • Job: Blob Picker
        • Job: Template Picker
        • Job: Filament Tracer
        • Job: Blob Picker Tuner
        • Interactive Job: Inspect Particle Picks
        • Job: Create Templates
      • Extraction
        • Job: Extract from Micrographs
        • Job: Downsample Particles
        • Job: Restack Particles
      • Deep Picking
        • Guideline for Supervised Particle Picking using Deep Learning Models
        • Deep Network Particle Picker
          • T20S Proteasome: Deep Particle Picking Tutorial
          • Job: Deep Picker Train and Job: Deep Picker Inference
        • Topaz (Bepler, et al)
          • T20S Proteasome: Topaz Particle Picking Tutorial
          • T20S Proteasome: Topaz Micrograph Denoising Tutorial
          • Job: Topaz Train and Job: Topaz Cross Validation
          • Job: Topaz Extract
          • Job: Topaz Denoise
      • Particle Curation
        • Job: 2D Classification
        • Interactive Job: Select 2D Classes
        • Job: Reference Based Auto Select 2D (BETA)
        • Job: Reconstruct 2D Classes
        • Job: Rebalance 2D Classes
        • Job: Class Probability Filter (Legacy)
        • Job: Rebalance Orientations
        • Job: Subset Particles by Statistic
      • 3D Reconstruction
        • Job: Ab-Initio Reconstruction
      • 3D Refinement
        • Job: Homogeneous Refinement
        • Job: Heterogeneous Refinement
        • Job: Non-Uniform Refinement
        • Job: Homogeneous Reconstruction Only
        • Job: Heterogeneous Reconstruction Only
        • Job: Homogeneous Refinement (Legacy)
        • Job: Non-uniform Refinement (Legacy)
      • CTF Refinement
        • Job: Global CTF Refinement
        • Job: Local CTF Refinement
        • Job: Exposure Group Utilities
      • Conformational Variability
        • Job: 3D Variability
        • Job: 3D Variability Display
        • Job: 3D Classification
        • Job: Regroup 3D Classes
        • Job: Reference Based Auto Select 3D (BETA)
        • Job: 3D Flexible Refinement (3DFlex) (BETA)
      • Postprocessing
        • Job: Sharpening Tools
        • Job: DeepEMhancer (Wrapper)
        • Job: Validation (FSC)
        • Job: Local Resolution Estimation
        • Job: Local Filtering
        • Job: ResLog Analysis
        • Job: ThreeDFSC (Wrapper) (Legacy)
      • Local Refinement
        • Job: Local Refinement
        • Job: Particle Subtraction
        • Job: Local Refinement (Legacy)
      • Helical Reconstruction
        • Helical symmetry in CryoSPARC
        • Job: Helical Refinement
        • Job: Symmetry search utility
        • Job: Average Power Spectra
      • Utilities
        • Job: Exposure Sets Tool
        • Job: Exposure Tools
        • Job: Generate Micrograph Thumbnails
        • Job: Cache Particles on SSD
        • Job: Check for Corrupt Particles
        • Job: Particle Sets Tool
        • Job: Reassign Particles to Micrographs
        • Job: Remove Duplicate Particles
        • Job: Symmetry Expansion
        • Job: Volume Tools
        • Job: Volume Alignment Tools
        • Job: Align 3D maps
        • Job: Split Volumes Group
        • Job: Orientation Diagnostics
      • Simulations
        • Job: Simulate Data (GPU)
        • Job: Simulate Data (Legacy)
    • CryoSPARC Tools
    • Data Processing Tutorials
      • Case study: End-to-end processing of a ligand-bound GPCR (EMPIAR-10853)
      • Case Study: DkTx-bound TRPV1 (EMPIAR-10059)
      • Case Study: Pseudosymmetry in TRPV5 and Calmodulin (EMPIAR-10256)
      • Case Study: End-to-end processing of an inactive GPCR (EMPIAR-10668)
      • Case Study: End-to-end processing of encapsulated ferritin (EMPIAR-10716)
      • Case Study: Exploratory data processing by Oliver Clarke
      • Tutorial: Tips for Membrane Protein Structures
      • Tutorial: Common CryoSPARC Plots
      • Tutorial: Negative Stain Data
      • Tutorial: Phase Plate Data
      • Tutorial: EER File Support
      • Tutorial: EPU AFIS Beam Shift Import
      • Tutorial: Patch Motion and Patch CTF
      • Tutorial: Float16 Support
      • Tutorial: Particle Picking Calibration
      • Tutorial: Blob Picker Tuner
      • Tutorial: Helical Processing using EMPIAR-10031 (MAVS)
      • Tutorial: Maximum Box Sizes for Refinement
      • Tutorial: CTF Refinement
      • Tutorial: Ewald Sphere Correction
      • Tutorial: Symmetry Relaxation
      • Tutorial: Orientation Diagnostics
      • Tutorial: BILD files in CryoSPARC v4.4+
      • Tutorial: Mask Creation
      • Case Study: Yeast U4/U6.U5 tri-snRNP
      • Tutorial: 3D Classification
      • Tutorial: 3D Variability Analysis (Part One)
      • Tutorial: 3D Variability Analysis (Part Two)
      • Tutorial: 3D Flexible Refinement
        • Installing 3DFlex Dependencies (v4.1–v4.3)
      • Tutorial: 3D Flex Mesh Preparation
    • Webinar Recordings
  • Real-time processing in cryoSPARC Live
    • About CryoSPARC Live
    • Prerequisites and Compute Resources Setup
    • How to Access cryoSPARC Live
    • UI Overview
    • New Live Session: Start to Finish Guide
    • CryoSPARC Live Tutorial Videos
    • Live Jobs and Session-Level Functions
    • Performance Metrics
    • Managing a CryoSPARC Live Session from the CLI
    • FAQs and Troubleshooting
  • Guides for v3
    • v3 User Interface Guide
      • Dashboard
      • Project and Workspace Management
      • Create and Build Jobs
      • Queue Job, Inspect Job and Other Job Actions
      • View and Download Results
      • Job Relationships
      • Resource Manager
      • User Management
    • Tutorial: Job Builder
    • Get Started with CryoSPARC: Introductory Tutorial (v3)
    • Tutorial: Manually Curate Exposures (v3)
  • Resources
    • Questions and Support
Powered by GitBook
On this page
  • At a Glance
  • Description
  • Inputs
  • Particles
  • 2D Class Averages
  • Commonly Adjusted Parameters
  • Rebalance factor
  • Number of superclasses or templates (integer)
  • Override maximum superclass size
  • Outputs
  • Particles selected
  • Templates
  • Particles excluded
  • Plots
  • Common Next Steps
  • Recommended Alternatives
  • References
  1. Processing Data in cryoSPARC
  2. All Job Types in CryoSPARC
  3. Particle Curation

Job: Rebalance 2D Classes

PreviousJob: Reconstruct 2D ClassesNextJob: Class Probability Filter (Legacy)

Last updated 1 month ago

At a Glance

Group 2D class averages into superclusters and, optionally, balance the number of particles in each supercluster.

Description

Inputs

Particles

Particles must have been 2D Classified, and should come from the same job as the 2D Class Averages.

2D Class Averages

Particles and 2D class averages should come from the same job. Note that Rebalance 2D Classes analyzes the class averages and not the particles themselves, so results will be better with higher-quality (i.e., less noisy) class averages.

Commonly Adjusted Parameters

Rebalance factor

Particles will be dropped from each superclass such that the smallest class is, at smallest, this fraction of the largest class. A Rebalance factor of 0.0 does not discard any particles. A Rebalance factor of 1.0 randomly discards particles from all classes (except the smallest class) until all classes are the same size.

Number of superclasses or templates (integer)

This should optimally be set to the number of unique views in the 2D class averages. This number is typically not known precisely, and so some experimentation is often necessary. Note that if Do superclassification is turned off this number must equal the number of 2D class averages in the input.

Override maximum superclass size

Provided the Rebalance factor is not 0.0, this parameter will determine the maximum superclass size rather than the Rebalance factor. Setting this parameter to some integer N is functionally equivalent to setting the Rebalance factor to $n/N$, where $n$ is the number of particles in the smallest class.

Outputs

Particles selected

Particles remaining in the dataset after rebalancing classes according to the Rebalance factor (or Override maximum superclass size).

Templates

The templates are unchanged from the input.

Particles excluded

Particles excluded from the dataset after rebalancing classes according to Rebalance factor (or Override maximum superclass size).

Plots

Rebalance 2D Classes creates an Affinity Matrix which displays how similar 2D classes are to each other. It is this affinity that is used to group the class averages into the requested number of superclasses.

Say we start with ten class averages and we want to group them into two superclasses.

First, we calculate the affinity of the classes for each other. The affinity is a measure of how similar the two classes look, and varies from 0.0 (not similar at all) to 1.0 (identical).

We can map the pairwise affinities on a matrix, where the row and column represent a specific class, and each cell is colored by the similarity between the class in its row and the class in its column.

If we rearrange the rows and columns such that classes with a high affinity for each other are adjacent, we can easily see the superclasses as square structure in the matrix. This structure arises naturally in a well-clustered matrix, since a group of rows and columns all have high affinity for each other and low affinity for other classes.

If the matrix does not have a clear pattern of squares for each superclass, or if the superclasses have members which “project” darker colors in their row and column, it may be that a different number of superclasses is needed.

Common Next Steps

This job is most useful as a diagnostic to assess distribution of particles among views before moving to 3D, and often the outputs are not directly used in following jobs.

The improved map may be useful for downstream tasks or for repeating particle picking, if the underrepresented views are present but not being picked. If, however, underrepresented views are simply not present in the micrograph it is unlikely that this technique (or any other) will recover an isotropic map.

Recommended Alternatives

References

  1. Wong, Wilson, et al. "Cryo-EM structure of the Plasmodium falciparum 80S ribosome bound to the anti-protozoan drug emetine." Elife 3 (2014): e03080.

  2. Tan, Y. Z. et al. Addressing preferred specimen orientation in single-particle cryo-EM through tilting. Nature Methods 14, 793–796 (2017).

  3. Campbell, Melody G., et al. "2.8 Å resolution reconstruction of the Thermoplasma acidophilum 20S proteasome using cryo-electron microscopy." Elife 4 (2015): e06380.

Rebalance 2D Classes analyses the 2D class averages produced by jobs to produce superclusters of similar class averages. Optionally, particles can be randomly excluded from the more populated superclasses to balance the number of particles in each superclass.

In some rare cases, rebalancing particles among views can improve initial results of in the case of severe orientation bias. If your Ab initio reconstruction shows evidence of severe bias (such as a flat map or a map with severe streaking), setting the Rebalance factor relatively high (e.g., 0.8) can improve results slightly.

If a 3D refinement of the particles exists, will provide quantitative description of orientation bias that may or may not exist in the particles, and whether or not that bias results in a significantly anisotropic map.

Similarly, if a 3D refinement of the particles exists, will directly rebalance the particles based on viewing direction rather than by 2D Class.

2D Classification
Ab initio reconstruction
Orientation Diagnostics
Rebalance Orientations
Class averages produced from EMPIAR 10028 data (Wong et al. 2014).
Class averages from EMPIAR 10096 (Tan et al. 2017)
Example Ab Initio maps produced from EMPIAR 10025 (Campbell et al. 2015)