Accurately placing macromolecular assemblies in the cellular context is important in understanding their mechanistic role inside the cell. Previously, we developed a 2D template-matching (2DTM) approach (Rickgauer et al., 2017[Rickgauer, J. P., Grigorieff, N. & Denk, W. (2017). eLife, 6, e25648.]; Lucas et al., 2021[Lucas, B. A., Himes, B. A., Xue, L., Grant, T., Mahamid, J. & Grigorieff, N. (2021). eLife, 10, e68946.]) in cisTEM (Grant et al., 2018[Grant, T., Rohou, A. & Grigorieff, N. (2018). eLife, 7, e35383.]) to detect targets in cellular cryo-EM images with high positional and orientational accuracy. 2DTM not only detects targets such as ribosomes in cryo-EM images but also provides data that enable the in situ classification and high-resolution reconstruction of these targets (Lucas et al., 2022[Lucas, B. A., Zhang, K., Loerch, S. & Grigorieff, N. (2022). eLife, 11, e79272.], 2023[Lucas, B. A., Himes, B. A. & Grigorieff, N. (2023). eLife, 12, 12RP90486.]; Elferich et al., 2022[Elferich, J., Schiroli, G., Scadden, D. T. & Grigorieff, N. (2022). eLife, 11, e80980.]). Building on these successes, this work aims to improve the 2DTM framework to detect more challenging targets in various environments.
A 2DTM search yields a signal-to-noise ratio (SNR) for every location in the cryo-EM image that depends on the cross-correlation between the template and the image (Rickgauer et al., 2017[Rickgauer, J. P., Grigorieff, N. & Denk, W. (2017). eLife, 6, e25648.]). A target is detected when the SNR value exceeds a statistically defined threshold that limits the average false positives to one per image, based on the assumption that the cryo-EM image is dominated by noise and cellular background and that the cross-correlation values observed across the image after whitening the noise/background follow a Gaussian distribution. The 2DTM SNR can be further normalized by subtracting the mean and dividing by the standard deviation of cross-correlations calculated across all sampled orientations at each location in the image (Rickgauer et al., 2017[Rickgauer, J. P., Grigorieff, N. & Denk, W. (2017). eLife, 6, e25648.]). This step is often referred to as `z-score' normalization (Spiegel & Stephens, 1999[Spiegel, M. R. & Stephens, L. J. (1999). Schaum's Outline of Theory and Problems of Statistics, 3rd ed. New York: McGraw-Hill.]). Using the z-score instead of the SNR improves the detection of capsomers in rotavirus double-layered particles (DLPs; Rickgauer et al., 2017[Rickgauer, J. P., Grigorieff, N. & Denk, W. (2017). eLife, 6, e25648.]) and ribosomes in a crowded cellular environment (Lucas et al., 2022[Lucas, B. A., Zhang, K., Loerch, S. & Grigorieff, N. (2022). eLife, 11, e79272.]). In the following, we will refer to the outputs of 2DTM as the 2DTM SNR and 2DTM z-score, respectively.
Previous applications of 2DTM have shown that the 2DTM SNR and z-score function differently depending on the characteristics of the sample and target. For example, when low-resolution features were suppressed by using a near-focus image setting (70 nm), the 2DTM SNR map showed a flat background with sharp peaks indicating the locations of apoferritins, even in a dense protein (bovine serum albumin) background (Rickgauer et al., 2017[Rickgauer, J. P., Grigorieff, N. & Denk, W. (2017). eLife, 6, e25648.]). On the other hand, low-resolution features from the target itself when strongly defocused (>2000 nm), or from the background structural noise, can result in broader peaks or an uneven background in the SNR map, complicating target detection (Rickgauer et al., 2017[Rickgauer, J. P., Grigorieff, N. & Denk, W. (2017). eLife, 6, e25648.]; Lucas et al., 2022[Lucas, B. A., Zhang, K., Loerch, S. & Grigorieff, N. (2022). eLife, 11, e79272.]). The misleading low-resolution background can be suppressed by calculating the 2DTM z-score (Rickgauer et al., 2017[Rickgauer, J. P., Grigorieff, N. & Denk, W. (2017). eLife, 6, e25648.]), which removes spurious correlations between the template and the structural noise in the image, thereby flattening the background and improving the detectability of targets in cellular environments (Rickgauer et al., 2020[Rickgauer, J. P., Choi, H., Lippincott-Schwartz, J. & Denk, W. (2020). bioRxiv, 2020.04.22.053868.]; Lucas et al., 2022[Lucas, B. A., Zhang, K., Loerch, S. & Grigorieff, N. (2022). eLife, 11, e79272.]). In Fig. 1[link](a), a segment of a previously published micrograph of a yeast lamella near the nucleus is presented (Lucas et al., 2022[Lucas, B. A., Zhang, K., Loerch, S. & Grigorieff, N. (2022). eLife, 11, e79272.]). This image section contains various cellular compartments located from left to right, including the vacuole, cytoplasm and nucleus. Using the mature 60S as a search template, 2DTM outputs a 2DTM SNR map and a 2DTM z-score map [Figs. 1[link](b) and 1[link](c)]. The bright spots in the 2DTM SNR map are locations with high correlation values, indicating 60S ribosomes. However, the peaks are surrounded by halos of increased SNR values extending to other low-resolution features in the image, such as membranes. The z-score map removes these halos and spurious matches of high-contrast features, thereby reducing the number of false detections (membranes or partial overlap with ribosomes) while preserving locations with high-resolution matches from the riboso