689 Publications

InstaMap: instant-NGP for cryo-EM density maps

Geoffrey Woollard, P. Cossio, S. Hanson, et al.

Despite the parallels between problems in computer vision and cryo-electron microscopy (cryo-EM), many state-of-the-art approaches from computer vision have yet to be adapted for cryo-EM. Within the computer-vision research community, implicits such as neural radiance fields (NeRFs) have enabled the detailed reconstruction of 3D objects from few images at different camera-viewing angles. While other neural implicits, specifically density fields, have been used to map conformational heterogeneity from noisy cryo-EM projection images, most approaches represent volume with an implicit function in Fourier space, which has disadvantages compared with solving the problem in real space, complicating, for instance, masking, constraining physics or geometry, and assessing local resolution. In this work, we build on a recent development in neural implicits, a multi-resolution hash-encoding framework called instant-NGP, that we use to represent the scalar volume directly in real space and apply it to the cryo-EM density-map reconstruction problem (InstaMap). We demonstrate that for both synthetic and real data, InstaMap for homogeneous reconstruction achieves higher resolution at shorter training stages than five other real-spaced representations. We propose a solution to noise overfitting, demonstrate that InstaMap is both lightweight and fast to train, implement masking from a user-provided input mask and extend it to molecular-shape heterogeneity via bending space using a per-image vector field.

Show Abstract

Simulation-based inference of single-molecule experiments

Lars Dingeldein, P. Cossio, Roberto Covino

Single-molecule experiments are a unique tool to characterize the structural dynamics of biomolecules. However, reconstructing molecular details from noisy single-molecule data is challenging. Simulation-based inference (SBI) integrates statistical inference, physics-based simulators, and machine learning and is emerging as a powerful framework for analysing complex experimental data. Recent advances in deep learning have accelerated the development of new SBI methods, enabling the application of Bayesian inference to an ever-increasing number of scientific problems. Here, we review the nascent application of SBI to the analysis of single-molecule experiments. We introduce parametric Bayesian inference and discuss its limitations. We then overview emerging deep-learning-based SBI methods to perform Bayesian inference for complex models encoded in computer simulators. We illustrate the first applications of SBI to single-molecule force-spectroscopy and cryo-electron microscopy experiments. SBI allows us to leverage powerful computer algorithms modeling complex biomolecular phenomena to connect scientific models and experiments in a principled way.

Show Abstract

Active Hydrodynamic Theory of Euchromatin and Heterochromatin

Alex Rautu, Alexandra Zidovska, David Saintillan, M. Shelley

The genome contains genetic information essential for cell's life. The genome's spatial organization inside the cell nucleus is critical for its proper function including gene regulation. The two major genomic compartments -- euchromatin and heterochromatin -- contain largely transcriptionally active and silenced genes, respectively, and exhibit distinct dynamics. In this work, we present a hydrodynamic framework that describes the large-scale behavior of euchromatin and heterochromatin, and accounts for the interplay of mechanical forces, active processes, and nuclear confinement. Our model shows contractile stresses from cross-linking proteins lead to the formation of heterochromatin droplets via mechanically driven phase separation. These droplets grow, coalesce, and in nuclear confinement, wet the boundary. Active processes, such as gene transcription in euchromatin, introduce non-equilibrium fluctuations that drive long-range, coherent motions of chromatin as well as the nucleoplasm, and thus alter the genome's spatial organization. These fluctuations also indirectly deform heterochromatin droplets, by continuously changing their shape. Taken together, our findings reveal how active forces, mechanical stresses and hydrodynamic flows contribute to the genome's organization at large scales and provide a physical framework for understanding chromatin organization and dynamics in live cells.

Show Abstract
March 26, 2025

Coarse-graining bacterial diffusion in disordered media to surface states

Bacterial motility in spatially structured environments impacts a variety of natural and engineering processes. Constructing models to predict, control, and design bacterial motility for these processes remains challenging because bacteria and active swimmers have complex interactions with surfaces and because the precise environment geometry is unknown. Here, we present a method for deriving bacterial diffusion coefficients in disordered media in terms of cell and environmental parameters. The approach abstracts the dynamics in the full geometry to “surface states,” which encode how cells interact with surfaces in the environment. Then, a long-time diffusion equation can be derived analytically from the state model. Applying this method to a run-and-tumble particle in a 2D Lorentz gas environment provides analytical predictions that show good agreement with particle simulations. Like past studies, we observe that the diffusivity depends nonmonotonically on the cell’s run length. Using the analytical expressions, we derive the optimal run length, revealing an intuitive dependence on environmental length scales. Furthermore, we find that rescaling length and time by the average distance and time between trap events collapses all of the diffusivities onto a single curve, which we derive analytically. Thus, our approach extracts interpretable, macroscopic diffusive behavior from complex microscopic dynamics, and provides tools and intuitions for understanding bacterial diffusion in disordered media.

Show Abstract

Recent Advances in Membrane Protein Simulations

James C. Gumbart, S. Hanson

imulating membrane proteins accurately combines two challenges into one: properly capturing the structure and dynamics of proteins as well as correctly representing the membrane environment in which they are usually embedded. Beginning with pioneering efforts in the 1980s and 1990s,1−7 both challenges have been met with increasing success over the years. Simulations of membrane proteins in realistic cellular contexts over many microseconds are now common.Concomitant advances in the determination of membrane protein structures, with over 50 unique structures determined 8 annually have further expanded the reach of simulations in this area. This Special Issue highlights a number of recent molecular dynamics (MD) simulations of membrane proteins and covers a wide range of applications and specialized techniques.

Show Abstract

PLUMED Tutorials: a collaborative, community-driven learning ecosystem

Gareth A. Tribello, Massimiliano Bonomi, P. Cossio, et al.

In computational physics, chemistry, and biology, the implementation of new techniques in shared and open-source software lowers barriers to entry and promotes rapid scientific progress. However, effectively training new software users presents several challenges. Common methods like direct knowledge transfer and in-person workshops are limited in reach and comprehensiveness. Furthermore, while the COVID-19 pandemic highlighted the benefits of online training, traditional online tutorials can quickly become outdated and may not cover all the software’s functionalities. To address these issues, here we introduce “PLUMED Tutorials,” a collaborative model for developing, sharing, and updating online tutorials. This initiative utilizes repository management and continuous integration to ensure compatibility with software updates. Moreover, the tutorials are interconnected to form a structured learning path and are enriched with automatic annotations to provide broader context. This paper illustrates the development, features, and advantages of PLUMED Tutorials, aiming to foster an open community for creating and sharing educational resources.

Show Abstract

Self-organized dynamics of a viscous drop with interfacial nematic activity

M. Firouznia , David Saintillan

We study emergent dynamics in a viscous drop subject to interfacial nematic activity. Using hydrodynamic simulations, we show how the interplay of nematodynamics, activity-driven flows in the fluid bulk, and surface deformations gives rise to a sequence of self-organized behaviors of increasing complexity, from periodic braiding motions of topological defects to chaotic defect dynamics and active turbulence, along with spontaneous shape changes and translation. Our findings recapitulate qualitative features of experiments and shed light on the mechanisms underpinning morphological dynamics in active interfaces.

Show Abstract

A model for boundary-driven tissue morphogenesis

Daniel S. Alber, Alexandre O. Jacinto, S. Shvartsman, et al.

Tissue deformations during morphogenesis can be active, driven by internal processes, or passive, resulting from stresses applied at their boundaries. Here, we introduce the Drosophila hindgut primordium as a model for studying boundary-driven tissue morphogenesis. We characterize its deformations and show that its complex shape changes can be a passive consequence of the deformations of the active regions of the embryo that surround it. First, we find an intermediate characteristic triangular shape in the 3D deformations of the hindgut. We construct a minimal model of the hindgut primordium as an elastic ring deformed by active midgut invagination and germ band extension on an ellipsoidal surface, which robustly captures the symmetry-breaking into this triangular shape. We then quantify the 3D kinematics of the tissue by a set of contours and discover that the hindgut deforms in two stages: an initial translation on the curved embryo surface followed by a rapid breaking of shape symmetry. We extend our model to show that the contour kinematics in both stages are consistent with our passive picture. Our results suggest that the role of in-plane deformations during hindgut morphogenesis is to translate the tissue to a region with anisotropic embryonic curvature and show that uniform boundary conditions are sufficient to generate the observed nonuniform shape change. Our work thus provides a possible explanation for the various characteristic shapes of blastopore-equivalents in different organisms and a framework for the mechanical emergence of global morphologies in complex developmental systems.

Show Abstract
March 5, 2025

A new statistical metric for robust target detection in cryo-EM using 2D template match

Kexin Zhang, P. Cossio, et al.

Accurately placing macromolecular assemblies in the cellular context is important in understanding their mechanistic role inside the cell. Previously, we developed a 2D template-matching (2DTM) approach (Rickgauer et al., 2017[Rickgauer, J. P., Grigorieff, N. & Denk, W. (2017). eLife, 6, e25648.]; Lucas et al., 2021[Lucas, B. A., Himes, B. A., Xue, L., Grant, T., Mahamid, J. & Grigorieff, N. (2021). eLife, 10, e68946.]) in cisTEM (Grant et al., 2018[Grant, T., Rohou, A. & Grigorieff, N. (2018). eLife, 7, e35383.]) to detect targets in cellular cryo-EM images with high positional and orientational accuracy. 2DTM not only detects targets such as ribosomes in cryo-EM images but also provides data that enable the in situ classification and high-resolution reconstruction of these targets (Lucas et al., 2022[Lucas, B. A., Zhang, K., Loerch, S. & Grigorieff, N. (2022). eLife, 11, e79272.], 2023[Lucas, B. A., Himes, B. A. & Grigorieff, N. (2023). eLife, 12, 12RP90486.]; Elferich et al., 2022[Elferich, J., Schiroli, G., Scadden, D. T. & Grigorieff, N. (2022). eLife, 11, e80980.]). Building on these successes, this work aims to improve the 2DTM framework to detect more challenging targets in various environments.

A 2DTM search yields a signal-to-noise ratio (SNR) for every location in the cryo-EM image that depends on the cross-correlation between the template and the image (Rickgauer et al., 2017[Rickgauer, J. P., Grigorieff, N. & Denk, W. (2017). eLife, 6, e25648.]). A target is detected when the SNR value exceeds a statistically defined threshold that limits the average false positives to one per image, based on the assumption that the cryo-EM image is dominated by noise and cellular background and that the cross-correlation values observed across the image after whitening the noise/background follow a Gaussian distribution. The 2DTM SNR can be further normalized by subtracting the mean and dividing by the standard deviation of cross-correlations calculated across all sampled orientations at each location in the image (Rickgauer et al., 2017[Rickgauer, J. P., Grigorieff, N. & Denk, W. (2017). eLife, 6, e25648.]). This step is often referred to as `z-score' normalization (Spiegel & Stephens, 1999[Spiegel, M. R. & Stephens, L. J. (1999). Schaum's Outline of Theory and Problems of Statistics, 3rd ed. New York: McGraw-Hill.]). Using the z-score instead of the SNR improves the detection of capsomers in rotavirus double-layered particles (DLPs; Rickgauer et al., 2017[Rickgauer, J. P., Grigorieff, N. & Denk, W. (2017). eLife, 6, e25648.]) and ribosomes in a crowded cellular environment (Lucas et al., 2022[Lucas, B. A., Zhang, K., Loerch, S. & Grigorieff, N. (2022). eLife, 11, e79272.]). In the following, we will refer to the outputs of 2DTM as the 2DTM SNR and 2DTM z-score, respectively.

Previous applications of 2DTM have shown that the 2DTM SNR and z-score function differently depending on the characteristics of the sample and target. For example, when low-resolution features were suppressed by using a near-focus image setting (70 nm), the 2DTM SNR map showed a flat background with sharp peaks indicating the locations of apoferritins, even in a dense protein (bovine serum albumin) background (Rickgauer et al., 2017[Rickgauer, J. P., Grigorieff, N. & Denk, W. (2017). eLife, 6, e25648.]). On the other hand, low-resolution features from the target itself when strongly defocused (>2000 nm), or from the background structural noise, can result in broader peaks or an uneven background in the SNR map, complicating target detection (Rickgauer et al., 2017[Rickgauer, J. P., Grigorieff, N. & Denk, W. (2017). eLife, 6, e25648.]; Lucas et al., 2022[Lucas, B. A., Zhang, K., Loerch, S. & Grigorieff, N. (2022). eLife, 11, e79272.]). The misleading low-resolution background can be suppressed by calculating the 2DTM z-score (Rickgauer et al., 2017[Rickgauer, J. P., Grigorieff, N. & Denk, W. (2017). eLife, 6, e25648.]), which removes spurious correlations between the template and the structural noise in the image, thereby flattening the background and improving the detectability of targets in cellular environments (Rickgauer et al., 2020[Rickgauer, J. P., Choi, H., Lippincott-Schwartz, J. & Denk, W. (2020). bioRxiv, 2020.04.22.053868.]; Lucas et al., 2022[Lucas, B. A., Zhang, K., Loerch, S. & Grigorieff, N. (2022). eLife, 11, e79272.]). In Fig. 1[link](a), a segment of a previously published micrograph of a yeast lamella near the nucleus is presented (Lucas et al., 2022[Lucas, B. A., Zhang, K., Loerch, S. & Grigorieff, N. (2022). eLife, 11, e79272.]). This image section contains various cellular compartments located from left to right, including the vacuole, cytoplasm and nucleus. Using the mature 60S as a search template, 2DTM outputs a 2DTM SNR map and a 2DTM z-score map [Figs. 1[link](b) and 1[link](c)]. The bright spots in the 2DTM SNR map are locations with high correlation values, indicating 60S ribosomes. However, the peaks are surrounded by halos of increased SNR values extending to other low-resolution features in the image, such as membranes. The z-score map removes these halos and spurious matches of high-contrast features, thereby reducing the number of false detections (membranes or partial overlap with ribosomes) while preserving locations with high-resolution matches from the riboso

Show Abstract
March 1, 2025

CyclicCAE: A Conformational Autoencoder for Efficient Heterochiral Macrocyclic Backbone Sampling

Andrew C. Powers, D. Renfrew, Parisa Hosseinzadeh, V. Mulligan

Macrocycles are a promising therapeutic class. The incorporation of heterochiral and non-natural chemical building-blocks presents challenges for rational design, however. With no existing machine learning methods tailored for heterochiral macrocycle design, we developed a novel convolutional autoencoder model to rapidly generate energetically favorable macrocycle backbones for heterochiral design and structure prediction. Our approach surpasses the current state-of-the-art method, Generalized Kinematic loop closure (GenKIC) in the Rosetta software suite. Given the absence of large, available macrocycle datasets, we created a custom dataset in-house and in silico. Our model, CyclicCAE, produces energetically stable backbones and designable structures more rapidly than GenKIC. It enables users to perform energy minimization, generate structurally similar or diverse inputs via MCMC, and conduct inpainting with fixed anchors or motifs. We propose that this novel method will accelerate the development of stable macrocycles, speeding up macrocycle drug design pipelines.

Show Abstract
February 27, 2025
  • Previous Page
  • Viewing
  • Next Page
Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates