481 Publications

Counting particles could give wrong probabilities in Cryo-Electron Microscopy

L. Evans, Lars Dingeldein, P. Cossio, et al.

Cryo-electron microscopy (cryo-EM) experiments take 2D snapshots of individual proteins. In principle, these snapshots contain not only the main biomolecular conformations but also scarcely populated states and rare transitions between intermediates. This makes cryo-EM a powerful tool, not only for investigating the structure of biomolecules at high resolution but also for inferring the entire conformational ensemble distribution. Some recent works have reported conformational state populations by counting particle-images from cryo-EM. We wish to caution the community that these measurements are highly susceptible to noise and should not be relied upon as a precise estimate of the thermodynamic landscape of a biomolecule for understanding its biological function. Here, we demonstrate that the extremely noisy nature of cryo-EM images and uncertainty in the viewing orientations of biomolecules lead to ambiguities when assigning images to structures. If ignored, this ambiguity can introduce inherent bias when determining the populations of conformational states through individual particle assignment. We further show that modeling the conformational probability distribution using the entire image dataset mitigates these biases

Show Abstract
March 28, 2025

Stabilizing the calculation of the self-energy in dynamical mean-field theory using constrained residual minimization

Harrison LaBollita, J. Kaye, Alexander Hampel

We propose a simple and efficient method to calculate the electronic self-energy in dynamical mean-field theory (DMFT), addressing a numerical instability often encountered when solving the Dyson equation. Our approach formulates the Dyson equation as a constrained optimization problem with a simple quadratic objective. The constraints on the self-energy are obtained via direct measurement of the leading order terms of its asymptotic expansion within a continuous time quantum Monte Carlo framework, and the use of the compact discrete Lehmann representation of the self-energy yields an optimization problem in a modest number of unknowns. We benchmark our method for the non-interacting Bethe lattice, as well as DMFT calculations for both model systems and \textit{ab-initio} applications.

Show Abstract

Recent Advances in Membrane Protein Simulations

James C. Gumbart, S. Hanson

Simulating membrane proteins accurately combines two challenges into one: properly capturing the structure and dynamics of proteins as well as correctly representing the membrane environment in which they are usually embedded. Beginning with pioneering efforts in the 1980s and 1990s,1−7 both challenges have been met with increasing success over the years. Simulations of membrane proteins in realistic cellular contexts over many microseconds are now common.Concomitant advances in the determination of membrane protein structures, with over 50 unique structures determined 8 annually have further expanded the reach of simulations in this area. This Special Issue highlights a number of recent molecular dynamics (MD) simulations of membrane proteins and covers a wide range of applications and specialized techniques.

Show Abstract

Random Batch Ewald Method for Dielectrically Confined Coulomb Systems

Zecheng Gan , X. Gao, J. Liang, Zhenli Xu

Quasi two-dimensional (quasi-2D) Coulomb systems have drawn widespread interest. The reduced symmetry of these systems leads to complex collective behaviors yet simultaneously poses significant challenges for particle-based simulations. In this paper, a novel method is presented for efficiently simulating a collection of (N) charges confined in doubly periodic slabs, with the extension to scenarios involving dielectric jumps at slab boundaries. Unlike existing methods, the method is insensitive to the aspect ratio of the simulation box, and it achieves optimal ( O(N)) complexity and strong parallel scalability, thanks to the random batch Ewald (RBE) approach. Moreover, the additional cost for polarization contributions, represented as image reflection series, is reduced to a negligible cost via combining the RBE with an efficient structure factor coefficient recalibration technique in (k)-space. Explicit formulas for optimal parameter choices of the algorithm are provided through error estimates, together with a rigorous proof. Finally, we demonstrate the accuracy, efficiency, and scalability of our method, called RBE2D, via numerical tests across a variety of prototype systems. An excellent agreement between RBE2D and the particle-particle particle-mesh method (PPPM) is observed, with a significant reduction in the computational cost and improved strong scalability, demonstrating that it is a promising method for a broad range of charged systems under quasi-2D confinement.

Show Abstract

Sampling From Multiscale Densities With Delayed Rejection Generalized Hamiltonian Monte Carlo

Hamiltonian Monte Carlo (HMC) is the mainstay of applied Bayesian inference for differentiable models. However, HMC still struggles to sample from hierarchical models that induce densities with multiscale geometry: a large step size is needed to efficiently explore low curvature regions while a small step size is needed to accurately explore high curvature regions. We introduce the delayed rejection generalized HMC (DR-G-HMC) sampler that overcomes this challenge by employing dynamic step size selection, inspired by differential equation solvers. In generalized HMC, each iteration does a single leapfrog step. DR-G-HMC sequentially makes proposals with geometrically decreasing step sizes upon rejection of earlier proposals. This simulates Hamiltonian dynamics that can adjust its step size along a (stochastic) Hamiltonian trajectory to deal with regions of high curvature. DR-G-HMC makes generalized HMC competitive by decreasing the number of rejections which otherwise cause inefficient backtracking and prevents directed movement. We present experiments to demonstrate that DR-G-HMC (1) correctly samples from multiscale densities, (2) makes generalized HMC methods competitive with the state of the art No-U-Turn sampler, and (3) is robust to tuning parameters.

Show Abstract

PLUMED Tutorials: a collaborative, community-driven learning ecosystem

Gareth A. Tribello, Massimiliano Bonomi, P. Cossio, et al.

In computational physics, chemistry, and biology, the implementation of new techniques in shared and open-source software lowers barriers to entry and promotes rapid scientific progress. However, effectively training new software users presents several challenges. Common methods like direct knowledge transfer and in-person workshops are limited in reach and comprehensiveness. Furthermore, while the COVID-19 pandemic highlighted the benefits of online training, traditional online tutorials can quickly become outdated and may not cover all the software’s functionalities. To address these issues, here we introduce “PLUMED Tutorials,” a collaborative model for developing, sharing, and updating online tutorials. This initiative utilizes repository management and continuous integration to ensure compatibility with software updates. Moreover, the tutorials are interconnected to form a structured learning path and are enriched with automatic annotations to provide broader context. This paper illustrates the development, features, and advantages of PLUMED Tutorials, aiming to foster an open community for creating and sharing educational resources.

Show Abstract

Level Set Teleportation: An Optimization Perspective

Aaron Mishkin, A. Bietti, R. M. Gower

We study level set teleportation, an optimization routine which tries to accelerate gradient descent (GD) by maximizing the gradient norm over a level set of the objective. While teleportation intuitively speeds-up GD via bigger steps, current work lacks convergence theory for convex functions, guarantees for solving the teleportation operator, and even clear empirical evidence showing this acceleration. We resolve these open questions. For convex functions satisfying Hessian stability, we prove that GD with teleportation obtains a combined sub-linear/linear convergence rate which is strictly faster than GD when the optimality gap is small. This is in sharp contrast to the standard (strongly) convex setting, where teleportation neither improves nor worsens convergence. To evaluate teleportation in practice, we develop a projected-gradient method requiring only Hessian-vector products. We use this to show that gradient methods with access to a teleportation oracle out-perform their standard versions on a variety of problems. We also find that GD with teleportation is faster than truncated Newton methods, particularly for non-convex optimization.

Show Abstract

Chirped amplitude mode in photo-excited superconductors

Thomas Blommel, J. Kaye, Yuta Murakami, Emanuel Gull, Denis Golež

Using a state-of-the-art numerical scheme, we show that the Higgs mode under excitation exhibits chirped oscillations and exponential decay when fluctuations are included. This is in stark contrast to conventional BCS collisionless dynamics which predict power-law decay and the absence of chirping. The chirped amplitude mode enables us to determine the local modification of the effective potential even when the system is in a long-lived prethermal state. We then show that this chirped amplitude mode is an experimentally observable quantity since the photoinduced (super)current in pump-probe experiments serves as an efficient proxy for the order parameter dynamics, including the chirped dynamics. Our result is based on the attractive Hubbard model using dynamical mean-field theory within the symmetry-broken state after a excitation across the superconducting gap. Since the collective response involves long timescales, we extend the hierarchical low-rank compression method for nonequilibrium Green's functions to symmetry-broken states and show that it serves as an efficient representation despite long-lived memory kernels.

Show Abstract

Fast algorithm for quasi-2D Coulomb systems

Zecheng Gan, X. Gao, J. Liang, Zhenli Xu

Quasi-2D Coulomb systems are of fundamental importance and have attracted much attention in many areas nowadays. Their reduced symmetry gives rise to interesting collective behaviors, but also brings great challenges for particle-based simulations. Here, we propose a novel algorithm framework to address the O(N2) simulation complexity associated with the long-range nature of Coulomb interactions. First, we introduce an efficient Sum-of-Exponentials (SOE) approximation for the long-range kernel associated with Ewald splitting, achieving uniform convergence in terms of inter-particle distance, which reduces the complexity to O(N7/5). We then introduce a random batch sampling method in the periodic dimensions, the stochastic approximation is proven to be both unbiased and with reduced variance via a tailored importance sampling strategy, further reducing the computational cost to O(N). The performance of our algorithm is demonstrated via various numerical examples. Notably, it achieves a speedup of 2∼3 orders of magnitude comparing with Ewald2D method, enabling molecular dynamics (MD) simulations with up to 106 particles on a single core. The present approach is therefore well-suited for large-scale particle-based simulations of Coulomb systems under confinement, making it possible to investigate the role of Coulomb interaction in many practical situations.

Show Abstract

Understanding Factual Recall in Transformers via Associative Memories

Eshaan Nichani, Jason D. Lee, A. Bietti

Large language models have demonstrated an impressive ability to perform factual recall. Prior work has found that transformers trained on factual recall tasks can store information at a rate proportional to their parameter count. In our work, we show that shallow transformers can use a combination of associative memories to obtain such near optimal storage capacity. We begin by proving that the storage capacities of both linear and MLP associative memories scale linearly with parameter count. We next introduce a synthetic factual recall task, and prove that a transformer with a single layer of self-attention followed by an MLP can obtain 100% accuracy on the task whenever either the total number of self-attention parameters or MLP parameters scales (up to log factors) linearly with the number of facts. In particular, the transformer can trade off between using the value matrices or the MLP as an associative memory to store the dataset of facts. We complement these expressivity results with an analysis of the gradient flow trajectory of a simplified linear attention model trained on our factual recall task, where we show that the model exhibits sequential learning behavior.

Show Abstract
  • Previous Page
  • Viewing
  • Next Page
Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates