CCM: Publications

Low-rank Green’s function representations applied to dynamical mean-field theory

Nan Sheng , Alexander Hampel, Sophie Beck, Olivier Parcollet, Nils Wentzell, J. Kaye, Kun Chen

Several recent works have introduced highly compact representations of single-particle Green's functions in the imaginary time and Matsubara frequency domains, as well as efficient interpolation grids used to recover the representations. In particular, the intermediate representation with sparse sampling and the discrete Lehmann representation (DLR) make use of low rank compression techniques to obtain optimal approximations with controllable accuracy. We consider the use of the DLR in dynamical mean-field theory (DMFT) calculations, and in particular show that the standard full Matsubara frequency grid can be replaced by the compact grid of DLR Matsubara frequency nodes. We test the performance of the method for a DMFT calculation of Sr$_2$RuO$_4$ at temperature $50$K using a continuous-time quantum Monte Carlo impurity solver, and demonstrate that Matsubara frequency quantities can be represented on a grid of only 36 nodes with no reduction in accuracy, or increase in the number of self-consistent iterations, despite the presence of significant Monte Carlo noise.

Show Abstract

Ensemble Reweighting Using Cryo-EM Particle Images

W. S. Wai Shing, David Silva-Sánchez, Julian Giraldo-Barreto, B. Carpenter, S. Hanson, A. Barnett, E. Thiede, P. Cossio

Cryo-electron microscopy (cryo-EM) has recently become a leading method for obtaining high-resolution structures of biological macromolecules. However, cryo-EM is limited to biomolecular samples with low conformational heterogeneity, where most conformations can be well-sampled at various projection angles. While cryo-EM provides single-molecule data for heterogeneous molecules, most existing reconstruction tools cannot retrieve the ensemble distribution of possible molecular conformations from these data. To overcome these limitations, we build on a previous Bayesian approach and develop an ensemble refinement framework that estimates the ensemble density from a set of cryo-EM particle images by reweighting a prior conformational ensemble, e.g., from molecular dynamics simulations or structure prediction tools. Our work provides a general approach to recovering the equilibrium probability density of the biomolecule directly in conformational space from single-molecule data. To validate the framework, we study the extraction of state populations and free energies for a simple toy model and from synthetic cryo-EM particle images of a simulated protein that explores multiple folded and unfolded conformations.

Show Abstract

Algebraically rigorous quaternion framework for the neural network pose estimation problem

C. Lin, Andrew J. Hanson, S. Hanson

The 3D pose estimation problem – aligning pairs of noisy 3D point clouds – is a problem with a wide variety of real- world applications. Here we focus on the use of quaternion- based neural network approaches to this problem and ap- parent anomalies that have arisen in previous efforts to re- solve them. In addressing these anomalies, we draw heav- ily from the extensive literature on closed-form methods to solve this problem. We suggest that the major concerns that have been put forward could be resolved using a sim- ple multi-valued training target derived from rigorous theo- retical properties of the rotation-to-quaternion map of Bar- Itzhack. This multi-valued training target is then demon- strated to have good performance for both simulated and ModelNet targets. We provide a comprehensive theoretical context, using the quaternion adjugate, to confirm and es- tablish the necessity of replacing single-valued quaternion functions by quaternions treated in the extended domain of multiple-charted manifolds.

Show Abstract

Conditionally Strongly Log-Concave Generative Models

Florentin Guth, Etienne Lempereur, Joan Bruna, S. Mallat

There is a growing gap between the impressive results of deep image generative models and classical algorithms that offer theoretical guarantees. The former suffer from mode collapse or memorization issues, limiting their application to scientific data. The latter require restrictive assumptions such as log-concavity to escape the curse of dimensionality. We partially bridge this gap by introducing conditionally strongly log-concave (CSLC) models, which factorize the data distribution into a product of conditional probability distributions that are strongly log-concave. This factorization is obtained with orthogonal projectors adapted to the data distribution. It leads to efficient parameter estimation and sampling algorithms, with theoretical guarantees, although the data distribution is not globally logconcave. We show that several challenging multiscale processes are conditionally log-concave using wavelet packet orthogonal projectors. Numerical results are shown for physical fields such as the φ4 model and weak lensing convergence maps with higher resolution than in previous works.

Show Abstract

A Stochastic Proximal Polyak Step Size

Fabian Schaipp, R. M. Gower, Michael Ulbrich

Recently, the stochastic Polyak step size (SPS) has emerged as a competitive adaptive step size scheme for stochastic gradient descent. Here we develop ProxSPS, a proximal variant of SPS that can handle regularization terms. Developing a proximal variant of SPS is particularly important, since SPS requires a lower bound of the objective function to work well. When the objective function is the sum of a loss and a regularizer, available estimates of a lower bound of the sum can be loose. In contrast, ProxSPS only requires a lower bound for the loss which is often readily available. As a consequence, we show that ProxSPS is easier to tune and more stable in the presence of regularization. Furthermore for image classification tasks, ProxSPS performs as well as AdamW with little to no tuning, and results in a network with smaller weight parameters. We also provide an extensive convergence analysis for ProxSPS that includes the non-smooth, smooth, weakly convex and strongly convex setting.

Show Abstract

Learning multi-scale local conditional probability models of images

Zahra Kadkhodaie, Florentin Guth, S. Mallat, Eero P Simoncelli

Deep neural networks can learn powerful prior probability models for images, as evidenced by the high-quality generations obtained with recent score-based diffusion methods. But the means by which these networks capture complex global statistical structure, apparently without suffering from the curse of dimensionality, remain a mystery. To study this, we incorporate diffusion methods into a multi-scale decomposition, reducing dimensionality by assuming a stationary local Markov model for wavelet coefficients conditioned on coarser-scale coefficients. We instantiate this model using convolutional neural networks (CNNs) with local receptive fields, which enforce both the stationarity and Markov properties. Global structures are captured using a CNN with receptive fields covering the entire (but small) low-pass image. We test this model on a dataset of face images, which are highly non-stationary and contain large-scale geometric structures.
Remarkably, denoising, super-resolution, and image synthesis results all demonstrate that these structures can be captured with significantly smaller conditioning neighborhoods than required by a Markov model implemented in the pixel domain. Our results show that score estimation for large complex images can be reduced to low-dimensional Markov conditional models across scales, alleviating the curse of dimensionality.

Show Abstract

Complex-to-Real Sketches for Tensor Products with Applications to the Polynomial Kernel

Jonas Wacker, R. Ohana, Maurizio Filippone

Randomized sketches of a tensor product of pvectors follow a tradeoff between statistical efficiency and computational acceleration. Commonly used approaches avoid computing the high-dimensional tensor product explicitly, resulting in a suboptimal dependence of O(3p) in the embedding dimension. We propose a simple Complex-to-Real (CtR) modification of well-known sketches that replaces real random projections by complex ones, incurring a lower O(2p)factor in the embedding dimension. The output of our sketches is real-valued, which renders
their downstream use straightforward. In particular, we apply our sketches to p-fold self-tensored inputs corresponding to the feature maps of the
polynomial kernel. We show that our method achieves state-of-the-art performance in terms of accuracy and speed compared to other randomized approximations from the literature.

Show Abstract

A Cheat Sheet for Bayesian Prediction

Bertrand Clarke, Y. Yao

This paper reviews the growing field of Bayesian prediction. Bayes point and interval prediction are defined and exemplified and situated in statistical prediction more generally. Then, four general approaches to Bayes prediction are defined and we turn to predictor selection. This can be done predictively or non-predictively and predictors can be based on single models or multiple models. We call these latter cases unitary predictors and model average predictors, respectively. Then we turn to the most recent aspect of prediction to emerge, namely prediction in the context of large observational data sets and discuss three further classes of techniques. We conclude with a summary and statement of several current open problems.

Show Abstract

Simulation-based inference of single-molecule force spectroscopy

Lars Dingeldein, P. Cossio, Roberto Covino

Single-molecule force spectroscopy (smFS) is a powerful approach to studying molecular self-organization. However, the coupling of the molecule with the ever-present experimental device introduces artifacts, that complicate the interpretation of these experiments. Performing statistical inference to learn hidden molecular properties is challenging because these measurements produce non-Markovian time series, and even minimal models lead to intractable likelihoods. To overcome these challenges, we developed a computational framework built on novel statistical methods called simulation-based inference (SBI). SBI enabled us to directly estimate the Bayesian posterior, and extract reduced quantitative models from smFS, by encoding a mechanistic model into a simulator in combination with probabilistic deep learning. Using synthetic data, we could systematically disentangle the measurement of hidden molecular properties from experimental artifacts. The integration of physical models with machine-learning density estimation is general, transparent, easy to use, and broadly applicable to other types of biophysical experiments.

Show Abstract

Simulation-based inference of single-molecule force spectroscopy

Lars Dingeldein, P. Cossio, Roberto Covino

Single-molecule force spectroscopy (smFS) is a powerful approach to studying molecular self-organization. However, the coupling of the molecule with the ever-present experimental device introduces artifacts, that complicate the interpretation of these experiments. Performing statistical inference to learn hidden molecular properties is challenging because these measurements produce non-Markovian time series, and even minimal models lead to intractable likelihoods. To overcome these challenges, we developed a computational framework built on novel statistical methods called simulation-based inference (SBI). SBI enabled us to directly estimate the Bayesian posterior, and extract reduced quantitative models from smFS, by encoding a mechanistic model into a simulator in combination with probabilistic deep learning. Using synthetic data, we could systematically disentangle the measurement of hidden molecular properties from experimental artifacts. The integration of physical models with machine-learning density estimation is general, transparent, easy to use, and broadly applicable to other types of biophysical experiments.

Show Abstract