2795 Publications

The Inaugural Flatiron Institute Cryo-EM Conformational Heterogeneity Challenge

M. Astore, P. Cossio, S. Hanson, et al.

Despite the rise of single particle cryo-electron microscopy (cryo-EM) as a premier method for resolving macromolecular structures at atomic resolution, methods to address molecular heterogeneity in vitrified samples have yet to reach maturity. With an increasing number of new methods to analyze the multitude of heterogeneous states captured in single particle images, a systematic approach to validation in this field is needed. With this motivation, we issued a challenge to the community to analyze two cryo-EM particle image sets of thyroglobulin that exhibit continuous conformational heterogeneity. The first dataset was experimental and the second was generated with a simulator, allowing control over the distribution of molecular structures and enabled direct comparison between participantsโ€™ submissions and the ground truth molecular structures and distributions. Participants were asked to submit 80 volumes representing the heterogeneous ensemble and estimate their respective populations in the image sets provided. Participation of the research community in the challenge was strong, with submissions from nearly all developers of heterogeneity methods, resulting in 41 submissions across both datasets. Submissions qualitatively exceeded expectations, with the molecular motions identified by methods resembling both each other and the ground truth motion. However, quantitatively assessing these similarities was a challenge in and of itself. In the process of assessing the submissions, we developed several validation metrics, most of which require reference to the underlying ground truth volumes. However, we have also explored the use of metrics that do not necessarily reference ground truth. This is particularly apt for experimental datasets where ground truth is inaccessible. These approaches allowed us to assess the similarity and accuracy in volume quality, molecular motions, and conformational distribution of di!erent submissions. These metrics and the e!orts of all participants help chart a path forward for the improvements of heterogeneity methods for cryo-EM and for future challenges to validate these new methods as they continue to be developed by the community.

Show Abstract

Lipid packing and local geometry influence septin curvature sensing

Brandy N. Curtis, Ellysa J.D. Vogt, C. Edelmaier, et al.

Septins can assemble into scaffolds at the plasma membrane to regulate cell morphology. While septins preferentially bind convex membranes via amphipathic helices, their assembly on varied geometries in cells suggests additional localization cues. We tested the hypothesis that lipid composition directs septin assembly through the property of lipid packing. We used pharmacological perturbations that alter fatty acid chain saturation to manipulate lipid packing and found septin structures were selectively disrupted at flat regions of the plasma membrane. To determine whether lipid packing is sufficient to impact septin assembly, molecular dynamics simulations were used to design lipid mixtures with varied packing to monitor septin adsorption in vitro. Septins strongly favored loosely packed lipid bilayers, but additional geometrical cues act in conjunction with this membrane property. This work demonstrates that packing defects and geometry jointly regulate septin localization, highlighting how distinct membrane properties are integrated to organize the septin cytoskeleton.

Show Abstract

Functions on Symmetric Matrices and Point Clouds via Lightweight Invariant Features from Galois Theory

Ben Blum-Smith, T. Huang, Marco Cuturi, S. Villar

In this work, we present a mathematical formulation for machine learning of (1) functions on symmetric matrices that are invariant with respect to the action of permutations by conjugation, and (2) functions on point clouds that are invariant with respect to rotations, reflections, and permutations of the points. To achieve this, we provide a construction of generically separating invariant features using ideas inspired by Galois theory. We construct ๐‘‚โก(๐‘›2) invariant features derived from generators for the field of rational functions on ๐‘› ร—๐‘› symmetric matrices that are invariant for joint permutations of rows and columns. We show that these invariant features can separate all distinct orbits of symmetric matrices, except for a measure zero set; such features can be used to universally approximate invariant functions on almost all weighted graphs. For point clouds in a fixed dimension, we prove that the number of invariant features can be reduced, generically without losing expressivity, to ๐‘‚โก(๐‘›), where ๐‘› is the number of points. We combine these invariant features with DeepSets to learn functions on symmetric matrices and point clouds with varying sizes. We empirically demonstrate the feasibility of our approach on molecule property regression and point cloud distance prediction.

Show Abstract

CosmoBench: A Multiscale, Multiview, Multitask Cosmology Benchmark for Geometric Deep Learning

T. Huang, R. Stiskalek, Jun-Young Lee, A. E. Bayer, Charles Margossian, Et al.

Cosmological simulations provide a wealth of data in the form of point clouds and directed trees. A crucial goal is to extract insights from this data that shed light on the nature and composition of the Universe. In this paper we introduce CosmoBench, a benchmark dataset curated from state-of-the-art cosmological simulations whose runs required more than 41 million core-hours and generated over two petabytes of data. CosmoBench is the largest dataset of its kind: it contains 34 thousand point clouds from simulations of dark matter halos and galaxies at three different length scales, as well as 25 thousand directed trees that record the formation history of halos on two different time scales. The data in CosmoBench can be used for multiple tasks -- to predict cosmological parameters from point clouds and merger trees, to predict the velocities of individual halos and galaxies from their collective positions, and to reconstruct merger trees on finer time scales from those on coarser time scales. We provide several baselines on these tasks, some based on established approaches from cosmological modeling and others rooted in machine learning. For the latter, we study different approaches -- from simple linear models that are minimally constrained by symmetries to much larger and more computationally-demanding models in deep learning, such as graph neural networks. We find that least-squares fits with a handful of invariant features sometimes outperform deep architectures with many more parameters and far longer training time. Still there remains tremendous potential to improve these baselines by combining machine learning and cosmology to fully exploit the data. CosmoBench sets the stage for bridging cosmology and geometric deep learning at scale. We invite the community to push the frontier of scientific discovery by engaging with this dataset, available at this https URL

Show Abstract

Evaluating Selective Quality Control in Mammalian Oogenesis: Evidence and Opportunities

Jay W. Zussman1, D. Skinner, S. Shvartsman, et al.

The formation and maintenance of the finite mammalian ovarian reserve are critical for fertility and species survival. Genetic and developmental studies have uncovered various mechanisms underlying oocyte development and maturation, revealing two curious features of the ovarian germline: (a) The establishment of the follicle reserve involves an initial massive overproduction of oocyte precursors, and (b) the total number of ovulated oocytes across an animal's fertile lifetime is a very small proportion of the initial ovarian reserve. Many have proposed that this indicates the existence of selective quality control to ensure gamete fitness. Here, we review the findings underlying the hypotheses for germline quality control during prepubertal development, homeostatic fertility, and reproductive aging. We evaluate whether the existing evidence base distinguishes the active selection of specific germ cell subsets from neutral dynamics. Throughout, we discuss strategies for applying statistical frameworks to evaluate selection in oogenesis and the implications of neutrality versus selection at various points in oocyte development.

Show Abstract

Photoinduced twist and untwist of moirรฉ superlattices

Two-dimensional moirรฉ materials are formed by artificially stacking atomically thin monolayers. Correlated and topological quantum phases can be engineered by precise choice of stacking geometry1--3. These designer electronic properties depend crucially on interlayer coupling and atomic registry4,5. An open question is how the atomic registry responds on ultrafast timescales to optical excitation and whether the moirรฉ geometry can be dynamically reconfigured to tune emergent phenomena in real time. Here we show that femtosecond photoexcitation drives a coherent twist--untwist motion of the moirรฉ superlattice in 2ยฐ and 57ยฐ twisted WSe2/MoSe2 heterobilayers, resolved directly by ultrafast electron diffraction. On above-band-gap photoexcitation, the moirรฉ superlattice diffraction features are enhanced within 1ps and subsequently suppressed several picoseconds after, deviating markedly from typical photoinduced lattice heating. Kinetic diffraction analysis, supported by simulations of the sample dynamics, indicates a peak-to-trough local twist angle modulation of 0.6ยฐ, correlated with a sub-THz frequency moirรฉ phonon. This motion is driven by ultrafast charge transfer that transiently increases interlayer attraction. Our results could lead to ultrafast control of moirรฉ periodic lattice distortions and, by extension, the local moirรฉ potential that shapes excitons, polarons and correlation-driven behaviours.
Show Abstract
November 1, 2025

Higher-order continuum models for twisted bilayer graphene

S. Quinn, Tianyu Kong, M. Luskin, Alexander B. Watson

The first-order continuum partial differential equation (PDE) model proposed by Bistritzer and MacDonald [Proc. Natl. Acad. Sci. U. S. A. 108, 12233โ€“12237 (2011)] accurately describes the single-particle electronic properties of twisted bilayer graphene at small twist angles. In this paper, we obtain higher-order corrections to the Bistritzerโ€“MacDonald (BM) model via a systematic multiple-scales expansion. We prove that the solution of the resulting higher-order PDE model accurately approximates the corresponding tight-binding wave function under a natural choice of parameters and given initial conditions that are spectrally localized to the monolayer Dirac points. Numerical simulations of tight-binding and continuum dynamics demonstrate the validity of the higher-order continuum model. Symmetries of the higher-order models are also discussed. This work extends the analysis from Watson et al., J. Math. Phys. 64, 031502 (2023), which rigorously established the validity of the (first-order) BM model.

Show Abstract

Innate immune molecular landscape following controlled human influenza virus infection

William Thistlethwaite, Xi Chen, O. Troyanskaya

Viral infections can induce prolonged changes in innate immunity. Here, we use blood samples from a human influenza H3N2 challenge study (NCT03883113) to perform comprehensive multi-omics analyses. We detect remodeling of immune programs in circulating innate immune cells that persist after resolution of the infection. We find changes associated with suppressed inflammation, including decreased cytokine and AP-1 gene expression as well as decreased accessibility at AP-1 targets and interleukin-related gene promoter regions. We also find decreased histone deacetylase gene expression, increased MAP kinase gene expression, and increased accessibility at interferon-related gene promoter regions. Genes involved in inflammation and methylation remodeling show modulation of gene-chromatin site regulatory circuit activity. These results reveal a coordinated rewiring of the molecular landscape in innate immune cells induced by mild influenza virus infection.

Show Abstract

Modeling Neural Activity with Conditionally Linear Dynamical Systems

Victor Geadah, A. Nejatbakhsh, D. Lipshutz, J. Pillow, A. Williams

Neural population activity exhibits complex, nonlinear dynamics, varying in time, over trials, and across experimental conditions. Here, we develop Conditionally Linear Dynamical System (CLDS) models as a general-purpose method to characterize these dynamics. These models use Gaussian Process (GP) priors to capture the nonlinear dependence of circuit dynamics on task and behavioral variables. Conditioned on these covariates, the data is modeled with linear dynamics. This allows for transparent interpretation and tractable Bayesian inference. We find that CLDS models can perform well even in severely data-limited regimes (e.g. one trial per condition) due to their Bayesian formulation and ability to share statistical power across nearby task conditions. In example applications, we apply CLDS to model thalamic neurons that nonlinearly encode heading direction and to model motor cortical neurons during a cued reaching task

Show Abstract

Spectral Analysis of Representational Similarity with Limited Neurons

Hyunmo Kang, A. Canatar, S. Chung

Measuring representational similarity between neural recordings and computational models is challenging due to constraints on the number of
neurons that can be recorded simultaneously. In this work, we investigate how such limitations affect similarity measures, focusing on Canonical Correlation Analysis (CCA) and Centered Kernel Alignment (CKA). Leveraging tools from Random Matrix Theory, we develop a predictive spectral framework for these measures and demonstrate that finite neuron sampling systematically underestimates similarity due to eigenvector de-
localization. To overcome this, we introduce a denoising method to infer population-level similarity, enabling accurate analysis even with small
neuron samples. Our theory is validated on synthetic and real datasets, offering practical strategies for interpreting neural data under finite sampling constraints.

Show Abstract
  • Previous Page
  • Viewing
  • Next Page
Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates