689 Publications

3D chromatin structures precede genome activation in Drosophila embryogenesis

Gabriel A. Dolsten, Evan Cofer, O. Troyanskaya, et al.

3D chromatin structure is critical for the regulation of gene expression during development. Here we used Micro-C assays at 100-bp resolution to map genome organization in Drosophila melanogaster throughout the first half of embryogenesis. These high-resolution contact maps reveal fine-scale features such as loops and boundaries delineating topologically associating domains. Notably, we observe that 3D chromatin structures form prior to zygotic genome activation and persist during successive mitotic cycles. Integrative analysis with 149 public chromatin immunoprecipitation sequencing (ChIP-seq) datasets identifies four classes of chromatin structuring elements, including a distinct group enriched for GAGA-associated factor (GAF) and Zelda binding, associated with developmental-gene regulation. These elements are mitotically retained and exhibit sequence and structure similarity between D. melanogaster and D. virilis. We propose that 3D chromatin organization in the pre-cellular embryo facilitates deployment of developmentally regulated genes during Drosophila embryogenesis.

Show Abstract
November 12, 2025

The Inaugural Flatiron Institute Cryo-EM Conformational Heterogeneity Challenge

M. Astore, P. Cossio, S. Hanson, et al.

Despite the rise of single particle cryo-electron microscopy (cryo-EM) as a premier method for resolving macromolecular structures at atomic resolution, methods to address molecular heterogeneity in vitrified samples have yet to reach maturity. With an increasing number of new methods to analyze the multitude of heterogeneous states captured in single particle images, a systematic approach to validation in this field is needed. With this motivation, we issued a challenge to the community to analyze two cryo-EM particle image sets of thyroglobulin that exhibit continuous conformational heterogeneity. The first dataset was experimental and the second was generated with a simulator, allowing control over the distribution of molecular structures and enabled direct comparison between participants’ submissions and the ground truth molecular structures and distributions. Participants were asked to submit 80 volumes representing the heterogeneous ensemble and estimate their respective populations in the image sets provided. Participation of the research community in the challenge was strong, with submissions from nearly all developers of heterogeneity methods, resulting in 41 submissions across both datasets. Submissions qualitatively exceeded expectations, with the molecular motions identified by methods resembling both each other and the ground truth motion. However, quantitatively assessing these similarities was a challenge in and of itself. In the process of assessing the submissions, we developed several validation metrics, most of which require reference to the underlying ground truth volumes. However, we have also explored the use of metrics that do not necessarily reference ground truth. This is particularly apt for experimental datasets where ground truth is inaccessible. These approaches allowed us to assess the similarity and accuracy in volume quality, molecular motions, and conformational distribution of di!erent submissions. These metrics and the e!orts of all participants help chart a path forward for the improvements of heterogeneity methods for cryo-EM and for future challenges to validate these new methods as they continue to be developed by the community.

Show Abstract
November 6, 2025

Lipid packing and local geometry influence septin curvature sensing

Brandy N. Curtis, Ellysa J.D. Vogt, C. Edelmaier, et al.

Septins can assemble into scaffolds at the plasma membrane to regulate cell morphology. While septins preferentially bind convex membranes via amphipathic helices, their assembly on varied geometries in cells suggests additional localization cues. We tested the hypothesis that lipid composition directs septin assembly through the property of lipid packing. We used pharmacological perturbations that alter fatty acid chain saturation to manipulate lipid packing and found septin structures were selectively disrupted at flat regions of the plasma membrane. To determine whether lipid packing is sufficient to impact septin assembly, molecular dynamics simulations were used to design lipid mixtures with varied packing to monitor septin adsorption in vitro. Septins strongly favored loosely packed lipid bilayers, but additional geometrical cues act in conjunction with this membrane property. This work demonstrates that packing defects and geometry jointly regulate septin localization, highlighting how distinct membrane properties are integrated to organize the septin cytoskeleton.

Show Abstract

Evaluating Selective Quality Control in Mammalian Oogenesis: Evidence and Opportunities

Jay W. Zussman1, D. Skinner, S. Shvartsman, et al.

The formation and maintenance of the finite mammalian ovarian reserve are critical for fertility and species survival. Genetic and developmental studies have uncovered various mechanisms underlying oocyte development and maturation, revealing two curious features of the ovarian germline: (a) The establishment of the follicle reserve involves an initial massive overproduction of oocyte precursors, and (b) the total number of ovulated oocytes across an animal's fertile lifetime is a very small proportion of the initial ovarian reserve. Many have proposed that this indicates the existence of selective quality control to ensure gamete fitness. Here, we review the findings underlying the hypotheses for germline quality control during prepubertal development, homeostatic fertility, and reproductive aging. We evaluate whether the existing evidence base distinguishes the active selection of specific germ cell subsets from neutral dynamics. Throughout, we discuss strategies for applying statistical frameworks to evaluate selection in oogenesis and the implications of neutrality versus selection at various points in oocyte development.

Show Abstract

Innate immune molecular landscape following controlled human influenza virus infection

William Thistlethwaite, Xi Chen, O. Troyanskaya

Viral infections can induce prolonged changes in innate immunity. Here, we use blood samples from a human influenza H3N2 challenge study (NCT03883113) to perform comprehensive multi-omics analyses. We detect remodeling of immune programs in circulating innate immune cells that persist after resolution of the infection. We find changes associated with suppressed inflammation, including decreased cytokine and AP-1 gene expression as well as decreased accessibility at AP-1 targets and interleukin-related gene promoter regions. We also find decreased histone deacetylase gene expression, increased MAP kinase gene expression, and increased accessibility at interferon-related gene promoter regions. Genes involved in inflammation and methylation remodeling show modulation of gene-chromatin site regulatory circuit activity. These results reveal a coordinated rewiring of the molecular landscape in innate immune cells induced by mild influenza virus infection.

Show Abstract

cryoJAX: A Cryo-electron Microscopy Image Simulation Library In JAX

Michael J. O'Brien, S. Hanson, D. Needleman, et al.

While cryo-electron microscopy (cryo-EM) has come to prominence in the last decade due to its ability to resolve biomolecular complexes at atomic resolution, advancements in experimental and computational methods have made cryo-EM promising for investigating intracellular organization and heterogeneous molecular states. A primary challenge for these alternative applications is the development of techniques for cryo-EM data analysis, which are very computationally demanding. To this end, it is advantageous to leverage advanced scientific computing frameworks for statistical analysis. One such framework is JAX, an emerging array-oriented Python numerical computing package for automatic differentiation and vectorization with a growing ecosystem for statistical inference and machine learning. We have developed cryoJAX, a cryo-EM image simulation library for building computational data analysis applications in JAX. CryoJAX is a flexible modeling language for cryo-EM image formation and therefore can support a wide range of data analysis downstream. By integrating with the JAX ecosystem, cryoJAX enables the development and deployment of algorithms for the growing breadth of scientific applications for cryo-EM.

Show Abstract
October 24, 2025

Examining Age-Bias and Stereotypes of Aging in LLMs

Sherwin Dewan , Ismail Shaikh, A. Sahoo

Large Language Models (LLMs) are increasingly being used across applications, ranging from content generation to decision-making, raising concerns about biases embedded in them. While biases related to gender, race, and culture have been studied extensively, understanding age-bias and stereotypes of aging in LLMs remain underexplored. This study analyzes LLM-generated responses to prompts related to aging, revealing stereotypical biases about aging pertaining to technology proficiency, cognitive and physical decline, and job roles. We noted that even responses without explicit age bias also had mentions of ageist stereotypes. We discuss considerations for involving older adults’ perspectives through human-in-the-loop methodologies yet exercising caution about aspects like internalized ageism.

Show Abstract
October 22, 2025

Disrupted developmental signaling induces novel transcriptional states

Aleena Patel, Vanessa Gonzalez, S. Shvartsman, M. Avdeeva

Signaling pathways induce stereotyped transcriptional changes as stem cells progress into mature cell types during embryogenesis. Signaling perturbations are necessary to discover which genes are responsive or insensitive to pathway activity. However, gene regulation is additionally dependent on cell state-specific factors like chromatin modifications or transcription factor binding. Thus, transcriptional profiles need to be assayed in single cells to identify potentially multiple, distinct perturbation responses among heterogeneous cell states in an embryo. In perturbation studies, comparing heterogeneous transcriptional states among experimental conditions often requires samples to be collected over multiple independent experiments, which can introduce confounding batch effects. We present Design-Aware Integration of Single Cell ExpEriments (DAISEE), a new algorithm that models perturbation responses in single-cell datasets collected according to complex experimental designs. We demonstrate that DAISEE improves upon a previously available integrative nonnegative matrix factorization framework, more efficiently separating perturbation responses from confounding variation. We use DAISEE to integrate newly collected single-cell RNA sequencing datasets from 5-h-old zebrafish embryos expressing optimized photoswitchable MEK (psMEK), which globally activates the extracellular signal-regulated kinase (ERK), a signaling molecule involved in many cell specification events. psMEK drives some cells that are normally not exposed to ERK signals toward other wild type states and induces novel states expressing early-acting endothelial genes. Overactive signaling is therefore capable of producing unexpected gene expression states in developing embryos.

Show Abstract

Using Time Dependent Rate Analysis to Evaluate the Quality of Machine Learned Reaction Coordinates for Biasing and Computing Kinetics

Nicodemo Mazzaferro , Suemin Lee, P. Cossio, et al.

Having an accurate reaction coordinate (RC) is essential for reliable kinetic characterization of molecular processes, but there are few quantitative metrics to evaluate RC quality. In this study, we consider the dimensionless γ metric from the Exponential Average Time-dependent Rate (EATR) method, which represents the fraction of a biasing potential along the RC that contributes to increasing the rate constant. We demonstrate that γ can be used to test whether the utility of a RC for predicting kinetics with a Metadynamics bias improves as the coordinate is iteratively updated to include new data. We evaluate RCs approximated via the iterative State Predictive Information Bottleneck (SPIB) approach, which was previously shown to be accurate across six protein–ligand dissociation systems. For these same systems, we compute γ values and mean accelerated times τ̅accel. After systematically scanning over fitting parameters, the results show that γ increases closer to 1, while τ̅accel decreases, revealing a consistent inverse correlation. These results demonstrate that γ serves as a practical criterion for RC evaluation and offers guidance for selecting SPIB–derived coordinates yielding quantitative kinetic predictions.

Show Abstract

Atlas of Glomerular Disease-Specific Genetic Effects on Gene Regulation in Blood Empowers New Gene Discovery Studies

Lilil Liu , Chen Wang, O. Troyanskaya, et al.

IgA nephropathy (IgAN), focal segmental glomerulosclerosis (FSGS), membranous nephropathy (MN), and minimal change disease (MCD) account for the majority of idiopathic glomerulopathies (GN). However, there are no powered transcriptomic datasets coupled to genetic data to investigate the genetic mechanisms underlying gene regulation in the context of GN.

Show Abstract
  • Previous Page
  • Viewing
  • Next Page
Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates