2596 Publications

The Fruit Fly Auxodrome: a computer vision setup for longitudinal studies of Drosophila development

Changyuan Wang , Denis F Faerberg , S. Shvartsman, Robert A Marmion

Studies in Drosophila have contributed a great deal to our understanding of developmental mechanisms. Indeed, familiar names of critical signaling components, such as Hedgehog and Notch, have their origins in the readily identifiable morphological phenotypes of Drosophila. Most studies that led to the identification of these and many other highly conserved genes were based on the end-point phenotypes, such as the larval cuticle or the adult wing. Additional information can be extracted from longitudinal studies, which can reveal how the phenotypes emerge over time. Here we present the Fruit Fly Auxodrome, an experimental setup that enables monitoring and quantitative analysis of the entirety of development of 96 individually housed Drosophila from hatching to eclosion. The Auxodrome combines an inexpensive live imaging setup and a computer vision pipeline that provides access to a wide range of quantitative information, such as the times of hatching and pupation, as well as dynamic patterns of larval activity. We demonstrate the Auxodrome in action by recapitulating several previously reported features of wild-type development as well as developmental delay in a Drosophila model of a human disease. The scalability of the presented design makes it readily suitable for large-scale longitudinal studies in multiple developmental contexts.

Show Abstract

Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry

Integrating task-relevant information into neural representations is a fundamental ability of both biological and artificial intelligence systems. Recent theories have categorized learning into two regimes: the rich regime, where neural networks actively learn task-relevant features, and the lazy regime, where networks behave like random feature models. Yet this simple lazy-rich dichotomy overlooks a diverse underlying taxonomy of feature learning, shaped by differences in learning algorithms, network architectures, and data properties. To address this gap, we introduce an analysis framework to study feature learning via the geometry of neural representations. Rather than inspecting individual learned features, we characterize how task-relevant representational manifolds evolve throughout the learning process. We show, in both theoretical and empirical settings, that as networks learn features, task-relevant manifolds untangle, with changes in manifold geometry revealing distinct learning stages and strategies beyond the lazy-rich dichotomy. This framework provides novel insights into feature learning across neuroscience and machine learning, shedding light on structural inductive biases in neural circuits and the mechanisms underlying out-of-distribution generalization.

Show Abstract
July 11, 2025

The open-source Masala software suite: Facilitating rapid methods development for synthetic heteropolymer design

Tristan Zaborniak, B. Turzo, D. Renfrew, V. Mulligan, et al.

Although canonical protein design has benefited from machine learning methods trained on databases of protein sequences and structures, synthetic heteropolymer design still relies heavily on physics-based methods. The Rosetta software, which provides diverse physics-based methods for designing sequences, exploring conformations, docking molecules, and performing analysis, has proven invaluable to this field. Nevertheless, Rosetta’s aging architecture, monolithic structure, non-open source code, and steep development learning curve are beginning to hinder new methods development. Here, we introduce the Masala software suite, a free, open-source set of C++ libraries intended to extend Rosetta and other software, and ultimately to be a successor to Rosetta. Masala is structured for modern computing hardware, and its build system automates the creation of application programming interface (API) layers, permitting Masala’s use as an extension library for existing software, including Rosetta. Masala features modular architecture in which it is easy for novice developers to add new plugin modules, which can be independently compiled and loaded at runtime, extending functionality of software linking Masala without source code alteration. Here, we describe implementation of Masala modules that accelerate protein and synthetic peptide design. We describe the implementation of Masala real-valued local optimizers and cost function network optimizers that can be used as drop-in replacements for Rosetta’s minimizer and packer when designing heteropolymers. We explore design-centric guidance terms for promoting desirable features, such as hydrogen bond networks, or discouraging undesirable features, such as unsatisfied buried hydrogen bond donors and acceptors, which we have re-implemented far more efficiently in Masala, providing up to two orders of magnitude of speedup in benchmarks. Finally, we discuss development goals for future versions of Masala.

Show Abstract

Variations in neuronal selectivity create efficient representational geometries for perception

Our visual capabilities depend on neural response properties in visual areas of our brains. Neurons exhibit a wide variety of selective response properties, but the reasons for this diversity are unknown. Here, we related the distribution of neuronal tuning properties to the information capacity of the population. Our results from theory, simulations, and analysis of recordings from macaque primary visual cortex (V1) reveal that diversity of amplitude and bandwidth drive complementary changes to the representational geometry of a population. Amplitude diversity pushes the centers of the representations further apart, whereas bandwidth heterogeneity decorrelates the center locations. These geometric changes separate out representations for distinct stimuli, creating more efficient encoding. We study how both types of diversity affect the population code for two different perceptual tasks: discrimination and identification. While both types of diversity improve encoding for both tasks, their distinct impacts on geometry make each more beneficial for one of the two tasks. Amplitude diversity impacts coding efficiency more for discrimination than it does for identification, while bandwidth diversity has a stronger impact on identification. These complementary effects indicate the importance of both types of diversity for perception. Finally, because tuning diversity exists across species and brain areas, our results suggest a fundamental neural coding strategy that may be applicable to a wide range of behavior.

Show Abstract

Sequestration of ribosome biogenesis factors in HSV- 1 nuclear aggregates revealed by spatially resolved thermal profiling

Peter J. Metzger , Tavis J. Reed , O. Troyanskaya

Viruses exploit host cell reliance on compartmentalization to facilitate their replication. Herpes simplex virus type 1 (HSV-1) modulates the subcellular localization of host proteins to suppress immune activation, license viral gene expression, and achieve translational shutoff. To spatially resolve dynamic protein-protein interaction (PPI) networks during infection with an immunostimulatory HSV-1 strain, we integrated nuclear/cytoplasmic fractionation with thermal proximity coaggregation analysis (N/C-TPCA). The resulting expanded depth and spatial resolution of PPIs charted compartment-specific assemblies of protein complexes throughout infection. We find that a broader suite of host chaperones than previously anticipated exhibits nuclear recruitment to form condensates known as virus-induced chaperone-enriched (VICE) domains. Monitoring protein and RNA constituents and ribosome activity, we establish that VICE domains sequester ribosome biogenesis factors from ribosomal RNA, accompanying a cell-wide defect in ribosome supply. These findings highlight infection-driven VICE domains as nodes of translational remodeling and demonstrate the utility of N/C-TPCA to study dynamic biological contexts.

Show Abstract

A live-cell biosensor of in vivo receptor tyrosine kinase activity reveals feedback regulation of a developmental gradient

Emily K. Ho , Rebecca P. Kim-Yip , S. Shvartsman, et al.

A lack of tools for detecting receptor activity in vivo has limited our ability to fully explore receptor-level control of developmental patterning. Here, we extend phospho-tyrosine tag (pYtag) biosensors to visualize endogenous receptor tyrosine kinase (RTK) activity in Drosophila. We build biosensors for three RTKs that function across developmental stages and tissues. By characterizing Torso::pYtag during embryonic terminal patterning, we find that Torso activity differs from downstream extracellular signal-regulated kinase (ERK) activity in two surprising ways: Torso activity is narrowly restricted to the poles but produces a broader gradient of ERK and decreases over developmental time, while ERK activity is sustained, an effect mediated by ERK pathway-dependent negative feedback. Our results suggest that a narrow domain of Torso activity, tuned in amplitude by negative feedback, locally activates signaling effectors, which diffuse through the syncytial embryo to form the ERK gradient. Altogether, the results of this work highlight the usefulness of pYtags for investigating receptor-level regulation of developmental patterning.

Show Abstract

Microtubules in Martini: Parameterizing a heterogeneous elastic-network towards a mechanically accurate microtubule

Microtubules are essential cytoskeletal filaments involved in cell motility, division, and intracellular transport, exhibiting complex structural dynamics governed by diverse biophysical factors. Atomistic simulations of microtubule assemblies remain challenging due to their extensive spatiotemporal scales. To address this, we present a multiscale approach combining the primarily top-down Martini 3 coarse-grained (CG) model with an appropriately parameterized heterogeneous elastic network to capture microtubule mechanics and molecular detail efficiently. By iteratively tuning the elastic network, we matched the structural fluctuations of CG heterodimeric building blocks to atomistic reference data, reproducing experimentally consistent mechanical properties. This framework helped us identify stabilizing long-lived interactions between charged C-terminal tails and the folded domain of neighboring tubulin subunits, offering insight into sequence-specific contributions to lattice stability. Our efforts culminated in the construction of a 200 nm microtubule composed of million interaction centers, enabling exploration of large-scale microtubule-associated processes with amino acid-level resolution. This work bridges the gap between molecular specificity and computational scalability, offering a platform for simulating biophysical processes across cellular length and time scales.

Show Abstract

Nonlinear spontaneous flow instability in active nematics

I. Lavi, Ricard Alert, Jean-François Joanny, Jaume Casademunt

Active nematics exhibit spontaneous flows through a well-known linear instability of the uniformly aligned quiescent state. Here, we show that even a linearly stable uniform state can experience a nonlinear instability, resulting in a discontinuous transition to spontaneous flows. In this case, quiescent and flowing states may coexist. Through a weakly nonlinear analysis and a numerical study, we trace the bifurcation diagram of striped patterns and show that the underlying pitchfork bifurcation switches from supercritical (continuous) to subcritical (discontinuous) by varying the flow-alignment parameter. We predict that the discontinuous spontaneous flow transition occurs for a wide range of parameters, including systems of contractile flow-aligning rods. Our predictions are relevant to active nematic turbulence and can potentially be tested in experiments on either cell layers or active cytoskeletal suspensions.

Show Abstract

Learning normalized image densities via dual score matching

Learning probability models from data is at the heart of many machine learning endeavors, but is notoriously difficult due to the curse of dimensionality. We introduce a new framework for learning normalized energy (log probability) models that is inspired from diffusion generative models, which rely on networks optimized to estimate the score. We modify a score network architecture to compute an energy while preserving its inductive biases. The gradient of this energy network with respect to its input image is the score of the learned density, which can be optimized using a denoising objective. Importantly, the gradient with respect to the noise level provides an additional score that can be optimized with a novel secondary objective, ensuring consistent and normalized energies across noise levels. We train an energy network with this dual score matching objective on the ImageNet64 dataset, and obtain a cross-entropy (negative log likelihood) value comparable to the state of the art. We further validate our approach by showing that our energy model strongly generalizes: estimated log probabilities are nearly independent of the specific images in the training set. Finally, we demonstrate that both image probability and dimensionality of local neighborhoods vary significantly with image content, in contrast with traditional assumptions such as concentration of measure or support on a low-dimensional manifold.

Show Abstract

Generation of fate patterns via intercellular forces

H. Nunley , Xufeng Xue, Jianping Fu, David K. Lubensky

Studies of fate patterning during development typically emphasize cell-cell communication via diffusible chemical signals. Recent experiments on stem cell colonies, however, suggest that in some cases mechanical stresses, rather than secreted chemicals, enable long-ranged cell-cell interactions that specify positional information and pattern cell fates. These findings inspire a model of mechanical patterning: fate affects cell contractility, and pressure in the cell layer biases fate. Cells at the colony edge, more contractile than cells at the center, seed a pattern that propagates via force transmission. Strikingly, our model implies that the width of the outer fate domain varies nonmonotonically with substrate stiffness, a prediction that we confirm experimentally; we argue that a similar dependence on substrate stiffness can be achieved by a chemical morphogen only if strong constraints on the signaling pathway's mechanobiology are met. Our findings thus support the idea that mechanical stress can mediate patterning in the complete absence of chemical morphogens, even in nonmotile cell layers, thus expanding the repertoire of possible roles for mechanical signals in development and morphogenesis. Future tests of additional model predictions, like the effect of anisotropic substrate rigidity, will further broaden the range of achievable fate patterns.

Show Abstract
  • Previous Page
  • Viewing
  • Next Page
Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates

privacy consent banner

Privacy preference

We use cookies to provide you with the best online experience. By clicking "Accept All," you help us understand how our site is used and enhance its performance. You can change your choice at any time here. To learn more, please visit our Privacy Policy.