689 Publications

A robust and versatile computational peptide design pipeline to inform wet-lab experiments

V. Mulligan, Tristan Zaborniak , Benjamin P. Brown , D. Renfrew

Since Merrifield’s development of solid-phase peptide synthesis, we have seen explosive growth in the number of synthetic building-blocks that can be incorporated into peptides. This has created a problem: the number of possible molecules that could be synthesized is many orders of magnitude greater than the largest conceivable combinatorial libraries. Computational design, based on combinatorial optimization algorithms, addresses this problem by proposing sequences likely to have desired folds and functions. These computational methods complement experiments by reducing astronomically large numbers of combinatorial possibilities to experimentally tractable shortlists. This presentation describes our robust, versatile methods, made available to peptide scientists in the Rosetta and Masala software suites, for designing peptides that fold into rigid conformations. Our physics-based methods generalize to exotic chemical building blocks poorly amenable to machine learning-based methods for want of training data. Our pipeline has produced experimentally-validated mixed-chirality peptides that bind to targets of therapeutic interest, and peptides that diffuse across cell membranes. Ongoing research is mapping the sequence optimization problem (which grows intractable even for supercomputers as the number of candidate chemical building blocks grows very large) to current and near-future quantum computers, allowing use of quantum algorithms in the context of the existing, widely-used design protocols.

Show Abstract

Nuclear instance segmentation and tracking for preimplantation mouse embryos

H. Nunley , Binglun Shao, Prateek Grover, A. Watters, S. Shvartsman, L. M. Brown, et al.

For investigations into fate specification and morphogenesis in time-lapse images of preimplantation embryos, automated 3D instance segmentation and tracking of nuclei are invaluable. Low signal-to-noise ratio, high voxel anisotropy, high nuclear density, and variable nuclear shapes can limit the performance of segmentation methods, while tracking is complicated by cell divisions, low frame rates, and sample movements. Supervised machine learning approaches can radically improve segmentation accuracy and enable easier tracking, but they often require large amounts of annotated 3D data. Here, we first report a previously unreported mouse line expressing near-infrared nuclear reporter H2B-miRFP720. We then generate a dataset (termed BlastoSPIM) of 3D images of H2B-miRFP720-expressing embryos with ground truth for nuclear instances. Using BlastoSPIM, we benchmark seven convolutional neural networks and identify Stardist-3D as the most accurate instance segmentation method. With our BlastoSPIM-trained Stardist-3D models, we construct a complete pipeline for nuclear instance segmentation and lineage tracking from the eight-cell stage to the end of preimplantation development (>100 nuclei). Finally, we demonstrate the usefulness of BlastoSPIM as pre-train data for related problems, both for a different imaging modality and for different model systems.

Show Abstract

Dynamic allostery drives autocrine and paracrine TGF-β signaling

Mingliang Jin, Robert I. Seed, P. Cossio, et al.

TGF-β, essential for development and immunity, is expressed as a latent complex (L-TGF-β) non-covalently associated with its prodomain and presented on immune cell surfaces by covalent association with GARP. Binding to integrin αvβ8 activates L-TGF-β1/GARP. The dogma is that mature TGF-β must physically dissociate from L-TGF-β1 for signaling to occur. Our previous studies discovered that αvβ8-mediated TGF-β autocrine signaling can occur without TGF-β1 release from its latent form. Here, we show that mice engineered to express TGF-β1 that cannot release from L-TGF-β1 survive without early lethal tissue inflammation, unlike those with TGF-β1 deficiency. Combining cryogenic electron microscopy with cell-based assays, we reveal a dynamic allosteric mechanism of autocrine TGF-β1 signaling without release where αvβ8 binding redistributes the intrinsic flexibility of L-TGF-β1 to expose TGF-β1 to its receptors. Dynamic allostery explains the TGF-β3 latency/activation mechanism and why TGF-β3 functions distinctly from TGF-β1, suggesting that it broadly applies to other flexible cell surface receptor/ligand systems.

Show Abstract

Multimodal Learning for Embryo Viability Prediction in Clinical IVF

Junsik Kim, Zhiyi Shi, D. Needleman

In clinical In-Vitro Fertilization (IVF), identifying the most viable embryo for transfer is important to increasing the likelihood of a successful pregnancy. Traditionally, this process involves embryologists manually assessing embryos’ static morphological features at specific intervals using light microscopy. This manual evaluation is not only time-intensive and costly, due to the need for expert analysis, but also inherently subjective, leading to variability in the selection process. To address these challenges, we develop a multimodal model that leverages both time-lapse video data and Electronic Health Records (EHRs) to predict embryo viability. A key challenge of our research is to effectively combine time-lapse video and EHR data, given their distinct modality characteristic. We comprehensively analyze our multimodal model with various modality inputs and integration approaches. Our approach will enable fast and automated embryo viability predictions in scale for clinical IVF.

Show Abstract

Automated single-cell omics end-to-end framework with data-driven batch inference

Yun Wang, O. Troyanskaya, X. Chen, et al.

To facilitate single-cell multi-omics analysis and improve reproducibility, we present single-cell pipeline for end-to-end data integration (SPEEDI), a fully automated end-to-end framework for batch inference, data integration, and cell-type labeling. SPEEDI introduces data-driven batch inference and transforms the often heterogeneous data matrices obtained from different samples into a uniformly annotated and integrated dataset. Without requiring user input, it automatically selects parameters and executes pre-processing, sample integration, and cell-type mapping. It can also perform downstream analyses of differential signals between treatment conditions and gene functional modules. SPEEDI’s data-driven batch-inference method works with widely used integration and cell-typing tools. By developing data-driven batch inference, providing full end-to-end automation, and eliminating parameter selection, SPEEDI improves reproducibility and lowers the barrier to obtaining biological insight from these valuable single-cell datasets. The SPEEDI interactive web application can be accessed at https://speedi.princeton.edu/. A record of this paper’s transparent peer review process is included in the supplemental information.

Show Abstract

Heterogeneity in ligand-bound TRPV1: A comparison of methods in cryo-EM and molecular dynamics simulation

M. Astore, R. Blackwell, David Silva-Sánchez, P. Cossio, S. Hanson

Cryogenic electron microscopy (cryo-EM) has emerged as a powerful method for resolving the atomistic details of cellular components. In recent years, several computational methods have been developed to study the heterogeneity of molecules in single-particle cryo-EM. In this study, we analyzed a publicly available single-particle dataset of TRPV1 using five of these methods: 3D Flexible Refinement, 3D Variability Analysis, cryoDRGN, ManifoldEM, and Bayesian ensemble reweighting. Beyond what we initially expected, we have found that this dataset contains significant heterogeneity— indicating that single particle datasets likely contain a rich spectrum of biologically relevant states. Further, we have found that different methods are best suited to studying different kinds of heterogeneity, with some methods being more sensitive to either compositional or conformational heterogeneity. We also apply a combination of Bayesian ensemble reweighting and molecular dynamics as supporting evidence for the presence of these rarer states within the sample. Finally, we developed a quantitative metric based on the analysis of the singular value decomposition and power spectra to compare the resulting volumes from each method. This work represents a detailed view of the variable outcomes of different heterogeneity methods used to analyze a single real dataset and presents a pathway to a deeper understanding of the biology of complex macromolecules like the TRPV1 ion channel.

Show Abstract
October 11, 2024

Mechanics and morphology of proliferating cell collectives with self-inhibiting growth

S. Weady, Bryce Palmer, A. Lamson, Taeyoon Kim, R. Farhadifar, M. Shelley

We study the dynamics of proliferating cell collectives whose microscopic constituents’ growth is inhibited by macroscopic growth-induced stress. Discrete particle simulations of a growing collective show the emergence of concentric-ring patterns in cell size whose spatiotemporal structure is closely tied to the individual cell’s stress response. Motivated by these observations, we derive a multiscale continuum theory whose parameters map directly to the discrete model. Analytical solutions of this theory show the concentric patterns arise from anisotropically accumulated resistance to growth over many cell cycles. This Letter shows how purely mechanical processes can affect the internal patterning and morphology of cell collectives, and provides a concise theoretical framework for connecting the micro- to macroscopic dynamics of proliferating matter.

Show Abstract
October 10, 2024

Inferring biological processes with intrinsic noise from cross-sectional data

S. Maddu, V. Chardès, M. Shelley

Inferring dynamical models from data continues to be a significant challenge in computational biology, especially given the stochastic nature of many biological processes. We explore a common scenario in omics, where statistically independent cross-sectional samples are available at a few time points, and the goal is to infer the underlying diffusion process that generated the data. Existing inference approaches often simplify or ignore noise intrinsic to the system, compromising accuracy for the sake of optimization ease. We circumvent this compromise by inferring the phase-space probability flow that shares the same time-dependent marginal distributions as the underlying stochastic process. Our approach, probability flow inference (PFI), disentangles force from intrinsic stochasticity while retaining the algorithmic ease of ODE inference. Analytically, we prove that for Ornstein-Uhlenbeck processes the regularized PFI formalism yields a unique solution in the limit of well-sampled distributions. In practical applications, we show that PFI enables accurate parameter and force estimation in high-dimensional stochastic reaction networks, and that it allows inference of cell differentiation dynamics with molecular noise, outperforming state-of-the-art approaches.

Show Abstract
October 10, 2024

Self-propulsion, flocking and chiral active phases from particles spinning at intermediate Reynolds numbers

Panyu Chen, S. Weady, M. Shelley, et al.

Vorticity, a measure of the local rate of rotation of a fluid element, is the driver of incompressible flow. In viscous fluids, powering bulk flows requires the continuous injection of vorticity from boundaries to counteract the diffusive effects of viscosity. Here we power a flow from within by suspending approximately cylindrical particles and magnetically driving them to rotate at Reynolds numbers in the intermediate range. We find that a single particle generates a localized three-dimensional region of vorticity around it—which we call a vortlet—that drives a number of remarkable behaviours. Slight asymmetries in the particle shape can deform the vortlet and cause the particle to self-propel. Interactions between vortlets are similarly rich, generating bound dynamical states. When a large number of vortlets interact, they spontaneously form collectively moving flocks. These flocks remain coherent while propelling, splitting and merging. If enough particles are added so as to saturate the flow chamber, a homogeneous three-dimensional active chiral fluid of vortlets is formed, which can be manipulated with gravity or flow chamber boundaries, leading to lively collective dynamics. Our findings demonstrate an inertial regime for synthetic active matter, provide a controlled physical system for the quantitative study of three-dimensional flocking in non-sentient systems and establish a platform for the study of three-dimensional active chiral fluids.

Show Abstract
October 8, 2024

Spatiotemporal dynamics of nucleocytoplasmic transport

A. Rautu, Alexandra Zidovska, M. Shelley

Nucleocytoplasmic transport is essential for cellular function, presenting a canonical example of rapid molecular sorting inside cells. It consists of a coordinated interplay between import/export of molecules in/out the cell nucleus. Here, we investigate the role of spatiotemporal dynamics of the nucleocytoplasmic transport and its regulation. We develop a biophysical model that captures the main features of the nucleocytoplasmic transport, in particular, its regulation through the Ran cycle. Our model yields steady-state profiles for the molecular components of the Ran cycle, their relaxation times, as well as the nuclear-to-cytoplasmic molecule ratio. We show that these quantities are affected by their spatial dynamics and heterogeneity within the nucleus. Specifically, we find that the spatial nonuniformity of Ran guanine exchange factor (RanGEF)—particularly its proximity to the nuclear envelope—increases the Ran content in the nucleus. We further show that RanGEF's accumulation near the nuclear envelope results from its intrinsic dynamics as a nuclear cargo, transported by the Ran cycle itself. Overall, our work highlights the critical role of molecular spatial dynamics in cellular processes and proposes new avenues for theoretical and experimental inquiries into the nucleocytoplasmic transport.

Show Abstract
  • Previous Page
  • Viewing
  • Next Page
Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates