605 Publications

A robust and versatile computational peptide design pipeline to inform wet-lab experiments

V. Mulligan, Tristan Zaborniak , Benjamin P. Brown , D. Renfrew

Since Merrifield’s development of solid-phase peptide synthesis, we have seen explosive growth in the number of synthetic building-blocks that can be incorporated into peptides. This has created a problem: the number of possible molecules that could be synthesized is many orders of magnitude greater than the largest conceivable combinatorial libraries. Computational design, based on combinatorial optimization algorithms, addresses this problem by proposing sequences likely to have desired folds and functions. These computational methods complement experiments by reducing astronomically large numbers of combinatorial possibilities to experimentally tractable shortlists. This presentation describes our robust, versatile methods, made available to peptide scientists in the Rosetta and Masala software suites, for designing peptides that fold into rigid conformations. Our physics-based methods generalize to exotic chemical building blocks poorly amenable to machine learning-based methods for want of training data. Our pipeline has produced experimentally-validated mixed-chirality peptides that bind to targets of therapeutic interest, and peptides that diffuse across cell membranes. Ongoing research is mapping the sequence optimization problem (which grows intractable even for supercomputers as the number of candidate chemical building blocks grows very large) to current and near-future quantum computers, allowing use of quantum algorithms in the context of the existing, widely-used design protocols.

Show Abstract

Nuclear instance segmentation and tracking for preimplantation mouse embryos

H. Nunley , Binglun Shao, Prateek Grover, A. Watters, S. Shvartsman, L. M. Brown, et al.

For investigations into fate specification and morphogenesis in time-lapse images of preimplantation embryos, automated 3D instance segmentation and tracking of nuclei are invaluable. Low signal-to-noise ratio, high voxel anisotropy, high nuclear density, and variable nuclear shapes can limit the performance of segmentation methods, while tracking is complicated by cell divisions, low frame rates, and sample movements. Supervised machine learning approaches can radically improve segmentation accuracy and enable easier tracking, but they often require large amounts of annotated 3D data. Here, we first report a previously unreported mouse line expressing near-infrared nuclear reporter H2B-miRFP720. We then generate a dataset (termed BlastoSPIM) of 3D images of H2B-miRFP720-expressing embryos with ground truth for nuclear instances. Using BlastoSPIM, we benchmark seven convolutional neural networks and identify Stardist-3D as the most accurate instance segmentation method. With our BlastoSPIM-trained Stardist-3D models, we construct a complete pipeline for nuclear instance segmentation and lineage tracking from the eight-cell stage to the end of preimplantation development (>100 nuclei). Finally, we demonstrate the usefulness of BlastoSPIM as pre-train data for related problems, both for a different imaging modality and for different model systems.

Show Abstract

Multimodal Learning for Embryo Viability Prediction in Clinical IVF

Junsik Kim, Zhiyi Shi, D. Needleman

In clinical In-Vitro Fertilization (IVF), identifying the most viable embryo for transfer is important to increasing the likelihood of a successful pregnancy. Traditionally, this process involves embryologists manually assessing embryos’ static morphological features at specific intervals using light microscopy. This manual evaluation is not only time-intensive and costly, due to the need for expert analysis, but also inherently subjective, leading to variability in the selection process. To address these challenges, we develop a multimodal model that leverages both time-lapse video data and Electronic Health Records (EHRs) to predict embryo viability. A key challenge of our research is to effectively combine time-lapse video and EHR data, given their distinct modality characteristic. We comprehensively analyze our multimodal model with various modality inputs and integration approaches. Our approach will enable fast and automated embryo viability predictions in scale for clinical IVF.

Show Abstract

Automated single-cell omics end-to-end framework with data-driven batch inference

Yun Wang, O. Troyanskaya, X. Chen, et al.

To facilitate single-cell multi-omics analysis and improve reproducibility, we present single-cell pipeline for end-to-end data integration (SPEEDI), a fully automated end-to-end framework for batch inference, data integration, and cell-type labeling. SPEEDI introduces data-driven batch inference and transforms the often heterogeneous data matrices obtained from different samples into a uniformly annotated and integrated dataset. Without requiring user input, it automatically selects parameters and executes pre-processing, sample integration, and cell-type mapping. It can also perform downstream analyses of differential signals between treatment conditions and gene functional modules. SPEEDI’s data-driven batch-inference method works with widely used integration and cell-typing tools. By developing data-driven batch inference, providing full end-to-end automation, and eliminating parameter selection, SPEEDI improves reproducibility and lowers the barrier to obtaining biological insight from these valuable single-cell datasets. The SPEEDI interactive web application can be accessed at https://speedi.princeton.edu/. A record of this paper’s transparent peer review process is included in the supplemental information.

Show Abstract

Mechanics and morphology of proliferating cell collectives with self-inhibiting growth

S. Weady, Bryce Palmer, A. Lamson, Taeyoon Kim, R. Farhadifar, M. Shelley

We study the dynamics of proliferating cell collectives whose microscopic constituents’ growth is inhibited by macroscopic growth-induced stress. Discrete particle simulations of a growing collective show the emergence of concentric-ring patterns in cell size whose spatiotemporal structure is closely tied to the individual cell’s stress response. Motivated by these observations, we derive a multiscale continuum theory whose parameters map directly to the discrete model. Analytical solutions of this theory show the concentric patterns arise from anisotropically accumulated resistance to growth over many cell cycles. This Letter shows how purely mechanical processes can affect the internal patterning and morphology of cell collectives, and provides a concise theoretical framework for connecting the micro- to macroscopic dynamics of proliferating matter.

Show Abstract
October 10, 2024

Inferring biological processes with intrinsic noise from cross-sectional data

Inferring dynamical models from data continues to be a significant challenge in computational biology, especially given the stochastic nature of many biological processes. We explore a common scenario in omics, where statistically independent cross-sectional samples are available at a few time points, and the goal is to infer the underlying diffusion process that generated the data. Existing inference approaches often simplify or ignore noise intrinsic to the system, compromising accuracy for the sake of optimization ease. We circumvent this compromise by inferring the phase-space probability flow that shares the same time-dependent marginal distributions as the underlying stochastic process. Our approach, probability flow inference (PFI), disentangles force from intrinsic stochasticity while retaining the algorithmic ease of ODE inference. Analytically, we prove that for Ornstein-Uhlenbeck processes the regularized PFI formalism yields a unique solution in the limit of well-sampled distributions. In practical applications, we show that PFI enables accurate parameter and force estimation in high-dimensional stochastic reaction networks, and that it allows inference of cell differentiation dynamics with molecular noise, outperforming state-of-the-art approaches.

Show Abstract
October 10, 2024

Self-propulsion, flocking and chiral active phases from particles spinning at intermediate Reynolds numbers

Panyu Chen, S. Weady, M. Shelley, et al.

Vorticity, a measure of the local rate of rotation of a fluid element, is the driver of incompressible flow. In viscous fluids, powering bulk flows requires the continuous injection of vorticity from boundaries to counteract the diffusive effects of viscosity. Here we power a flow from within by suspending approximately cylindrical particles and magnetically driving them to rotate at Reynolds numbers in the intermediate range. We find that a single particle generates a localized three-dimensional region of vorticity around it—which we call a vortlet—that drives a number of remarkable behaviours. Slight asymmetries in the particle shape can deform the vortlet and cause the particle to self-propel. Interactions between vortlets are similarly rich, generating bound dynamical states. When a large number of vortlets interact, they spontaneously form collectively moving flocks. These flocks remain coherent while propelling, splitting and merging. If enough particles are added so as to saturate the flow chamber, a homogeneous three-dimensional active chiral fluid of vortlets is formed, which can be manipulated with gravity or flow chamber boundaries, leading to lively collective dynamics. Our findings demonstrate an inertial regime for synthetic active matter, provide a controlled physical system for the quantitative study of three-dimensional flocking in non-sentient systems and establish a platform for the study of three-dimensional active chiral fluids.

Show Abstract
October 8, 2024

Spatiotemporal dynamics of nucleocytoplasmic transport

A. Rautu, Alexandra Zidovska, M. Shelley

Nucleocytoplasmic transport is essential for cellular function, presenting a canonical example of rapid molecular sorting inside cells. It consists of a coordinated interplay between import/export of molecules in/out the cell nucleus. Here, we investigate the role of spatiotemporal dynamics of the nucleocytoplasmic transport and its regulation. We develop a biophysical model that captures the main features of the nucleocytoplasmic transport, in particular, its regulation through the Ran cycle. Our model yields steady-state profiles for the molecular components of the Ran cycle, their relaxation times, as well as the nuclear-to-cytoplasmic molecule ratio. We show that these quantities are affected by their spatial dynamics and heterogeneity within the nucleus. Specifically, we find that the spatial nonuniformity of Ran guanine exchange factor (RanGEF)—particularly its proximity to the nuclear envelope—increases the Ran content in the nucleus. We further show that RanGEF's accumulation near the nuclear envelope results from its intrinsic dynamics as a nuclear cargo, transported by the Ran cycle itself. Overall, our work highlights the critical role of molecular spatial dynamics in cellular processes and proposes new avenues for theoretical and experimental inquiries into the nucleocytoplasmic transport.

Show Abstract

8574 Single cell multiomics suggests tumor cell energy metabolism heterogeneity in non-functioning gonadotroph pituitary adenomas

F M Ruf-Zamojski, X. Chen, O. Troyanskaya, R. Sealfon, et al.

Pituitary adenomas cause hormonal dysregulation and severe morbidity. The absence of clear biomarkers for prognosis/treatment and the high risk of recurrence make management challenging. Non-functioning gonadotroph pituitary adenomas (NFPAs) represent proliferation of gonadotroph lineage cells without an increase in secretion of gonadotropins. Detection is often delayed until mass effects cause visual defects. Higher resolution molecular study may improve classification, diagnosis, treatment, and provide insight into adenoma biology. We analyzed four NFPAs using unbiased genome-wide same-cell single nucleus 10X Genomics multiome assay (transcriptome and chromatin accessibility) to define individual cell identity/states in comparison with normal gonadotrope cell references from twelve age/sex-matched human post-mortem pituitaries. In total, we generated high quality multiomics single-cell data for 33,832 adenoma cells and 5,451 normal gonadotropes. We identified tumor, immune, vascular, and proliferative cells confirming cellular heterogeneity. Multidimensional scaling analysis using Manhattan distances between cell types showed that NFPA gonadotropes were closer in gene expression to healthy gonadotropes among all pituitary cell types. In addition, we observed differences in the macrophage and endothelial cell populations between tumors and healthy pituitaries. Next, we studied gene regulatory circuitry (regulated gene, modulating transcription factor and its chromatin interaction site) using the new Control of Regulation Extracted from Multiomics Assays (CREMA, [1]) multiome analysis method which showed high variation between normal and tumor gonadotropes. Within each adenoma, the NFPA tumor cells encompassed two main subgroups of cells, one group (GT+ marker tumor cells) expressing established gonadotrope markers and another group not (GT- marker tumor cells). GT+ and GT- tumor cells formed a cluster distinct from healthy gonadotropes. GT+ and GT- tumor cells differed by their level of expression of mitochondrial genes, suggesting decreased aerobic capacity associated with the GT- tumor subgroup. Interestingly, differential analyses between the healthy and the NFPA gonadotropes from same-cell multiome data highlighted dissimilar gene expression and chromatin accessibility patterns.Our newly collected single cell data characterize the difference between NFPA tumor cells and normal gonadotrope cells. Our results suggest a Warburg-effect like difference in aerobic respiratory capacity among tumor cells within the same pituitary adenoma. Overall, this study brings new insights into the molecular characteristics of NFPAs, including tumor cell heterogeneity, and represents a new resource for the field.

Show Abstract

Minimal motifs for habituating systems

M. Smart, S. Shvartsman, Martin Mönnigmann

Habituation—a phenomenon in which a dynamical system exhibits a diminishing response to repeated stimulations that eventually recovers when the stimulus is withheld—is universally observed in living systems from animals to unicellular organisms. Despite its prevalence, generic mechanisms for this fundamental form of learning remain poorly defined. Drawing inspiration from prior work on systems that respond adaptively to step inputs, we study habituation from a nonlinear dynamics perspective. This approach enables us to formalize classical hallmarks of habituation that have been experimentally identified in diverse organisms and stimulus scenarios. We use this framework to investigate distinct dynamical circuits capable of habituation. In particular, we show that driven linear dynamics of a memory variable with static nonlinearities acting at the input and output can implement numerous hallmarks in a mathematically interpretable manner. This work establishes a foundation for understanding the dynamical substrates of this primitive learning behavior and offers a blueprint for the identification of habituating circuits in biological systems.

Show Abstract
  • Previous Page
  • Viewing
  • Next Page
Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates