531 Publications

Combining machine learning with structure-based protein design to predict and engineer post-translational modifications of proteins

Moritz Ertelt, V. Mulligan, et al.

Post-translational modifications (PTMs) of proteins play a vital role in their function and stability. These modifications influence protein folding, signaling, protein-protein interactions, enzyme activity, binding affinity, aggregation, degradation, and much more. To date, over 400 types of PTMs have been described, representing chemical diversity well beyond the genetically encoded amino acids. Such modifications pose a challenge to the successful design of proteins, but also represent a major opportunity to diversify the protein engineering toolbox. To this end, we first trained artificial neural networks (ANNs) to predict eighteen of the most abundant PTMs, including protein glycosylation, phosphorylation, methylation, and deamidation. In a second step, these models were implemented inside the computational protein modeling suite Rosetta, which allows flexible combination with existing protocols to model the modified sites and understand their impact on protein stability as well as function. Lastly, we developed a new design protocol that either maximizes or minimizes the predicted probability of a particular site being modified. We find that this combination of ANN prediction and structure-based design can enable the modification of existing, as well as the introduction of novel, PTMs. The potential applications of our work include, but are not limited to, glycan masking of epitopes, strengthening protein-protein interactions through phosphorylation, as well as protecting proteins from deamidation liabilities. These applications are especially important for the design of new protein therapeutics where PTMs can drastically change the therapeutic properties of a protein. Our work adds novel tools to Rosetta’s protein engineering toolbox that allow for the rational design of PTMs.

Show Abstract

Nuclear instance segmentation and tracking for preimplantation mouse embryos

H. Nunley , Binglun Shao, Prateek Grover, A. Watters, S. Shvartsman, L. M. Brown, et al.

For investigations into fate specification and cell rearrangements in live images of preimplantation embryos, automated and accurate 3D instance segmentation of nuclei is invaluable; however, the performance of segmentation methods is limited by the images' low signal-to-noise ratio and high voxel anisotropy and the nuclei's dense packing and variable shapes. Supervised machine learning approaches have the potential to radically improve segmentation accuracy but are hampered by a lack of fully annotated 3D data. In this work, we first establish a novel mouse line expressing near-infrared nuclear reporter H2B-miRFP720. H2B-miRFP720 is the longest wavelength nuclear reporter in mice and can be imaged simultaneously with other reporters with minimal overlap. We then generate a dataset, which we call BlastoSPIM, of 3D microscopy images of H2B-miRFP720-expressing embryos with ground truth for nuclear instance segmentation. Using BlastoSPIM, we benchmark the performance of five convolutional neural networks and identify Stardist-3D as the most accurate instance segmentation method across preimplantation development. Stardist-3D, trained on BlastoSPIM, performs robustly up to the end of preimplantation development (> 100 nuclei) and enables studies of fate patterning in the late blastocyst. We, then, demonstrate BlastoSPIM's usefulness as pre-train data for related problems. BlastoSPIM and its corresponding Stardist-3D models are available at: blastospim.flatironinstitute.org.

Show Abstract

To be or not to be: orb, the fusome and oocyte specification in Drosophila

In the fruit fly Drosophila melanogaster, two cells in a cyst of 16 interconnected cells have the potential to become the oocyte, but only one of these will assume an oocyte fate as the cysts transition through regions 2a and 2b of the germarium. The mechanism of specification depends on a polarized microtubule network, a dynein dependent Egl:BicD mRNA cargo complex, a special membranous structure called the fusome and its associated proteins, and the translational regulator orb. In this work, we have investigated the role of orb and the fusome in oocyte specification. We show here that specification is a stepwise process. Initially, orb mRNAs accumulate in the two pro-oocytes in close association with the fusome. This association is accompanied by the activation of the orb autoregulatory loop, generating high levels of Orb. Subsequently, orb mRNAs become enriched in only one of the pro-oocytes, the presumptive oocyte, and this is followed, with a delay, by Orb localization to the oocyte. We find that fusome association of orb mRNAs is essential for oocyte specification in the germarium, is mediated by the orb 3′ UTR, and requires Orb protein. We also show that the microtubule minus end binding protein Patronin functions downstream of orb in oocyte specification. Finally, in contrast to a previously proposed model for oocyte selection, we find that the choice of which pro-oocyte becomes the oocyte does not seem to be predetermined by the amount of fusome material in these two cells, but instead depends upon a competition for orb gene products.

Show Abstract
February 12, 2024

Modulation of Aβ 16–22 aggregation by glucose

Meenal Jain , A. Sahoo, Silvina Matysiak

The self-assembly of amyloid-beta (Aβ) peptides into fibrillar structures in the brain is a signature of Alzheimer's disease. Recent studies have reported correlations between Alzheimer's disease and type-2 diabetes. Structurally, hyperglycemia induces covalent protein crosslinkings by advanced glycation end products (AGE), which can affect the stability of Aβ oligomers. In this work, we leverage physics-based coarse-grained molecular simulations to probe alternate thermodynamic pathways that affect peptide aggregation propensities at varying concentrations of glucose molecules. Similar to previous experimental reports, our simulations show a glucose concentration-dependent increase in Aβ aggregation rates, without changes in the overall secondary structure content. We discovered that glucose molecules prefer partitioning onto the aggregate–water interface at a specific orientation, resulting in a loss of molecular rotational entropy. This effectively hastens the aggregation rates, as peptide self-assembly can reduce the available surface area for peptide–glucose interactions. This work introduces a new thermodynamic-driven pathway, beyond chemical cross-linking, that can modulate Aβ aggregation.

Show Abstract

Influence of surface viscosities on the electrodeformation of a prolate viscous drop

H. Nganguia, Y.-N. Young, et al.

Contaminants and other agents are often present at the interface between two fluids, giving rise to rheological properties such as surface shear and dilatational viscosities. The dynamics of viscous drops with interfacial viscosities has attracted greater interest in recent years, due to the influence of surface rheology on deformation and the surrounding flows. We investigate the effects of shear and dilatational viscosities on the electro-deformation of a viscous drop using the Taylor–Melcher leaky dielectric model. We use a large deformation analysis to derive an ordinary differential equation for the drop shape. Our model elucidates the contributions of each force to the overall deformation of the drop and reveals a rich range of dynamic behaviors that show the effects of surface viscosities and their dependence on rheological and electrical properties of the system. We also examine the physical mechanisms underlying the observed behaviors by analyzing the surface dilatation and surface deformation.

Show Abstract
December 23, 2023

Prebifurcation enhancement of imbibition-drainage hysteresis cycles

I. Lavi, et al.

The efficient transport of fluids through disordered media requires a thorough understanding of how the driving rate affects two-phase interface propagation. Despite our understanding of front dynamics in homogeneous environments, as well as how medium heterogeneities shape fluid interfaces at rest, little is known about the effects of localized topographical variations on large-scale interface dynamics. To gain physical insights into this problem, we study here oil-air displacements through an “imperfect” Hele-Shaw cell. Combining experiments, numerical simulations, and theory, we show that the flow rate dramatically alters the interface response to a porous constriction as one approaches the Saffman-Taylor instability, strictly under stable conditions. This gives rise to asymmetric imbibition–drainage hysteresis cycles that feature divergent extensions and nonlocal effects, all of which are aptly captured and explained by a minimal free boundary model.

Show Abstract

Interpretable neural architecture search and transfer learning for understanding CRISPR–Cas9 off-target enzymatic reactions

Finely-tuned enzymatic pathways control cellular processes, and their dysregulation can lead to disease. Creating predictive and interpretable models for these pathways is challenging because of the complexity of the pathways and of the cellular and genomic contexts. Here we introduce Elektrum, a deep learning framework which addresses these challenges with data-driven and biophysically interpretable models for determining the kinetics of biochemical systems. First, it uses in vitro kinetic assays to rapidly hypothesize an ensemble of high-quality Kinetically Interpretable Neural Networks (KINNs) that predict reaction rates. It then employs a novel transfer learning step, where the KINNs are inserted as intermediary layers into deeper convolutional neural networks, fine-tuning the predictions for reaction-dependent in vivo outcomes. Elektrum makes effective use of the limited, but clean in vitro data and the complex, yet plentiful in vivo data that captures cellular context. We apply Elektrum to predict CRISPR-Cas9 off-target editing probabilities and demonstrate that Elektrum achieves state-of-the-art performance, regularizes neural network architectures, and maintains physical interpretability

Show Abstract

Peak-agnostic high-resolution cis-regulatory circuitry mapping using single cell multiome data

Zidong Zhang, X. Chen, O. Troyanskaya, et al.

Single same cell RNAseq/ATACseq multiome data provide unparalleled potential to develop high resolution maps of the cell-type specific transcriptional regulatory circuitry underlying gene expression. We present CREMA, a framework that recovers the full cis-regulatory circuitry by modeling gene expression and chromatin activity in individual cells without peak-calling or cell type labeling constraints. We demonstrate that CREMA overcomes the limitations of existing methods that fail to identify about half of functional regulatory elements which are outside the called chromatin ‘peaks’. These circuit sites outside called peaks are shown to be important cell type specific functional regulatory loci, sufficient to distinguish individual cell types. Analysis of mouse pituitary data identifies a Gata2-circuit for the gonadotrope-enriched disease-associated Pcsk1 gene, which is experimentally validated by reduced gonadotrope expression in a gonadotrope conditional Gata2-knockout model. We present a web accessible human immune cell regulatory circuit resource, and provide CREMA as an R package.

Show Abstract

Soft matter roadmap

Jean-Louis Barrat , Andrea J Liu

Soft materials are usually defined as materials made of mesoscopic entities, often self-organised, sensitive to thermal fluctuations and to weak perturbations. Archetypal examples are colloids, polymers, amphiphiles, liquid crystals, foams. The importance of soft materials in everyday commodity products, as well as in technological applications, is enormous, and controlling or improving their properties is the focus of many efforts. From a fundamental perspective, the possibility of manipulating soft material properties, by tuning interactions between constituents and by applying external perturbations, gives rise to an almost unlimited variety in physical properties. Together with the relative ease to observe and characterise them, this renders soft matter systems powerful model systems to investigate statistical physics phenomena, many of them relevant as well to hard condensed matter systems. Understanding the emerging properties from mesoscale constituents still poses enormous challenges, which have stimulated a wealth of new experimental approaches, including the synthesis of new systems with, e.g. tailored self-assembling properties, or novel experimental techniques in imaging, scattering or rheology. Theoretical and numerical methods, and coarse-grained models, have become central to predict physical properties of soft materials, while computational approaches that also use machine learning tools are playing a progressively major role in many investigations. This Roadmap intends to give a broad overview of recent and possible future activities in the field of soft materials, with experts covering various developments and challenges in material synthesis and characterisation, instrumental, simulation and theoretical methods as well as general concepts.

Show Abstract

Building a “trap model” of glassy dynamics from a local structural predictor of rearrangements

S. A. Ridout, I. Tah, A. J. Liu

Here we introduce a variation of the trap model of supercooled liquids based on softness, a particle-based variable identified by machine learning that quantifies the local structural environment and energy barrier for the particle to rearrange. As in the trap model, we assume that each particle's softness, and hence energy barrier, evolves independently. We show that our model makes qualitatively reasonable predictions of behaviors such as the dependence of fragility on density in a model supercooled liquid. We also show failures of the model, indicating in some cases signs that softness may be missing important information, and in other cases features that may only be explained by correlations neglected in the trap model.

Show Abstract
  • Previous Page
  • Viewing
  • Next Page
Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates