531 Publications

Shrinkage improves estimation of microbial associations under different normalization methods

M Badri, Z Kurtz, R. Bonneau, C. Müller

Consistent estimation of associations in microbial genomic survey count data is fundamental to microbiome research. Technical limitations, including compositionality, low sample sizes, and technical variability, obstruct standard application of association measures and require data normalization prior to estimating associations. Here, we investigate the interplay between data normalization and microbial association estimation by a comprehensive analysis of statistical consistency. Leveraging the large sample size of the American Gut Project (AGP), we assess the consistency of the two prominent linear association estimators, correlation and proportionality, under different sample scenarios and data normalization schemes, including RNA-seq analysis work flows and log-ratio transformations. We show that shrinkage estimation, a standard technique in high-dimensional statistics, can universally improve the quality of association estimates for microbiome data. We find that large-scale association patterns in the AGP data can be grouped into five normalization-dependent classes. Using microbial association network construction and clustering as examples of exploratory data analysis, we show that variance-stabilizing and log-ratio approaches provide for the most consistent estimation of taxonomic and structural coherence. Taken together, the findings from our reproducible analysis workflow have important implications for microbiome studies in multiple stages of analysis, particularly when only small sample sizes are available.

Show Abstract
April 4, 2020

Machine learning, the kidney, and genotype–phenotype analysis

R. Sealfon, L Mariani, M Kretzler, O. Troyanskaya

With biomedical research transitioning into data-rich science, machine learning provides a powerful toolkit for extracting knowledge from large-scale biological data sets. The increasing availability of comprehensive kidney omics compendia (transcriptomics, proteomics, metabolomics, and genome sequencing), as well as other data modalities such as electronic health records, digital nephropathology repositories, and radiology renal images, makes machine learning approaches increasingly essential for analyzing human kidney data sets. Here, we discuss how machine learning approaches can be applied to the study of kidney disease, with a particular focus on how they can be used for understanding the relationship between genotype and phenotype.

Show Abstract

Flexible filaments buckle into helicoidal shapes in strong compressional flows

B. Chakrabarti, Y. Liu, J. LaGrone, R. Cortez, L. Fauci, O. du Roure, D. Saintillan, A. Linder

The occurrence of coiled or helical morphologies is common in nature, from plant roots to DNA packaging into viral capsids, as well as in applications such as oil drilling processes. In many examples, chiral structures result from the buckling of a straight fibre either with intrinsic twist or to which end moments have been applied in addition to compression forces. Here, we elucidate a generic way to form regular helicoidal shapes from achiral straight filaments transported in viscous flows with free ends. Through a combination of experiments using fluorescently labelled actin filaments in microfluidic divergent flows and two distinct sets of numerical simulations, we demonstrate the robustness of helix formation. A nonlinear stability analysis is performed, and explains the emergence of such chiral structures from the nonlinear interaction of perpendicular planar buckling modes, an effect that solely requires a strong compressional flow, independent of the exact nature of the fibre or type of flow field. The fundamental mechanism for the uncovered morphological transition and characterization of the emerging conformations advance our understanding of several biological and industrial processes and can also be exploited for the controlled microfabrication of chiral objects.

Show Abstract

Classification of the Molecular Defects Associated with Pathogenic Variants of the SLC6A8 Creatine Transporter

M Salazar, N Zelt, R Saldivar, C Kuntz, S Chen, W Penn, R. Bonneau, J. Koehler, J Schlebach

More than 80 loss-of-function (LOF) mutations in the SLC6A8 creatine transporter (hCRT1) are responsible for cerebral creatine deficiency syndrome (CCDS), which gives rise to a spectrum of neurological defects, including intellectual disability, epilepsy, and autism spectrum disorder. To gain insight into the nature of the molecular defects caused by these mutations, we quantitatively profiled the cellular processing, trafficking, expression, and function of eight pathogenic CCDS variants in relation to the wild type (WT) and one neutral isoform. All eight CCDS variants exhibit measurable proteostatic deficiencies that likely contribute to the observed LOF. However, the magnitudes of their specific effects on the expression and trafficking of hCRT1 vary considerably, and we find that the LOF associated with two of these variants primarily arises from the disruption of the substrate-binding pocket. In conjunction with an analysis of structural models of the transporter, we use these data to suggest mechanistic classifications for these variants. To evaluate potential avenues for therapeutic intervention, we assessed the sensitivity of these variants to temperature and measured their response to the proteostasis regulator 4-phenylbutyrate (4-PBA). Only one of the tested variants (G132V) is sensitive to temperature, though its response to 4-PBA is negligible. Nevertheless, 4-PBA significantly enhances the activity of WT hCRT1 in HEK293T cells, which suggests it may be worth evaluating as a therapeutic for female intellectual disability patients carrying a single CCDS mutation. Together, these findings reveal that pathogenic SLC6A8 mutations cause a spectrum of molecular defects that should be taken into consideration in future efforts to develop CCDS therapeutics.

Show Abstract

Rapid Dynamics of Signal-Dependent Transcriptional Repression by Capicua

S. Keenan, S. Blythe, R. Marmion, N. Djabrayan, E. Wieschaus, S. Shvartsman

Optogenetic perturbations, live imaging, and time-resolved ChIP-seq assays in Drosophila embryos were used to dissect the ERK-dependent control of the HMG-box repressor Capicua (Cic), which plays critical roles in development and is deregulated in human spinocerebellar ataxia and cancers. We established that Cic target genes are activated before significant downregulation of nuclear localization of Cic and demonstrated that their activation is preceded by fast dissociation of Cic from the regulatory DNA. We discovered that both Cic-DNA binding and repression are rapidly reinstated in the absence of ERK activation, revealing that inductive signaling must be sufficiently sustained to ensure robust transcriptional response. Our work provides a quantitative framework for the mechanistic analysis of dynamics and control of transcriptional repression in development.

Show Abstract

Characterization of antibiotic resistance and host-microbiome interactions in the human upper respiratory tract during influenza infection

L Zhang, C Forst, A Gordon, G Gussin, A Gerber , P Fernandez , T Ding, L Lashua, M Wang, A Balmaseda, R. Bonneau, B Zhang, E Ghedin

Background
The abundance and diversity of antibiotic resistance genes (ARGs) in the human respiratory microbiome remain poorly characterized. In the context of influenza virus infection, interactions between the virus, the host, and resident bacteria with pathogenic potential are known to complicate and worsen disease, resulting in coinfection and increased morbidity and mortality of infected individuals. When pathogenic bacteria acquire antibiotic resistance, they are more difficult to treat and of global health concern. Characterization of ARG expression in the upper respiratory tract could help better understand the role antibiotic resistance plays in the pathogenesis of influenza-associated bacterial secondary infection.

Results
Thirty-seven individuals participating in the Household Influenza Transmission Study (HITS) in Managua, Nicaragua, were selected for this study. We performed metatranscriptomics and 16S rRNA gene sequencing analyses on nasal and throat swab samples, and host transcriptome profiling on blood samples. Individuals clustered into two groups based on their microbial gene expression profiles, with several microbial pathways enriched with genes differentially expressed between groups. We also analyzed antibiotic resistance gene expression and determined that approximately 25% of the sequence reads that corresponded to antibiotic resistance genes mapped to Streptococcus pneumoniae and Staphylococcus aureus. Following construction of an integrated network of ARG expression with host gene co-expression, we identified several host key regulators involved in the host response to influenza virus and bacterial infections, and host gene pathways associated with specific antibiotic resistance genes.

Conclusions
This study indicates the host response to influenza infection could indirectly affect antibiotic resistance gene expression in the respiratory tract by impacting the microbial community structure and overall microbial gene expression. Interactions between the host systemic responses to influenza infection and antibiotic resistance gene expression highlight the importance of viral-bacterial co-infection in acute respiratory infections like influenza.

Show Abstract
March 17, 2020

Designing Peptides on a Quantum Computer

V. Mulligan, H Melo, H Merritt, S Slocum, B Weitzner, A Watkins, D. Renfrew, C Pelissier, P Arora, R. Bonneau

Although a wide variety of quantum computers are currently being developed, actual computational results have been largely restricted to contrived, artificial tasks. Finding ways to apply quantum computers to useful, real-world computational tasks remains an active research area. Here we describe our mapping of the protein design problem to the D-Wave quantum annealer. We present a system whereby Rosetta, a state-of-the-art protein design software suite, interfaces with the D-Wave quantum processing unit to find amino acid side chain identities and conformations to stabilize a fixed protein backbone. Our approach, which we call the QPacker, uses a large side-chain rotamer library and the full Rosetta energy function, and in no way reduces the design task to a simpler format. We demonstrate that quantum annealer-based design can be applied to complex real-world design tasks, producing designed molecules comparable to those produced by widely adopted classical design approaches. We also show through large-scale classical folding simulations that the results produced on the quantum annealer can inform wet-lab experiments. For design tasks that scale exponentially on classical computers, the QPacker achieves nearly constant runtime performance over the range of problem sizes that could be tested. We anticipate better than classical performance scaling as quantum computers mature.

Show Abstract
March 11, 2020

Metabolome-Informed Microbiome Analysis Refines Metadata Classifications and Reveals Unexpected Medication Transfer in Captive Cheetahs

J. Gauglitz, J. Morton, A. Tripathi, S. Hansen, M. Gaffney, C. Carpenter, K. Weldon, R. Shah, A. Parampil, A. Fidgett, A. Swafford, R. Knight, P. Dorrenstein

Topological defects determine the structure and function of physical and biological matter over a wide range of scales, from the turbulent vortices in planetary atmospheres, oceans or quantum fluids to bioelectrical signalling in the heart1,2,3 and brain4, and cell death5. Many advances have been made in understanding and controlling the defect dynamics in active6,7,8,9 and passive9,10 non-equilibrium fluids. Yet, it remains unknown whether the statistical laws that govern the dynamics of defects in classical11 or quantum fluids12,13,14 extend to the active matter7,15,16 and information flows17,18 in living systems. Here, we show that a defect-mediated turbulence underlies the complex wave propagation patterns of Rho-GTP signalling protein on the membrane of starfish egg cells, a process relevant to cytoskeletal remodelling and cell proliferation19,20. Our experiments reveal that the phase velocity field extracted from Rho-GTP concentration waves exhibits vortical defect motions and annihilation dynamics reminiscent of those seen in quantum systems12,13, bacterial turbulence15 and active nematics7. Several key statistics and scaling laws of the defect dynamics can be captured by a minimal Helmholtz–Onsager point vortex model21 as well as a generic complex Ginzburg–Landau22 continuum theory, suggesting a close correspondence between the biochemical signal propagation on the surface of a living cell and a widely studied class of two-dimensional turbulence23 and wave22 phenomena.

Show Abstract
March 10, 2020

Inference of Multisite Phosphorylation Rate Constants and Their Modulation by Pathogenic Mutations

E. Yeung, S. McFann, L. Marsh, E. Dufresne, S. Filippi, H. Harrington, S. Shvartsman, M. Wühr

Multisite protein phosphorylation plays a critical role in cell regulation [1, 2, 3]. It is widely appreciated that the functional capabilities of multisite phosphorylation depend on the order and kinetics of phosphorylation steps, but kinetic aspects of multisite phosphorylation remain poorly understood [4, 5, 6]. Here, we focus on what appears to be the simplest scenario, when a protein is phosphorylated on only two sites in a strict, well-defined order. This scenario describes the activation of ERK, a highly conserved cell-signaling enzyme. We use Bayesian parameter inference in a structurally identifiable kinetic model to dissect dual phosphorylation of ERK by MEK, a kinase that is mutated in a large number of human diseases [7, 8, 9, 10, 11, 12]. Our results reveal how enzyme processivity and efficiencies of individual phosphorylation steps are altered by pathogenic mutations. The presented approach, which connects specific mutations to kinetic parameters of multisite phosphorylation mechanisms, provides a systematic framework for closing the gap between studies with purified enzymes and their effects in the living organism.

Show Abstract

Inference of Multisite Phosphorylation Rate Constants and Their Modulation by Pathogenic Mutations

E. Yeung, S. McFann, L. Marsh, E. Dufresne, S. Fillipi, H. Harrington, S. Shvartsman, M. Wühr

Multisite protein phosphorylation plays a critical role in cell regulation [1, 2, 3]. It is widely appreciated that the functional capabilities of multisite phosphorylation depend on the order and kinetics of phosphorylation steps, but kinetic aspects of multisite phosphorylation remain poorly understood [4, 5, 6]. Here, we focus on what appears to be the simplest scenario, when a protein is phosphorylated on only two sites in a strict, well-defined order. This scenario describes the activation of ERK, a highly conserved cell-signaling enzyme. We use Bayesian parameter inference in a structurally identifiable kinetic model to dissect dual phosphorylation of ERK by MEK, a kinase that is mutated in a large number of human diseases [7, 8, 9, 10, 11, 12]. Our results reveal how enzyme processivity and efficiencies of individual phosphorylation steps are altered by pathogenic mutations. The presented approach, which connects specific mutations to kinetic parameters of multisite phosphorylation mechanisms, provides a systematic framework for closing the gap between studies with purified enzymes and their effects in the living organism.

Show Abstract
  • Previous Page
  • Viewing
  • Next Page
Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates