CCB: Publications

Multiple abiotic stimuli are integrated in the regulation of rice gene expression under field conditions

A. Plessis, C. Hafemeister, O. Wilkins, Z.J. Gonzaga, R.S. Meyer, I. Pires, C. Müller, E.M. Septiningsih, R. Bonneau

Plants rely on transcriptional dynamics to respond to multiple climatic fluctuations and contexts in nature. We analyzed the genome-wide gene expression patterns of rice (Oryza sativa) growing in rainfed and irrigated fields during two distinct tropical seasons and determined simple linear models that relate transcriptomic variation to climatic fluctuations. These models combine multiple environmental parameters to account for patterns of expression in the field of co-expressed gene clusters. We examined the similarities of our environmental models between tropical and temperate field conditions, using previously published data. We found that field type and macroclimate had broad impacts on transcriptional responses to environmental fluctuations, especially for genes involved in photosynthesis and development. Nevertheless, variation in solar radiation and temperature at the timescale of hours had reproducible effects across environmental contexts. These results provide a basis for broad-based predictive modeling of plant gene expression in the field.

Show Abstract

Toward rational thermostabilization of Aspergillus oryzae cutinase: Insights into catalytic and structural stability

A.N. Shirke, D. Basore, G.L. Butterfoss, R. Bonneau, C. Bystroff, R.A. Gross

Cutinases are powerful hydrolases that can cleave ester bonds of polyesters such as poly(ethylene terephthalate) (PET), opening up new options for enzymatic routes for polymer recycling and surface modification reactions. Cutinase from Aspergillus oryzae (AoC) is promising owing to the presence of an extended groove near the catalytic triad which is important for the orientation of polymeric chains. However, the catalytic efficiency of AoC on rigid polymers like PET is limited by its low thermostability; as it is essential to work at or over the glass transition temperature (Tg) of PET, that is, 70°C. Consequently, in this study we worked toward the thermostabilization of AoC. Use of Rosetta computational protein design software in conjunction with rational design led to a 6°C improvement in the thermal unfolding temperature (Tm) and a 10-fold increase in the half-life of the enzyme activity at 60°C. Surprisingly, thermostabilization did not improve the rate or temperature optimum of enzyme activity. Three notable findings are presented as steps toward designing more thermophilic cutinase: (a) surface salt bridge optimization produced enthalpic stabilization, (b) mutations to proline reduced the entropy loss upon folding, and (c) the lack of a correlative increase in the temperature optimum of catalytic activity with thermodynamic stability suggests that the active site is locally denatured at a temperature below the Tm of the global structure. Proteins 2016; 84:60–72. © 2015 Wiley Periodicals, Inc.

Show Abstract

An experimentally supported model of the Bacillus subtilis global transcriptional regulatory network

M.L. Arrieta‐Ortiz, C. Hafemeister, A.R. Bate, T. Chu, A. Greenfield, B. Shuster, S.N. Barry, M. Gallitto, B. Liu, T. Kacmarczyk, F. Santoriello, J. Chen, C.D.A Rodrigues, T. Sato, D.Z. Rudner, A. Driks, R. Bonneau, P. Eichenberger

Organisms from all domains of life use gene regulation networks to control cell growth, identity, function, and responses to environmental challenges. Although accurate global regulatory models would provide critical evolutionary and functional insights, they remain incomplete, even for the best studied organisms. Efforts to build comprehensive networks are confounded by challenges including network scale, degree of connectivity, complexity of organism–environment interactions, and difficulty of estimating the activity of regulatory factors. Taking advantage of the large number of known regulatory interactions in Bacillus subtilis and two transcriptomics datasets (including one with 38 separate experiments collected specifically for this study), we use a new combination of network component analysis and model selection to simultaneously estimate transcription factor activities and learn a substantially expanded transcriptional regulatory network for this bacterium. In total, we predict 2,258 novel regulatory interactions and recall 74% of the previously known interactions. We obtained experimental support for 391 (out of 635 evaluated) novel regulatory edges (62% accuracy), thus significantly increasing our understanding of various cell processes, such as spore formation.

Show Abstract

Positive-Unlabeled Learning in the Face of Labeling Bias

N. Youngs, D. Shasha, R. Bonneau

Positive-Unlabeled (PU) learning scenarios are a class of semi-supervised learning where only a fraction of the data is labeled, and all available labels are positive. The goal is to assign correct (positive and negative) labels to as much data as possible. Several important learning problems fall into the PU-learning domain, as in many cases the cost and feasibility of obtaining negative examples is prohibitive. In addition to the positive-negative disparity the overall cost of labeling these datasets typically leads to situations where the number of unlabeled examples greatly outnumbers the labeled. Accordingly, we perform several experiments, on both synthetic and real-world datasets, examining the performance of state of the art PU-learning algorithms when there is significant bias in the labeling process. We propose novel PU algorithms and demonstrate that they outperform the current state of the art on a variety of benchmarks. Lastly, we present a methodology for removing the costly parameter-tuning step in a popular PU algorithm.

Show Abstract

An IL-23R/IL-22 Circuit Regulates Epithelial Serum Amyloid A to Promote Local Effector Th17 Responses

T. Sano, W. Huang, J.A. Hall, Yi Yang, A. Chen, S.J. Gavzy, J.-Y. Lee, J.W. Ziel, E. Miraldi, A.I. Domingos, R. Bonneau

RORγt+ Th17 cells are important for mucosal defenses but also contribute to autoimmune disease. They accumulate in the intestine in response to microbiota and produce IL-17 cytokines. Segmented filamentous bacteria (SFB) are Th17-inducing commensals that potentiate autoimmunity in mice. RORγt+ T cells were induced in mesenteric lymph nodes early after SFB colonization and distributed across different segments of the gastrointestinal tract. However, robust IL-17A production was restricted to the ileum, where SFB makes direct contact with the epithelium and induces serum amyloid A proteins 1 and 2 (SAA1/2), which promote local IL-17A expression in RORγt+ T cells. We identified an SFB-dependent role of type 3 innate lymphoid cells (ILC3), which secreted IL-22 that induced epithelial SAA production in a Stat3-dependent manner. This highlights the critical role of tissue microenvironment in activating effector functions of committed Th17 cells, which may have important implications for how these cells contribute to inflammatory disease.

Show Abstract

Cohesin loss alters adult hematopoietic stem cell homeostasis, leading to myeloproliferative neoplasms

J. Mullenders, B. Aranda-Orgilles, P. Lhoumaud, M. Keller, J. Pae, K. Wang, C. Kayembe, P.P Rocha, R. Raviram, Y. Gong, P.K. Premsrirut, A. Tsirigos, R. Bonneau, J.A. Skok, L. Cimmino, D. Hoehn, I. Aifantis

The cohesin complex (consisting of Rad21, Smc1a, Smc3, and Stag2 proteins) is critically important for proper sister chromatid separation during mitosis. Mutations in the cohesin complex were recently identified in a variety of human malignancies including acute myeloid leukemia (AML). To address the potential tumor-suppressive function of cohesin in vivo, we generated a series of shRNA mouse models in which endogenous cohesin can be silenced inducibly. Notably, silencing of cohesin complex members did not have a deleterious effect on cell viability. Furthermore, knockdown of cohesin led to gain of replating capacity of mouse hematopoietic progenitor cells. However, cohesin silencing in vivo rapidly altered stem cells homeostasis and myelopoiesis. Likewise, we found widespread changes in chromatin accessibility and expression of genes involved in myelomonocytic maturation and differentiation. Finally, aged cohesin knockdown mice developed a clinical picture closely resembling myeloproliferative disorders/neoplasms (MPNs), including varying degrees of extramedullary hematopoiesis (myeloid metaplasia) and splenomegaly. Our results represent the first successful demonstration of a tumor suppressor function for the cohesin complex, while also confirming that cohesin mutations occur as an early event in leukemogenesis, facilitating the potential development of a myeloid malignancy.

Show Abstract

The dynamics of microtubule/motor-protein assemblies in biology and physics

M. Shelley

Many important processes in the cell are mediated by stiff microtubule polymers and the active motor proteins moving on them. This includes the transport of subcellular structures (nuclei, chromosomes, organelles) and the self-assembly and positioning of the mitotic spindle. Little is understood of these processes, but they present fascinating problems in fluid-structure interactions. Microtubules and motor proteins are also the building blocks of new biosynthetic active suspensions driven by motor-protein activity. These reduced systems can be probed—and modeled—more easily than can the fully biological ones and demonstrate their own aspects of self-assembly and complex dynamics. I review recent work modeling such systems as fluid-structure interaction problems and as multiscale complex fluids.

Show Abstract

Interactive Big Data Resource to Elucidate Human Immune Pathways and Diseases

D. Gorenshteyn, et al.

Many functionally important interactions between genes and proteins involved in immunological diseases and processes are unknown. The exponential growth in public high-throughput data offers an opportunity to expand this knowledge. To unlock human-immunology-relevant insight contained in the global biomedical research effort, including all public high-throughput datasets, we performed immunological-pathway-focused Bayesian integration of a comprehensive, heterogeneous compendium comprising 38,088 genome-scale experiments. The distillation of this knowledge into immunological networks of functional relationships between molecular entities (ImmuNet), and tools to mine this resource, are accessible to the public at http://immunet.princeton.edu. The predictive capacity of ImmuNet, established by rigorous statistical validation, is easily accessed by experimentalists to generate data-driven hypotheses. We demonstrate the power of this approach through the identification of unique host-virus interaction responses, and we show how ImmuNet complements genetic studies by predicting disease-associated genes. ImmuNet should be widely beneficial for investigating the mechanisms of the human immune system and immunological diseases.

Show Abstract

Predicting effects of noncoding variants with deep learning–based sequence model

J. Zhou, O. Troyanskaya

Identifying functional effects of noncoding variants is a major challenge in human genetics. To predict the noncoding-variant effects de novo from sequence, we developed a deep learning–based algorithmic framework, DeepSEA (http://deepsea.princeton.edu/), that directly learns a regulatory sequence code from large-scale chromatin-profiling data, enabling prediction of chromatin effects of sequence alterations with single-nucleotide sensitivity. We further used this capability to improve prioritization of functional variants including expression quantitative trait loci (eQTLs) and disease-associated variants.

Show Abstract

Tweeting From Left to Right

P. Barberá, J.T. Jost, J. Nagler, J.A. Tucker, R. Bonneau

We estimated ideological preferences of 3.8 million Twitter users and, using a data set of nearly 150 million tweets concerning 12 political and nonpolitical issues, explored whether online communication resembles an “echo chamber” (as a result of selective exposure and ideological segregation) or a “national conversation.” We observed that information was exchanged primarily among individuals with similar ideological preferences in the case of political issues (e.g., 2012 presidential election, 2013 government shutdown) but not many other current events (e.g., 2013 Boston Marathon bombing, 2014 Super Bowl). Discussion of the Newtown shootings in 2012 reflected a dynamic process, beginning as a national conversation before transforming into a polarized exchange. With respect to both political and nonpolitical issues, liberals were more likely than conservatives to engage in cross-ideological dissemination; this is an important asymmetry with respect to the structure of communication that is consistent with psychological theory and research bearing on ideological differences in epistemic, existential, and relational motivation. Overall, we conclude that previous work may have overestimated the degree of ideological segregation in social-media usage.

Show Abstract