CCB: Publications

Probabilistic Modelling of Chromatin Code Landscape Reveals Functional Diversity of Enhancer-like Chromatin States

Interpreting the functional state of chromatin from the combinatorial binding patterns of chromatin factors, that is, the chromatin codes, is crucial for decoding the epigenetic state of the cell. Here we present a systematic map of Drosophila chromatin states derived from data-driven probabilistic modelling of dependencies between chromatin factors. Our model not only recapitulates enhancer-like chromatin states as indicated by widely used enhancer marks but also divides these states into three functionally distinct groups, of which only one specific group possesses active enhancer activity. Moreover, we discover a strong association between one specific enhancer state and RNA Polymerase II pausing, linking transcription regulatory potential and chromatin organization. We also observe that with the exception of long-intron genes, chromatin state transition positions in transcriptionally active genes align with an absolute distance to their corresponding transcription start site, regardless of gene length. Using our method, we provide a resource that helps elucidate the functional and spatial organization of the chromatin code landscape.

Show Abstract

Actomyosin-driven left-right asymmetry: from molecular torques to chiral self organization.

S. Naganathan, T. Middelkoop, S. Fürthauer, S. Grill

Chirality or mirror asymmetry is a common theme in biology found in organismal body plans, tissue patterns and even in individual cells. In many cases the emergence of chirality is driven by actin cytoskeletal dynamics. Although it is well established that the actin cytoskeleton generates rotational forces at the molecular level, we are only beginning to understand how this can result in chiral behavior of the entire actin network in vivo. In this review, we will give an overview of actin driven chiralities across different length scales known until today. Moreover, we evaluate recent quantitative models demonstrating that chiral symmetry breaking of cells can be achieved by properly aligning molecular-scale torque generation processes in the actomyosin cytoskeleton.

Show Abstract

A Miniature Protein Stabilized by a Cation− π Interaction Network

T Craven, M Cho, N Traaseth, R. Bonneau, K Kirschenbaum

The design of folded miniature proteins is predicated on establishing noncovalent interactions that direct the self-assembly of discrete thermostable tertiary structures. In this work, we describe how a network of cation−π interactions present in proteins containing “WSXWS motifs” can be emulated to stabilize the core of a miniature protein. This 19-residue protein sequence recapitulates a set of interdigitated arginine and tryptophan residues that stabilize a distinctive β-strand:loop:PPII-helix topology. Validation of the compact fold determined by NMR was carried out by mutagenesis of the cation−π network and by comparison to the corresponding disulfide-bridged structure. These results support the involvement of a coordinated set of cation−π interactions that stabilize the tertiary structure.

Show Abstract

Text Classification for Automatic Detection of E-Cigarette Use and Use for Smoking Cessation from Twitter: A Feasibility Pilot

Y. Aphinyanaphongs, A. Lulejian, D.P. Brown, R. Bonneau, P. Krebs

Rapid increases in e-cigarette use and potential exposure to harmful byproducts have shifted public health focus to e-cigarettes as a possible drug of abuse. Effective surveillance of use and prevalence would allow appropriate regulatory responses. An ideal surveillance system would collect usage data in real time, focus on populations of interest, include populations unable to take the survey, allow a breadth of questions to answer, and enable geo-location analysis. Social media streams may provide this ideal system. To realize this use case, a foundational question is whether we can detect ecigarette use at all. This work reports two pilot tasks using text classification to identify automatically Tweets that indicate e-cigarette use and/or e-cigarette use for smoking cessation. We build and define both datasets and compare performance of 4 state of the art classifiers and a keyword search for each task. Our results demonstrate excellent classifier performance of up to 0.90 and 0.94 area under the curve in each category. These promising initial results form the foundation for further studies to realize the ideal surveillance solution.

Show Abstract

Antibiotic perturbation of the murine gut microbiome enhances the adiposity, insulin resistance, and liver disease associated with high-fat diet

D Mahana, C Trent, Z Kurtz, N Bokulich, T Battaglia, J Chung, C. Müller, H Li, R. Bonneau, M Blaser

Background
Obesity, type 2 diabetes, and non-alcoholic fatty liver disease (NAFLD) are serious health concerns, especially in Western populations. Antibiotic exposure and high-fat diet (HFD) are important and modifiable factors that may contribute to these diseases.

Methods
To investigate the relationship of antibiotic exposure with microbiome perturbations in a murine model of growth promotion, C57BL/6 mice received lifelong sub-therapeutic antibiotic treatment (STAT), or not (control), and were fed HFD starting at 13 weeks. To characterize microbiota changes caused by STAT, the V4 region of the 16S rRNA gene was examined from collected fecal samples and analyzed.

Results
In this model, which included HFD, STAT mice developed increased weight and fat mass compared to controls. Although results in males and females were not identical, insulin resistance and NAFLD were more severe in the STAT mice. Fecal microbiota from STAT mice were distinct from controls. Compared with controls, STAT exposure led to early conserved diet-independent microbiota changes indicative of an immature microbial community. Key taxa were identified as STAT-specific and several were found to be predictive of disease. Inferred network models showed topological shifts concurrent with growth promotion and suggest the presence of keystone species.

Conclusions
These studies form the basis for new models of type 2 diabetes and NAFLD that involve microbiome perturbation.

Show Abstract

Breaking TADs: insights into hierarchical genome organization

P.P Rocha, R. Raviram, R. Bonneau, J.A. Skok

The 3D organization of chromosomes enables cells to balance the biophysical constraints of the crowded nucleus with the functional dynamics of gene regulation. Physical contacts between genes and their regulatory elements are essential for proper transcriptional control and maintenance of these interactions is critical for preventing aberrations in physiological processes that could manifest as disease states. The first insights into global nuclear organization came from imaging studies using FISH (fluorescent in-situ hybridization) analyses, which demonstrated that chromosomes occupy individual territories in the nucleus with minimal intermingling between them [1]. The development of chromosome conformation capture (3C) in which chromatin fragments in close physical proximity can be detected enabled the characterization of molecular interactions between different loci [2]. When 3C-based techniques incorporated massive parallel sequencing (such as in Hi-C) the description of molecular chromatin interactions at a genome-wide scale was finally possible [3]. Hi-C was the first unbiased approach aimed at capturing all interactions in the nucleus thereby providing a snapshot of nuclear organization at the global scale. The first Hi-C study revealed that each chromosomal territory is further divided into large domains of 5–10Mb that physically separate two compartments (A and B), which strongly correlate with active and inactive chromatin, respectively [3]. Furthermore, this study demonstrated that interactions between loci in the same compartment occur at a higher frequency than between loci in different compartments [3]. With the progressive decrease in sequencing costs, higher-resolution Hi-C revealed a new level of nuclear organization where compartments A and B can be further divided into “topologically associated domains” (TADs) [4–6]. In mammalian cells these domains range in size from a few 100kbs to 5Mbs in size (with an average of 1MB). Since they exhibit a high degree of conservation between cell types and species it was proposed that TADs represent the fundamental unit of physical organization of the genome [5].

Show Abstract

Biophysically Motivated Regulatory Network Inference: Progress and Prospects

R. Bonneau

Thanks to the confluence of genomic technology and computational developments, the possibility of network inference methods that automatically learn large comprehensive models of cellular regulation is closer than ever. This perspective focuses on enumerating the elements of computational strategies that, when coupled to appropriate experimental designs, can lead to accurate large-scale models of chromatin state and transcriptional regulatory structure and dynamics. We highlight 4 research questions that require further investigation in order to make progress in network inference: (1) using overall constraints on network structure such as sparsity, (2) use of informative priors and data integration to constrain individual model parameters, (3) estimation of latent regulatory factor activity under varying cell conditions, and (4) new methods for learning and modeling regulatory factor interactions. We conclude that methods combining advances in these 4 categories of required effort with new genomic technologies will result in biophysically motivated dynamic genome-wide regulatory network models for several of the best-studied organisms and cell types.

Show Abstract

Bacillus subtilis Systems Biology: Applications of -Omics Techniques to the Study of Endospore Formation

A.R. Bate, R. Bonneau, P. Eichenberger

The principal B. subtilis laboratory strain, strain 168, is derived from a parent strain isolated in Marburg, Germany, following a mutagenesis procedure (1). The popularity of this strain arose after it was shown to be competent for genetic transformation (2, 3), which paved the way for myriad molecular genetics analyses that led to a detailed understanding of the biology of B. subtilis and related Gram-positive bacteria.

Show Abstract

Active contraction of microtubule networks

P. Foster, S. Fürthauer, M. Shelley, D. J. Needleman

Many cellular processes are driven by cytoskeletal assemblies. It remains unclear how cytoskeletal filaments and motor proteins organize into cellular scale structures and how molecular properties of cytoskeletal components affect the large-scale behaviors of these systems. Here, we investigate the self-organization of stabilized microtubules in Xenopus oocyte extracts and find that they can form macroscopic networks that spontaneously contract. We propose that these contractions are driven by the clustering of microtubule minus ends by dynein. Based on this idea, we construct an active fluid theory of network contractions, which predicts a dependence of the timescale of contraction on initial network geometry, a development of density inhomogeneities during contraction, a constant final network density, and a strong influence of dynein inhibition on the rate of contraction, all in quantitative agreement with experiments. These results demonstrate that the motor-driven clustering of filament ends is a generic mechanism leading to contraction.

Show Abstract

The Critical Periphery in the Growth of Social Protests

P. Barberá, N. Wang, R. Bonneau, J. Nagler, J. Tucker, S. González-Bailón

Social media have provided instrumental means of communication in many recent political protests. The efficiency of online networks in disseminating timely information has been praised by many commentators; at the same time, users are often derided as “slacktivists” because of the shallow commitment involved in clicking a forwarding button. Here we consider the role of these peripheral online participants, the immense majority of users who surround the small epicenter of protests, representing layers of diminishing online activity around the committed minority. We analyze three datasets tracking protest communication in different languages and political contexts through the social media platform Twitter and employ a network decomposition technique to examine their hierarchical structure. We provide consistent evidence that peripheral participants are critical in increasing the reach of protest messages and generating online content at levels that are comparable to core participants. Although committed minorities may constitute the heart of protest movements, our results suggest that their success in maximizing the number of online citizens exposed to protest messages depends, at least in part, on activating the critical periphery. Peripheral users are less active on a per capita basis, but their power lies in their numbers: their aggregate contribution to the spread of protest messages is comparable in magnitude to that of core participants. An analysis of two other datasets unrelated to mass protests strengthens our interpretation that core-periphery dynamics are characteristically important in the context of collective action events. Theoretical models of diffusion in social networks would benefit from increased attention to the role of peripheral nodes in the propagation of information and behavior.

Show Abstract