CCB: Publications

SARS-CoV-2 Outbreak Dynamics in an Isolated US Military Recruit Training Center With Rigorous Prevention Measures

Rhonda A. Lizewski, R. Sealfon, O. Troyanskaya, et al.

Marine recruits training at Parris Island experienced an unexpectedly high rate of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection, despite preventive measures including a supervised, 2-week, pre-entry quarantine. We characterize SARS-CoV-2 transmission in this cohort.

Show Abstract

Identifying genes and pathways linking astrocyte regional specificity to Alzheimer’s disease susceptibility

Ran Zhang , Margarete Knudsen, O. Troyanskaya, et al.

Astrocytes have been shown to play a central role in Alzheimer’s Disease (AD). However, the genes and biological pathways underlying disease manifestation are unknown, and it is unclear whether regional molecular differences among astrocytes contribute to regional specificity of disease. Here, we began to address these challenges with integrated experimental and computational approaches. We constructed a human astrocyte-specific functional gene network using Bayesian integration of a large compendium of human functional genomics data, as well as regional astrocyte gene expression profiles we generated in the mouse. This network identifies likely region-specific astrocyte pathways that operate in healthy brains. We leveraged our findings to compile genome-wide astrocyte-associated disease-gene predictions, employing a novel network-guided differential expression analysis (NetDIFF). We also used this data to predict a list of astrocyte-expressed genes mediating region-specific human disease, using a network-guided shortest path method (NetPATH). Both the network and our results are publicly available using an interactive web interface at http://astrocyte.princeton.edu. Our experimental and computational studies propose a strategy for disease gene and pathway prediction that may be applied to a host of human neurological disorders.

Show Abstract

Pre-infection antiviral innate immunity contributes to sex differences in SARS-CoV-2 infection

N. Sauerwald, Z. Zhang, W. Mao , R. Sealfon, O. Troyanskaya, et al.

Male sex is a major risk factor for SARS-CoV-2 infection severity. To understand the basis for this sex difference, we studied SARS-CoV-2 infection in a young adult cohort of United States Marine recruits. Among 2,641 male and 244 female unvaccinated and seronegative recruits studied longitudinally, SARS-CoV-2 infections occurred in 1,033 males and 137 females. We identified sex differences in symptoms, viral load, blood transcriptome, RNA splicing, and proteomic signatures. Females had higher pre-infection expression of antiviral interferon-stimulated gene (ISG) programs. Causal mediation analysis implicated ISG differences in number of symptoms, levels of ISGs, and differential splicing of CD45 lymphocyte phosphatase during infection. Our results indicate that the antiviral innate immunity set point causally contributes to sex differences in response to SARS-CoV-2 infection. A record of this paper’s transparent peer review process is included in the supplemental information.

Show Abstract

A sequence-based global map of regulatory activity for deciphering human genetics

Kathleen Chen, A. Wong, O. Troyanskaya, Jian Zhou

Epigenomic profiling has enabled large-scale identification of regulatory elements, yet we still lack a systematic mapping from any sequence or variant to regulatory activities. We address this challenge with Sei, a framework for integrating human genetics data with sequence information to discover the regulatory basis of traits and diseases. Sei learns a vocabulary of regulatory activities, called sequence classes, using a deep learning model that predicts 21,907 chromatin profiles across >1,300 cell lines and tissues. Sequence classes provide a global classification and quantification of sequence and variant effects based on diverse regulatory activities, such as cell type-specific enhancer functions. These predictions are supported by tissue-specific expression, expression quantitative trait loci and evolutionary constraint data. Furthermore, sequence classes enable characterization of the tissue-specific, regulatory architecture of complex traits and generate mechanistic hypotheses for individual regulatory pathogenic mutations. We provide Sei as a resource to elucidate the regulatory basis of human health and disease.

Show Abstract

A reference tissue atlas for the human kidney

Jens Hansen, R. Sealfon, O. Troyanskaya, et al.

Kidney Precision Medicine Project (KPMP) is building a spatially specified human kidney tissue atlas in health and disease with single-cell resolution. Here, we describe the construction of an integrated reference map of cells, pathways, and genes using unaffected regions of nephrectomy tissues and undiseased human biopsies from 56 adult subjects. We use single-cell/nucleus transcriptomics, subsegmental laser microdissection transcriptomics and proteomics, near-single-cell proteomics, 3D and CODEX imaging, and spatial metabolomics to hierarchically identify genes, pathways, and cells. Integrated data from these different technologies coherently identify cell types/subtypes within different nephron segments and the interstitium. These profiles describe cell-level functional organization of the kidney following its physiological functions and link cell subtypes to genes, proteins, metabolites, and pathways. They further show that messenger RNA levels along the nephron are congruent with the subsegmental physiological activity. This reference atlas provides a framework for the classification of kidney disease when multiple molecular mechanisms underlie convergent clinical phenotypes.

Show Abstract

Molecular Characterization of Membranous Nephropathy

R. Sealfon, Laura Mariani, J. Funk, A. Wong, O. Troyanskaya

Although membranous nephropathy (MN) is one of the most common causes of nephrotic syndrome, the molecular characteristics of the kidney damage in MN remain poorly defined. In this study, the authors applied a machine-learning framework to predict diagnosis on the basis of gene expression in microdissected kidney tissue from patients with glomerulonephropathies. They found that MN has a glomerular transcriptional signature that distinguishes it from other glomerulonephropathies and identified a set of MN-specific genes differentially expressed across two independent cohorts and robustly recovered in an additional validation cohort. They also found the MN-specific genes are enriched in targets of transcription factor NF-κB and are predominantly expressed in podocytes. This work provides a molecular snapshot of MN and elucidates transcriptional alterations specific to this disease.

Show Abstract

The proto-oncogene DEK regulates neuronal excitability and tau accumulation in Alzheimer’s disease vulnerable neurons

Patricia Rodriguez-Rodriguez, O. Troyanskaya

Neurons from layer II of the entorhinal cortex (ECII) are the first to accumulate tau protein aggregates and degenerate during prodromal Alzheimer’s disease. Here, we use a data-driven functional genomics approach to model ECII neurons in silico and identify the proto-oncogene DEK as a potential driver of tau pathology. By modulating DEK levels in EC neurons in vitro and in vivo, we first validate the accuracy and cell-type specificity of our network predictions. We then show that Dek silencing changes the inducibility of immediate early genes and alters neuron excitability, leading to dysregulation of neuronal plasticity genes. We further find that loss of function of DEK leads to tau accumulation in the soma of ECII neurons, reactivity of surrounding microglia, and eventually microglia-mediated neuron loss. This study validates a pathological gene discovery tool that opens new therapeutic avenues and sheds light on a novel pathway driving tau pathology in vulnerable neurons.

Show Abstract

An activation to memory differentiation trajectory of tumor-infiltrating lymphocytes informs metastatic melanoma outcomes

Abhinav Jaiswal , Akanksha Verma, O. Troyanskaya, et al.

There is a need for better classification and understanding of tumor-infiltrating lymphocytes (TILs). Here, we applied advanced functional genomics to interrogate 9,000 human tumors and multiple single-cell sequencing sets using benchmarked T cell states, comprehensive T cell differentiation trajectories, human and mouse vaccine responses, and other human TILs. Compared with other T cell states, enrichment of T memory/resident memory programs was observed across solid tumors. Trajectory analysis of single-cell melanoma CD8+ TILs also identified a high fraction of memory/resident memory-scoring TILs in anti-PD-1 responders, which expanded post therapy. In contrast, TILs scoring highly for early T cell activation, but not exhaustion, associated with non-response. Late/persistent, but not early activation signatures, prognosticate melanoma survival, and co-express with dendritic cell and IFN-γ response programs. These data identify an activation-like state associated to poor response and suggest successful memory conversion, above resuscitation of exhaustion, is an under-appreciated aspect of successful anti-tumoral immunity.

Show Abstract

Asymptomatic SARS-CoV-2 Infection Is Associated With Higher Levels of Serum IL-17C, Matrix Metalloproteinase 10 and Fibroblast Growth Factors Than Mild Symptomatic COVID-19

Alessandra Soares-Schanoski, N. Sauerwald, O. Troyanskaya

Young adults infected with SARS-CoV-2 are frequently asymptomatic or develop only mild disease. Because capturing representative mild and asymptomatic cases require active surveillance, they are less characterized than moderate or severe cases of COVID-19. However, a better understanding of SARS-CoV-2 asymptomatic infections might shed light into the immune mechanisms associated with the control of symptoms and protection. To this aim, we have determined the temporal dynamics of the humoral immune response, as well as the serum inflammatory profile, of mild and asymptomatic SARS-CoV-2 infections in a cohort of 172 initially seronegative prospectively studied United States Marine recruits, 149 of whom were subsequently found to be SARS-CoV-2 infected. The participants had blood samples taken, symptoms surveyed and PCR tests for SARS-CoV-2 performed periodically for up to 105 days. We found similar dynamics in the profiles of viral load and in the generation of specific antibody responses in asymptomatic and mild symptomatic participants. A proteomic analysis using an inflammatory panel including 92 analytes revealed a pattern of three temporal waves of inflammatory and immunoregulatory mediators, and a return to baseline for most of the inflammatory markers by 35 days post-infection. We found that 23 analytes were significantly higher in those participants that reported symptoms at the time of the first positive SARS-CoV-2 PCR compared with asymptomatic participants, including mostly chemokines and cytokines associated with inflammatory response or immune activation (i.e., TNF-α, TNF-β, CXCL10, IL-8). Notably, we detected 7 analytes (IL-17C, MMP-10, FGF-19, FGF-21, FGF-23, CXCL5 and CCL23) that were higher in asymptomatic participants than in participants with symptoms; these are known to be involved in tissue repair and may be related to the control of symptoms. Overall, we found a serum proteomic signature that differentiates asymptomatic and mild symptomatic infections in young adults, including potential targets for developing new therapies and prognostic tests.

Show Abstract

Single nucleus transcriptome and chromatin accessibility of postmortem human pituitaries reveal diverse stem cell regulatory mechanisms

Zidong Zhang, Michel Zamojski, O. Troyanskaya, et al

Despite their importance in tissue homeostasis and renewal, human pituitary stem cells (PSCs) are incompletely characterized. We describe a human single nucleus RNA-seq and ATAC-seq resource from pediatric, adult, and aged postmortem pituitaries (snpituitaryatlas.princeton.edu) and characterize cell-type-specific gene expression and chromatin accessibility programs for all major pituitary cell lineages. We identify uncommitted PSCs, committing progenitor cells, and sex differences. Pseudotime trajectory analysis indicates that early-life PSCs are distinct from the other age groups. Linear modeling of same-cell multiome data identifies regulatory domain accessibility sites and transcription factors that are significantly associated with gene expression in PSCs compared with other cell types and within PSCs. We identify distinct deterministic mechanisms that contribute to heterogeneous marker expression within PSCs. These findings characterize human stem cell lineages and reveal diverse mechanisms regulating key PSC genes and cell type identity.

Show Abstract