Promoter sequence and architecture determine expression variability and confer robustness to genetic variants
Abstract
Genetic and environmental exposures cause variability in gene expression. Although most genes are affected in a population, their effect sizes vary greatly, indicating the existence of regulatory mechanisms that could amplify or attenuate expression variability. Here, we investigate the relationship between the sequence and transcription start site architectures of promoters and their expression variability across human individuals. We find that expression variability can be largely explained by a promoter's DNA sequence and its binding sites for specific transcription factors. We show that promoter expression variability reflects the biological process of a gene, demonstrating a selective trade-off between stability for metabolic genes and plasticity for responsive genes and those involved in signaling. Promoters with a rigid transcription start site architecture are more prone to have variable expression and to be associated with genetic variants with large effect sizes, while a flexible usage of transcription start sites within a promoter attenuates expression variability and limits genotypic effects. Our work provides insights into the variable nature of responsive genes and reveals a novel mechanism for supplying transcriptional and mutational robustness to essential genes through multiple transcription start site regions within a promoter.
Data availability
Sequencing data have been deposited into the GEO database: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE188131
-
GTEx Analysis V8 (dbGaP Accession phs000424.v8.p2)dbGaP Accession phs000424.v8.p2.
Article and author information
Author details
Funding
Danmarks Frie Forskningsfond (6108-00038)
- Robin Andersson
European Research Council (638173)
- Robin Andersson
Novo Nordisk Fonden (NNF18OC0052570)
- Robin Andersson
Novo Nordisk Fonden (NNF20OC0059796)
- Robin Andersson
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Copyright
© 2022, Einarsson et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Chromosomes and Gene Expression
RNA interference (RNAi) is a conserved pathway that utilizes Argonaute proteins and their associated small RNAs to exert gene regulatory function on complementary transcripts. While the majority of germline-expressed RNAi proteins reside in perinuclear germ granules, it is unknown whether and how RNAi pathways are spatially organized in other cell types. Here, we find that the small RNA biogenesis machinery is spatially and temporally organized during Caenorhabditis elegans embryogenesis. Specifically, the RNAi factor, SIMR-1, forms visible concentrates during mid-embryogenesis that contain an RNA-dependent RNA polymerase, a poly-UG polymerase, and the unloaded nuclear Argonaute protein, NRDE-3. Curiously, coincident with the appearance of the SIMR granules, the small RNAs bound to NRDE-3 switch from predominantly CSR-class 22G-RNAs to ERGO-dependent 22G-RNAs. NRDE-3 binds ERGO-dependent 22G-RNAs in the somatic cells of larvae and adults to silence ERGO-target genes; here we further demonstrate that NRDE-3-bound, CSR-class 22G-RNAs repress transcription in oocytes. Thus, our study defines two separable roles for NRDE-3, targeting germline-expressed genes during oogenesis to promote global transcriptional repression, and switching during embryogenesis to repress recently duplicated genes and retrotransposons in somatic cells, highlighting the plasticity of Argonaute proteins and the need for more precise temporal characterization of Argonaute-small RNA interactions.
-
- Chromosomes and Gene Expression
- Genetics and Genomics
A new method for mapping torsion provides insights into the ways that the genome responds to the torsion generated by RNA polymerase II.