Robust and annotation-free analysis of alternative splicing across diverse cell types in mice

  1. Gonzalo Benegas
  2. Jonathan Fischer
  3. Yun S Song  Is a corresponding author
  1. University of California, Berkeley, United States
  2. University of Florida, United States

Abstract

Although alternative splicing is a fundamental and pervasive aspect of gene expression in higher eukaryotes, it is often omitted from single-cell studies due to quantification challenges inherent to commonly used short-read sequencing technologies. Here, we undertake the analysis of alternative splicing across numerous diverse murine cell types from two large-scale single-cell datasets-the Tabula Muris and BRAIN Initiative Cell Census Network-while accounting for understudied technical artifacts and unannotated events. We find strong and general cell-type-specific alternative splicing, complementary to total gene expression but of similar discriminatory value, and identify a large volume of novel splicing events. We specifically highlight splicing variation across different cell types in primary motor cortex neurons, bone marrow B cells, and various epithelial cells, and we show that the implicated transcripts include many genes which do not display total expression differences. To elucidate the regulation of alternative splicing, we build a custom predictive model based on splicing factor activity, recovering several known interactions while generating new hypotheses, including potential regulatory roles for novel alternative splicing events in critical genes like Khdrbs3 and Rbfox1. We make our results available using public interactive browsers to spur further exploration by the community.

Data availability

All data analyzed in this study are publicly available and URL links are provided in the Materials and Methods section of our manuscript.Our source code as well as all results represented in figures and tables are publicly available on our lab's GitHub repositories:https://github.com/songlab-cal/scquint andhttps://github.com/songlab-cal/scquint-analysis

The following previously published data sets were used

Article and author information

Author details

  1. Gonzalo Benegas

    Graduate Group in Computational Biology, University of California, Berkeley, Berkeley, United States
    Competing interests
    The authors declare that no competing interests exist.
  2. Jonathan Fischer

    Department of Biostatistics, University of Florida, Gainesville, United States
    Competing interests
    The authors declare that no competing interests exist.
  3. Yun S Song

    Computer Science Division, University of California, Berkeley, Berkeley, United States
    For correspondence
    yss@berkeley.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-0734-9868

Funding

National Institutes of Health (R35-GM134922)

  • Gonzalo Benegas
  • Yun S Song

Chan Zuckerberg Initiative (CZF2019-002449)

  • Gonzalo Benegas
  • Yun S Song

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Copyright

© 2022, Benegas et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 5,909
    views
  • 512
    downloads
  • 12
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Gonzalo Benegas
  2. Jonathan Fischer
  3. Yun S Song
(2022)
Robust and annotation-free analysis of alternative splicing across diverse cell types in mice
eLife 11:e73520.
https://doi.org/10.7554/eLife.73520

Share this article

https://doi.org/10.7554/eLife.73520

Further reading

    1. Chromosomes and Gene Expression
    2. Genetics and Genomics
    Omid Gholamalamdari, Tom van Schaik ... Andrew S Belmont
    Research Article

    Models of nuclear genome organization often propose a binary division into active versus inactive compartments yet typically overlook nuclear bodies. Here, we integrated analysis of sequencing and image-based data to compare genome organization in four human cell types relative to three different nuclear locales: the nuclear lamina, nuclear speckles, and nucleoli. Although gene expression correlates mostly with nuclear speckle proximity, DNA replication timing correlates with proximity to multiple nuclear locales. Speckle attachment regions emerge as DNA replication initiation zones whose replication timing and gene composition vary with their attachment frequency. Most facultative LADs retain a partially repressed state as iLADs, despite their positioning in the nuclear interior. Knock out of two lamina proteins, Lamin A and LBR, causes a shift of H3K9me3-enriched LADs from lamina to nucleolus, and a reciprocal relocation of H3K27me3-enriched partially repressed iLADs from nucleolus to lamina. Thus, these partially repressed iLADs appear to compete with LADs for nuclear lamina attachment with consequences for replication timing. The nuclear organization in adherent cells is polarized with nuclear bodies and genomic regions segregating both radially and relative to the equatorial plane. Together, our results underscore the importance of considering genome organization relative to nuclear locales for a more complete understanding of the spatial and functional organization of the human genome.

    1. Chromosomes and Gene Expression
    Ashwin Govindan, Nicholas K Conrad
    Research Article

    O-GlcNAcylation is the reversible post-translational addition of β-N-acetylglucosamine to serine and threonine residues of nuclear and cytoplasmic proteins. It plays an important role in several cellular processes through the modification of thousands of protein substrates. O-GlcNAcylation in humans is mediated by a single essential enzyme, O-GlcNAc transferase (OGT). OGT, together with the sole O-GlcNAcase OGA, form an intricate feedback loop to maintain O-GlcNAc homeostasis in response to changes in cellular O-GlcNAc using a dynamic mechanism involving nuclear retention of its fourth intron. However, the molecular mechanism of this dynamic regulation remains unclear. Using an O-GlcNAc responsive GFP reporter cell line, we identify SFSWAP, a poorly characterized splicing factor, as a trans-acting factor regulating OGT intron detention. We show that SFSWAP is a global regulator of retained intron splicing and exon skipping that primarily acts as a negative regulator of splicing. In contrast, knockdown of SFSWAP leads to reduced inclusion of a ‘decoy exon’ present in the OGT retained intron which may mediate its role in OGT intron detention. Global analysis of decoy exon inclusion in SFSWAP and UPF1 double knockdown cells indicate altered patterns of decoy exon usage. Together, these data indicate a role for SFSWAP as a global negative regulator of pre-mRNA splicing and positive regulator of intron retention.