Complementary evolution of coding and noncoding sequence underlies mammalian hairlessness

  1. Amanda Kowalczyk
  2. Maria Chikina
  3. Nathan L Clark  Is a corresponding author
  1. University of Pittsburgh, United States
  2. University of Utah, United States

Abstract

Body hair is a defining mammalian characteristic, but several mammals, such as whales, naked mole-rats, and humans, have notably less hair than others. To find the genetic basis of reduced hair quantity, we used our evolutionary-rates-based method, RERconverge, to identify coding and noncoding sequences that evolve at significantly different rates in so-called hairless mammals compared to hairy mammals. Using RERconverge, we performed a genome-wide scan over 62 mammal species using 19,149 genes and 343,598 conserved noncoding regions to find genetic elements that evolve at significantly different rates in hairless mammals compared to hairy mammals. We show that these rate shifts resulted from relaxation of evolutionary constraint on hair-related sequences in hairless species. In addition to detecting known and potential novel hair-related genes, we also discovered hundreds of putative hair-related regulatory elements. Computational investigation revealed that genes and their associated noncoding regions show different evolutionary patterns and influence different aspects of hair growth and development. Many genes under accelerated evolution are associated with the structure of the hair shaft itself, while evolutionary rate shifts in noncoding regions also included the dermal papilla and matrix regions of the hair follicle that contribute to hair growth and cycling. Genes that were top-ranked for coding sequence acceleration included known hair and skin genes KRT2, KRT35, PKP1, and PTPRM that surprisingly showed no signals of evolutionary rate shifts in nearby noncoding regions. Conversely, accelerated noncoding regions are most strongly enriched near regulatory hair-related genes and microRNAs, such as mir205, ELF3, and FOXC1, that themselves do not show rate shifts in their protein-coding sequences. Such dichotomy highlights the interplay between the evolution of protein sequence and regulatory sequence to contribute to the emergence of a convergent phenotype.

Data availability

All data generated or analysed during this study are included in the manuscript and supporting file; Source Data files have been provided for all Figures. Code files are deposited in GitHub at https://github.com/nclark-lab/hairlessness

The following previously published data sets were used

Article and author information

Author details

  1. Amanda Kowalczyk

    Department of Computational Biology, University of Pittsburgh, Pittsburgh, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-9061-1336
  2. Maria Chikina

    Department of Computational Biology, University of Pittsburgh, Pittsburgh, United States
    Competing interests
    The authors declare that no competing interests exist.
  3. Nathan L Clark

    Department of Human Genetics, University of Utah, Salt Lake City, United States
    For correspondence
    nclark@utah.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-0006-8374

Funding

National Institutes of Health (HG009299)

  • Amanda Kowalczyk
  • Maria Chikina
  • Nathan L Clark

National Institutes of Health (EY030546)

  • Amanda Kowalczyk
  • Maria Chikina
  • Nathan L Clark

The funding agencies did not have a role in study design, data collection and interpretation, or the decision to submit the work for publication.

Copyright

© 2022, Kowalczyk et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 7,910
    views
  • 952
    downloads
  • 22
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Amanda Kowalczyk
  2. Maria Chikina
  3. Nathan L Clark
(2022)
Complementary evolution of coding and noncoding sequence underlies mammalian hairlessness
eLife 11:e76911.
https://doi.org/10.7554/eLife.76911

Share this article

https://doi.org/10.7554/eLife.76911

Further reading

  1. Mammals without body hair evolved this trait independently, but relied on the same set of genes to guide the process

    1. Chromosomes and Gene Expression
    2. Evolutionary Biology
    Timothy Fuqua, Yiqiao Sun, Andreas Wagner
    Research Article

    Gene regulation is essential for life and controlled by regulatory DNA. Mutations can modify the activity of regulatory DNA, and also create new regulatory DNA, a process called regulatory emergence. Non-regulatory and regulatory DNA contain motifs to which transcription factors may bind. In prokaryotes, gene expression requires a stretch of DNA called a promoter, which contains two motifs called –10 and –35 boxes. However, these motifs may occur in both promoters and non-promoter DNA in multiple copies. They have been implicated in some studies to improve promoter activity, and in others to repress it. Here, we ask whether the presence of such motifs in different genetic sequences influences promoter evolution and emergence. To understand whether and how promoter motifs influence promoter emergence and evolution, we start from 50 ‘promoter islands’, DNA sequences enriched with –10 and –35 boxes. We mutagenize these starting ‘parent’ sequences, and measure gene expression driven by 240,000 of the resulting mutants. We find that the probability that mutations create an active promoter varies more than 200-fold, and is not correlated with the number of promoter motifs. For parent sequences without promoter activity, mutations created over 1500 new –10 and –35 boxes at unique positions in the library, but only ~0.3% of these resulted in de-novo promoter activity. Only ~13% of all –10 and –35 boxes contribute to de-novo promoter activity. For parent sequences with promoter activity, mutations created new –10 and –35 boxes in 11 specific positions that partially overlap with preexisting ones to modulate expression. We also find that –10 and –35 boxes do not repress promoter activity. Overall, our work demonstrates how promoter motifs influence promoter emergence and evolution. It has implications for predicting and understanding regulatory evolution, de novo genes, and phenotypic evolution.