Coevolution-based prediction of key allosteric residues for protein function regulation

  1. Juan Xie
  2. Weilin Zhang
  3. Xiaolei Zhu
  4. Minghua Deng
  5. Luhua Lai  Is a corresponding author
  1. Peking University, China
  2. Anhui Agricultural University, China

Abstract

Allostery is fundamental to many biological processes. Due to the distant regulation nature, how allosteric mutations, modifications and effector binding impact protein function is difficult to forecast. In protein engineering, remote mutations cannot be rationally designed without large-scale experimental screening. Allosteric drugs have raised much attention due to their high specificity and possibility of overcoming existing drug-resistant mutations. However, optimization of allosteric compounds remains challenging. Here, we developed a novel computational method KeyAlloSite to predict allosteric site and to identify key allosteric residues (allo-residues) based on the evolutionary coupling model. We found that protein allosteric sites are strongly coupled to orthosteric site compared to non-functional sites. We further inferred key allo-residues by pairwise comparing the difference of evolutionary coupling scores of each residue in the allosteric pocket with the functional site. Our predicted key allo-residues are in accordance with previous experimental studies for typical allosteric proteins like BCR-ABL1, Tar and PDZ3, as well as key cancer mutations. We also showed that KeyAlloSite can be used to predict key allosteric residues distant from the catalytic site that are important for enzyme catalysis. Our study demonstrates that weak coevolutionary couplings contain important information of protein allosteric regulation function. KeyAlloSite can be applied in studying the evolution of protein allosteric regulation, designing and optimizing allosteric drugs, performing functional protein design and enzyme engineering.

Data availability

All data that support the results of this study are included in the manuscript, supplementary files, and GitHub repository(https://github.com/huilan1210/KeyAlloSite). Source Data files have been provided for all Figures(except Figure 6 and Figure 1-figure supplement 1).

The following previously published data sets were used

Article and author information

Author details

  1. Juan Xie

    Center for Quantitative Biology, Peking University, Beijing, China
    Competing interests
    The authors declare that no competing interests exist.
  2. Weilin Zhang

    College of Chemistry and Molecular Engineering, Peking University, Beijing, China
    Competing interests
    The authors declare that no competing interests exist.
  3. Xiaolei Zhu

    School of Sciences, Anhui Agricultural University, Hefei, China
    Competing interests
    The authors declare that no competing interests exist.
  4. Minghua Deng

    Center for Quantitative Biology, Peking University, Beijing, China
    Competing interests
    The authors declare that no competing interests exist.
  5. Luhua Lai

    College of Chemistry and Molecular Engineering, Peking University, Beijing, China
    For correspondence
    lhlai@pku.edu.cn
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-8343-7587

Funding

National Key R&D Program of China (2022YFA1303700)

  • Luhua Lai

National Natural Science Foundation of China (21633001,22237002)

  • Luhua Lai

Chinese Academy of Medical Sciences (2021-I2M-5-014)

  • Luhua Lai

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Copyright

© 2023, Xie et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 2,713
    views
  • 465
    downloads
  • 15
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Juan Xie
  2. Weilin Zhang
  3. Xiaolei Zhu
  4. Minghua Deng
  5. Luhua Lai
(2023)
Coevolution-based prediction of key allosteric residues for protein function regulation
eLife 12:e81850.
https://doi.org/10.7554/eLife.81850

Share this article

https://doi.org/10.7554/eLife.81850

Further reading

    1. Biochemistry and Chemical Biology
    Bernd K Gilsbach, Franz Y Ho ... Christian Johannes Gloeckner
    Research Article

    The Parkinson’s disease (PD)-linked protein Leucine-Rich Repeat Kinase 2 (LRRK2) consists of seven domains, including a kinase and a Roc G domain. Despite the availability of several high-resolution structures, the dynamic regulation of its unique intramolecular domain stack is nevertheless still not well understood. By in-depth biochemical analysis, assessing the Michaelis–Menten kinetics of the Roc G domain, we have confirmed that LRRK2 has, similar to other Roco protein family members, a KM value of LRRK2 that lies within the range of the physiological GTP concentrations within the cell. Furthermore, the R1441G PD variant located within a mutational hotspot in the Roc domain showed an increased catalytic efficiency. In contrast, the most common PD variant G2019S, located in the kinase domain, showed an increased KM and reduced catalytic efficiency, suggesting a negative feedback mechanism from the kinase domain to the G domain. Autophosphorylation of the G1+2 residue (T1343) in the Roc P-loop motif is critical for this phosphoregulation of both the KM and the kcat values of the Roc-catalyzed GTP hydrolysis, most likely by changing the monomer–dimer equilibrium. The LRRK2 T1343A variant has a similar increased kinase activity in cells compared to G2019S and the double mutant T1343A/G2019S has no further increased activity, suggesting that T1343 is crucial for the negative feedback in the LRRK2 signaling cascade. Together, our data reveal a novel intramolecular feedback regulation of the LRRK2 Roc G domain by a LRRK2 kinase-dependent mechanism. Interestingly, PD mutants differently change the kinetics of the GTPase cycle, which might in part explain the difference in penetrance of these mutations in PD patients.

    1. Biochemistry and Chemical Biology
    2. Structural Biology and Molecular Biophysics
    Katherine A Senn, Karli A Lipinski ... Aaron A Hoskins
    Research Article

    Pre-mRNA splicing is catalyzed in two steps: 5ʹ splice site (SS) cleavage and exon ligation. A number of proteins transiently associate with spliceosomes to specifically impact these steps (first and second step factors). We recently identified Fyv6 (FAM192A in humans) as a second step factor in Saccharomyces cerevisiae; however, we did not determine how widespread Fyv6’s impact is on the transcriptome. To answer this question, we have used RNA sequencing (RNA-seq) to analyze changes in splicing. These results show that loss of Fyv6 results in activation of non-consensus, branch point (BP) proximal 3ʹ SS transcriptome-wide. To identify the molecular basis of these observations, we determined a high-resolution cryo-electron microscopy (cryo-EM) structure of a yeast product complex spliceosome containing Fyv6 at 2.3 Å. The structure reveals that Fyv6 is the only second step factor that contacts the Prp22 ATPase and that Fyv6 binding is mutually exclusive with that of the first step factor Yju2. We then use this structure to dissect Fyv6 functional domains and interpret results of a genetic screen for fyv6Δ suppressor mutations. The combined transcriptomic, structural, and genetic studies allow us to propose a model in which Yju2/Fyv6 exchange facilitates exon ligation and Fyv6 promotes usage of consensus, BP distal 3ʹ SS.