AIVE: accurate predictions of SARS-CoV-2 infectivity from comprehensive analysis

  1. Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul 06591, Republic of Korea
  2. Graduate School of Medical Science and Engineering, Korea Advanced Institute and Technology (KAIST), Daejeon 34141, Republic of Korea
  3. Department of Systems Biology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Republic of Korea
  4. Department of Microbiology and Immunology, Seoul National University College of Medicine, Seoul 03080, Republic of Korea
  5. Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul 03080, Republic of Korea
  6. School of Chemical and Biological Engineering, Seoul National University, Seoul 08826, Republic of Korea
  7. Department of Medical Life Sciences, College of Medicine, The Catholic University of Korea, Seoul 06591, Republic of Korea
  8. Seoul National University Bundang Hospital, Seongnam, Gyeonggi-do 13620, Republic of Korea
  9. Precision Medicine Research Center, College of Medicine, The Catholic University of Korea, Seoul 06591, Republic of Korea
  10. Cancer Evolution Research Center, College of Medicine, The Catholic University of Korea, Seoul 06591, Republic of Korea
  11. CMC Institute for Basic Medical Science, the Catholic Medical Center of The Catholic University of Korea, Seoul 06591, Republic of Korea

Peer review process

Not revised: This Reviewed Preprint includes the authors’ original preprint (without revision), an eLife assessment, and public reviews.

Read more about eLife’s peer review process.

Editors

  • Reviewing Editor
    Alvin Wei Tian Ng
  • Senior Editor
    Murim Choi
    Seoul National University, Seoul, Korea, the Republic of

Reviewer #1 (Public review):

Summary:

Park et al. conducted various analyses attempting to elucidate the biological significance of SARS-CoV-2 mutations. However, the study lacks a clear objective. The specific goals of the analyses in each subsection are unclear, as is how the results from these subsections are interconnected. Compiling results from unrelated analyses into a single paper can be confusing for readers. Clarifying the objective and narrowing down the topics would make the paper's purpose clearer.

The logic of the study is also unclear. For instance, the authors developed an evaluation score, APESS, for analyzing viral sequences. Although they state that the APESS score correlates with viral infectivity, there is no explanation in the results section about why this is the case.

The structure of the paper should be reconsidered.

Reviewer #2 (Public review):

Summary:

The authors have developed a machine learning tool AIVE to predict the infectivity of SARS-CoV-2 variants and also a scoring metric to measure infectivity. A large number of virus sequences were used with a very detailed analysis that incorporates hydrophobic, hydrophilic, acid, and alkaline characteristics. The protein structures were also considered to measure infectivity and search for core mutations. The study especially focused on the S protein of SARS-CoV-2. The contents of this study would be of interest to many researchers related to this area and the web service would be helpful to easily analyze such data without in-depth bioinformatics expertise.

Strengths:

- Analysis of large-scale data.

- Experimental validation on a partial set of searched mutations.

- A user-friendly web-based analysis platform that is made public.

Weaknesses:

- Complexity of the research.

  1. Howard Hughes Medical Institute
  2. Wellcome Trust
  3. Max-Planck-Gesellschaft
  4. Knut and Alice Wallenberg Foundation