NICEdrug.ch, a workflow for rational drug design and systems-level analysis of drug metabolism

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information

Abstract

The discovery of a drug requires over a decade of intensive research and financial investments – and still has a high risk of failure. To reduce this burden, we developed the NICEdrug.ch resource, which incorporates 250,000 bioactive molecules, and studied their enzymatic metabolic targets, fate, and toxicity. NICEdrug.ch includes a unique fingerprint that identifies reactive similarities between drug–drug and drug–metabolite pairs. We validated the application, scope, and performance of NICEdrug.ch over similar methods in the field on golden standard datasets describing drugs and metabolites sharing reactivity, drug toxicities, and drug targets. We use NICEdrug.ch to evaluate inhibition and toxicity by the anticancer drug 5-fluorouracil, and suggest avenues to alleviate its side effects. We propose shikimate 3-phosphate for targeting liver-stage malaria with minimal impact on the human host cell. Finally, NICEdrug.ch suggests over 1300 candidate drugs and food molecules to target COVID-19 and explains their inhibitory mechanism for further experimental screening. The NICEdrug.ch database is accessible online to systematically identify the reactivity of small molecules and druggable enzymes with practical applications in lead discovery and drug repurposing.

Introduction

To assure effective therapies for previously untreated illness, emerging diseases, and personalized medicine, new small molecules are needed. However, the process to develop new drugs is complex, costly, and time consuming. This is especially problematic considering that about 90% of drug candidates in clinical trials are discarded due to unexpected toxicity or other secondary effects. This inefficiency threatens our health care system and economy (Wong et al., 2019). Improving how we discover and design new drugs could reduce the time and costs involved in the developmental pipeline and hence is of primary importance to define efficient medical therapies.

Current drug discovery techniques often involve high-throughput screens with candidates and a set of target enzymes presumably involved in a disease, which leads to the selection for those candidates with the preferred activity. However, the biochemical space of small molecules and possible targets in the cell is huge, which limits the possible experimental testing. Computational methods for drug pre-screening and discovery are therefore promising. In silico, one can systematically search the maximum biochemical space for targets and molecules with desired structures and functions to narrow down the molecules to test experimentally.

There are two main in silico strategies for drug discovery: a data-driven approach based on machine learning or a mechanistic approach based on the available biochemical knowledge. Machine learning (ML) has been successfully used in all stages of drug discovery, from the prediction of targets to the discovery of drug candidates, as shown in some recent studies (Reker et al., 2020; Shilo et al., 2020; Stokes et al., 2020; Vamathevan et al., 2019). However, ML approaches require big, high-quality data sets of drug activity and associated physiology (Vamathevan et al., 2019), which might be challenging to obtain when studying drug action mechanisms and side effects in humans. ML also uses trained neural networks, which can lack interpretability and repeatability. This can make it difficult to explain why the neural networks has chosen a specific result, why it unexpectedly failed for an unseen dataset, and the final results may vary (Vamathevan et al., 2019).

Mechanistic-based approaches can also rationally identify small molecules in a desired system and do not require such large amounts of data. Such methods commonly screen based on structural similarity to a native enzyme substrate (anti-metabolite) or to a known drug (for drug repurposing), considering the complete structure of a molecule to extract information about protein–ligand fitness (Jarvis and Ouvry, 2019; Verlinde and Hol, 1994). However, respecting enzymatic catalysis, the reactive sites, and neighboring atoms play a more important role than the rest of the molecule when assessing molecular reactivity (Hadadi et al., 2019). Indeed, reactive-site-centric information might allow to identify (1) the metabolic fate and neighbors of a small molecule (Javdan et al., 2020), including metabolic precursors or prodrugs and products of metabolic degradation, (2) small molecules sharing reactivity (Lim et al., 2010), and (3) competitively inhibited enzymes (Ghattas et al., 2016). Furthermore, neither ML nor mechanistic-based approaches consider the metabolism of the patient, even though the metabolic fate of the drug and the existence of additional targets in the cell might give rise to toxicity. To our knowledge, no available method accounts for human biochemistry when refining the search for drugs.

In this study, we present the development of the NICEdrug.ch database using a more holistic and updated approach to a traditional mechanistic-based screen by (1) adding a more detailed analysis of drug molecular structures and enzymatic targets based on structural aspects of enzymatic catalysis and (2) accounting for drug metabolism in the context of human biochemistry. NICEdrug.ch assesses the similarity of the reactivity between a drug candidate and a native substrate of an enzyme based on their common reactive sites and neighboring atoms (i.e., the NICEdrug score) in an analogous fashion as the computational tool BridgIT (Hadadi et al., 2019). It also identifies all biochemical transformations in the cellular metabolism that can modify and degrade a drug candidate using a previously developed reaction-prediction tool, termed Biochemical Network Integrated Computational Explorer (BNICE.ch) (Hatzimanikatis et al., 2005; Soh and Hatzimanikatis, 2010) and the ATLAS of Biochemistry (Hadadi et al., 2016; Hafner et al., 2020). With NICEdrug.ch, we automatically analyzed the functional, reactive, and physicochemical properties of around 250,000 small molecules to suggest the action mechanism, metabolic fate, toxicity, and possibility of drug repurposing for each compound.

To prove the predictive power of NICEdrug.ch in large-scale analysis, we collected and tested over 70,000 drug–enzyme pair inhibition data from available bioassays and high-throughput screening studies. Our comparison of predicted and experimentally tested drug–enzyme pairs shows that NICEdrug.ch predictive accuracy is over 70%. Remarkably, half of the drugs in this comparison show 100% accuracy. We have listed five potential sources of disagreement for the remaining half (accuracy of 65%) including drugs acting through non-competitive inhibition, which are out of the scope of NICEdrug.ch. Moreover, we have evaluated the accuracy of NICEdrug.ch predictions on drugs and metabolites that share reactivity, drug toxicity, and drug targets using golden standard datasets, i.e., a set of experimentally observed drug metabolites (Flynn et al., 2020; Kirchmair et al., 2015), a collection of cytotoxicity bioassay records from PubChem (Svensson et al., 2017; Webel et al., 2020; Yin et al., 2019), and a collection of drug–protein interactions reported in PubChem bioassays (Kim et al., 2021; Wang et al., 2012).

We apply NICEdrug.ch to study drug action mechanisms and identify drugs for repurposing related to four diseases: cancer, high cholesterol, malaria, and COVID-19. We also sought for molecules in food, as available in fooDB the largest database of food constituents (Scalbert et al., 2011), with putative anti SARS-CoV-2 activity. Finally, we provide NICEdrug.ch as an online resource (https://lcsb-databases.epfl.ch/pathways/Nicedrug/). Overall, NICEdrug.ch combines knowledge of molecular structures, enzymatic reaction mechanisms (as included in BNICE.ch; Finley et al., 2009; Hadadi and Hatzimanikatis, 2015; Hatzimanikatis et al., 2005; Henry et al., 2010; Soh and Hatzimanikatis, 2010; Tokic et al., 2018), and cellular biochemistry (currently human, Plasmodium, and Escherichia coli metabolism) to provide a promising and innovative resource to accelerate the discovery and design of novel drugs.

Results

NICEdrug.ch discovers 200,000 bioactive molecules one reaction away from known drugs in a human cell

To build the initial NICEdrug.ch database, we gathered over 70,000 existing small molecules presumed suitable for treating human diseases from three source databases: KEGG, ChEMBL, and DrugBank (Figure 1—figure supplement 1, method). We eliminated duplicate molecules, curated available information, computed thermodynamic properties, and applied the Lipinski rules (Lipinski et al., 2001) to keep only the molecules that have drug-like properties in NICEdrug.ch (Figure 1, ‘Materials and methods’). NICEdrug.ch currently includes 48,544 unique small molecules from the source databases.

Figure 1 with 3 supplements see all

Download asset Open asset

Pipeline to construct and use the NICEdrug.ch database.

NICEdrug.ch (1) curates available information and calculates the properties of an input compound; (2) identifies the reactive sites of that compound; (3) explores the hypothetical metabolism of the compound in a cell; (4) stores all functional, reactive, bio-, and physico-chemical properties in open-source database; and (5) allows generation of reports to evaluate (5a) reactivity of a small molecule, (5b) drug repurposing, and (5c) druggability of an enzymatic target. See also Figure 1—figure supplement 1; Figure 1—figure supplement 2; Figure 1—figure supplement 3, and Supplementary file 1 .

To evaluate the reactivity of the 48,544 drugs and drug candidates, we searched for all possible reactive sites on each molecule with BNICE.ch (Hatzimanikatis et al., 2005; Figure 1, ‘Materials and methods’). All of the 48,544 molecules contain at least one reactive site and hence might be reactive in a cell. In total, we identified more than 5 million potential reactive sites (183 k unique) on the 48,544 molecules and matched them to a corresponding enzyme by assigning them to an Enzyme Commission (EC) number. All of these enzymes belong to the human metabolic network (Supplementary file 1, ‘Materials and methods’). Interestingly, 10.4% of identified reactive sites correspond to the p450 class of enzymes, which are responsible for breaking down compounds in the human body by introducing reactive groups on those compounds, also known as phase I of drug metabolism (Figure 1—figure supplement 2A). The sites that were identified varied greatly from simple and small (i.e., comprising a minimum number of one atom) to more complex sites that covered a large part of the molecule. The biggest reactive site includes 30 atoms (Figure 1—figure supplement 2B).

Given the important role of metabolism in the biochemical transformations and toxicity of drugs (Dumoulin et al., 2020), we investigated the metabolism of the 48,544 input molecules in human cells. We predicted the hypothetical biochemical neighborhoods of all NICEdrug.ch small molecules in a human cell (i.e., reacting with known human metabolites and cofactors) using a retro-biosynthetic analysis with BNICE.ch (Figure 1—figure supplement 1, ‘Materials and methods’). With this approach, we discovered 197,246 unique compounds connected to the input drugs and drug candidates via one step or reaction (products of the first generation), and the associated hypothetical biochemical neighborhood consists of 630,449 reactions (Figure 1—figure supplement 2). The 197,246 unique compounds are part of a new set of bioactive molecules in NICEdrug.ch that might act as drugs or prodrugs in a human cell. We stored the total number of 245,790 small molecules (including the curated set of 48,544 drugs and drug candidates and the new set of 197,246 bioactive compounds), their calculated properties, and biochemistry in our open-access database of drug metabolism, NICEdrug.ch.

To use NICEdrug.ch to identify drug-drug or drug–metabolite pairs that have shared reactivity and target enzymes, we developed a new metric called the NICEdrug score (Figure 1—figure supplement 3). The NICEdrug score uses information about the structure of the reactive site and its surroundings (as computed using the BridgIT methodology) and is stored in the form of a fingerprint (‘Materials and methods’). The fingerprint of a molecule’s reactive site and the neighborhood around this reactive site—termed the reactive site-centric fingerprint—serves to compare this site-specific similarity with other molecules. We recently showed that the reactive site-centric fingerprint of a reaction provides a better predictive measure of similar reactivity than the overall molecular structure, as the overall structure can be much larger than the reactive site and skew the results by indicating high similarities when the reactivity is actually quite different (Hadadi et al., 2019). Here, we generated reactive site-centric fingerprints for all 20 million reactive sites identified in the 48,544 drug–drug candidates and 197,246 one-step-away molecules included in NICEdrug.ch. The 20 million reactive site-centric fingerprints for the total 245,790 small molecules are available in NICEdrug.ch to be used in similarity comparisons and classifying molecules (‘Materials and methods’).

We propose the usage of NICEdrug.ch to generate reports that define the hypothetical reactivity of a molecule, the molecule’s reactive sites as identified by target enzymes, and the NICEdrug score between drug–drug and drug–metabolite pairs. The NICEdrug.ch reports can be used for three main applications: (1) to identify the metabolism of small molecules; (2) to suggest drug repurposing; and (3) to evaluate the druggability of an enzyme in a desired cell or organism (Figure 1), as we show in the next sections. Currently, NICEdrug.ch includes metabolic information for human cells, a malaria parasite, and Escherichia coli, and it is easily extendible to other organisms in the future.

Validation of NICEdrug.ch against biochemical assays

To prove the potential of NICEdrug.ch to predict the druggability (through competitive inhibition) of an enzyme by a small molecule, we compare a set of 70 k NICEdrug.ch drug–enzyme pair predictions with available biochemical assays and high-throughput compound screenings (Supplementary file 2, ‘Materials and methods’). The set of 70 k drug–enzyme pairs involves all available active and inactive inhibition data for 2570 small molecules and 198 enzymes in the PubChem Bioassays database (Wang et al., 2012). A comparison between the drugs’ predicted and measured bioactivity against enzymes results in a predictive accuracy of NICEdrug.ch of 0.73. Interestingly, we identify two clusters of drugs: a set of 1269 small molecules for which the NICEdrug.ch predictions are 100% accurate and a set of 1301 drugs with 65% accuracy. We investigated the reasons for the mismatches and identify five explanations (Supplementary file 2, ‘Materials and methods’).

We have also compared the scope and application of NICEdrug.ch and other available computational drug discovery tools (Supplementary file 2), and we show how NICEdrug.ch outperforms the scope and predictive potential of all these tools (Supplementary file 2, ‘Materials and methods’).

We also quantitatively compared the accuracy of NICEdrug.ch predictions over similar methods in the field using golden standard datasets. These experimental datasets describe drugs and metabolites that share reactivity, drug toxicities, and drug targets. Hence, they serve to evaluate the NICEdrug.ch reactivity, druggability, and repurposing reports.

Evaluation of NICEdrug.ch reactivity report

To evaluate the NICEdrug.ch reactivity report, we first used an experimental set including 29 small molecules and their 55 unique metabolic products (labeled in public databases) (Flynn et al., 2020). We compared the predictive accuracy of NICEdrug.ch with other tools predicting reactivity, i.e., XenoNet (Flynn et al., 2020), GLORY (de Bruyn Kops et al., 2021; de Bruyn Kops et al., 2019), SyGMa (Ridder and Wagener, 2008), and BioTransformer (Djoumbou-Feunang et al., 2019). NICEdrug.ch predicted 53 of the 55 metabolic products from the small molecule dataset, rendering a sensitivity score of 0.96. The two metabolites missing are venetoclax and SCHEMBL18637099, which are produced through at least one reaction with an unknown reaction mechanism and hence are out of the scope of NICEdrug.ch. The tools XenoNet, GLORY, SyGMa, and BioTransformer showed a sensitivity score of 0.89, 0.83, 0.74, and 0.72 on the same dataset. To this end, not only NICEdrug.ch outperforms previous tools, but it also provides information on the metabolic pathways and reaction mechanisms involved in the production of each metabolic product (see ‘Materials and methods’, Supplementary file 2). We next evaluated the NICEdrug.ch reactivity with a second dataset including 16 pairs of drugs and metabolite that share reactivity (Kirchmair et al., 2015) (‘Materials and methods’, Supplementary file 2). NICEdrug.ch correctly identified the pathways metabolizing 15 of the drugs and the associated metabolites sharing reactivity (Supplementary file 2).

Evaluation of NICEdrug.ch toxicity report

As done before (Svensson et al., 2017; Webel et al., 2020; Yin et al., 2019), we used cytotoxicity bioassay records from PubChem (Svensson et al., 2017) involving 1777 drugs to evaluate the NICEdrug.ch toxicity report. Other available tools predict drug toxicity using machine learning. The accuracy of the machine-learning-based methods ranged from 0.67 to 0.78, as previously reported (Svensson et al., 2017; Yin et al., 2019). For the same dataset, NICEdrug.ch shows an accuracy of 0.94, with a precision, recall, and F1 of 0.94, 0.92, and 0.96, respectively (‘Materials and methods’, Supplementary file 2).

Evaluation of NICEdrug.ch druggability report

We compared the NICEdrug.ch druggability report with the widely used ‘network-based inference (NBI)’ tool for drug–target interaction (DTI) prediction. As a basis for this comparison, we used the high-quality drug–enzyme bioassay data from PubChem (Kim et al., 2021; Wang et al., 2012), which includes 651 records reporting the inhibition of 78 enzymes by 297 molecules. The area under the curve (AUC), a commonly used criterion for assessing computational target prediction methods (Mayr et al., 2018), quantified a remarkable improvement in the overall performance of NICEdrug.ch (0.85) over the NBI tool (0.61). Further analysis found that the optimal druggability scores is 0.46, with precision, recall, and F1 values of 0.88, 0.89, and 0.89, respectively (‘Materials and methods’, Supplementary file 2).

NICEdrug.ch suggests inhibitory mechanisms of the anticancer drug 5-FU and avenues to alleviate its toxicity

As a case study, we used NICEdrug.ch to investigate the mode of action and metabolic fate of one of the most commonly used drugs to treat cancer, 5-fluorouracil (5-FU), by exploring its reactivity and the downstream products or intermediates that are formed during the cascade of biochemical transformations. 5-FU interferes with DNA synthesis as an anti-metabolite (Longley et al., 2003), meaning that its various intermediates like 5-fluorodeoxyuridine monophosphate (FdUMP) are similar enough to naturally occurring substrates and they can act as competitive inhibitors in the cell.

We therefore used NICEdrug.ch to study the intermediates of 5-FU that occurred between one to four reaction steps away from 5-FU (Supplementary file 3), which is a reasonable range to occur in the body after 5-FU treatment (Testa, 2010). This analysis identified 407 compounds (90 biochemical and 317 chemical molecules) that have the biochemical potential to inhibit certain enzymes. Because the NICEdrug score that analyses reactive site and neighborhood similarities can serve as a better predictor of metabolite similarity, we assessed the NICEdrug score of the intermediates compared to human metabolites. This resulted in a wide range of NICEdrug scores between the different 5-FU intermediates and human metabolites, ranging from no similarity at a NICEdrug score of 0 to the equivalent substructure on a compound at a NICEdrug score of 1. More importantly, some of the 407 metabolite inhibitors (as explained next) were known compounds that have been investigated for their effects on 5-FU toxicity, but most of these compounds were newly identified by NICEdrug.ch and could therefore serve as avenues for future research into alleviating the side effects of this drug.

We investigated these 407 compounds in more detail, looking first at the set of already validated metabolite inhibitors. 5-Fluorouridine (two steps away from 5-FU) and UDP-L-arabinofuranose (four steps away from 5-FU) are very similar to uridine, with NICEdrug scores of 0.95 and 1, respectively. Uridine is recognized as a substrate by two human enzymes: cytidine deaminase (EC: 3.5.4.5) and 5'-nucleotidase (EC: 3.1.3.5) (Figure 2). Therefore, NICEdrug.ch predictions show that the degradation metabolism of 5-FU generates downstream molecules similar to uridine, which likely leads to the inhibition of these two enzymes. This effect has already been investigated as a potential method for reducing the toxicity of 5-FU, wherein it was proposed that high concentrations of uridine could compete with the toxic 5‐FU metabolites (Ma et al., 2017).

Figure 2

Download asset Open asset

Similarity in reactive site and neighborhood defines para-metabolites in 5-FU metabolism and inhibited human metabolic enzymes.

Eight para-metabolites in the 5-FU metabolic neighborhood (represented as defined in ‘Materials and methods’). We show the most similar native human metabolites, inhibited enzymes, and native products of the reactions. See also Supplementary file 3 .

NICEdrug.ch also identified a few potential metabolites that have not been previously studied for their effects. These metabolites share a reactive site with native human metabolites and differ in the reactive site neighborhood, and we refer to them as para-metabolites (Sartorelli and Johns, 2013). 6-Methyl-2'-deoxyadenosine, purine-deoxyribonucleoside, and 2′-deoxyisoguanosine structurally resemble the reactive site neighborhood of deoxyadenosine, with respective NICEdrug scores of 1, 1, and 0.91. Similarly, 2-aminoadenosine, 2-chloroadenosine, and 2-methylaminoadenosine (four steps from 5-FU) have the same reactive site neighborhood as adenosine, with NICEdrug scores of 1, 1, and 0.96, respectively. Adenosine and deoxyadenosine are both native substrates of the adenosine kinase (EC: 2.7.1.20) and 5′-nucleotidase (EC: 3.1.3.5) (Figure 2). Therefore, we suggest that the 5-FU derivatives 2-aminoadenosine and 2-chloroadenosine are competitive inhibitors for the two enzymes adenosine kinase and 5′-nucleotidase. With these new insights from NICEdrug.ch, we hypothesize that co-administering adenosine or deoxyadenosine and uridine (Figure 2) with 5-FU might be required to reduce its toxic effects and hopefully alleviate the side effects of the 5-FU cancer treatment.

Metabolic degradation of 5-FU leads to compounds with Fluor in their reactive site that are less reactive and more toxic than other intermediates

In the previous case study, we showed inhibitors that contain the identical active site to the native enzyme. However, a slightly different reactive site might still be able to bind to an enzyme and compete with a native substrate, also defined as anti-metabolite (Matsuda et al., 2014). We explored this scenario by defining relaxed constraints in two steps. We first identified all atoms around a reactive site to compare the binding characteristics between the native molecule and putative inhibitor. Next, we compared the reactive site of the native molecule and putative inhibitor and scored the latter based on similarity (‘Materials and methods’). Following these two steps, we assessed the similarity between intermediates in the 5-FU metabolic neighborhood and human metabolites. Among all 407 compounds in the 5-FU metabolism (Supplementary file 3), we found eight that show a close similarity to human metabolites (NICEdrug score above 0.9, Figure 3) that might be competitive inhibitors or anti-metabolites. Inside the reactive site, the original hydrogen atom is bioisosterically replaced by fluorine. F–C bonds are extremely stable and therefore block the active site by forming a stable complex with the enzyme. The inhibitory effect of the intermediates tegafur, 5-fluorodeoxyuridine, and FdUMP (one to two reaction steps away) has been confirmed in studies by Kobayakawa and Kojima, 2011 and Bielas et al., 2009. In addition, NICEdrug.ch also predicts that 5flurim, 5-fluorodeoxyuridine triphosphate, 5-fluorodeoxyuridine triphosphate, 5-fluorouridine diphosphate, and 5-fluorouridine triphosphate, some of which occur further downstream in the 5-FU metabolism, also act as anti-metabolites (Figure 3). Based on the insights from NICEdrug.ch, we suggest the inhibitory and side effect of 5-FU treatment might be more complex than previously thought. 5-FU downstream products are structurally close to human metabolites and might form stable complexes with native enzymes. This knowledge could serve to further refine the pharmacokinetic and pharmacodynamic models of 5-FU and ultimately the dosage administered during treatment.

Figure 3

Download asset Open asset

A different reactive site but similar neighborhood defines top anti-metabolites in 5-FU metabolism and inhibited human metabolic enzyme.

Eight anti-metabolites of dUMP in the 5-FU metabolic neighborhood (represented as defined in ‘Materials and methods’). Note that the reactive site of the anti-metabolites is different than the one of the native human metabolite, but the neighborhood is highly similar, which determines the high NICEdrug score (value in parenthesis). We show the inhibited human enzyme (dTMP synthase) and reaction, and its native product. See also Supplementary file 3

NICEdrug.ch identifies toxic alerts in the anticancer drug 5-FU and its products from metabolic degradation

The concept of drug toxicity refers not to overdoses but instead to the toxic effects at medical doses (Guengerich, 2011), which often occur due to the degradation products generated through drug metabolism. Extensive efforts have been expended to identify toxic molecules or, more generally, to extract the substructures that are responsible for toxicity (called structural alerts). The Liver Toxicity Knowledge Base (LTKB) and the super toxic database include 1036 and about 60 k toxic molecules, respectively (Schmidt et al., 2009; Thakkar et al., 2018). ToxAlert provides around 1200 alerts related to different forms of toxicity (Sushko et al., 2012). However, the number of molecules that are analyzed and labeled as toxic in databases is disproportionally low compared to the space of compounds. Additionally, structural alerts are indicated for many compounds, and current alerts might identify redundant and over-specific substructures, which questions their reliability (Yang et al., 2017).

To quantify the toxicity of downstream products of drugs in NICEdrug.ch, we collected all of the molecules cataloged as toxic in the LTKB and super toxic databases (approved toxic molecules) along with their lethal dose (LC₅₀), as well as the existing structural alerts provided by ToxAlert. We measured the similarity of an input molecule with all approved toxic molecules using the reactive site-centric fingerprints implemented in BridgIT and the NICEdrug score (‘Materials and methods’). Next, we scanned both the toxic reference molecule and the input molecule for structural hints of toxicity, referred to here as NICEdrug toxic alerts. We kept common NICEdrug toxic alerts between the reference, which is a confirmed toxic compound, and input molecule. With this procedure in place, NICEdrug.ch finds for each input molecule the most similar toxic molecules, along with their common toxic alerts, and serves to assess the toxicity of a new molecule based on the mapped toxic alerts. Additionally, the NICEdrug toxic alerts and toxicity level of drug intermediates can be traced with NICEdrug.ch through the whole degradation pathway to reveal the origin of the toxicity.

As an example, we herein tested the ability of NICEdrug.ch to identify the toxicity in 5-FU metabolism. First, we queried the toxicity profile of all intermediates in the 5-FU metabolic neighborhood, integrating both known and hypothetical human reactions (‘Materials and methods’). In this analysis, we generated all compounds up to four steps away from 5-FU. Based on the toxicity report of each potential degradation product, we calculated a relative toxicity metric that adds the LC₅₀ value, NICEdrug score, and number of common NICEdrug toxic alerts with all approved toxic drugs (‘Materials and methods’). We generated the metabolic neighborhood around 5-FU and labeled each compound with our toxicity metric (Supplementary file 3). Interestingly, we show that the top most toxic intermediates match the list of known three toxic intermediates in 5-FU metabolism (Figure 4; Krauß and Bracher, 2018). Based on the toxicity analysis in NICEdrug.ch for 5-FU, we hypothesize there are highly toxic products of 5-FU drug metabolism that had not been identified either experimentally or computationally and it might be necessary to experimentally evaluate their toxicity to recalibrate the dosage of 5-FU treatment.

Figure 4

Download asset Open asset

Comparing downstream products to known toxic molecules and analyzing their common structural toxic alerts explains metabolic toxicity of 5-FU.

Example of six suggested toxic molecules in the 5-FU metabolic neighborhood (represented as defined in ‘Materials and methods’). We show toxic compounds from the supertoxic and hepatotoxic databases that lead to the highest NICEdrug toxicity score (number under toxic intermediate name, ‘Materials and methods’). We highlight functional groups linked to five NICEdrug toxic alerts (legend bottom right). See also Supplementary file 3.

The nicedrug.ch reactive site-centric fingerprint accurately clusters statins of type I and II and guides drug repurposing

Because potential side effects of a drug are documented when the drug passes the approval process, repurposing approved drugs for other diseases can reduce the medical risks and development expenses (Himmelstein et al., 2017). For instance, the antitussive noscapine has been repurposed to treat some types of cancers (Mahmoudian and Rahimi-Moghaddam, 2009; Rajesh, 2011). Because NICEdrug.ch can search for functional (i.e., reactivity), structural (i.e., size), and physicochemical (i.e., solubility) similarities between molecules while accounting for human biochemistry, we wanted to determine whether NICEdrug.ch could therefore suggest drug repurposing strategies.

As a case study, we investigated the possibility of drug repurposing to replace statins, which are a class of drugs often prescribed to lower blood cholesterol levels and to treat cardiovascular disease. Indeed, data from the National Health and Nutrition Examination Survey indicate that nearly half of adults 75 years and older in the United States use prescription cholesterol-lowering statins (Bibbins-Domingo et al., 2016). Since some patients do not tolerate these drugs and many still do not reach a safe blood cholesterol level (Kong et al., 2004), there is a need for alternatives. Being competitive inhibitors of the cholesterol biosynthesis enzyme 3-hydroxy-3-methyl-glutaryl-coenzyme A reductase (HMG-CoA reductase) (Jiang et al., 2018; Mulhaupt et al., 2003), all statins share the same reactive site. BNICE.ch labeled this reactive site, in a linear or circular form, as corresponding to an EC number of 4.2.1.- (Istvan and Deisenhofer, 2001). NICEdrug.ch includes 254 molecules with the same reactive site that are recognized by enzymes of EC class 4.2.1.-, ten of which are known statins. We used the NICEdrug score to cluster the 254 molecules into different classes (Supplementary file 4, Figure 5). Two of the classes correspond to all currently known statins, which are classified based on their activity into types 1 and 2, wherein statins of type two are less active and their reactive site is more stable compared to type 1. This property is well distinguished in the clustering based on the NICEdrug score (Figure 5A).

Figure 5 with 1 supplement see all

Download asset Open asset

Clustering of molecules with statin reactive sites based on NICEdrug score suggests drugs for repurposing.

(A) Pairwise NICEdrug score between all molecules with statin reactive sites (heat map) and number of metabolic reactions in which they participate (right). We highlight clusters of statins of type 1 (cluster a) and type 2 (cluster b), and clusters of most similar molecules to type one statins (cluster c) and type two statins (cluster d). Within the metabolic reactions, we indicate the total number of reactions (dark color) and the number of reactions that involve the statin reactive site (light color). (B) Examples of statins and Mevastatin analogues of type one from cluster c (blue) and of type two from cluster d (gold). We left the known statins unmarked, which are appropriately clustered together based on the NICEdrug score, and we mark with * new molecules that cluster with statins and that NICEdrug.ch suggests could be repurposed to act as statins. Reactive sites in type one statins and type two statins are colored in blue and orange, respectively. The reactive site neighborhood as considered in the NICEdrug score is also marked. See also; Figure 5—figure supplement 1 , and Supplementary file 4.

In addition to properly classifying the 10 known statins (Figure 5B,C, molecules non-marked), we identified seven other NICEdrug.ch molecules that clustered tightly with these statins (Figure 5B,C, molecules marked with *). These new molecules share the same reactive site and physicochemical properties, and they have the highest similarity with known statins in atoms neighboring the reactive site. In a previous study by Endo and Hasumi, 1993, these seven NICEdrug.ch molecules were introduced as Mevastatin analogues for inhibiting cholesterol biosynthesis. Therefore, they were already suggested as possible candidates for treating high blood cholesterol and could be a good option for repurposing. Furthermore, we found eight known drugs not from the statin family among the 254 scanned molecules (Supplementary file 4). One of them, acetyl-l-carnitine (Figure 5C, molecule marked with **), is mainly used for treating neuropathic pain (Li et al., 2015), though Tanaka et al., 2004 have already confirmed that it also has a cholesterol-reducing effect.

Overall, NICEdrug.ch was able to characterize all known enzymatic reactions that metabolize statins, including proposed alternatives and new hypothetical reactions that could be involved in their metabolism within human cells (Figure 5A, Figure 5—figure supplement 1). The identification of seven drugs that clustered around the statins and were already designed as alternatives to statins confirms the power of NICEdrug.ch and the NICEdrug score to search large databases for similar compounds in structure and function. Furthermore, the discovery of the eight compounds unrelated to known statins offer multiple candidate drugs for repurposing along with a map of their metabolized intermediates for the treatment of high cholesterol, though further preclinical experiments would be required to verify their clinical benefits.

NICEdrug.ch suggests over 500 drugs and drug candidates to target liver-stage malaria and simultaneously minimize side effects in human cells, with shikimate 3-phosphate as a top candidate

Efficiently targeting malaria remains a global health challenge. Malaria parasites (Plasmodium) are developing resistance to all known drugs, and antimalarials cause many side effects (World Health Organization, 2018). We applied NICEdrug.ch to identify drug candidates that target liver-stage developing malaria parasites and lessen or avoid side effects in human cells.

We previously reported 178 essential genes and enzymes for liver-stage development in the malaria parasite Plasmodium berghei (Stanway et al., 2019; Supplementary file 5, ‘Materials and methods’). Of 178 essential Plasmodium enzymes, 32 enzymes are not essential in human cells (Wang et al., 2015; Supplementary file 5, ‘Materials and methods’). We extracted all molecules catalyzed by these 32 enzymes uniquely essential in Plasmodium, which resulted in 68 metabolites and 157 unique metabolite–enzyme pairs (Supplementary file 5, ‘Materials and methods’). We used NICEdrug.ch to examine the druggability of the 32 essential Plasmodium enzymes with the curated 48,544 drugs and drug candidates (Figure 1) and the possibility of repurposing them to target malaria.

We considered as candidates for targeting liver-stage malaria as the drugs or their metabolic neighbors that show a good NICEdrug score (NICEdrug score above 0.5) with any of the 157 Plasmodium metabolite–enzyme pairs. We identified 516 such drug candidates, targeting 16 essential Plasmodium enzymes (Supplementary file 6, ‘Materials and methods’). Furthermore, 1164 other drugs appear in the metabolic neighborhood of the 516 identified drugs (between one and three reaction steps away). Interestingly, of the 516 identified drug candidates, digoxigenin, estradiol-17beta, and estriol have been previously validated as antimalarials (Antonova-Koch et al., 2018), and NICEdrug.ch suggests their antimalarial activity relies on the competitive inhibition of the KRC enzyme (Figure 6). This enzyme is part of both the steroid metabolism and the fatty acid elongation metabolism, which we recently showed is essential for Plasmodium liver-stage development (Stanway et al., 2019). Among the 516 NICEdrug.ch antimalarial candidates, there are also 89 molecules present in the metabolic neighborhood of antimalarial drugs approved by Antonova-Koch et al., 2018, which suggests these antimalarials might be prodrugs (Supplementary file 6).

Figure 6

Download asset Open asset

NICEdrug.ch suggests shikimate 3-phosphate as a top candidate to target liver-stage malaria and minimize side effects in host human cells.

(A) Schema of ideal scenario to target malaria, wherein a drug efficiently inhibits an essential enzyme for malaria parasite survival and does not inhibit essential enzymes in the host human cell to prevent side effects. (B) Shikimate 3-phosphate inhibits enzymes in the *Plasmodium* shikimate metabolism, which is essential for liver-stage development of the parasite. Shikimate 3-phosphate does not inhibit any enzyme in the human host cell since it is not a native human metabolite, and it does not show similarity to any native human metabolite. (C) Mechanistic details of inhibition of aroC by shikimate 3-phosphate and other NICEdrug candidates. See also Supplementary file 5; Supplementary file 6.

Being an intracellular parasite, antimalarial treatments should be efficient at targeting Plasmodium as well as assure the integrity of the host cell (Figure 6A). To tackle this challenge, we identified 1497 metabolites participating in metabolic reactions catalyzed with essential human enzymes (Supplementary file 5, ‘Materials and methods’) and excluded the antimalarial drug candidates that shared reactive site-centric similarity with the extracted human metabolite set (to satisfy NICEdrug score below 0.5). Of all 516 drug candidates that might target liver-stage Plasmodium, a reduced set of 64 molecules minimize the inhibition of essential human enzymes (Supplementary file 6, ‘Materials and methods’) and are hence optimal antimalarial candidates.

Among our set of 64 optimal antimalarial candidates, a set of 14 drugs targeting the Plasmodium shikimate metabolism, whose function is essential for liver-stage malaria development (Stanway et al., 2019), arose as the top candidate because of its complete absence in human cells. The set of drug candidates targeting shikimate metabolism include 40 prodrugs (between one and three reaction steps away) that have been shown to have antimalarial activity (Antonova-Koch et al., 2018; Supplementary file 6). NICEdrug.ch identified molecules among the prodrugs with a high number of toxic alerts, like nitrofen. It also identified four molecules with scaffolds similar (two or three steps away) to the 1-(4-chlorobenzoyl)pyrazolidin-3-one of shikimate and derivatives. This result suggests that downstream compounds of the 40 prodrugs might target the Plasmodium shikimate pathway, but also might cause side effects in humans (Supplementary file 6).

To this end, NICEdrug.ch identified shikimate 3-phosphate as a top candidate antimalarial drug. We propose that shikimate 3-phosphate inhibits the essential Plasmodium shikimate biosynthesis pathway without side effects in the host cell (Figure 6, Supplementary file 6). Excitingly, shikimate 3-phosphate has been used to treat E. coli and Streptococcus infections without appreciable toxicity for patients (Díaz-Quiroz et al., 2018). Furthermore, recent studies have shown that inhibiting the shikimate pathway using 7-deoxy-sedoheptulose is an attractive antimicrobial and herbicidal strategy with no cytotoxic effects on mammalian cells (Brilisauer et al., 2019). Experimental studies should now validate the capability of shikimate 3-phosphate to efficiently and safely target liver malaria, and could further test other NICEdrug.ch antimalarial candidates (Supplementary file 6).

NICEdrug.ch identifies over 1300 molecules to fight COVID-19, with N-acetylcysteine as a top candidate

SARS-CoV-2 is responsible for the currently on-going COVID-19 pandemic and the death of over three million people (as of today, 11 May 2021 [Dong et al., 2020]), and there is currently no confirmed treatment for it. Attacking the host factors that allow replication and spread of the virus is an attractive strategy to treat viral infections like COVID-19. A recent study has identified 332 interactions between SARS-CoV-2 proteins and human proteins, which involve 332 hijacked human proteins or host factors (Gordon et al., 2020). Here, we first used NICEdrug.ch to identify inhibitors of enzymatic host factors of SARS-CoV-2. Targeting such human enzymes prevents interactions between human and viral proteins (PPI) (‘Materials and methods’, Figure 7A). Of the 332 hijacked human proteins, we identified 97 enzymes (‘Materials and methods’, Supplementary file 7) and evaluated their druggability by inhibitors among the 250,000 small molecules in NICEdrug.ch and 80,000 molecules in food (‘Materials and methods’, Figure 7A). NICEdrug.ch suggests 22 hijacked human enzymes can be drug targets and proposed 1301 potential competitive inhibitors from the NICEdrug.ch database. Of 1301 potential inhibitors, 465 are known drugs, 712 are active metabolic products of 1419 one-step-away prodrugs, and 402 are molecules in fooDB (Supplementary file 7). We found among the top anti SARS-CoV-2 drug candidates the known reverse transcriptase inhibitor didanosine (Figure 7B, Supplementary file 7), which other in silico screenings have also suggested as a potential treatment for COVID-19 (Alakwaa, 2020; Cava et al., 2020). Among others, NICEdrug.ch also identified: (1) actodigin, which belongs to the family of cardiotonic molecules proven to be effective against MERS-CoV but without mechanistic knowledge (Ko et al., 2020), (2) three molecules in ginger (6-paradol, 10-gingerol, and 6-shogaol) inhibiting catechol methyltransferase, and (3) brivudine, a DNA polymerase inhibitor that has been used to treat herpes zoster (Wassilew, 2005) and prevent MERS-CoV infection (Park et al., 2019), and NICEdrug.ch suggests it for repurposing (Figure 7—figure supplement 1, Supplementary file 7).

Figure 7 with 2 supplements see all

Download asset Open asset

NICEdrug.ch strategy to fight COVID-19, and NICEdrug.ch candidate inhibitors of SARS-CoV-2 host factors: reverse transcriptase and HDAC2.

(A) Schema of NICEdrug strategy to target COVID-19, wherein a drug (top-left) or molecules in food (top-right) efficiently inhibit a human enzyme hijacked by SARS-CoV-2. Inhibition of this host factor reduces or abolishes protein–protein interactions (PPI) with a viral protein and prevents SARS-CoV-2 proliferation. (B) Inhibition of the reverse transcriptase (EC: 1.1.1.205 or P12268) and the PPI with SARS-CoV-nsp14 by didanosine based on NICEdrug.ch. (C) Inhibition of the HDAC2 (EC: 3.5.1.98) and the PPI with SARS-CoV-nsp5 by molecules containing acetyl moiety (like melatonin, N-acetylcysteine, and N8-acetylspermidine), and molecules containing carboxylate moiety (like valproate, stains, and butyrate) based on NICEdrug.ch. See also Figure 7—figure supplement 1; Figure 7—figure supplement 2, Supplementary file 7; Supplementary file 8.

Drugs like remdesivir, EIDD-2801, favipiravir, and inhibitors of angiotensin converting enzyme 2 (ACE2) have been used to treat COVID-19 (Jeon et al., 2020), and act through a presumably effective inhibitory mechanism (Figure 7—figure supplement 2). For instance, the three drugs remdesivir, EIDD-2801, and favipiravir are believed to inhibit the DNA-directed RNA polymerase (EC: 2.7.7.6). Here, we used the NICEdrug.ch reactive site-centric fingerprint to search for alternative small molecules in NICEdrug.ch and fooDB that could be repurposed to target ACE2 and DNA-directed RNA polymerase. NICEdrug.ch identified a total of 215 possible competitive inhibitors of ACE2. Among those is captopril, a known ACE2 inhibitor (Kim et al., 2003), and D-leucyl-N-(4-carbamimidoylbenzyl)-l-prolinamide, a NICEdrug.ch suggestion for drug repurposing to treat COVID-19. We also found 39 food-based molecules with indole-3-acetyl-proline (a molecule in soybean) as top ACE2 inhibitor candidate (Figure 7—figure supplement 2, Supplementary file 8). To target the same enzyme as remdesivir, EIDD-2801, and favipiravir, NICEdrug.ch identified 1115 inhibitors of the DNA-directed RNA polymerase, like the drug vidarabine, which shows broad spectrum activity against DNA viruses in cell cultures and significant antiviral activity against infections like the herpes viruses, the vaccinia virus, and varicella zoster virus (Suzuki et al., 2006). We further found 556 molecules in food that might inhibit DNA-directed RNA polymerase, like trans-zeatin riboside triphosphate (FDB031217) (Supplementary file 8).

One of the host factors identified by Gordon et al., 2020 is the histone deacetylase 2 (HDAC2), which acetylates proteins and is an important transcriptional and epigenetic regulator. The acetyl and carboxylate moieties are the reactive sites of the forward (N6-acetyl-l-lysyl-[histone]) and reverse (acetate) biotransformation of HDAC2, respectively (Figure 7). NICEdrug.ch recognized a total of 640 drugs for repurposing that can inhibit HDAC2, including 311 drugs sharing the acetyl moiety and showing a NICEdrug score above 0.5 with respect to N6-acetyl-l-lysyl-[histone], and 329 drugs sharing the carboxylate moiety and presenting a NICEdrug score above 0.5 with acetate (‘Materials and methods’). Among the drugs sharing the acetyl reactive site, we identified the known HDAC2 inhibitor melatonin (Wu et al., 2018), and to our knowledge new candidates like N-acetylhistamine and N-acetylcysteine. We also located 22 molecules in food with potential HDAC2 inhibitory activity, like N8-acetylspermidine (FDB022894) (Figure 7C, Supplementary file 8). Drugs sharing the carboxylate reactive site (as identified with NICEdrug) include the known HDAC2 inhibitors valproate, butyrate, phenyl butyrate (Abdel-Atty et al., 2014) and statins (Kong et al., 2004; Figure 7C, Supplementary file 8). Interestingly, statins have been shown to have protective activity against SARS-CoV-2 (Lodigiani et al., 2020; Zhang et al., 2020). In addition, the NICEdrug.ch candidate N-acetylcysteine is a commonly used mucolytic drug that is sometimes considered as a dietary supplement and has putative antioxidant properties. Indeed, N-acetylcysteine is believed for long to be a precursor of the cellular antioxidant glutathione (Mårtensson et al., 1989), but has unknown pharmacological action. NICEdrug.ch suggests that N-acetylcysteine might present a dual antiviral activity: firstly, N-acetylcysteine is converted to cysteine by HDAC2 and by that means, it is competitively inhibiting the native function of HDAC2 and interactions with viral proteins (Figure 7C, Supplementary file 8). Cysteine next fuels the glutathione biosynthesis pathway and produces glutathione in two steps.

Given the high coverage of validated molecules with activity against SARS-CoV-2 that NICEdrug.ch captured in this unbiased and reactive site-centric analysis, we suggest there might be other molecules in the set of 1300 NICEdrug.ch candidates that could also fight COVID-19. Excitingly, there are many molecules that can be directly tested since these are drugs that have already passed all safety regulations or are molecules present in food, like N-acetylcysteine for which we further reveal an action mechanism behind its potential anti-SARS-CoV-2 activity. Other new candidates for which no safety data is available should be further validated experimentally and clinically. The mechanistic analyses provided by NICEdrug.ch could also guide new pharmacokinetic and pharmacodynamic models simulating SARS-CoV-2 infection and treatment.

Discussion

To systematically illuminate the metabolism and all enzymatic targets (competitively inhibited) of known drugs and hypothetical prodrugs to aid in the development of new therapeutic compounds, we used a proven reaction–prediction tool BNICE.ch (Hatzimanikatis et al., 2005) and an analysis of neighboring atoms of reactive sites analogous to BridgIT (Hadadi et al., 2019) and performed the first large-scale computational analysis of drug biochemistry and toxicity in the context of human metabolism. The analysis involved over 250,000 small molecules, and curation and computation of bio- and physico-chemical drug properties that we assembled in an open-source drug database NICEdrug.ch that can generate detailed drug metabolic reports and can be easily accessed and used by researchers, clinicians, and industry partners. NICEdrug.ch revealed 20 million potential reactive sites at the 250,000 small molecules of the database, and there exist over 3000 enzymes in the human metabolism that can be inhibited with the 250,000 molecules. This is because NICEdrug.ch can identify potential metabolic intermediates of a drug and scans these molecules for substructures that can interact with catalytic sites across all enzymes in a desired cell.

NICEdrug.ch adapts the metric previously developed for reactions in BridgIT (Hadadi et al., 2019) to precisely compare drug–drug and drug–metabolite pairs based on similarity of reactive site and the neighborhood around this reactive site, which we have recently shown outperforms previously defined molecular comparison metrics (Hadadi et al., 2019). Since NICEdrug.ch shows high specificity in the identification of such reactive sites and neighborhood, it provides a better mechanistic understanding than currently available methods (Robertson, 2005). Despite these advances, it remains challenging to systematically identify non-competitive inhibition or targeting of non-enzymatic biological processes. We suggest coupling NICEdrug.ch drug metabolic reports with other in silico and experimental analyses accounting for signaling induction of small molecules and other non-enzymatic biological processes like transport of metabolites in a cell. The combined analysis of drug effects on different possible biological targets (not uniquely enzymes) will ultimately increase the coverage of molecules for which a mechanistic understanding of their mode of action is assigned.

A better understanding of the mechanisms of interactions and the specific nodes where the compounds act can help re-evaluate pharmacokinetic and pharmacodynamic models, dosage, and treatment. Such understanding can be used in the future to build models that correlate the pharmacodynamic information with specific compounds and chemical substructures in a manner similar to the one used for correlating compound structures with transcriptomic responses. We have shown for one of the most commonly used anticancer drugs, 5-FU, that NICEdrug.ch identifies and ranks alternative sources of toxicity and hence can guide the design of updated models and treatments to alleviate the drug’s side effects.

The mechanistic understanding will also further promote the development of drugs for repurposing. While current efforts in repurposing capitalize on the accepted status of known drugs, some of the issues with side effects and unknown interactions limit their development as drugs for new diseases. Given that drug repurposing will require new dosage and administration protocols, the understanding of their interactions with the human metabolism will be very important in identifying, developing, and interpreting unanticipated side effects and physiological responses. We evaluated the possibility of drug repurposing with NICEdrug.ch as a substitute for statins, which are broadly used to reduce cholesterol but have many side effects. NICEdrug.ch and its reactive site-centric comparison accurately cluster both family types of statins, even though they are similar in overall molecular structure and show different reactivity. In addition, NICEdrug.ch suggests a set of new molecules with hypothetically less side effects (Endo and Hasumi, 1993; Tanaka et al., 2004) that share reactive sites with statins.

A better mechanistic understanding of drug targets can guide the design of treatments against infectious diseases, for which we need effective drugs that target pathogens without side effects in the host cell. This is arguably the most challenging type of problem in drug design, and indeed machine learning has continuously failed to guide such designs given the difficulty in quantifying side effects – not to mention in acquiring large, consistent, and high-quality data sets from human patients. To demonstrate the power of NICEdrug.ch for tackling this problem, we sought to identify drugs that target liver-stage malaria parasites and minimize the impact on the human host cell. We identified over 500 drugs that inhibit essential Plasmodium enzymes in the liver stages and minimize the impact on the human host cell. Our top drug candidate is shikimate 3-phosphate targeting the parasite’s shikimate metabolism, which we recently identified as essential in a high-throughput gene knockout screening in Plasmodium (Stanway et al., 2019). Excitingly, our suggested antimalarial candidate shikimate 3-phosphate has already been used for Escherichia and Streptococcus infections without appreciable side effects (Díaz-Quiroz et al., 2018).

Finally, minimizing side effects becomes especially challenging in the treatment of viral infections, since viruses fully rely on the host cell to replicate. As a last demonstration of the potential of NICEdrug.ch, we sought to target COVID-19 by identifying inhibitors of 22 known enzymatic host factors of SARS-CoV-2 (Gordon et al., 2020). NICEdrug.ch identified over 1300 molecules that might target the 22 host factors and prevent SARS-CoV-2 replication. As a validation, NICEdrug.ch correctly identified known inhibitors of those enzymes and further suggested safe drugs for repurposing and other food molecules with activity against SARS-CoV-2. Among the NICEdrug.ch suggestions for COVID-19, based on the knowledge on its mechanism and safety, we highlight N-acetylcysteine as an inhibitor of HDAC2 and SARS-CoV-2.

Overall, we believe that a system-level or metabolic network analysis, coupled with an investigation of reactive sites, will likely accelerate the discovery of new drugs and provide additional understanding regarding metabolic fate, action mechanisms, and side effects and can complement on-going experimental effects to understand drug metabolism (Javdan et al., 2020). To fully capture, understand, and predict drug metabolism, it is necessary to evaluate two aspects: (1) the metabolic fate of small molecules and (2) the absorption and distribution of small molecules to the actual target cells and enzymes. This study concentrates on the first aspect, whereas the second aspect will be addressed in future work on NICEdrug.ch.

We suggest the generation of drug metabolic reports to understand the reactivity of new small molecules, the possibility of drug repurposing, and the druggability of enzymes. Our results and high predictive accuracy (above 70%) using NICEdrug.ch suggest that this database can be a novel avenue towards the systematic pre-screening and identification of drugs and antimicrobials. In addition to human metabolic information, NICEdrug.ch currently includes information for the metabolism of P. berghei and E. coli. Because we are making it publicly available (https://lcsb-databases.epfl.ch/pathways/Nicedrug/), our hope is that scientists and decision takers in pharmaceutical industry alike can make use of this unique database to better inform their research and clinical decisions – saving time, money, and ultimately lives.

Materials and methods

Key resources table

Reagent type (species) or resource	Designation	Source or reference	Additional information
Software, algorithm	OpenBabel 2.4.1	doi:10.1186/1758-2946-3-33
Software, algorithm	BridgIT	doi:10.1073/pnas.1818877116
Software, algorithm	ATLAS of biochemistry	doi:10.1021/acssynbio.6b00054; doi:10.1021/acssynbio.0c00052
Software, algorithm	MORPHEUS	https://clue.io/morpheus
Software, algorithm	NICEdrug.ch (curated bioactive molecules and analysis of drug metabolism)	This paper; http://nicedrug.ch/	See Materials and methods

Share this article

Cite this article

Pipeline to construct and use the NICEdrug.ch database.

Similarity in reactive site and neighborhood defines para-metabolites in 5-FU metabolism and inhibited human metabolic enzymes.

A different reactive site but similar neighborhood defines top anti-metabolites in 5-FU metabolism and inhibited human metabolic enzyme.

Comparing downstream products to known toxic molecules and analyzing their common structural toxic alerts explains metabolic toxicity of 5-FU.

Clustering of molecules with statin reactive sites based on NICEdrug score suggests drugs for repurposing.

NICEdrug.ch suggests shikimate 3-phosphate as a top candidate to target liver-stage malaria and minimize side effects in host human cells.

NICEdrug.ch strategy to fight COVID-19, and NICEdrug.ch candidate inhibitors of SARS-CoV-2 host factors: reverse transcriptase and HDAC2.

Quantitative comparison of drug toxicity (A) and drug–enzyme pairs (B, C).

Author details

Homa MohammadiPeyhani

Contribution

Competing interests

Anush Chiappino-Pepe

Present address

Contribution

Contributed equally with

Competing interests

Kiandokht Haddadi

Present address

Contribution

Contributed equally with

Competing interests

Jasmin Hafner

Present address

Contribution

Competing interests

Noushin Hadadi

Present address

Contribution

Competing interests

Vassily Hatzimanikatis

Contribution

For correspondence

Competing interests

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organisms

Further reading