Next Article in Journal
Morpho-Functional Characterisation of the Rat Ventral Caudal Nerve in a Model of Axonal Peripheral Neuropathy
Previous Article in Journal
Antimicrobial and Virucidal Potential of Morpholinium-Based Ionic Liquids
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

CLC-Pred 2.0: A Freely Available Web Application for In Silico Prediction of Human Cell Line Cytotoxicity and Molecular Mechanisms of Action for Druglike Compounds

by
Alexey A. Lagunin
1,2,*,
Anastasia V. Rudik
1,
Pavel V. Pogodin
1,
Polina I. Savosina
1,
Olga A. Tarasova
1,
Alexander V. Dmitriev
1,
Sergey M. Ivanov
1,2,
Nadezhda Y. Biziukova
1,
Dmitry S. Druzhilovskiy
1,
Dmitry A. Filimonov
1 and
Vladimir V. Poroikov
1,*
1
Department of Bioinformatics, Institute of Biomedical Chemistry, 119435 Moscow, Russia
2
Department of Bioinformatics, Pirogov Russian National Research Medical University, 117997 Moscow, Russia
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2023, 24(2), 1689; https://doi.org/10.3390/ijms24021689
Submission received: 1 December 2022 / Revised: 4 January 2023 / Accepted: 12 January 2023 / Published: 14 January 2023
(This article belongs to the Section Molecular Toxicology)

Abstract

:
In vitro cell-line cytotoxicity is widely used in the experimental studies of potential antineoplastic agents and evaluation of safety in drug discovery. In silico estimation of cytotoxicity against hundreds of tumor cell lines and dozens of normal cell lines considerably reduces the time and costs of drug development and the assessment of new pharmaceutical agent perspectives. In 2018, we developed the first freely available web application (CLC-Pred) for the qualitative prediction of cytotoxicity against 278 tumor and 27 normal cell lines based on structural formulas of 59,882 compounds. Here, we present a new version of this web application: CLC-Pred 2.0. It also employs the PASS (Prediction of Activity Spectra for Substance) approach based on substructural atom centric MNA descriptors and a Bayesian algorithm. CLC-Pred 2.0 provides three types of qualitative prediction: (1) cytotoxicity against 391 tumor and 47 normal human cell lines based on ChEMBL and PubChem data (128,545 structures) with a mean accuracy of prediction (AUC), calculated by the leave-one-out (LOO CV) and the 20-fold cross-validation (20F CV) procedures, of 0.925 and 0.923, respectively; (2) cytotoxicity against an NCI60 tumor cell-line panel based on the Developmental Therapeutics Program’s NCI60 data (22,726 structures) with different thresholds of IG50 data (100, 10 and 1 nM) and a mean accuracy of prediction from 0.870 to 0.945 (LOO CV) and from 0.869 to 0.942 (20F CV), respectively; (3) 2170 molecular mechanisms of actions based on ChEMBL and PubChem data (656,011 structures) with a mean accuracy of prediction 0.979 (LOO CV) and 0.978 (20F CV). Therefore, CLC-Pred 2.0 is a significant extension of the capabilities of the initial web application.

1. Introduction

Cancer is a major public health problem worldwide [1] and the discovery of new antitumor agents is one of the main directions of modern drug development. Despite significant efforts in the new antitumor agent research, due to the great diversity of tumors, their individual peculiarities and the emergence of drug resistance, there is a great need for new antitumor compounds. In vitro screening cytotoxicity of drug candidates against cell lines significantly improves the process of drug discovery. It allows the study of the efficacy against tumor cell lines and the estimation toxicity in normal cell lines to select the most effective and safe compounds. At the present time, over a thousand cell lines are used in the study of cytotoxicity (e.g., the Center for Molecular Therapeutics included a panel with 1200 human cancer cell lines [2]). NCI60 is one of the first known panels of tumor cell lines which has been used for the screening of antitumor drugs [3]. Even though cell lines have been used to assess cytotoxicity for several decades, only a few hundred thousand compounds have been tested so far. Many experimental results of cell-line cytotoxicity studies are kept in freely available chemical databases (e.g., PubChem [4] and ChEMBL [5]) or sources related to some screening programs (e.g., the Genomics of Drug Sensitivity in Cancer project [6] or Developmental Therapeutics Program NCI60 [7]). They can be used to create in silico tools for the prediction of cell-line cytotoxicity that are based on the analysis of structure–activity relationships (SAR). Such tools are supposed to estimate the potential cytotoxic effects of drug candidates or new druglike compounds before their experimental studies. They may also be of help in the studies of natural compounds and drug repositioning.
A number of studies aimed at creating computational tools for the prediction of cell-line cytotoxicity based on structural formulae of compounds, but most of them either were limited by the narrow chemical space due to the small number of compounds used [8,9] or made prediction for the cell lines related to the appropriate tissue [10,11]. Therefore, in 2018 we introduced a freely available web application, CLC-Pred (Cell-Line Cytotoxicity Predictor, https://www.way2drug.com/cell-line/, accessed on 1 January 2023), for the in silico prediction of cytotoxic effect of compounds in 278 cancer and 27 normal cell lines based on their structural formula [12]. It was based on the PASS Online approach [13,14,15] and the experimental data from ChEMBL’s 23 databases for 59,882 compounds. Since then, more than 80 papers have been published with studies using CLC-Pred to evaluate the cytotoxicity of new chemical compounds [16,17], natural compounds [18,19,20], as well as in studies aimed at finding new antitumor substances [21,22]. In the present paper, we offer a new version of the web application, CLC-Pred 2.0, that is based on significantly bigger data on the cytotoxicity of compounds in relation to cell lines. We also extended the capabilities of our web application by adding the ability to predict cytotoxicity with different thresholds of GI50 data (100, 10 and 1 nM) against NCI60 cell lines based on the data from the Developmental Therapeutics Program (DTP) NCI60 [7], where more than 20,000 compounds were tested against 60 cell lines using a single protocol. Moreover, we added the ability to predict over 2,100 molecular mechanisms of action (MoA) in relation to human proteins, which, in combination with the phenotypic assessment of cellular cytotoxicity, will allow a better planning of experimental studies and suggest possible molecular mechanisms of cytotoxicity for the studied compounds.

2. Results

2.1. Creation of Classification Models for Cell-Line Cytotoxicity Prediction

The training set with 128,545 unique structures of compounds tested against 1162 cell lines was created based on the experimental data (379,767 experimental values) related to cytotoxic studies from the ChEMBL and PubChem databases. After PASS training and the creation of classification models of “structure–cytotoxicity” relationships, 438 human cell lines (391 tumor and 47 normal cell lines) with an accuracy of prediction (AUC) higher than 0.8 were selected (Table S1). Most of the cell lines were related to lung, blood, skin, colon, breast, ovarian, hematopoietic and lymphoid tissues (Figure 1).
The average accuracy of prediction (AUC) calculated by the leave-one-out (LOO CV) and 20-fold cross-validation (20-fold CV) procedures was 0.925 and 0.923, respectively. It displays the robustness of the cytotoxicity prediction for the selected cell lines.
The training sets based on the DTP’s NCI60 data included 22,726 unique structures tested against 60 cell lines with 1,262,878 experimental values. These training sets were created based on different thresholds of IG50 values (100, 10 and 1 nM). The average AUC calculated by the LOO CV and 20-fold CV procedures was (1) 0.870 and 0.869 for 100 nM; (2) 0.898 and 0.897 for 10 nM; (3) 0.945 and 0.942 for a 1 nM threshold, respectively. The accuracy of the cytotoxicity prediction against all NCI60 cell lines was higher than 0.80. In contrast to the previous training set based on ChEMBL and PubChem data, in these training sets, the differentiation between active and inactive compounds was made according to the nanomolar thresholds of GI50 values. The number of cell lines and the average accuracy of prediction (for a 1 nM IG50 value threshold) according to the origin are shown in Figure 2a and Figure 2b, respectively. Most cell lines were associated with melanoma, nonsmall-cell lung carcinoma and kidneys. The most accurate SAR models were received for cell lines from the central nervous system (CNS), colon and melanoma. The detailed data on the prediction accuracy, the number of active compounds and other characteristics from CellMiner DTP NCI60 metadata for appropriate cell lines are presented in Supplementary Materials, Table S2.

2.2. Creation of Classification Models to Predict Molecular Mechanisms of Action

The training set to predict the molecular mechanisms of action related to the interaction of compounds with human proteins was created based on ChEMBL and PubChem data. The training set included 656,011 unique structures and 957,545 records describing experimental results related to 2876 molecular mechanisms of action. After completing the training procedure and selection of activities with the accuracy of prediction over 0.8, 2170 molecular mechanisms of action were selected. The average accuracy of the prediction calculated by the LOO CV and the 20-fold CV procedures was 0.979 and 0.978, respectively. The distribution of the mechanisms of action based on the ChEMBL family classification of proteins is shown in Figure 3. Most MoA are related to enzymes, membrane receptors (mostly G protein-coupled receptors) and ion channels (Figure 3a). Most enzymes are kinases, proteases, transferases and hydrolases (Figure 3b). The detailed data on the training set, the accuracy of prediction, numbers of active compounds and ChEMBL protein family classifications are presented in Supplementary Materials, Table S3.

2.3. CLC-Pred 2.0 Web Application

The CLC-Pred 2.0 web application (https://www.way2drug.com/CLC-pred, accessed on 1 January 2023) was developed based on the created SAR models for the prediction of human cell lines’ cytotoxicity and molecular mechanisms of actions related to human proteins. The web application allows one to use SMILES, the names of approved drugs, MDL Molfile or Marvin JS drawn structures as an input. The prediction results consist of three tabs—“Cell-line”, ”TDP NCI-60 (1 nM)”, ”TDP NCI-60 (10 nM)”, ”TDP NCI-60 (100 nM)” and ”Target”. Inside each tab, the user can see the results of the prediction with selectable rows. The results of the prediction contain three main characteristics, the names of activities, Pa (the probability that the compound will be active) and Pi (the probability that the compound will be inactive). In addition to these characteristics, one can see the detailed description of the activity. The prediction results are downloadable as PDF, CSV, Excel files and can be copied to the clipboard. In the section “Training set”, the user sees the total list of the predicted activities with the accuracy of prediction calculated by the LOO CV and 20-fold CV procedures. The interface of CLC-Pred 2.0 with prediction results for the EGFR inhibitor erlotinib is shown on Figure 4. According to Drugbank [23], erlotinib is used in the treatment of nonsmall-cell lung cancer, pancreatic cancer and several other types of cancer. The prediction results also demonstrate high and medium probabilities of cytotoxicity against EKVX (nonsmall-cell lung carcinoma) and several other cell lines related to the lungs (NCI-H358, SK-LU-1 and CL97).
The prediction results of cell-line cytotoxicity based on the TDP’s NCI-60 data for erlotinib showed that it had the potential to be very active against renal carcinoma cell line TK-10 at 1 and 10 nM (Figure 5a,b). The cytotoxicity against TK-10 was also predicted at 100 nM threshold (it is not displayed in Figure 5c because Pa = 0.255). Figure 5c shows the prediction of the cytotoxicity against another kidney cell line—UO-31 (Pa = 0.267). Such results also coincide with the prediction of the medium probability to reveal cytotoxicity in the kidney sarcoma cell line OS-RC-2 (Figure 4). The successful treatment of kidney cancer patients with the combination of erlotinib and bevacizumab was also confirmed by a recent publication [24]. The use of erlotinib in kidney cancer patients was also proposed by the independent bioinformatics analysis of OMICS data [25].
The prediction results for possible molecular mechanisms of action include the prediction of EGFR antagonism, which is the main mechanism of the anticancer activity of erlotinib, as seen in Figure 6, wherein the value of the probability for erlotinib to be an EGFR antagonist is higher than the one for other predicted mechanisms of action. One can also see that erlotinib has a high probability to act as an inhibitor of several kinases. Such probability may become the basis for studying experimentally the interaction of erlotinib with the appropriate kinases.

3. Discussion

In this study, we significantly expanded the capabilities of CLC-Pred web application. The number of structures of tested compounds studied for cell-line cytotoxicity and used for the creation of SAR models was increased by up to two times. The number of cell lines, both tumor and normal, for which the cytotoxicity of compounds can be predicted, was almost doubled. The data on cell-line cytotoxicity based on the TDP’s NCI60 provided the opportunity to create highly accurate SAR models and search for very active compounds with different thresholds of activity: 1, 10 and 100 nM of GI50 values. In CLC-Pred 2.0, the toxicity prediction coupled with the ability to predict molecular mechanisms of action enables researchers to obtain both the phenotypic information and molecular effects of the studied compounds. The prediction of both cell-line cytotoxicity and known mechanisms of antineoplastic action considerably increases the possibility to find anticancer compounds. Moreover, the prediction of the molecular mechanisms of action will help to estimate the possible action of the studied compound on the proteins related to ADME and side effects. CLC-Pred 2.0 provides the possibility to predict more than 2100 mechanisms of action. The prediction of cell-line cytotoxicity against normal cells will help to evaluate the possible toxic effect of the compounds studied in different therapeutic fields, with the treatment outcomes for patients with a tumor included. We believe that CLC-Pred 2.0 will also add to the studies related to drug repositioning and extracts. The results of the prediction provided by CLC-Pred 2.0 will help medicinal chemists, pharmacologists and toxicologists to select more promising drug candidates and plan experimental studies based only on the structural formula of compounds. It will lead to a more rational use of time and resources in drug development.

4. Materials and Methods

4.1. Datasets

4.1.1. Creation of the Training Set with Tumor and Normal Cell Lines’ Cytotoxicity Data

The experimental data (IG50, IC50 and % inhibition values) and structural formulas related to cell-line cytotoxicity studies were extracted from the PubChem (February 2022) and ChEMBL (version 29) databases. The compounds were considered active if their IG50 or IC50 values were less than 10,000 nM or if the percent of inhibition exceeded 50%. All compounds were considered as inactive for the appropriate cell line if they were not active for the cell line according to the above-mentioned criteria. The names of cell lines were standardized. Cell lines’ descriptions, tissue specificity and tumor types were used according to ChEMBL, Cellosaurus and ATCC (American Type Culture Collection) data.

4.1.2. Creation of Training Set Based on the DTP’s NCI60 Data

The initial data from CellMiner [7] on cytotoxicity assessment according to drug activity levels expressed as 50% growth-inhibitory levels (GI50) at 48 h using the sulforhodamine B assay [26] were used for the preparation of the DTP’s NCI60 training sets. All data with labels in the column “Failure reason” were deleted. The compounds were considered active against the appropriate cell line if its medium negative log10 of the GI50 value was equal or over 9 (1 nM), 8 (10 nM) or 7 (100 nM) for the appropriate training sets. Additional data related with the characteristics of NCI60 cell lines were given from the table with the metadata on the NCI60 panel of cancer cell lines that is available on CellMiner.

4.1.3. Creation of Training Set with Data on Mechanisms of Action

The experimental data (Ki, IC50, Kact and % inhibition values) and structural formulas related to the action on human proteins were extracted from the PubChem (February 2022) and ChEMBL (version 29) databases. The compounds were considered active if their Ki or IC50 values were less than 10,000 nM or if the percent of inhibition was higher than 50%. All compounds were considered inactive for the appropriate mechanism of action if they were not active according to the above-mentioned criteria. The canonical names of proteins from the UniProt database were used to create the names of the mechanisms of action. The data for protein targets from PubChem and ChEMBL were coupled based on UniProt IDs.

4.2. PASS Approach

The PASS (Prediction of Activity Spectra for Substances) software tool simultaneously predicts many types of biological activities (activity spectra) for compounds based on their structures [12,13,14,15]. Biological activities are described qualitatively (active or inactive) in PASS. The cytotoxicity of compounds against the appropriate cell lines and the mechanisms of action are also considered as biological activities. The sets of unique substructural atom-centric Multilevel Neighborhoods of Atoms (MNA) descriptors are used for the representation of the molecular structures of compounds. These descriptors are a linear notation of atom-centered fragments in the structure of an organic molecule. They are based on the molecular structure representation that includes the hydrogen atoms according to the valences and partial charges of the atoms and does not specify the types of bonds. MNA descriptors do not represent the stereochemical peculiarities of the molecule, the substances with stereochemical different structures are formally considered to be equivalent [15].
The algorithm for revealing structure–activity relationships is based on the naive Bayes approach with some significant enhancements [14,15].
The leave-one-out and 20-fold cross-validation procedures for all predictable types of biological activity and all substances are used during the training procedure. The invariant accuracy of prediction (IAP) is calculated for accuracy estimations. IAP values are numerically equal to ROC AUC (the area under the ROC curve) values. IAP is the estimation of the probability that positive and negative examples (active and inactive compounds) that are arbitrarily chosen from a validation set may be classified correctly by the prediction.
The prediction results of PASS are given as a list of activities with probabilities “to be active” Pa and “to be inactive” Pi. The list of predicted activities is arranged in a descending order according to Pa–Pi values. Thus, the more probable activity types are put at the top of the list. If the user chooses a higher Pa value as a cutoff for the selection of probable activities, the chance to confirm the predicted activities by the experiment is also high; however, many existing activities will be lost. For instance, if Pa > 0.5 is used as a threshold, about a half of the real activities will be lost; for Pa > 0.7, the portion of lost activities is 70%, etc. Moreover, a high Pa value shows the similarity of the queried compound structure and the typical compound structures of similar activity. Thus, experimental studies are mostly likely to confirm such an activity. The Pa value being smaller but still higher than the Pi value means that the chance to confirm the activity is lower. However, if the activity is discovered in the experiment, the compounds may be a new chemical entity for it.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms24021689/s1.

Author Contributions

Conceptualization and methodology, A.A.L. and V.V.P.; software, A.V.R. and D.A.F.; validation, formal analysis and investigation, A.A.L.; resources, D.S.D.; data curation, P.V.P., P.I.S., O.A.T., A.V.D., S.M.I. and N.Y.B.; writing—original draft preparation, A.A.L., A.V.R.; writing—review and editing V.V.P. and D.A.F.; visualization, A.A.L. and A.V.R.; supervision, A.A.L. and V.V.P.; project administration, D.A.F.; funding acquisition, D.A.F. All authors have read and agreed to the published version of the manuscript.

Funding

The work was supported by Russian Science Foundation grant 19-15-00396, https://rscf.ru/project/19-15-00396/.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Siegel, R.L.; Miller, K.D.; Fuchs, H.E.; Jemal, A. Cancer statistics, 2022. CA Cancer J. Clin. 2022, 72, 7–33. [Google Scholar] [CrossRef]
  2. Sharma, S.V.; Haber, D.A.; Settleman, J. Cell line-based platforms to evaluate the therapeutic efficacy of candidate anticancer agents. Nat. Rev. Cancer 2010, 10, 241–253. [Google Scholar] [CrossRef]
  3. Shoemaker, R.H. The NCI60 human tumour cell line anticancer drug screen. Nat. Rev. Cancer 2006, 6, 813–823. [Google Scholar] [CrossRef] [PubMed]
  4. Kim, S.; Chen, J.; Cheng, T.; Gindulyte, A.; He, J.; He, S.; Li, Q.; Shoemaker, B.A.; Thiessen, P.A.; Yu, B.; et al. PubChem in 2021: New data content and improved web interfaces. Nucleic Acids Res. 2021, 49, D1388–D1395. [Google Scholar] [CrossRef]
  5. Mendez, D.; Gaulton, A.; Bento, A.P.; Chambers, J.; De Veij, M.; Félix, E.; Magariños, M.P.; Mosquera, J.F.; Mutowo, P.; Nowotka, M.; et al. ChEMBL: Towards direct deposition of bioassay data. Nucleic Acids Res. 2019, 47, D930–D940. [Google Scholar] [CrossRef] [PubMed]
  6. Yang, W.; Soares, J.; Greninger, P.; Edelman, E.J.; Lightfoot, H.; Forbes, S.; Bindal, N.; Beare, D.; Smith, J.A.; Thompson, I.R.; et al. Genomics of Drug Sensitivity in Cancer (GDSC): A resource for therapeutic biomarker discovery in cancer cells. Nucleic Acids Res. 2013, 41, D955–D961. [Google Scholar] [CrossRef] [Green Version]
  7. Reinhold, W.C.; Sunshine, M.; Varma, S.; Doroshow, J.H.; Pommier, Y. Using CellMiner 1.6 for Systems Pharmacology and Genomic Analysis of the NCI-60. Clin. Cancer Res. 2015, 21, 3841–3852. [Google Scholar] [CrossRef] [Green Version]
  8. Menden, M.P.; Iorio, F.; Garnett, M.; McDermott, U.; Benes, C.H.; Ballester, P.J.; Saez-Rodriguez, J. Machine learning prediction of cancer cell sensitivity to drugs based on genomic and chemical properties. PLoS ONE 2013, 8, e61318. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Cadow, J.; Born, J.; Manica, M.; Oskooei, A.; Rodríguez Martínez, M. PaccMann: A web service for interpretable anticancer compound sensitivity prediction. Nucleic Acids Res. 2020, 48, W502–W508. [Google Scholar] [CrossRef]
  10. Speck-Planche, A.; Kleandrova, V.V.; Luan, F.; Cordeiro, M.N. Rational drug design for anti-cancer chemotherapy: Multi-target QSAR models for the in silico discovery of anti-colorectal cancer agents. Bioorg. Med. Chem. 2012, 20, 4848–4855. [Google Scholar] [CrossRef]
  11. Bonnet, M.; Flanagan, J.U.; Chan, D.A.; Lai, E.W.; Nguyen, P.; Giaccia, A.J.; Hay, M.P. SAR studies of 4-pyridyl heterocyclic anilines that selectively induce autophagic cell death in von Hippel-Lindau-deficient renal cell carcinoma cells. Bioorg. Med. Chem. 2011, 19, 3347–3356. [Google Scholar] [CrossRef] [Green Version]
  12. Lagunin, A.A.; Dubovskaja, V.I.; Rudik, A.V.; Pogodin, P.V.; Druzhilovskiy, D.S.; Gloriozova, T.A.; Filimonov, D.A.; Sastry, G.N.; Poroikov, V.V. CLC-Pred: A freely available web-service for in silico prediction of human cell line cytotoxicity for drug-like compounds. PLoS ONE 2018, 13, e0191838. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Lagunin, A.; Stepanchikova, A.; Filimonov, D.; Poroikov, V. PASS: Prediction of activity spectra for biologically active substances. Bioinformatics 2000, 16, 747–748. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Filimonov, D.A.; Lagunin, A.A.; Gloriozova, T.A.; Rudik, A.V.; Druzhilovskii, D.S.; Pogodin, P.V.; Poroikov, V.V. Prediction of the Biological Activity Spectra of Organic Compounds Using the Pass Online Web Resource. Chem. Heterocycl. Compd. 2014, 50, 444–457. [Google Scholar] [CrossRef]
  15. Poroikov, V.V.; Filimonov, D.A.; Gloriozova, T.A.; Lagunin, A.A.; Druzhilovskiy, D.S.; Stolbov, L.A.; Dmitriev, A.V.; Tarasova, O.A.; Ivanov, S.M.; Pogodin, P.V. Computer-aided prediction of biological activity spectra for organic compounds: The possibilities and limitations. Russ. Chem. Bull. 2019, 68, 2143–2154. [Google Scholar] [CrossRef]
  16. Bojarska, J.; Breza, M.; Remko, M.; Czyz, M.; Gajos-Michniewicz, A.; Zimecki, M.; Kaczmarek, K.; Madura, I.D.; Wojciechowski, J.M.; Wolf, W.M. Structural and Biofunctional Insights into the Cyclo(Pro-Pro-Phe-Phe-) Scaffold from Experimental and In Silico Studies: Melanoma and Beyond. Int. J. Mol. Sci. 2022, 23, 7173. [Google Scholar] [CrossRef]
  17. Dos Santos, I.V.F.; Borges, R.S.; Silva, G.M.; de Lima, L.R.; Bastos, R.S.; Ramos, R.S.; Silva, L.B.; da Silva, C.H.T.P.; Dos Santos, C.B.R. Hierarchical Virtual Screening Based on Rocaglamide Derivatives to Discover New Potential Anti-Skin Cancer Agents. Front. Mol. Biosci. 2022, 9, 836572. [Google Scholar] [CrossRef]
  18. Luo, L.; Wang, Q.; Liao, Y. The Inhibitors of CDK4/6 from a Library of Marine Compound Database: A Pharmacophore, ADMET, Molecular Docking and Molecular Dynamics Study. Mar. Drugs 2022, 20, 319. [Google Scholar] [CrossRef]
  19. Hoque, N.; Afroz, F.; Khatun, F.; Rony, S.R.; Hasan, C.M.; Rana, M.S.; Sohrab, M.H. Physicochemical, Pharmacokinetic and Cytotoxicity of the Compounds Isolated from an Endophyte Fusarium oxysporum: In Vitro and In Silico Approaches. Toxins 2022, 14, 159. [Google Scholar] [CrossRef]
  20. Ahmed, S.; Moni, D.A.; Sonawane, K.D.; Paek, K.Y.; Shohael, A.M. A comprehensive in silico exploration of pharmacological properties, bioactivities and COX-2 inhibitory potential of eleutheroside B from Eleutherococcus senticosus (Rupr. & Maxim.) Maxim. J. Biomol. Struct. Dyn. 2021, 39, 6553–6566. [Google Scholar] [CrossRef]
  21. Raducka, A.; Świątkowski, M.; Korona-Głowniak, I.; Kaproń, B.; Plech, T.; Szczesio, M.; Gobis, K.; Szynkowska-Jóźwik, M.I.; Czylkowska, A. Zinc Coordination Compounds with Benzimidazole Derivatives: Synthesis, Structure, Antimicrobial Activity and Potential Anticancer Application. Int. J. Mol. Sci. 2022, 23, 6595. [Google Scholar] [CrossRef]
  22. Wadood, A.; Ajmal, A.; Junaid, M.; Rehman, A.U.; Uddin, R.; Azam, S.S.; Khan, A.Z.; Ali, A. Machine Learning-based Virtual Screening for STAT3 Anticancer Drug Target. Curr. Pharm. Des. 2022, 28, 3023–3032. [Google Scholar] [CrossRef] [PubMed]
  23. Wishart, D.S.; Feunang, Y.D.; Guo, A.C.; Lo, E.J.; Marcu, A.; Grant, J.R.; Sajed, T.; Johnson, D.; Li, C.; Sayeeda, Z.; et al. DrugBank 5.0: A major update to the DrugBank database for 2018. Nucleic Acids Res. 2018, 46, D1074–D1082. [Google Scholar] [CrossRef] [PubMed]
  24. Choi, Y.; Keam, B.; Kim, M.; Yoon, S.; Kim, D.; Choi, J.G.; Seo, J.Y.; Park, I.; Lee, J.L. Bevacizumab Plus Erlotinib Combination Therapy for Advanced Hereditary Leiomyomatosis and Renal Cell Carcinoma-Associated Renal Cell Carcinoma: A Multicenter Retrospective Analysis in Korean Patients. Cancer Res. Treat. 2019, 51, 1549–1556. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Zhang, Y.; Tang, M.; Guo, Q.; Xu, H.; Yang, Z.; Li, D. The value of erlotinib related target molecules in kidney renal cell carcinoma via bioinformatics analysis. Gene 2022, 816, 146173. [Google Scholar] [CrossRef]
  26. Rubinstein, L.V.; Shoemaker, R.H.; Paull, K.D.; Simon, R.M.; Tosini, S.; Skehan, P.; Scudiero, D.A.; Monks, A.; Boyd, M.R. Comparison of in vitro anticancer-drug-screening data generated with a tetrazolium assay versus a protein assay against a diverse panel of human tumor cell lines. J. Natl. Cancer Inst. 1990, 82, 1113–1118. [Google Scholar] [CrossRef]
Figure 1. Distribution of 438 cell lines by organs and tissue.
Figure 1. Distribution of 438 cell lines by organs and tissue.
Ijms 24 01689 g001
Figure 2. (a) Distribution of cell lines from DTP NCI60 by organs or general types; (b) average accuracy of cytotoxicity prediction for grouped cell lines from DTP NCI60 at a 1 nM IG50 value threshold. NS Cell Lung—nonsmall-cell lung.
Figure 2. (a) Distribution of cell lines from DTP NCI60 by organs or general types; (b) average accuracy of cytotoxicity prediction for grouped cell lines from DTP NCI60 at a 1 nM IG50 value threshold. NS Cell Lung—nonsmall-cell lung.
Ijms 24 01689 g002
Figure 3. Distribution of predicted mechanisms of action by protein families: (a) number of predicted MoA classifying by ChEMBL protein family level 1; (b) number of predicted MoA classified by ChEMBL protein family level 2 for the enzyme protein family.
Figure 3. Distribution of predicted mechanisms of action by protein families: (a) number of predicted MoA classifying by ChEMBL protein family level 1; (b) number of predicted MoA classified by ChEMBL protein family level 2 for the enzyme protein family.
Ijms 24 01689 g003
Figure 4. Interface of CLC-Pred 2.0 and prediction results of general cell-line cytotoxicity for erlotinib.
Figure 4. Interface of CLC-Pred 2.0 and prediction results of general cell-line cytotoxicity for erlotinib.
Ijms 24 01689 g004
Figure 5. CLC-Pred 2.0 results of cytotoxicity prediction based on the TDP’s NCI60 data for erlotinib at different threshold of cytotoxicity: (a) at 1 nM, (b) at 10 nM; (c) at 100 nM.
Figure 5. CLC-Pred 2.0 results of cytotoxicity prediction based on the TDP’s NCI60 data for erlotinib at different threshold of cytotoxicity: (a) at 1 nM, (b) at 10 nM; (c) at 100 nM.
Ijms 24 01689 g005aIjms 24 01689 g005b
Figure 6. Interface of CLC-Pred 2.0 with prediction results of molecular mechanisms of action for erlotinib.
Figure 6. Interface of CLC-Pred 2.0 with prediction results of molecular mechanisms of action for erlotinib.
Ijms 24 01689 g006
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lagunin, A.A.; Rudik, A.V.; Pogodin, P.V.; Savosina, P.I.; Tarasova, O.A.; Dmitriev, A.V.; Ivanov, S.M.; Biziukova, N.Y.; Druzhilovskiy, D.S.; Filimonov, D.A.; et al. CLC-Pred 2.0: A Freely Available Web Application for In Silico Prediction of Human Cell Line Cytotoxicity and Molecular Mechanisms of Action for Druglike Compounds. Int. J. Mol. Sci. 2023, 24, 1689. https://doi.org/10.3390/ijms24021689

AMA Style

Lagunin AA, Rudik AV, Pogodin PV, Savosina PI, Tarasova OA, Dmitriev AV, Ivanov SM, Biziukova NY, Druzhilovskiy DS, Filimonov DA, et al. CLC-Pred 2.0: A Freely Available Web Application for In Silico Prediction of Human Cell Line Cytotoxicity and Molecular Mechanisms of Action for Druglike Compounds. International Journal of Molecular Sciences. 2023; 24(2):1689. https://doi.org/10.3390/ijms24021689

Chicago/Turabian Style

Lagunin, Alexey A., Anastasia V. Rudik, Pavel V. Pogodin, Polina I. Savosina, Olga A. Tarasova, Alexander V. Dmitriev, Sergey M. Ivanov, Nadezhda Y. Biziukova, Dmitry S. Druzhilovskiy, Dmitry A. Filimonov, and et al. 2023. "CLC-Pred 2.0: A Freely Available Web Application for In Silico Prediction of Human Cell Line Cytotoxicity and Molecular Mechanisms of Action for Druglike Compounds" International Journal of Molecular Sciences 24, no. 2: 1689. https://doi.org/10.3390/ijms24021689

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop