Elsevier

Computational Biology and Chemistry

Volume 78, February 2019, Pages 317-329
Computational Biology and Chemistry

Research Article
Virtual high-throughput screens identifying hPK-M2 inhibitors: Exploration of model extrapolation

https://doi.org/10.1016/j.compbiolchem.2018.12.006Get rights and content

Highlights

  • Trained predictive SVM classification/regression models with available hPK-M2 data.

  • Screened PubChem Compound Database, ∼72 million compounds, for lead candidates.

  • Select candidates identified and purchased for activity validation experiments.

  • Candidate screening hit-rate: 1st round – 1/12 (8.13%), 2nd round – 6/11 (54.5%).

  • 2nd round result analysis for extrapolation effects: stepwise extrapolation advised.

Abstract

Glycolysis with PK-M2 occurs typically in anaerobic conditions and atypically in aerobic conditions, which is known as the Warburg effect. The Warburg effect is found in many oncogenic situations and is believed to provide energy and biomass for oncogenesis to persist. The work presented targets human PK-M2 (hPK-M2) in a virtual high-throughput screen to identify new inhibitors and leads for further study. In the initial screen, one of the 12 candidates selected for experimental validation showed biological activity (hit-rate = 8.13%). In the second screen with retrained models, six of 11 candidates selected for experimental validation showed biological activity (hit-rate: 54.5%). Additionally, four different scaffolds were identified for further analysis when examining the tested candidates and compounds in the training data. Finally, extrapolation was necessary to identify a sufficient number of candidates to test in the second screen. Examination of the results suggested stepwise extrapolation to maximize efficiency.

Introduction

Pyruvate kinase (PK; EC 2.7.1.40) is one of the final enzymes participating in glycolysis: it transfers phosphate groups from phosphoenolpyruvate (PEP) to adenosine diphosphate (ADP) to produce pyruvate and adenosine triphosphate (ATP). Two genes encode pyruvate kinase, each producing different isoenzymes: the PKL gene produces tissue-specific PK in the liver (PKL) and erythrocytes (PKR) (Noguchi et al., 1987), while the PKM gene produces the PK-M1 and PK-M2 isoform depending on the exon splicing, 9 or 10 respectively (Noguchi et al., 1986). Furthermore, PK-M2 can be an active tetramer, allosterically activated by fructose-1,6-bisphosphate (Ashizawa et al., 1991), or a nearly-inactive dimer (Eigenbrodt et al., 1992). Glycolysis with PK-M1 occurs in aerobic conditions, yielding much more energy (net production of 36 ATP/glucose), in mature differentiated muscle and brain cells (Nelson and Cox, 2012, Takenaka et al., 1991). Glycolysis with tetrameric PK-M2 occurs in anaerobic environments, yielding less energy (net production of 2 ATP/glucose), predominantly in dividing and embryonic/fetal tissue (Nelson and Cox, 2012, Takenaka et al., 1991, Eigenbrodt et al., 1992). During normal development, the M2 isoenzyme is highly-expressed initially and gradually replaced by the other three isoenzymes (Dombrauckas et al., 2005, Ao et al., 2017).

In tumors and oncogenic environments, PK-M2 has two different roles. The first is ATP production by the active tetramer in oxygen-rich environments (Elbers et al., 1991, Hacker et al., 1998), known as the Warburg effect (Warburg, 1926). The second is biomass accumulation as PK-M2 tetramers breakdown to less active dimers in hypoxic and acidic environments (Kumar et al., 2010). The accumulation of byproducts shunts biomass for use in processes critical for cell proliferation and development, such as the production of nucleic acids, proteins, and lipids (Mazurek et al., 1997, Mazurek et al., 2001, Vander Heiden et al., 2009; Li et al., 2014). Furthermore, PK-M2 may play a role in anti-cancer drug resistance (Yoo et al., 2004, Martinez-Balibrea et al., 2009). When PK-M2 is replaced with PK-M1 (Trialists et al., 2005, Wang et al., 2012) or absent (Ao et al., 2017), cancer cells cannot sustain growth or start apoptosis (Ao et al., 2017, Wang et al., 2012).

Given its presence and functionality in different cancer types such as gastrointestinal (Schneider and Schulze, 2003, Hardt et al., 2000, Oremek et al., 1997), kidney (Brinck et al., 1994), ovarian (Chao et al., 2017), lung (Schneider et al., 2003), and breast (Ibsen et al., 1982) cancer, it is increasingly considered a target for cancer treatments. Exploration of possible PK-M2 treatments include: small-molecule activators (Auld et al., 2018, Boxer et al., 2011, Kung et al., 2012), small-molecule inhibitors (Spoden et al., 2008, Auld et al., 2018, Vander Heiden et al., 2010, Chen et al., 2011, Anastasiou et al., 2012, Yacovan et al., 2012, Guo et al., 2013, Parnell et al., 2013, Xu et al., 2014), down-regulation (Pandita et al., 2014, Wong et al., 2014, Wang et al., 2012), and switching the expressed isoenzyme form to PK-M1 (Liu et al., 2014).

There are two relatively common methods to identify hPK-M2 treatment candidates: high-throughput screening (HTS) and computational analysis. HTS was often used to identify drug leads in compound libraries (Auld et al., 2018, Boxer et al., 2009, Kung et al., 2012, Xu et al., 2014, Guo et al., 2013, Vander Heiden et al., 2010) while computational analysis was used to probe structure and interaction (Kalyaanamoorthy and Chen, 2011, Anastasiou et al., 2012, Guo et al., 2013). Eventually, HTS and computational analysis were used complementarily (Yacovan et al., 2012): computational analysis via virtual HTS (vHTS) identified candidates to focus HTS resources on while HTS-confirmed leads are computationally analyzed identify ways to enhance activity via structural modification. However, much of the work is proprietary (Yacovan et al., 2012) and unavailable for review in the open literature. The presented work uses open-sourced, published software and methodologies that take advantage of the technological advancements since 2012 to screen a more extensive database (PubChem Compound: 72 million).

This work is one in a series to characterize and test the effectiveness of the authors’ previously presented approach (Chen and Visco, 2016, Chen and Visco, 2017, Chen et al., 2018) with different protein/ligand systems and datasets of varying sizes, activity classification distributions, and quality. Previously studied targets include Cathepsin L (Chen and Visco, 2016), clotting factor XIIa (Chen and Visco, 2017), and complement factor C1s (Chen et al., 2018). PK-M2 was chosen for this work because of its prevalence and role in tumors and cancers metabolism (Vander Heiden et al., 2009). Additionally, this is the first instance where activators, instead of inhibitors, are identified by the approach.

The approach uses structural features in compounds to create classification- and activity-predicting models. This does not mean the approach identifies structural causes of biological activity. However, inferences may be drawn by examining the structural features used in the models (Kayello et al., 2014) and found in the experimentally tested candidates (Chen and Visco, 2016, Chen and Visco, 2017).

Section snippets

Approach considerations

An essential consideration in vHTS is the number of potential candidates, which increases exponentially with each additional factor (e.g., identity and types of atoms, the configuration of bonds, branching, cyclization, etc.) (Bohacek et al., 1996) Defined subsets of candidates can still be extremely substantial: there are 1060 possible 30-atom molecules consisting of C, N, O, and S atoms only (Bohacek et al., 1996). Not all 1060 are physically viable or currently synthesizable, but it

First round classification and QSAR model creation, vHTS and validation results

AID 2533 contained 202 compounds (86 actives and 116 inactives) (Vander Heiden, 2012). Candidates were tested up to 50 μM (Vander Heiden, 2012). After removing PAINS from the dataset, 183 compounds remained (85 active, 98 inactives). The PAINS-free dataset yielded 521 atomic Signatures of heights 0, 1, and 2. PCA identified the atomic Signatures contributing the most to capturing variance for use in creating and training GA-SVM models. Two different kinds of models were created from the

Model discussion

In previous applications of this approach, several trends emerged. One was the increase in training and cross-validation error between the first and second round of models (Chen and Visco, 2016, Chen and Visco, 2017). However, that was not observed in this work. One explanation may be that all the compounds tested in the first round were of the same scaffold so the information added was necessary to classify compounds containing that scaffold correctly but did not add unnecessary bias against

Conclusion

Pyruvate kinase's role in glycolysis is transferring phosphate groups from PEP to ADP and yielding pyruvate and ATP. The M2 isoform is present in tissues during early development and when the tissue has become cancerous. It is believed M2 confers metabolic flexibility, switching between the active tetramer and inactive dimer as needed to produce more ATP (tetramer) or biomass necessary for cell development and division. One avenue of treatment under active investigation is the activation of M2:

Author contributions

Authors contributed equally in this work.

Conflict of interest

None.

Acknowledgments

The authors would like to acknowledge Dr. Nic Leipzig for lab access to perform the experiments and The University of Akron's Integrated Biosciences program for supporting the endeavor.

References (93)

  • C. Kung et al.

    Small molecule activation of pkm2 in cancer cells induces serine auxotrophy

    Chem. Biol.

    (2012)
  • Z. Li et al.

    The multifaceted regulation and functions of PKM2 in tumor progression

    Biochim. Biophys. Acta – Rev. Cancer

    (2014)
  • T. Noguchi et al.

    The M1- and M2-type isozymes of rat pyruvate kinase are produced from the same gene by alternative RNA splicing

    J. Biol. Chem.

    (1986)
  • T. Noguchi et al.

    The L-and R-type isozymes of rat pyruvate kinase are produced from a single gene by use of different promoters

    J. Biol. Chem.

    (1987)
  • D.A. Pereira et al.

    Origin and evolution of high throughput screening

    Br. J. Pharmacol.

    (2007)
  • J. Schneider et al.

    Tumor m2-pyruvate kinase in the follow-up of inoperable lung cancer patients: a pilot study

    Cancer Lett.

    (2003)
  • M.G. Vander Heiden et al.

    Identification of small molecule inhibitors of pyruvate kinase m2

    Biochem. Pharmacol.

    (2010)
  • D.C. Weis et al.

    Computer-aided molecular design using the signature molecular descriptor: application to solvent selection

    Comput. Chem. Eng.

    (2010)
  • D.C. Weis et al.

    Data mining pubchem using a support vector machine with the signature molecular descriptor: classification of factor xia inhibitors

    J. Mol. Graph. Model.

    (2008)
  • Y. Xu et al.

    Discovery of 3-(trifluoromethyl)-1h-pyrazole-5-carboxamide activators of the m2 isoform of pyruvate kinase (pkm2)

    Bioorgan. Med. Chem. Lett.

    (2014)
  • A. Yacovan et al.

    1-(Sulfonyl)-5-(arylsulfonyl) indoline as activators of the tumor cell specific m2 isoform of pyruvate kinase

    Bioorgan. Med. Chem. Lett.

    (2012)
  • J. Alvarsson et al.

    Benchmarking study of parameter variation when using signature fingerprints together with support vector machines

    J. Chem. Inf. Model.

    (2014)
  • J. Alvarsson et al.

    Ligand-based target prediction with signature fingerprints

    J. Chem. Inf. Model

    (2014)
  • D. Anastasiou et al.

    Pyruvate kinase m2 activators promote tetramer formation and suppress tumorigenesis

    Nat. Chem. Biol.

    (2012)
  • R. Ao et al.

    Effects of pkm2 gene silencing on the proliferation and apoptosis of colorectal cancer ls-147t and sw620 cells

    Cell. Physiol. Biochem.

    (2017)
  • D. Auld, M. Shen, A. P. Skoumbourdis, J.-k. Jiang, M. Boxer, N. Southall, J. Inglese, C. Thomas, Identification of...
  • J.B. Baell et al.

    New substructure filters for removal of pan assay interference compounds (PAINS) from screening libraries and for their exclusion in bioassays

    J. Med. Chem.

    (2010)
  • A. Bender et al.

    Similarity searching of chemical databases using atom environment descriptors (MOLPRINT 2D): evaluation of performance

    J. Chem. Inf. Comput. Sci.

    (2004)
  • R.S. Bohacek et al.

    The art and practice of structure-based drug design: a molecular modeling perspective

    Med. Res. Rev.

    (1996)
  • M.B. Boxer et al.

    Evaluation of substituted n, n′-diarylsulfonamides as activators of the tumor cell specific m2 isoform of pyruvate kinase

    J. Med. Chem.

    (2009)
  • M.B. Boxer et al.

    Identification of activators for the m2 isoform of human pyruvate kinase version 3

    (2011)
  • U. Brinck et al.

    L- and m2-pyruvate kinase expression in renal cell carcinomas and their metastases

    Virchows Arch.

    (1994)
  • T.-K. Chao et al.

    Pyruvate kinase m2 is a poor prognostic marker of and a therapeutic target in ovarian cancer

    PLoS One

    (2017)
  • N.G. Chemmangattuvalappil et al.

    A novel methodology for property-based molecular design using multiple topological indices

    Industrial & Engineering Chemistry Research

    (2013)
  • X. Chen et al.

    Performance of similarity measures in 2d fragment-based similarity searching: comparison of structural descriptors and similarity coefficients

    J. Chem. Inf. Comput. Sci.

    (2002)
  • J.J.F. Chen et al.

    Developing an in silico pipeline for faster drug candidate discovery: virtual high throughput screening with the signature molecular descriptor using support vector machine models

    Chem. Eng. Sci.

    (2016)
  • J. Chen et al.

    Shikonin and its analogs inhibit cancer cell glycolysis by targeting tumor pyruvate kinase-m2

    Oncogene

    (2011)
  • T. Cheng et al.

    Structure-based virtual screening for drug discovery: a problem-centric review

    AAPS J.

    (2012)
  • W.D. Cornell et al.

    A 2nd generation force-field for the simulation of proteins, nucleic-acids, and organic-molecules

    J. Am. Chem. Soc.

    (1995)
  • C. Cortes et al.

    Support-vector networks

    Mach. Learn.

    (1995)
  • C.M. Dobson

    Chemical space and biology

    Nature

    (2004)
  • J.D. Dombrauckas et al.

    Structural basis for tumor pyruvate kinase M2 allosteric regulation and catalysis

    Biochemistry

    (2005)
  • D. Douguet et al.

    Lea3d: a computer-aided ligand design for structure-based drug design

    J. Med. Chem.

    (2005)
  • J. Drews

    Drug discovery: a historical perspective

    Science

    (2000)
  • J.D. Durrant et al.

    Molecular dynamics simulations and drug discovery

    BMC Biol.

    (2011)
  • E. Eigenbrodt et al.

    Double role for pyruvate kinase type M2 in the expansion of phosphometabolite pools found in tumor cells

    Crit. Rev. Oncog.

    (1992)
  • Cited by (2)

    View full text