Abstract
The classification of human cancers constitutes to date a significant challenge in the context of microarray data analysis. The discovery of gene hallmarks for biological processes involves the examination of large gene expression matrices in a broad and massively parallel manner. In this article, a comprehensive and comparative analysis of thyroid cancer datasets is presented, including stages for feature selection, hypothesis testing, and classification. Also, datasets are integrated, and results for this integration are reported and analyzed. To conclude, text mining is used to investigate some biological information regarding the main resulting characteristic genes. Some genes found during the research, HINT3 in particular, appear to be worth to be further studied.
Supported by CONICET (112-2017-0100829) and SGCyT-UNS (24/N052).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Athar, A., et al.: Arrayexpress update - from bulk to single-cell expression data. Nucleic Acids Res. 47(Database-Issue), D711–D715 (2019)
Barros-Filho, M.C., Marchi, F.A., Pinto, C.A., Rogatto, S.R., Kowalski, L.P.: High diagnostic accuracy based on CLDN10, HMGA2, and LAMB3 transcripts in papillary thyroid carcinoma. J. Clin. Endocrinol. Metabolism 100(6), E890–E899 (2015). https://doi.org/10.1210/jc.2014-4053
Behzadi, P., Ranjbar, R.: Dna microarray technology and bioinformatic web services. Acta Microbiol. Immunol. Hung. 66(1), 19–30 (2019). https://doi.org/10.1556/030.65.2018.028
Brazma, A., et al.: Minimum information about a microarray experiment (MIAME) - towards standards for microarray data. Nat. Genet. 29, 365–371 (2001)
Chandrababu, S., Bastola, D.: A novel prediction model for discovering beneficial effects of natural compounds in drug repurposing. In: Rojas, I., Valenzuela, O., Rojas, F., Herrera, L.J., Ortuño, F. (eds.) IWBBIO 2020. LNCS, vol. 12108, pp. 811–824. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-45385-5_72
Chien, M.N., Yang, P.S., Lee, J.J., Wang, T.Y., Hsu, Y.C., Cheng, S.P.: Recurrence-associated genes in papillary thyroid cancer: an analysis of data from the cancer genome atlas. Surgery 161(6), 1642–1650 (2017). https://doi.org/10.1016/j.surg.2016.12.039
Chou, T.F., Cheng, J., Tikh, I.B., Wagner, C.R.: Evidence that human histidine triad nucleotide binding protein 3 (hint3) is a distinct branch of the histidine triad (hit) superfamily. J. Mol. Biol. 373(4), 978–989 (2007). https://doi.org/10.1016/j.jmb.2007.08.023
Chukwudozie, O.S., et al.: The relevance of bioinformatics applications in the discovery of vaccine candidates and potential drugs for covid-19 treatment. Bioinform. Biol. Insights 15 (2021). https://doi.org/10.1177/11779322211002168
Edgar, R., Domrachev, M., Lash, A.E.: Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 30(1), 207–210 (2002). https://doi.org/10.1093/nar/30.1.207
Escalera, S., Pujol, O., Radeva, P.: Error-correcting output codes library. J. Mach. Learn. Res. 11(20), 661–664 (2010)
Géron, A.: Hands-on machine learning with Scikit-Learn and TensorFlow: concepts, tools, and techniques to build intelligent systems. O’Reilly Media, Sebastopol (2017)
Guyon, I., Weston, J., Barnhill, S., Vapnik, V.: Gene selection for cancer classification using support vector machines. Mach. Learn. 46(1–3), 389–422 (2002). https://doi.org/10.1023/A:1012487302797
Haley, B., Roudnicky, F.: Functional genomics for cancer drug target discovery. Cancer Cell 38(1), 31–43 (2020). https://doi.org/10.1016/j.ccell.2020.04.006
Hu, S., Liao, Y., Chen, L.: Identification of key pathways and genes in anaplastic thyroid carcinoma via integrated bioinformatics analysis. Med. Sci. Monitor 24, 6438–6448 (2018). https://doi.org/10.12659/MSM.910088
Huang, Y., et al.: Bioinformatics analysis of key genes and latent pathway interactions based on the anaplastic thyroid carcinoma gene expression profile. Oncol. Lett. 13, 167–176 (2017). https://doi.org/10.3892/ol.2016.5447
Kitahara, C.M., Schneider, A.B., Brenner, A.V.: Thyroid Cancer, chap. 44. Oxford University Press (2017). https://doi.org/10.1093/oso/9780190238667.003.0044
Miranda-Filho, A., et al.: Thyroid cancer incidence trends by histology in 25 countries: a population-based study. Lancet Diabetes Endocrinol. 9(4), 225–234 (2021). https://doi.org/10.1016/S2213-8587(21)00027-9
Morte, B., Martínez, T., Zambrano, A., Pascual, A.: Monocyte-mediated regulation of genes by the amyloid and prion peptides in SH-SY5Y neuroblastoma cells. Neurochem. Int. 58(6), 613–619 (2011). https://doi.org/10.1016/j.neuint.2011.01.019
Rossi, E.D., Pantanowitz, L., Hornick, J.L.: A worldwide journey of thyroid cancer incidence centred on tumour histology. Lancet Diabetes Endocrinol. 9(4), 193–194 (2021). https://doi.org/10.1016/S2213-8587(21)00049-8
van Ruissen, F., Baas, F.: Serial Analysis of Gene Expression (SAGE), pp. 41–66. Humana Press, Totowa (2007)
Tomas, G., Vincent, D.: Sporadic vs. post-chernobyl papillary vs. anaplastic thyroid cancers (2012)
Tovar, H., Alvarez-Suarez, D.E., Gómez-Romero, L., Hernández-Lemus, E.: Bioinformatics of genome-wide expression studies. In: Bioinformatics and Human Genomics Research, chap. 5, pp. 73–99. CRC Press (2021)
Wang, W., Xu, G., Ding, C.L., Zhao, L.J., Zhao, P., Ren, H., Qi, Z.T.: All-trans retinoic acid protects hepatocellular carcinoma cells against serum-starvation-induced cell death by upregulating collagen 8a2. FEBS J. 280(5), 1308–1319 (2013). https://doi.org/10.1111/febs.12122
Wu, C.C., et al.: Integrated analysis of fine-needle-aspiration cystic fluid proteome, cancer cell secretome, and public transcriptome datasets for papillary thyroid cancer biomarker discovery. Oncotarget 9(15), 12079–12100 (2018). https://doi.org/10.18632/oncotarget.23951
Yan, T., Ding, F., Zhao, Y.: Integrated identification of key genes and pathways in Alzheimer’s disease via comprehensive bioinformatical analyses. Hereditas 156(25) (2019). https://doi.org/10.1186/s41065-019-0101-0
Young, A.P., Jackson, D.J., Wyeth, R.C.: A technical review and guide to RNA fluorescence in situ hybridization. PeerJ 8, March 2020. https://doi.org/10.7717/peerj.8806
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Petrini, I., Cecchini, R.L., Mascaró, M., Ponzoni, I., Carballido, J.A. (2022). Statistical Learning Analysis of Thyroid Cancer Microarray Data. In: Rojas, I., Valenzuela, O., Rojas, F., Herrera, L.J., Ortuño, F. (eds) Bioinformatics and Biomedical Engineering. IWBBIO 2022. Lecture Notes in Computer Science(), vol 13347. Springer, Cham. https://doi.org/10.1007/978-3-031-07802-6_8
Download citation
DOI: https://doi.org/10.1007/978-3-031-07802-6_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-07801-9
Online ISBN: 978-3-031-07802-6
eBook Packages: Computer ScienceComputer Science (R0)