Abstract
Disease classification based on biological data is an important area in bioinformatics and biomedical research. It helps the doctors and medical practitioners for the early detection of disease and support them as a computer-aided diagnostic tool for accurate diagnosis, prognosis, and treatment of disease. Earlier Microarray gene expression data have wide application for the classification of disease, but now Next-generation sequencing (NGS) has replaced the Microarray technology. From the last few years, RNA sequence (RNA-Seq) data are widely used for the transcriptomic analysis. Hence, RNA-Seq based classification of disease is in its infancy. In this article, we present a general framework for the classification of disease constructed on RNA-Seq data. This framework will guide the researchers to process RNA-Seq, extract relevant features and apply the appropriate classifier to classify any kind of disease.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Raza K (2016) Analysis of microarray data using artificial intelligence based techniques. In: Handbook of research on computational intelligence applications in bioinformatics. IGI Global, pp 216–239
Jabeen A, Ahmad N, Raza K (2019) Differential expression analysis of ZIKV infected human RNA sequence reveals potential genetic biomarkers. Lect Notes Bioinf 11465:283–294, Springer
Cho JH, Lee D, Park JH, Lee IB (2004) Gene selection and classification from microarray data using kernel machine. FEBS Lett 571(1–3):93–98
Wang Y, Makedon FS, Ford JC, Pearlman J (2004) HykGene: a hybrid approach for selecting marker genes for phenotype classification using microarray gene expression data. Bioinformatics 21(8):1530–1537
Wang X, Gotoh O (2009) Microarray-based cancer prediction using soft computing approach. Cancer Informat 7
Raza K (2014) Clustering analysis of cancerous microarray data. J Chem Pharm Res 6(9):488–493
Oshlack A, Wakefield MJ (2009) Transcript length bias in RNA-seq data confounds systems biology. Biol Dir 4(1):14
Richard H, Schulz MH, Sultan M, Nurnberger A, Schrinner S, Balzereit D, Haas SA (2010) Prediction of alternative isoforms from exon expression levels in RNA-Seq experiments. Nucleic Acids Res 38(10):e112–e112
Singh D, Orellana CF, Hu Y, Jones CD, Liu Y, Chiang DY, Prins JF (2011) FDM: a graph-based statistical method to detect differential transcription using RNA-seq data. Bioinformatics 27(19):2633–2640
Ning K, Fermin D, Nesvizhskii AI (2012) Comparative analysis of different label-free mass spectrometry-based protein abundance estimates and their correlation with RNA-Seq gene expression data. J Proteome Res 11(4):2261–2271
Ramsköld D, Luo S, Wang YC, Li R, Deng Q, Faridani OR, Schroth GP (2012) Full-length mRNA-Seq from single-cell levels of RNA and individual circulating tumor cells. Nat Biotechnol 30(8):777
Hänzelmann S, Castelo R, Guinney J (2013) GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics 14(1):7
Chen X, Huang YA, Wang XS, You ZH, Chan KC (2016) FMLNCSIM: fuzzy measure-based lncRNA functional similarity calculation model. Oncotarget 7(29):45948
Jabeen A, Ahmad N, Raza K (2018b) Machine learning-based state-of-the-art methods for the classification of RNA-Seq data. In: Classification in BioApps. Springer, Cham, pp 133–172
Wani N, Raza K (2019) Raw sequence to target gene prediction: an integrated inference pipeline for ChIP-Seq and RNA-Seq datasets. In: Applications of artificial intelligence techniques in engineering. Advances in intelligent systems and computing, vol 697. Springer, Singapore
FactQC A quality control tool for high throughput sequence data. https://www.bioinformatics.babraham.ac.uk/projects/fastqc/
Raza K, Ahmad S (2019) Recent advancement in next-generation sequencing techniques and its computational analysis. Int J Bioinf Res Appl 15(3):191–220, Inderscience
Robinson MD, McCarthy DJ, Smyth GK (2010) edgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26(1):139–140
Trapnell C, Hendrickson DG, Sauvageau M, Goff L, Rinn JL, Pachter L (2013) Differential analysis of gene regulation at transcript resolution with RNA-seq. Nat Biotechnol 31(1):46
Love MI, Huber W, Anders S (2014) Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15(12):550
Acknowledgements
The authors acknowledge Dr. Khalid Raza, Department of Computer Science, Jamia Millia Islamia for necessary discussion and suggestion on the manuscript.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Iqbal, N., Kumar, P. (2020). A Framework for the RNA-Seq Based Classification and Prediction of Disease. In: Kumar, A., Paprzycki, M., Gunjan, V. (eds) ICDSMLA 2019. Lecture Notes in Electrical Engineering, vol 601. Springer, Singapore. https://doi.org/10.1007/978-981-15-1420-3_8
Download citation
DOI: https://doi.org/10.1007/978-981-15-1420-3_8
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1419-7
Online ISBN: 978-981-15-1420-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)