Skip to main content

A Framework for the RNA-Seq Based Classification and Prediction of Disease

  • Conference paper
  • First Online:
ICDSMLA 2019

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 601))

  • 92 Accesses

Abstract

Disease classification based on biological data is an important area in bioinformatics and biomedical research. It helps the doctors and medical practitioners for the early detection of disease and support them as a computer-aided diagnostic tool for accurate diagnosis, prognosis, and treatment of disease. Earlier Microarray gene expression data have wide application for the classification of disease, but now Next-generation sequencing (NGS) has replaced the Microarray technology. From the last few years, RNA sequence (RNA-Seq) data are widely used for the transcriptomic analysis. Hence, RNA-Seq based classification of disease is in its infancy. In this article, we present a general framework for the classification of disease constructed on RNA-Seq data. This framework will guide the researchers to process RNA-Seq, extract relevant features and apply the appropriate classifier to classify any kind of disease.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Raza K (2016) Analysis of microarray data using artificial intelligence based techniques. In: Handbook of research on computational intelligence applications in bioinformatics. IGI Global, pp 216–239

    Google Scholar 

  2. Jabeen A, Ahmad N, Raza K (2019) Differential expression analysis of ZIKV infected human RNA sequence reveals potential genetic biomarkers. Lect Notes Bioinf 11465:283–294, Springer

    Google Scholar 

  3. Cho JH, Lee D, Park JH, Lee IB (2004) Gene selection and classification from microarray data using kernel machine. FEBS Lett 571(1–3):93–98

    Article  Google Scholar 

  4. Wang Y, Makedon FS, Ford JC, Pearlman J (2004) HykGene: a hybrid approach for selecting marker genes for phenotype classification using microarray gene expression data. Bioinformatics 21(8):1530–1537

    Article  Google Scholar 

  5. Wang X, Gotoh O (2009) Microarray-based cancer prediction using soft computing approach. Cancer Informat 7

    Google Scholar 

  6. Raza K (2014) Clustering analysis of cancerous microarray data. J Chem Pharm Res 6(9):488–493

    Google Scholar 

  7. Oshlack A, Wakefield MJ (2009) Transcript length bias in RNA-seq data confounds systems biology. Biol Dir 4(1):14

    Article  Google Scholar 

  8. Richard H, Schulz MH, Sultan M, Nurnberger A, Schrinner S, Balzereit D, Haas SA (2010) Prediction of alternative isoforms from exon expression levels in RNA-Seq experiments. Nucleic Acids Res 38(10):e112–e112

    Google Scholar 

  9. Singh D, Orellana CF, Hu Y, Jones CD, Liu Y, Chiang DY, Prins JF (2011) FDM: a graph-based statistical method to detect differential transcription using RNA-seq data. Bioinformatics 27(19):2633–2640

    Google Scholar 

  10. Ning K, Fermin D, Nesvizhskii AI (2012) Comparative analysis of different label-free mass spectrometry-based protein abundance estimates and their correlation with RNA-Seq gene expression data. J Proteome Res 11(4):2261–2271

    Article  Google Scholar 

  11. Ramsköld D, Luo S, Wang YC, Li R, Deng Q, Faridani OR, Schroth GP (2012) Full-length mRNA-Seq from single-cell levels of RNA and individual circulating tumor cells. Nat Biotechnol 30(8):777

    Google Scholar 

  12. Hänzelmann S, Castelo R, Guinney J (2013) GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics 14(1):7

    Article  Google Scholar 

  13. Chen X, Huang YA, Wang XS, You ZH, Chan KC (2016) FMLNCSIM: fuzzy measure-based lncRNA functional similarity calculation model. Oncotarget 7(29):45948

    Article  Google Scholar 

  14. Jabeen A, Ahmad N, Raza K (2018b) Machine learning-based state-of-the-art methods for the classification of RNA-Seq data. In: Classification in BioApps. Springer, Cham, pp 133–172

    Google Scholar 

  15. Wani N, Raza K (2019) Raw sequence to target gene prediction: an integrated inference pipeline for ChIP-Seq and RNA-Seq datasets. In: Applications of artificial intelligence techniques in engineering. Advances in intelligent systems and computing, vol 697. Springer, Singapore

    Google Scholar 

  16. FactQC A quality control tool for high throughput sequence data. https://www.bioinformatics.babraham.ac.uk/projects/fastqc/

  17. Raza K, Ahmad S (2019) Recent advancement in next-generation sequencing techniques and its computational analysis. Int J Bioinf Res Appl 15(3):191–220, Inderscience

    Google Scholar 

  18. Robinson MD, McCarthy DJ, Smyth GK (2010) edgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26(1):139–140

    Article  Google Scholar 

  19. Trapnell C, Hendrickson DG, Sauvageau M, Goff L, Rinn JL, Pachter L (2013) Differential analysis of gene regulation at transcript resolution with RNA-seq. Nat Biotechnol 31(1):46

    Article  Google Scholar 

  20. Love MI, Huber W, Anders S (2014) Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15(12):550

    Article  Google Scholar 

Download references

Acknowledgements

The authors acknowledge Dr. Khalid Raza, Department of Computer Science, Jamia Millia Islamia for necessary discussion and suggestion on the manuscript.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Naiyar Iqbal .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Iqbal, N., Kumar, P. (2020). A Framework for the RNA-Seq Based Classification and Prediction of Disease. In: Kumar, A., Paprzycki, M., Gunjan, V. (eds) ICDSMLA 2019. Lecture Notes in Electrical Engineering, vol 601. Springer, Singapore. https://doi.org/10.1007/978-981-15-1420-3_8

Download citation

Publish with us

Policies and ethics