Disease Candidate Gene Identification and Gene Regulatory Network Building Through Medical Literature Mining

Wang, Yong; Jiang, Chenyang; Cheng, Jinbiao; Wang, Xiaoqun

doi:10.1007/978-3-319-38771-0_44

Yong Wang¹⁷,
Chenyang Jiang¹⁷,
Jinbiao Cheng¹⁷ &
…
Xiaoqun Wang¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 455))

1295 Accesses

Abstract

Finding key genes associated with diseases is an essential problem of disease diagnosis and treatment, and drug design. Bioinformatics takes advantage of computer technology to analyze biomedical data to help finding the information about these genes. Biomedical literatures, which consists of original experimental data and results, are attracting more attention from bio-informatics researchers because literature mining technology can extract knowledge more efficiently. This paper designs an algorithm to estimate the association degree between genes according to their co-citations in biomedical literatures from PubMed database, and to further predict the causative genes associated with a disease. The paper also uses hierarchical clustering algorithm to build a specific genes regulation network. Experiments on uterine cancer shows that the proposed algorithm can identify pathogenic genes of uterine cancer accurately and rapidly.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Lander ES, Weinberg RA (2000) Genomics: journey to the center of biology. Science, U.S. 287, pp 1777–1782
Google Scholar
Jensen LJ, Saric J, Bork P (2006) Literature mining for the biologist: from information retrieval to biological discovery. Nat Rev Gen Lond 7:119–129
Article Google Scholar
AI-Mubaid H, Singh RK (2005) A new text mining approach for finding protein-to-disease associations. Am J Biochem Biotechnol 1(3):145–152
Article Google Scholar
Chun HW, Tsuruoka Y, Kim JD et al (2006) Automatic recognition of topic-classified relations between prostate cancer and genes using MEDLINE abstracts. BMC Bioinform 7(1):1–8
Article Google Scholar
Chen JY, Shen C, Sivachenko AY (2006) Mining Alzheimer disease relevant proteins from integrated protein interactome data. Pac Symp Biocomput 11:367–378
Google Scholar
Human Protein Reference Database (2009). http://www.hprd.org/
Database of Interacting Proteins (2014). http://dip.doe-mbi.ucla.edu/dip/Main.cgi
Interologous Interaction Database (2015). http://ophid.utoronto.ca/ophidv2.204/
Ozgur A, Vu TG, Radev DR (2008) Identifying gene-disease associations using centrality on a literature mined gene-interaction network. Bioinformatics 24(13):i277–i285
Article Google Scholar
Liu B, Jiang T, Ma S et al (2006) Exploring candidate genes for human brain diseases from a brain-specific gene network. Biochem Biophys Res Commun 349(4):1308C–1314
Article Google Scholar
Radivojac P, Peng K, Clark WT, Peters BJ et al (2008) An integrated approach to inferring gene-disease associations in humans. Proteins 72(3):1030–1037
Article Google Scholar
Wu X, Liu Q, Jiang R (2009) Align human interaetome with phenome to identify causative genes and networks underlying disease families. Bioinformatics 25(1):98–104
Article Google Scholar
Miozzi L, Piro RM, Rosa F, Ala U, Silengo L et al (2008) Functionnl annotation and identification of candidate disease genes by computational analysis of normal tissue gene expression data. PLoS One 3(6):24–39
Article Google Scholar
Ortutay Y, Vihinen M (2009) Identification of candidate disease genes by integrating Gene Ontologies and protein-interaction networks: case study of primary immunodeficiencies. Nucleic Acids Res 37(2):622–628
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Software, Beijing Institute of Technology, Beijing, 100081, China
Yong Wang, Chenyang Jiang & Jinbiao Cheng
Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
Xiaoqun Wang

Authors

Yong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Chenyang Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Jinbiao Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoqun Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yong Wang .

Editor information

Editors and Affiliations

Department of Automation and Applied Informatics, Faculty of Engineering, Aurel Vlaicu University of Arad, Arad, Romania
Valentina Emilia Balas
University of South Australia, Bournemouth University, Poole, UK, and University of South Australia, Adelaide, Australia
Lakhmi C. Jain
School of Information Engineering, Chang'an University, Xi'an, China
Xiangmo Zhao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, Y., Jiang, C., Cheng, J., Wang, X. (2017). Disease Candidate Gene Identification and Gene Regulatory Network Building Through Medical Literature Mining. In: Balas, V., Jain, L., Zhao, X. (eds) Information Technology and Intelligent Transportation Systems. Advances in Intelligent Systems and Computing, vol 455. Springer, Cham. https://doi.org/10.1007/978-3-319-38771-0_44

Download citation

DOI: https://doi.org/10.1007/978-3-319-38771-0_44
Published: 06 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-38769-7
Online ISBN: 978-3-319-38771-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics