Abstract
Finding key genes associated with diseases is an essential problem of disease diagnosis and treatment, and drug design. Bioinformatics takes advantage of computer technology to analyze biomedical data to help finding the information about these genes. Biomedical literatures, which consists of original experimental data and results, are attracting more attention from bio-informatics researchers because literature mining technology can extract knowledge more efficiently. This paper designs an algorithm to estimate the association degree between genes according to their co-citations in biomedical literatures from PubMed database, and to further predict the causative genes associated with a disease. The paper also uses hierarchical clustering algorithm to build a specific genes regulation network. Experiments on uterine cancer shows that the proposed algorithm can identify pathogenic genes of uterine cancer accurately and rapidly.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Lander ES, Weinberg RA (2000) Genomics: journey to the center of biology. Science, U.S. 287, pp 1777–1782
Jensen LJ, Saric J, Bork P (2006) Literature mining for the biologist: from information retrieval to biological discovery. Nat Rev Gen Lond 7:119–129
AI-Mubaid H, Singh RK (2005) A new text mining approach for finding protein-to-disease associations. Am J Biochem Biotechnol 1(3):145–152
Chun HW, Tsuruoka Y, Kim JD et al (2006) Automatic recognition of topic-classified relations between prostate cancer and genes using MEDLINE abstracts. BMC Bioinform 7(1):1–8
Chen JY, Shen C, Sivachenko AY (2006) Mining Alzheimer disease relevant proteins from integrated protein interactome data. Pac Symp Biocomput 11:367–378
Human Protein Reference Database (2009). http://www.hprd.org/
Database of Interacting Proteins (2014). http://dip.doe-mbi.ucla.edu/dip/Main.cgi
Interologous Interaction Database (2015). http://ophid.utoronto.ca/ophidv2.204/
Ozgur A, Vu TG, Radev DR (2008) Identifying gene-disease associations using centrality on a literature mined gene-interaction network. Bioinformatics 24(13):i277–i285
Liu B, Jiang T, Ma S et al (2006) Exploring candidate genes for human brain diseases from a brain-specific gene network. Biochem Biophys Res Commun 349(4):1308C–1314
Radivojac P, Peng K, Clark WT, Peters BJ et al (2008) An integrated approach to inferring gene-disease associations in humans. Proteins 72(3):1030–1037
Wu X, Liu Q, Jiang R (2009) Align human interaetome with phenome to identify causative genes and networks underlying disease families. Bioinformatics 25(1):98–104
Miozzi L, Piro RM, Rosa F, Ala U, Silengo L et al (2008) Functionnl annotation and identification of candidate disease genes by computational analysis of normal tissue gene expression data. PLoS One 3(6):24–39
Ortutay Y, Vihinen M (2009) Identification of candidate disease genes by integrating Gene Ontologies and protein-interaction networks: case study of primary immunodeficiencies. Nucleic Acids Res 37(2):622–628
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing Switzerland
About this paper
Cite this paper
Wang, Y., Jiang, C., Cheng, J., Wang, X. (2017). Disease Candidate Gene Identification and Gene Regulatory Network Building Through Medical Literature Mining. In: Balas, V., Jain, L., Zhao, X. (eds) Information Technology and Intelligent Transportation Systems. Advances in Intelligent Systems and Computing, vol 455. Springer, Cham. https://doi.org/10.1007/978-3-319-38771-0_44
Download citation
DOI: https://doi.org/10.1007/978-3-319-38771-0_44
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-38769-7
Online ISBN: 978-3-319-38771-0
eBook Packages: EngineeringEngineering (R0)