Abstract
This paper presents a GA-based method to generate novel logical-based features, represented by parse trees, from DNA sequences enriched with H3K4me1 histone signatures. Current methods which mostly utilize k-mers content features are not able to represent the possible complex interaction of various DNA segments in H3K4me1 regions. We hypothesize that such complex interaction modeling is significant towards recognition of H3K4me1 marks. Our propose method employ the tree structure to model the logical relationship between k-mers from the marks. To benchmark our generated features, we compare it to the typically used k-mer content features using the mouse (mm9) genome dataset. Our results show that the logical rule features improve the performance in terms of f-measure for all the datasets tested.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Lettice, L.A.: A Long-range Shh Enhancer Regulates Expression in the Developing Limb and Fin and is Associated with Preaxial Polydactyly. Human Molecular Genetics 12, 1725–1735 (2003)
Wittkopp, P.J., Kalay, G.: Cis-regulatory elements: Molecular Mechanisms and Evolutionary Processes Underlying Divergence. Nature Review Genetics 13, 56–69 (2012)
Das, M.K., Dai, H.: A Survey of DNA Motif Finding Algorithms. BMC Bioinformatics 8, S21 (2007)
Barski, A., Cuddapah, S., Cui, K., Roh, T., Schones, D.E., Wang, Z., Wei, G., Chepelev, L., Zhao, K.: High-Resolution Profiling of Histone Methylations in the Human Genome. Cell 129, 823–837 (2007)
Heintzman, N.D., Stuart, R.K., Hon, G., Fu, Y., Ching, W.C., Barrera, L.O., Van Calcar, S., Qu, C., Ching, K., Wang, W., Weng, Z., Green, R.D., Crawford, G.E.: Distinct and Predictive Chromatin Signatures of Transcriptional Promoters and Enhancers in the Human Genome. Nature Genetics 39, 311–318 (2007)
Firpi, H.A., Ucar, D., Tan, K.: Discover Regulatory DNA Elements Using Chromatin Signatures and Artificial Neural Network. Bioinformatics 26, 1579–1586 (2010)
Gorkin, D.U., Lee, D., Reed, X., Fletez-Brant, C., Bessling, S.L., Loftus, S.K., Beer, M.A., Pavan, W.J., Mccallion, A.S.: Integration of ChIP-seq and Machine Learning Reveals Enhancers and a Predictive Regulatory Sequence Vocabulary in Melanocytes. Genome Research 22, 2290–2301 (2012)
Holland, J.: Adaptation in Natural and Artificial Systems. The University of Michigan Press, Ann Arbor (1975)
Pham, T.H., Ho, T.B., Tran, D.H., Satou, K.: Prediction of Histone Modifications in DNA Sequences. Bioinformatics and Bioengineering, 959–966 (2007)
Mitchell, M.: An Introduction to Genetic Algorithms. The MIT Press, London (2001)
Kamath, U., Compton, J., Islamaj-Dogan, R., De Jong, K.A., Shehu, A.: An Evolutionary Algorithm Approach for feature Generation from Sequence Data and Its Application to DNA Splice Site Prediction. IEEE/ACM Transactions on Computational Biology and Informatics 9, 1387–1397 (2012)
Chang, C., Lin, C.: A Library for Support Vector Machines. ACM Transactions on Intelligent Systems and Technology 27, 27 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Fong, P.K., Lee, N.K., Abdullah, M.T. (2014). Employing Genetic Algorithm to Construct Epigenetic Tree-Based Features for Enhancer Region Prediction. In: Loo, C.K., Yap, K.S., Wong, K.W., Beng Jin, A.T., Huang, K. (eds) Neural Information Processing. ICONIP 2014. Lecture Notes in Computer Science, vol 8836. Springer, Cham. https://doi.org/10.1007/978-3-319-12643-2_48
Download citation
DOI: https://doi.org/10.1007/978-3-319-12643-2_48
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12642-5
Online ISBN: 978-3-319-12643-2
eBook Packages: Computer ScienceComputer Science (R0)