Abstract
Recent work in molecular biology has revealed that transcription factors are biologically important in gene regulation. A transcription factor regulates the expression level of a gene by binding to the promoter region of the gene. A model that can accurately describe the binding sites of a transcription factor is thus crucial for understanding the biological mechanisms of gene regulation. In this paper, we develop a new feature extraction algorithm that can accurately obtain features of the binding sites of a transcription factor. The obtained features describe the pair-wise correlations of different positions in a binding site. Based on these features, pair-wise correlations can be integrated into a statistical model that describes the binding sites of a transcriptional factor. Our testing results show that, this approach is able to identify important features for transcription factor binding sites and statistical models based on these features can achieve prediction accuracy that is higher than or comparable with that of other feature extraction methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Barash, Y., Elidan, G., Friedman, N., Kaplan T.: Modeling dependencies in protein-DNA binding sites. In: Proceedings of the Seventh Annual International Conference on Computational Biology, pp. 28–37 (2003)
Brent, M.M., Anand, R., Marmorstein, R.: Structural basis for DNA recognition by Foxo1 and its regulation by posttranslational modification. Structure 16, 1407–1416 (2008)
Bulyk, M.: DNA microarray technologies for measuring protein-DNA interactions. Curr. Opin. Biotechnol. 17, 1–9 (2006)
Cormen, T., Leiserson, C., Rivest, R., Stein, C.: Introduction to Algorithms, 2nd edn. MIT Press, Cambridge (2001)
Che, D., Song, Y., Zhao, H.: MDGA: motif discovery using a genetic algorithm. In: Proceedings of the 2005 Conference on Genetic and Evolutionary Computation, pp. 447–452 (2005)
Elnitski, L., Jin, V.X., Farnham, P.J., Jones, S.J.M.: Locating mammalian transcription factor binding sites: a survey of computational and experimental techniques. Genome Res. 16(12), 1455–1464 (2006)
Gallas, D., Schmitz, A.: DNA footprinting: a simple method for the detection of protein-dna binding specificity. Nucleic Acids Res. 5(9), 3157–3170 (1978)
Garner, M., Revzin, A.: A gel electrophoresis method for quantifying the binding site of proteins to specific DNA regions: application to components of the escherichia coli lactose operon regulatory systems. Nucleic Acids Res. 9(13), 3047–3060 (1981)
Harbinson, C.T., et al.: Transcriptional regulatory code of a eukaryotic genome. Nature 431(7004), 99–104 (2004)
Hu, T.C., et al.: Snail associates with EGR-1 and SP-1 to upregulate transcriptional activation of P15ink14b. FEBS J. 227, 1202–1218 (2010)
Liu, C., Song, Y.: Parameterized complexity and inapproximability of dominating set problem in chordal and near chordal graphs. J. Comb. Optim. 22(4), 684–698 (2011)
Liu, J., Neuwald, A., Lawrence, C.: Bayesian models for local alignment and gibbs sampling strategies. J. Am. Stat. Assoc. 90(432), 1156–1170 (1995)
Liu, X., Brutlag, D., Liu, J.: Bioprospector: discovering conserved DNA motifs in upstream regulatory regions of coexpressed genes. In: Proceedings of 2001 Pacific Symposium on Biocomputing, pp. 127–138 (2001)
MacIssac, K., Wang, T., Gordon, D., Gifford, D., Stormo, G., Fraenkel, E.: An Improved map of conserved regulatory sites for saccharomyces cerevisiae. BMC Bioinf. 7, 113 (2006)
Maerkl, S., Quake, S.: A systems approach to measuring the binding energy landscapes of transcription factors. Science 315(5809), 233–236 (2007)
Quigley, M., et al.: Transcriptional analysis of HIV-specific CD8+ T cells shows that PD-1 inhibits T cell function by upregulating BATF. Nat. Med. 16, 1147–1151 (2010)
Rigbolt, K.T., et al.: System-wide temporal characterization of the proteome and phosphoproteome of human embryonic stem cell differentiation. Sci. Signal. 4, RS3 (2011)
Santolini, M., Mora, T., Hakim, V.: A general pair-wise interaction model provides an accurate description of in vivo transcription factor binding sites. PLoS One 9(6), e99015 (2014)
Sharon, E., Segal, E.: A feature-based approach to modeling protein-DNA interactions. In: Speed, T., Huang, H. (eds.) RECOMB 2007. LNCS (LNBI), vol. 4453, pp. 77–91. Springer, Heidelberg (2007)
Song, Y., Wang, C., Qu, J.: A parameterized algorithm for predicting transcription factor binding sites. In: Huang, D.-S., Han, K., Gromiha, M. (eds.) ICIC 2014. LNCS, vol. 8590, pp. 339–350. Springer, Heidelberg (2014)
Song, Y., Yu, M.: On finding the longest antisymmetric path in directed acyclic graphs. Inf. Process. Lett. 115(2), 377–381 (2015)
Song, Y., Chi, A.Y.: Peptide sequencing via graph path decomposition. Inf. Sci. 301, 262–270 (2015)
Song, Y., Chi, A.Y.: A new approach for parameter estimation in the sequence-structure alignment of non-coding RNAs. J. Inf. Sci. Eng. 31(2), 593–607 (2015)
Song, Y.: An improved parameterized algorithm for the independent feedback vertex set problem. Theoret. Comput. Sci. 535, 25–30 (2014)
Song, Y.: A new parameterized algorithm for rapid peptide sequencing. PLoS One 9(2), e87476 (2014)
Song, Y., Yu, M.: On the treewidths of graphs of bounded degree. PLoS One 10(4), e0120880 (2015)
Stormo, G.: Computer methods for analyzing sequence recognition of nucleic acids. Annu. Rev. Biochem. 17, 241–263 (1988)
Stormo, G., Hartzell, G.: Identifying protein-binding sites from unaligned DNA fragments. Proc. Natl. Acad. Sci. 86(4), 1183–1187 (1989)
Wang, L.-S., Jensen, S.T., Hannenhalli, S.: An interaction-dependent model for transcription factor binding. In: Eskin, E., Ideker, T., Raphael, B., Workman, C. (eds.) RECOMB 2005. LNCS (LNBI), vol. 4023, pp. 225–234. Springer, Heidelberg (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Song, Y., Chi, A.Y., Qu, J. (2015). A Graph Theoretic Approach for the Feature Extraction of Transcription Factor Binding Sites. In: Huang, DS., Jo, KH., Hussain, A. (eds) Intelligent Computing Theories and Methodologies. ICIC 2015. Lecture Notes in Computer Science(), vol 9226. Springer, Cham. https://doi.org/10.1007/978-3-319-22186-1_44
Download citation
DOI: https://doi.org/10.1007/978-3-319-22186-1_44
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-22185-4
Online ISBN: 978-3-319-22186-1
eBook Packages: Computer ScienceComputer Science (R0)