Skip to main content

A Graph Theoretic Approach for the Feature Extraction of Transcription Factor Binding Sites

  • Conference paper
  • First Online:
Intelligent Computing Theories and Methodologies (ICIC 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9226))

Included in the following conference series:

  • 1506 Accesses

Abstract

Recent work in molecular biology has revealed that transcription factors are biologically important in gene regulation. A transcription factor regulates the expression level of a gene by binding to the promoter region of the gene. A model that can accurately describe the binding sites of a transcription factor is thus crucial for understanding the biological mechanisms of gene regulation. In this paper, we develop a new feature extraction algorithm that can accurately obtain features of the binding sites of a transcription factor. The obtained features describe the pair-wise correlations of different positions in a binding site. Based on these features, pair-wise correlations can be integrated into a statistical model that describes the binding sites of a transcriptional factor. Our testing results show that, this approach is able to identify important features for transcription factor binding sites and statistical models based on these features can achieve prediction accuracy that is higher than or comparable with that of other feature extraction methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Barash, Y., Elidan, G., Friedman, N., Kaplan T.: Modeling dependencies in protein-DNA binding sites. In: Proceedings of the Seventh Annual International Conference on Computational Biology, pp. 28–37 (2003)

    Google Scholar 

  2. Brent, M.M., Anand, R., Marmorstein, R.: Structural basis for DNA recognition by Foxo1 and its regulation by posttranslational modification. Structure 16, 1407–1416 (2008)

    Article  MATH  Google Scholar 

  3. Bulyk, M.: DNA microarray technologies for measuring protein-DNA interactions. Curr. Opin. Biotechnol. 17, 1–9 (2006)

    Article  Google Scholar 

  4. Cormen, T., Leiserson, C., Rivest, R., Stein, C.: Introduction to Algorithms, 2nd edn. MIT Press, Cambridge (2001)

    MATH  Google Scholar 

  5. Che, D., Song, Y., Zhao, H.: MDGA: motif discovery using a genetic algorithm. In: Proceedings of the 2005 Conference on Genetic and Evolutionary Computation, pp. 447–452 (2005)

    Google Scholar 

  6. Elnitski, L., Jin, V.X., Farnham, P.J., Jones, S.J.M.: Locating mammalian transcription factor binding sites: a survey of computational and experimental techniques. Genome Res. 16(12), 1455–1464 (2006)

    Article  Google Scholar 

  7. Gallas, D., Schmitz, A.: DNA footprinting: a simple method for the detection of protein-dna binding specificity. Nucleic Acids Res. 5(9), 3157–3170 (1978)

    Article  Google Scholar 

  8. Garner, M., Revzin, A.: A gel electrophoresis method for quantifying the binding site of proteins to specific DNA regions: application to components of the escherichia coli lactose operon regulatory systems. Nucleic Acids Res. 9(13), 3047–3060 (1981)

    Article  Google Scholar 

  9. Harbinson, C.T., et al.: Transcriptional regulatory code of a eukaryotic genome. Nature 431(7004), 99–104 (2004)

    Article  Google Scholar 

  10. Hu, T.C., et al.: Snail associates with EGR-1 and SP-1 to upregulate transcriptional activation of P15ink14b. FEBS J. 227, 1202–1218 (2010)

    Article  MATH  Google Scholar 

  11. Liu, C., Song, Y.: Parameterized complexity and inapproximability of dominating set problem in chordal and near chordal graphs. J. Comb. Optim. 22(4), 684–698 (2011)

    Article  MathSciNet  Google Scholar 

  12. Liu, J., Neuwald, A., Lawrence, C.: Bayesian models for local alignment and gibbs sampling strategies. J. Am. Stat. Assoc. 90(432), 1156–1170 (1995)

    Article  Google Scholar 

  13. Liu, X., Brutlag, D., Liu, J.: Bioprospector: discovering conserved DNA motifs in upstream regulatory regions of coexpressed genes. In: Proceedings of 2001 Pacific Symposium on Biocomputing, pp. 127–138 (2001)

    Google Scholar 

  14. MacIssac, K., Wang, T., Gordon, D., Gifford, D., Stormo, G., Fraenkel, E.: An Improved map of conserved regulatory sites for saccharomyces cerevisiae. BMC Bioinf. 7, 113 (2006)

    Article  MATH  Google Scholar 

  15. Maerkl, S., Quake, S.: A systems approach to measuring the binding energy landscapes of transcription factors. Science 315(5809), 233–236 (2007)

    Article  Google Scholar 

  16. Quigley, M., et al.: Transcriptional analysis of HIV-specific CD8+ T cells shows that PD-1 inhibits T cell function by upregulating BATF. Nat. Med. 16, 1147–1151 (2010)

    Article  Google Scholar 

  17. Rigbolt, K.T., et al.: System-wide temporal characterization of the proteome and phosphoproteome of human embryonic stem cell differentiation. Sci. Signal. 4, RS3 (2011)

    Article  MATH  Google Scholar 

  18. Santolini, M., Mora, T., Hakim, V.: A general pair-wise interaction model provides an accurate description of in vivo transcription factor binding sites. PLoS One 9(6), e99015 (2014)

    Article  Google Scholar 

  19. Sharon, E., Segal, E.: A feature-based approach to modeling protein-DNA interactions. In: Speed, T., Huang, H. (eds.) RECOMB 2007. LNCS (LNBI), vol. 4453, pp. 77–91. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  20. Song, Y., Wang, C., Qu, J.: A parameterized algorithm for predicting transcription factor binding sites. In: Huang, D.-S., Han, K., Gromiha, M. (eds.) ICIC 2014. LNCS, vol. 8590, pp. 339–350. Springer, Heidelberg (2014)

    Google Scholar 

  21. Song, Y., Yu, M.: On finding the longest antisymmetric path in directed acyclic graphs. Inf. Process. Lett. 115(2), 377–381 (2015)

    Article  MathSciNet  MATH  Google Scholar 

  22. Song, Y., Chi, A.Y.: Peptide sequencing via graph path decomposition. Inf. Sci. 301, 262–270 (2015)

    Article  MathSciNet  Google Scholar 

  23. Song, Y., Chi, A.Y.: A new approach for parameter estimation in the sequence-structure alignment of non-coding RNAs. J. Inf. Sci. Eng. 31(2), 593–607 (2015)

    Google Scholar 

  24. Song, Y.: An improved parameterized algorithm for the independent feedback vertex set problem. Theoret. Comput. Sci. 535, 25–30 (2014)

    Article  MathSciNet  MATH  Google Scholar 

  25. Song, Y.: A new parameterized algorithm for rapid peptide sequencing. PLoS One 9(2), e87476 (2014)

    Article  Google Scholar 

  26. Song, Y., Yu, M.: On the treewidths of graphs of bounded degree. PLoS One 10(4), e0120880 (2015)

    Article  Google Scholar 

  27. Stormo, G.: Computer methods for analyzing sequence recognition of nucleic acids. Annu. Rev. Biochem. 17, 241–263 (1988)

    Google Scholar 

  28. Stormo, G., Hartzell, G.: Identifying protein-binding sites from unaligned DNA fragments. Proc. Natl. Acad. Sci. 86(4), 1183–1187 (1989)

    Article  MATH  Google Scholar 

  29. Wang, L.-S., Jensen, S.T., Hannenhalli, S.: An interaction-dependent model for transcription factor binding. In: Eskin, E., Ideker, T., Raphael, B., Workman, C. (eds.) RECOMB 2005. LNCS (LNBI), vol. 4023, pp. 225–234. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yinglei Song .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Song, Y., Chi, A.Y., Qu, J. (2015). A Graph Theoretic Approach for the Feature Extraction of Transcription Factor Binding Sites. In: Huang, DS., Jo, KH., Hussain, A. (eds) Intelligent Computing Theories and Methodologies. ICIC 2015. Lecture Notes in Computer Science(), vol 9226. Springer, Cham. https://doi.org/10.1007/978-3-319-22186-1_44

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-22186-1_44

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-22185-4

  • Online ISBN: 978-3-319-22186-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics