GENERAL

Wavelet-based multifractal analysis of DNA sequences by using chaos-game representation

and

2010 Chinese Physical Society and IOP Publishing Ltd
, , Citation Han Jia-Jing and Fu Wei-Juan 2010 Chinese Phys. B 19 010205 DOI 10.1088/1674-1056/19/1/010205

1674-1056/19/1/010205

Abstract

Chaos game representation (CGR) is proposed as a scale-independent representation for DNA sequences and provides information about the statistical distribution of oligonucleotides in a DNA sequence. CGR images of DNA sequences represent some kinds of fractal patterns, but the common multifractal analysis based on the box counting method cannot deal with CGR images perfectly. Here, the wavelet transform modulus maxima (WTMM) method is applied to the multifractal analysis of CGR images. The results show that the scale-invariance range of CGR edge images can be extended to three orders of magnitude, and complete singularity spectra can be calculated. Spectrum parameters such as the singularity spectrum span are extracted to describe the statistical character of DNA sequences. Compared with the singularity spectrum span, exon sequences with a minimal spectrum span have the most uniform fractal structure. Also, the singularity spectrum parameters are related to oligonucleotide length, sequence component and species, thereby providing a method of studying the length polymorphism of repeat oligonucleotides.

Export citation and abstract BibTeX RIS

10.1088/1674-1056/19/1/010205