Genomic Sequence Analysis of Fugu rubripes CFTR and Flanking Genes in a 60 kb Region Conserving Synteny with 800 kb of Human Chromosome 7

  1. Heather Davidson1,3,
  2. Martin S. Taylor1,
  3. Ann Doherty1,
  4. A. Christopher Boyd1, and
  5. David J. Porteous1,2
  1. 1Medical Research Council Human Genetics Unit, Western General Hospital, Edinburgh EH4 2XU, UK; 2Medical Genetics Section, Department of Medical Sciences, University of Edinburgh, Molecular Medicine Centre, Western General Hospital, Edinburgh EH4 2XU, UK

Abstract

To define control elements that regulate tissue-specific expression of the cystic fibrosis transmembrane regulator (CFTR), we have sequenced 60 kb of genomic DNA from the puffer fish Fugu rubripes (Fugu) that includes the CFTR gene. This region of the Fugu genome shows conservation of synteny with 800-kb sequence of the human genome encompassing the WNT2, CFTR, Z43555, and CBP90 genes. Additionally, the genomic structure of each gene is conserved. In a multiple sequence alignment of human, mouse, andFugu, the putative WNT2 promoter sequence is shown to contain highly conserved elements that may be transcription factor or other regulatory binding sites. We have found two putative ankyrin repeat-containing genes that flank the CFTR gene. Overall sequence analysis suggests conservation of intron/exon boundaries betweenFugu and human CFTR and revealed extensive homology between functional protein domains. However, the immediate 5′ regions of human and Fugu CFTR are highly divergent with few conserved sequences apart from those resembling diminished cAMP response elements (CRE) and CAAT box elements. Interestingly, the polymorphic polyT tract located upstream of exon 9 is present in human and Fugu but absent in mouse. Similarly, an intron 1 and intron 9 element common to human and Fugu is absent in mouse. The euryhaline killifish CFTR coding sequence is highly homologous to the Fugusequence, suggesting that upregulation of CFTR in that species in response to salinity may be regulated transcriptionally.

[The sequence data described in this paper have been submitted to the GenBank data library under accession no. AJ271361, for the combined cosmids 159C9, 146H13, 6M15, and 145M20.]

Footnotes

  • 3 Corresponding author.

  • E-MAIL H.Davidson{at}ed.ac.uk; FAX 44 131 651 1059.

    • Received April 6, 2000.
    • Accepted June 2, 2000.
| Table of Contents

Preprint Server