INTRODUCTION

Loss of heterozygosity (LOH) at chromosomal loci associated with tumor suppressor genes has been implicated in the genesis of many types of human malignancies. On the basis of frequent LOH in tumors, coupled with linkage analysis in some hereditary cancer syndromes, a number of tumor suppressor genes, such as RB1, DCC2, NF23, VHL4, MTS15, DPH2L/OVCA16, and PTEN/MMAC17 have been successively isolated.

It has been reported that LOH occurred at 17p in various types of human cancers, including colorectal carcinoma8, breast cancer9, hepatocellular carcinoma (HCC)10, 11, 12, medulloblastoma13, primitive neuroectodermal tumor 14, astrocytoma15, and prostate carcinoma16. Previous studies in our laboratory implied that some new tumor suppressor(s) or cancer-related gene(s) might exist on chromosome 17p13.3 distinct from the p53 gene and LOH at 17p13.3 in HCC were not obligatorily accompanied by mutation or LOH of p5310. Since the same region has shown frequent LOH in many different human cancers, we anticipated that some novel genes residing on chromosome 17p13.3 might be involved in the development of many cancers. Through detailed deletion mapping analysis in a large number of HCC samples, we have defined a 500 kb deletion region at 17p13.3, flanked by loci D17S1840 and D17S654. Subsequently, the bacterial artificial chromosomes (BACs) and bacteriophage P1-derived artificial chromosomes (PACs) covering the 500 kb region have been isolated (published in a separate article, manuscript in preparation). In this paper, we report the cloning and preliminary characterization of C17orf25 isolated from the one of PACs, PAC579, containing the locus D17S926 which has the most frequent LOH in HCC in the deletion region on human chromosome 17p13.3.

MATERIALS AND METHODS

Isolation, labeling of PAC579 and screening of cDNA library

The PAC579 containing the locus D17S926 in the deletion region of chromosome 17p13.3 was obtained from the GenomeSystem Inc. cDNA clones were isolated by screening the human liver cDNA library (Clontech) using PAC579 DNA as a probe. A full-length cDNA, designated as C17orf25, was obtained by rapid amplification of cDNA end technique.

PAC579 DNA was isolated from the NS3516 bacteria17, 18. The insert of the human genomic DNA in PAC579 was prepared for being used as probes. 1 μg of PAC579 DNA was digested with the restriction endonuclease Not I to release the genomic DNA insert. DNA samples were loaded onto 1% agarose gels and fractionated by pulsed field gel electrophoresis (PFGE) in 0.25×TBE for 30 h using a CHEF-DR II DRIVE MODULE (Bio-Rad) at 200 V with pulse times of 5 to 10 s19, 20. The insert size of the PAC579 was also determined by pulsed field gel electrophoresis. The insert of the PAC579 was excised from the gel and DNA concentration estimated by fluorescence examination. Approximately 150 ng of purified PAC579 DNA was radioactively labeled using a random-priming kit (Amersham) to get a specific radioactivity of 1×109 cpm/mg. Labeled probes were then purified by spin column of Sephadex G-50. Probes were denatured and allowed to prehybridize in 5×SSC with 1 μg/μl denatured Cot1 DNA and 0.36 μg/μl denatured sheared pPAC vector DNA in a total volume of 1 ml for 20 min at 65°C.

An oligo(dT) plus random prime human liver cDNA library constructed in λgt11 phage vector system was used in this study (Clontech). It was plated at a density of no more than 40,000 plaques per 15 cm plate and lysis was allowed to continue until plaques were almost confluent during the first screening cycle. Plaque lift filters were prehybridized and hybridized in the solution with 6×SSC, 5×Denhardts, 0.5% SDS, 200 mg/ml denatured sheared salmon sperm DNA and 100 μg/ml denatured sheared human placental DNA. Filters were prehybridized for 12-16 h and hybridized for 18 h. Post-hybridization washes were 2×SSC/0.1% SDS for 30 min at 37°C, and 0.5×SSC/0.1% SDS until the radioactivity measured by a b-ray counter (Mini-Instruments series 900 mini-monitor) was less than 10 cps. Autoradiography was conducted at −70°C for 3-5 d. Duplicate plaque lifts were only used in the first screening cycle.

5′- RACE technique

A ∼ 1.6 kb fragment of C17orf25 cDNA was obtained by screening the human liver cDNA library using PAC579 DNA as a probe. The cDNA was sequenced and found to be lacking the 5′-untranslated region (5′-UTR) and ATG start codon. To isolate the 5′ sequence of the C17orf25 cDNA, primers for 5′-RACE, R1 and R2, were designed from the 3′-UTR region of the C17orf25 cDNA (Fig 1). The primary 5′-RACE PCR was performed by using primers AP1 (Adaptor Primer 1, 5′-CCATCCTAATACGAC TCACTATAGGGC-3′) and R1 (5′-GCGTGCAGCAACGTCACACACTC-3′) for 37 cycles of touchdown PCR with the Human Liver Marathon kit (Clontech) and the Advantage cDNA polymerase (Clontech). The cycling conditions that we used were as follows: 94°C 1 min, 1 cycle; 94°C 30 s, 72°C 4 min, 5 cycles; 94oC 30 s, 70°C 4 min, 5 cycles; 94°C 20 s, 68°C 4 min, 27 cycles. The nested RACE PCR was done by using primers AP2 (Adaptor Primer 2, 5′-ACTCACTATAGGGCTCGAGCGG C-3′) and R2 (5′-GTATCGTCAGGCGCTGGGAATGG-3′) under the same conditions as the primary 5′-RACE PCR. RACE amplification product was then purified and cloned into the vector pT-Adv (Clontech) and sequenced by M13 reverse and forward primers. After 5′-RACE PCR, a full-length cDNA was then obtained and designated as C17orf25.

Figure 1
figure 1

Nucleotide sequence of human C17orf25 cDNA and its deduced amino acid sequence. In the nucleotide sequence, the underlined triplet refers to the in-frame stop codon that lies upstream to the first ATG; 5′ RACE sequence is indicated by the dotted line (bp 1-170); two putative poly(A) signal (AATAAA) are identified by the letters highlighted in gray; the asterisk indicates the stop codon; the arrows indicate the sequence and orientation of 5′ RACE primers. In the deduced protein sequence, the boxed letters indicate a putative protein kinase C phosphorylation site; the double-boxed letters indicate a putative casein kinase II phosphorylation site; the boxed letters in dotted line indicate a N-glycosylation site; the boxed letters highlighted in gray indicate the four putative N-myristoylation sites.

PAC579 DNA sequencing and analysis

The genomic sequence of PAC579 was determined using the shotgun sequencing approach. This work was accomplished by Shanghai GeneCore Biotechnologies Inc. PAC579 DNA sequencing was performed by utilizing the ABI Prism Dye Terminator Cycle Sequencing kit (Perkin-Elmer), and the reactions were analyzed by an ABI Prism 377 DNA Sequencer (Perkin-Elmer). DNA and the predicted amino acid sequences were analyzed by the Genetics Computer Group (GCG) Sequence Analysis Software Package (Wisconsin Packageâ„¢ version).

Northern blot analysis

A multiple tissue northern (MTN) blot (Clontech) and a northern blot with 8 pairs of hepatocellular carcinoma (HCC) tissues and adjacent noncancerous liver tissues from the same patients were hybridized with the DNA insert (1.6 kb) of the C17orf25 fragment. The MTN blot contains poly (A)-selected RNAs from eight different human tissues. The C17orf25 cDNA probe was labeled with [α-32P]dCTP by random priming method. The RNA hybridization buffer contained 6×SSC, 5×Denhardt's solution, 50% formamide, 0.5% SDS and 100 μg/ml of denatured sheared salmon sperm DNA. The blots were hybridized at 42°C overnight and washed with 2×SSC/1% SDS twice at 37°C for 30 min each, followed by autoradiography21.

Colony formation assay

The C17orf25 cDNA containing the complete ORF (bp 1-1384) was cloned into a eukaryotic expression vector pCMV-Script with a strong CMV promoter (neo+, Stratagene). The plasmid DNA of pCMV-Script/C17orf25 and vector DNA of pCMV-Script were extracted and purified by using QIAGEN plasmid purification system. SMMC7721 cells (a gift of Shanghai Second Military Medical University) grown in 60 mm dishes were transfected with pCMV-Script control or pCMV-Script/C17orf25 (6 μg each per 60mm dish) by means of lipofectamine under conditions recommended by the manufacturer (Gibco BRL). The medium was replaced with fresh, complete medium at 8 hours following transfection. On the next day, the transfected cells were selected in the medium containing 800 μg/ml of G418. Cells were incubated for 14 d, fixed with 10% acetate/10% methanol for 15 min, and stained with 0.4% crystal violet in 20% ethanol for 15 min to examine the colonies23. The effect on colony formation of hepatoma cells was examined as compared with vector control.

RESULTS

cDNA isolation from PAC579 and cloning of C17orf25

To identity novel genes from the common deletion region on chromosome 17p13.3, a PAC clone designated as PAC579 was isolated from a human PAC library using STS-specific primers of D17S926 locus which has the most frequent LOH in human HCC (GenomeSystem Inc). PAC579 contained a 110 kb artificial chromosome recombinant DNA according to results of pulsed field gel electrophoresis (data not shown). The DNA of PAC579 was used as a probe to isolate specific cDNA clones from the human liver cDNA library in λgt11 vector (Clontech). Initially, 78 cDNA clones have been isolated after three rounds of hybridization screening. All these cDNA clones were partially sequenced at both ends and the sequencing data were analyzed and compared with genomic sequence of PAC579. Non-specific and duplicate or overlapping clones were discarded. Finally, 13 unique, and non-overlapping cDNA clones were isolated from the genomic DNA region of PAC579. In this report, we describe one of these clones, a novel gene, designated as C17orf25. This original cDNA of C17orf25 was sequenced and found to be missing the partial 5′ end sequence (bp 1 ∼ 170, Fig 1).

To isolate the 5′ sequences of C17orf25, primers (R1 and R2) were designed and used for 5′ RACE experiments. The reaction extended the sequence of C17orf25 to the 5-untranslated sequence (5′ UTR), as shown in Fig 1. The composite sequence is 1814 bp with a single ORF of 313 amino acids (MW=34791). The first ATG codon (bp 25 ∼ 27) with flanking sequences in consistent with Kozak consensus was designated as the methionine initiation codon. There were two polyadenylation signals (AATAAA) in the 3′-untranslated sequences.

Organization of the human C17orf25

The genomic organization of C17orf25 was determined by comparison and analysis of the cDNA sequence of C17orf25 (Genbank accession No. AF177342) and PAC579 genomic sequence (Genbank accession No. AF258545). C17orf25 comprised 10 exons and 9 introns, and spanning ∼ 23 kb (Tab 1).

Table 1 Exon/Intron organization of the human C17orf25

Homologies of C17orf25 with other proteins

C17orf25 sequence has been registered in Genbank database (Aug 12, 1999). Amino acid sequence comparison of C17orf25 using BLAST in SwissProt database revealed that the protein of C17orf25 is partially homologous to hypothetical 32kd protein C16C10.10 in chromosome III, a putative homologue of glyoxalase, in caenorhabditis elegans. Percentage of identity of amino acid sequences were found to be 42% between C17orf25 and the hypothetical glyoxalase in caenorhabditis elegans, and 27% between C17orf25 and glyoxalase in brassica olerecea (Fig 2). Interestingly, the protein of C17orf25 is not homologous to human glyoxalase I and II. Since only partial identity has been found at a low percentage of identity between C17orf25 and the hypothetical protein in caenorhabditis elegans or brassica oleracea, it seems that C17orf25 might be a gene encoding a human protein with other function than glyoxalase.

Figure 2
figure 2

Comparison of the amino acid sequence of the human C17orf25, putative glyoxalase of caenorhabditis elegans, and putative glyoxalase I of brassica oleracea. Identical amino acid residues were highlighted in black, and similar amino acid residues in gray. The amino acid sequence is represented by single-letter abbreviations. GXL Ce: Glyoxalase of C. elegans; GXL Bo: Glyoxalase of B. oleracea.

Expression of human C17orf25

A ∼ 1.6 kb fragment of C17orf25 cDNA (bp 170-1814) was hybridized to the human multiple-tissue northern blot. Result showed a single transcript of ∼ 1.8 kb in human heart, brain, liver, kidney, pancreas and placenta, but no expression in skeletal muscle and lung tissues (Fig 3). These data also indicated that the full-length cDNA sequence (1814 bp) of C17orf25 was correspondence with the size of C17orf25 mRNA transcript. Northern blot analysis in HCC tissues showed that the expression of C17orf25 was decreased in HCC samples as compared to adjacent noncancerous liver tissues in patient D24, G2, G11 and G16 (4/8) (Fig 4). To determine the possible occurrence of mutation present in C17orf25 from HCC, we sequenced the PCR products from the full coding region of C17orf25 from 20 pairs of genomic DNA extracted from HCC and adjacent noncancerous liver tissues. However, no mutations/alterations have so far been identified in these samples.

Figure 3
figure 3

Tissue expression pattern of C17orf25. Northern blot analysis was performed using human MTN blots, which contained 2 μg of poly(A) RNA from different human tissues in each lane. The human tissues from which mRNAs were isolated were shown at the top. The size of internal RNA standards (kb) was indicated on the left. A transcript of ∼1.8 kb has been observed in heart, brain, placenta, liver, kidney, pancreas. Lower panel: blot was reprobed with β actin.

Figure 4
figure 4

C17orf25 expression analysis in hepatocellular carcinoma. Northern blots containing RNAs from tumor (T) and adjacent noncancerous liver (N) tissues in patients with primary hepatocellular carcinoma from China were probed with 32P-labeled C17orf25 cDNA. The patient numbers are shown above the blots. Lower panel: blot was reprobed with GAPDH.

Growth suppression effect of C17orf25 to HCC SMMC7721 cell

We examined the growth suppressive effect of C17orf25 on the colony formation efficiency on human HCC SMMC7721 cells by colony formation assay. After 14 days of selection in G418, C17orf25 suppressed the colony formation of the transfected SMMC7721 cells. The amounts of colonies in SMMC7721 cells transfected with C17orf25 were decreased as compared with the result in cells transfected with pCMV-Script vector control (Fig 5).

Figure 5
figure 5

Growth assay of transfected SMMC7721 cells. Cells were transfected with 6 mg of the indicated plasmids and grown in the presence of G418 for ∼2 weeks. G418r colonies were stained with crystal violet and then scored.

DISCUSSION

In this study, we have isolated C17orf25 in the deletion region of chromosome 17p13.3 by a combination of cDNA library screening using PAC 579 as a probe and RACE method. The gene is distributed in a genomic area about 23 kb, comprising 10 exons in total. C17orf25 is a novel gene, since the same sequence has not been found with any known human genes at either nucleic acid or protein level in database. A putative full length human gene transcript, CGI-15024 , was also registered in Genbank database (Genbank accession No. AF151908; May 18, 20000). CGI-150 was obtained by assembling in silico cloning method through the comparative gene identification (CGI) with caenorhabditis elegans. After sequence comparison between CGI-150 and C17orf25, we found that the CGI-150 was an incorrect composite sequence misassembled by the full-length C17orf25 cDNA and the partial fragment of another gene. Sequence analysis between PAC579 genomic sequence and CGI-150 also revealed that the CGI-150 was the misconnected sequence at the 5′ end of C17orf25 cDNA with the other gene in PAC579. The expression pattern of C17orf25 and its size of mRNA transcript[Fig 3] also demonstrated that the size of CGI-150 (2580 bp) was incorrect as an artifact. Therefore, the CGI-150 registered in Genbank was an incorrect composite sequence misassembled by the entire C17orf25 cDNA with a fragment from another gene. Lai et al also admitted in their article that the comparative gene identification (CGI) method had some limitations inherent from dbEST. Gene transcripts are often fragmented and not easily assembled into one continuous contig by using nucleotide sequences as the primary alignment basis, such as UniGene and HGI database. It seems essential for the strategy of comparative gene identification to eliminate nucleotide sequence errors in database.

As the 17p13.3 region is heterozygously deleted with a high frequency in many types of human cancer tissues and their cell lines, we investigated the change of mRNA expression. The preliminary study of the expression of C17orf25 was reduced obviously as compared with the matched noncancerous liver tissues in HCC samples. The transfection of pCMV-Script plasmid containing C17orf25 into the human HCC SMMC7721 cells and overexpression of C17orf25 shown that C17orf25 was able to inhibit the colony formation of human hepatoma cell in vitro, which provided a clue to link this novel gene to the regulation of cell growth.

Although the C17orf25 protein does share considerable identities with glyoxalase from caenorhabditis elegans and brassica oleracea, but no obvious homology is found between it and human glyoxylase. The observation of partial homology of the C17orf25 protein sequence to C16C10.10 (a hypothetical glyoxalase in caenorhabditis elegans) seems unlikely to provide obvious clues to the function of this gene in vivo. Nevertheless, we could not exclude so far the possibility that C17orf25 would be a distant member of glyoxalse system or glyoxalase-like enzyme family till the function of C17orf25 has been extensively examined. Human glyoxalase I is a glutathione-binding protein involved in the detoxification of methylglyoxal, a by-product of glycolysis, and other a-oxoaldehydes25. This enzyme catalyzes the conversion of the methylglyoxal-glutathione conjugate to S-D lactoylglutathione, which in turn is hydrolyzed by glyoxalase II into D-lactate and glutathione (GSH). This cytosolic enzyme system is among the earliest expressed proteins during the embryogenesis and development and it would persist throughout all the course of maturation, adult life and senescence26. The main physiological substrates for the glyoxalase system are methylglyoxal formed from the Embden-Meyerhof pathway and glyxal formed from lipid peroxidation and glycation reactions27. Aberrations in the expression of human glyoxalase in cancer and diabetes have been reported. Concentrations of methylglyoxal, S-D lactoyl glutathione and D-lactate were found to be elevated in the blood samples of both insulin dependent and independent diabetic patients, compared to normal healthy controls28, 29, 30. Ayoub et al31 described that the glyoxalase metabolite, S-D-lactoyl glutathione, might mediate the anti-proliferative effects. Another study by Di Ilio et al32 measured glyoxalase I and glyoxalase II activities in urogenital tumor and non-tumorous tissues and found decreased glyoxalase I levels in 10 out of 15 kidney tumors when compared with corresponding normal kidney tissue. Elevated levels of glyoxalase I were also reported in human prostate cancer33. Studies from Ranganathan et al34 showed elevations in glyxalase I activities in 16 out of 21 colon tumor samples as compared to corresponding normal colon tissues. Considering about the expression of C17orf25 in multiple types of human tissues and the result of homology analysis between C17orf25 and homologue in caenorhabditis elegans or brassica olerecea, we postulated that C17orf25 was evolutionarily conserved. This study can be regarded as a contribution to the goal of the human genome project to identify and fully characterize all human genes. The detailed mechanism of C17orf25 in the development of cancer and the relationship between C17orf25 and human glyoxylase system remain to be elucidated in further investigation.