|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Journal of Lipid Research, Vol. 47, 944-952, May 2006 Zinc Finger Protein 202, genetic variation, and HDL cholesterol in the general population
* Department of Clinical Biochemistry, Rigshospitalet, Copenhagen University Hospital, Copenhagen, Denmark Published, JLR Papers in Press, February 7, 2006.
1 To whom correspondence should be addressed. e-mail: at-h{at}rh.dk
Zinc Finger Protein 202 (ZNF202) is a transcriptional repressor that binds elements found predominantly in genes involved in HDL metabolism. We tested the following hypotheses: 1) frequencies of single-nucleotide polymorphisms (SNPs) and haplotypes in ZNF202 differ between individuals with low and high HDL cholesterol; and 2) SNPs in ZNF202 affect HDL cholesterol levels in the general population. We screened the promoter and protein-coding exons of ZNF202 in individuals with the highest 1% (n = 95) and lowest 1% (n = 95) HDL cholesterol among 9,259 Danish adults. None of the 10 SNPs identified differed in frequency as single sites or as haplotypes between low and high HDL cholesterol groups. In accordance with this, seven mutations were equally frequent (45%) in individuals with low or high HDL cholesterol. Finally, for all five SNPs identified in the coding region, we determined the association of genotype with HDL cholesterol in 9,259 individuals from the general population. Four SNPs were not associated with variation in HDL cholesterol, although c.*2T>G homozygosity was associated with a discrete effect on HDL cholesterol in men. We show that genetic variation in ZNF202 is common in the general population. However, SNPs in the protein-coding region of ZNF202 do not make a major contribution to HDL cholesterol levels.
Supplementary key words apolipoproteins genetic epidemiology large-scale genotyping lipids lipoproteins molecular biology molecular medicine reverse cholesterol transport transcription factors high density lipoprotein cholesterol Abbreviations: apoA-I, apolipoprotein A-I; BMI, body mass index; KRAB, Krüppel-associated box; LD, linkage disequilibrium; SCAN, SRE-ZBP, CT-finS1, AW-1, Number 18; SNP, single-nucleotide polymorphism; UTR, untranslated region; ZNF202, Zinc Finger Protein 202
Levels of HDL cholesterol are inversely related to risk of ischemic heart disease in the general population (1, 2). The HDL particle is responsible for the delivery of cellular cholesterol from peripheral tissues to the liver and is thus a key component in reverse cholesterol transport (3). Twin and family studies suggest that approximately half of the variation in HDL cholesterol is genetically determined (47). A new susceptibility locus for familial hypoalphalipoproteinemia (Online Mendelian Inheritance in Man 604091) on chromosome 11q23 was identified in Utah pedigrees (8); this region contains the Zinc Finger Protein 202 (ZNF202) gene. ZNF202 is functionally characterized by a SRE-ZBP, CT-finS1, AW-1, Number 18 (SCAN) oligomerization domain, a Krüppel-associated box (KRAB) repression domain, and eight zinc finger (Cys2His2) DNA binding motifs, a typical domain architecture for transcription factors (9). ZNF202 target genes are mainly involved in lipid and, particularly, HDL cholesterol metabolism (1012), suggesting that this transcriptional repressor might be important in the determination of HDL cholesterol levels in the general population. However, it is largely unknown to what extent ZNF202 varies genetically in the general population and whether such genetic variation influences HDL cholesterol levels.
We tested the following hypotheses: 1) frequencies of single-nucleotide polymorphisms (SNPs) and haplotypes in ZNF202 differ between individuals with low and high HDL cholesterol levels; and 2) SNPs in the protein-coding region of ZNF202 affect HDL cholesterol levels in the general population. To increase the likelihood of identifying genetic variation with significant effects on HDL cholesterol levels, we screened the promoter and protein coding regions of ZNF202 (
Subjects The Copenhagen City Heart Study is a prospective cardiovascular population study of individuals selected based on the Central Population Register Code to reflect the adult Danish general population aged 20 to 80+ years. In 19911994, 9,259 participants (55% women) gave blood for DNA analyses (13, 14). More than 99% were white and of Danish descent. This study was approved by the local ethical committee: Nos. 100.2039/91 and 01-421/94, Copenhagen and Frederiksberg committee. All participants gave written informed consent.
For the genetic screening of ZNF202 (GenBank accession number NM_003455), we selected individuals from the Copenhagen City Heart Study with the 1% lowest (n = 95) and 1% highest (n = 95) HDL cholesterol levels for age and gender (in 10 year age groups). Consequently, the cutoff levels for HDL cholesterol depend upon the seven age groups for each gender (15). We previously showed that by screening these groups with extreme phenotypes, we increased the likelihood of identifying mutations and SNPs (rare allele frequency of <1% and
Gene screening
SNP genotyping in the general population
Biochemical assays
Statistical analyses
Characteristics of individuals with the lowest 1% and highest 1% HDL cholesterol levels and of the total general population sample are shown in Table 1 . Individuals in the low HDL cholesterol group had lower apoA-I levels, higher triglyceride levels, and were more obese than individuals in the high HDL cholesterol group or in the general population.
Genetic variation in ZNF202 Six SNPs [c.IVS4-240A>T, c.IVS4-223T>C, c.461C>T (p.A154V), c.731A>G (p.V244V), c.775A>G (p.K259E), and c.*2T>G] and two mutations [c.820G>C (p.V274L) and c.1813A>T (p.R605W)] were identified in or flanking the protein-coding regions of the gene and in the 3' untranslated region (3'UTR), four of which introduced amino acid substitutions (Table 2 , Fig. 1 ): c.461C>T (p.A154V) and c.820G>C (p.V274L) substitute similar amino acids, c.775A>G (p.K259E) introduces a shift between positively charged and negatively charged side chains, and c.1813A>T (p.R605W) introduces a shift between positively charged nonpolar and uncharged polar side chains. c.461C>T (p.A154V) is located in the SCAN domain, known to be important for protein-protein interaction, whereas both c.775A>G (p.K259E) and c.820G>C (p.V274L) are in the KRAB A domain, important for transcriptional repression. The amino acid residues affected by these two variants are situated in highly conserved areas of ZNF202 and are themselves completely conserved between species (human, mouse, rat) (Fig. 2 ). c.1813A>T (p.R605W) is located in the seventh zinc finger motif, in a highly conserved area, and p.R605 is also conserved between species (human, mouse, rat). c.*2T>G is located 2 bp downstream of the translation stop site (Table 2, Fig. 1). Of the four nonsynonymous variants identified, c.820G>C (p.V274L), c.1813A>T (p.R605W), and c.775A>G (p.K259E) have not been reported previously.
Four SNPs (g.685G>A, g.660A>G, g.118G>T, and g.+34G>A) and three mutations (g.447T>C, g.232C>T, and g.122C>T) were identified in the promoter region and 5'UTR of exon 1 (Table 2, Fig. 1): g.685G>A and g.660A>G are situated in a silencer sequence, g.447T>C in a putative Yin Yang 1 binding site, g.122C>T in a putative myeloid zinc finger 1 binding site (12), and g.+34G>A in a putative downstream promoter element. Four of these seven variants have not been reported previously. Four variants were identified in introns: two SNPs and two new mutations. Judging from their positions in relation to the exon-intron junctions and the nucleotide substitutions introduced, none of these would be predicted to affect splicing.
Individuals in the low and high HDL cholesterol groups Haplotype analysis of the five SNPs located in and around the coding region estimated that six haplotypes accounted for >98% of all haplotypes in the low HDL cholesterol group and for almost 100% in the high HDL cholesterol group (Table 3 ). However, none of these haplotypes differed in frequency between the low and high HDL groups, suggesting that they did not affect HDL cholesterol levels.
Individuals in the general population With the exception of a synonymous SNP [c.731A>G (p.V244V)], we genotyped the total general population sample (n = 9,259) for all SNPs located in and around the protein-coding region [c.IVS4-240A>T, c.IVS4-223T>C, c.461C>T (p.A154V), c.775A>G (p.K259E), and c.*2T>G] (Table 2). Allele frequencies of these five SNPs ranged from 1% to 39% in the total general population sample. Genotype frequencies did not differ from those predicted by Hardy-Weinberg equilibrium.
Overall associations (regardless of variation at the other four sites) for each of the five SNPs with HDL cholesterol and apoA-I levels are presented separately by gender in Fig. 3
. In men (n = 4,064), homozygosity for c.*2G in the 3'UTR was associated with an apparent decrease in HDL cholesterol of
To further explore these results, we tested the isolated single-site effect of the c.*2T>G SNP on HDL cholesterol and apoA-I levels in men by comparing five SNP genotypes differing only at this site (Fig. 4 ). Using this approach, we found an association between c.*2T>G genotype and HDL cholesterol in one of five tests and an association between c.*2T>G genotype and apoA-I levels in three of five tests. After Bonferroni correction, only one association between genotype and apoA-I levels remained significant (P = 0.01).
Pairwise LD was tested for the five SNPs located in and around the coding region and genotyped in 9,103 individuals from the general population (Table 4 ). Strong pairwise LD was present for all SNP pairs. However, because allele frequencies varied widely between SNPs (from 1% to 39%), these LDs should be interpreted with caution.
With the aim to identify genetic variation in ZNF202 affecting HDL cholesterol levels in the general population, we used a systematic approach in which we screened 700 bp of the promoter and all protein-coding exons of the ZNF202 gene in 190 individuals with extreme HDL cholesterol levels selected from a large sample of the general population (n = 9,259). We subsequently genotyped the entire general population sample for the five SNPs located in and around the protein coding region and determined the association with HDL cholesterol and apoA-I levels. Novel observations in this study include the following: 1) we identified nine new genetic variants (two SNPs and seven mutations), of which two were located in predicted transcription factor binding sites in the proximal promoter, and three were nonsynonymous variants; 2) common SNPs in and around the coding region and haplotypes harboring these SNPs did not segregate differently in individuals with low and high HDL cholesterol levels; 3) these SNPs also did not have a major effect on HDL cholesterol and apoA-I levels in the total general population, as single sites or as combined genotypes differing only at the relevant SNP; and 4) mutations in ZNF202 were equally frequent (5%) in individuals with low or high HDL cholesterol levels. This is the first systematic study to determine the genetic variation in the promoter and in and around the protein-coding region of ZNF202 in a substantial number of individuals with low or high HDL cholesterol levels and to investigate the role of ZNF202 in HDL metabolism in the general population. Because ZNF202 has been identified in a low hypoalphalipoproteinemia locus (8, 19), and because in vitro studies suggest that ZNF202 is a transcriptional repressor of several key genes in HDL cholesterol homeostasis (10), genetic variation in ZNF202 might affect HDL cholesterol levels in plasma and perhaps play a role in the development of atherosclerosis in the general population. However, the data summarized above suggest that genetic variation in and around the protein-coding region of ZNF202 is not a major determinant of HDL cholesterol or apoA-I levels in the general population. We detected all previously reported SNPs in the promoter and coding region of the gene as well as two new SNPs, one in the proximal promoter (g.118G>T) and a nonsynonymous SNP in exon 8 [c.775A>G (p.K259E)]. The c.775A>G (p.K259E) SNP was the least frequent of the six SNPs identified in and around the coding region, suggesting that we did not overlook any important SNPs. Three SNPs were identified in the protein-coding region, one synonymous [c.731A>G (p.V244V)] and two nonsynonymous [c.461C>T (p.A154V) and c.775A>G (p.K259E)], and one in the 3'UTR (c.*2T>G). The c.461C>T (p.A154V) SNP is located in the SCAN domain of ZNF202. The SCAN domain is important for protein-protein interaction and thus regulates ZNF202 activity (11, 20, 21). However, A154 was not conserved between species (human, mouse, rat), and alanine and valine are very similar amino acids. c.461C>T (p.A154V) was equally distributed between the two HDL cholesterol groups, suggesting that this SNP did not affect HDL cholesterol levels. This was confirmed when genotyping c.461C>T (p.A154V) in 9,103 individuals from the general population; no association between c.461C>T (p.A154V) genotype and HDL cholesterol levels was found. c.731A>G (p.V244V) and c.775A>G (p.K259E) are both located in the KRAB A domain, an important repressor domain. The KRAB A domain is highly conserved, and selected conserved amino acids have been shown to be essential for repression (22). K259 is highly conserved between species (human, mouse, rat), and the positive charge of the K259 residue is also conserved between paralogous human proteins with the same protein architecture, SCAN-KRAB-Cys2His2, as ZNF202. K259 is located close to residues proven to be essential for the repressor activity of the KRAB A domain (22); therefore, it is possible that a SNP in this position could affect interaction with other protein residues, leading to decreased repressor activity. However, c.775A>G (p.K259E) was identified in both high and low HDL cholesterol groups with equal frequencies and showed no association with HDL cholesterol levels in the general population. c.*2T>G is a very common SNP located in the 3'UTR, 2 bp downstream of the translation stop site. In men in the general population, c.*2G homozygosity appeared to be associated with marginally lower levels of HDL cholesterol and apoA-I compared with c.*2T heterozygotes and homozygotes. However, this association was not robust and may be attributable to a type I error: only the association with apoA-I levels remained significant after correction for multiple comparisons. Thus, iteration in a large, independent population to confirm or reject the association between c.*2G homozygosity and low HDL cholesterol and apoA-I levels would be desirable. The HapMap data for Caucasians uses nine SNPs [in introns and protein-coding exons, including dbSNP 1144507 and c.461C>T (p.A154V); Table 2] in 60 individuals (120 alleles) spanning both non-protein-coding and protein-coding exons of ZNF202 to define four haplotypes with frequencies of >1% and three with frequencies of >5% (www.hapmap.org). In our sample of individuals with low and high HDL cholesterol, we used five SNPs in 190 individuals (380 alleles) spanning the protein-coding exons only to define five haplotypes with frequencies of >1% and four with frequencies of >5%. Adding one (g.+34 G>A in exon 1; Table 2) of only two SNPs reported in non-protein-coding exons from the National Center for Biotechnology Information SNP database (http://www.ncbi.nlm.nih.gov/entrez/) to our haplotype data did not add additional haplotypes in our sample. Finally, had we used HapMap SNPs spanning only the protein-coding exons to determine haplotypes in our data (assuming that the same haplotypes are found in the two groups of Caucasians), we would have detected only two common haplotypes. Four new mutations were identified in and around the protein-coding region, two of which were nonsynonymous. The affected amino acid residues were located in functionally important areas of ZNF202 in the KRAB A domain [c.820G>C (p.V274L)] and in the zinc finger domain [c.1813A>T (p.R605W)]. V274 is conserved between species but does not introduce a change in amino acid charge. R605 is conserved between species but not between paralogous proteins. However, a change from a positively charged nonpolar amino acid (arginine) to an uncharged polar amino acid (tryptophan) might alter the conformation of the zinc finger motif and perhaps change the DNA binding specificity. The systematic approach by which we screened individuals with extreme HDL cholesterol levels has previously proven sensitive in the detection of both mutations with strong phenotypic effects and SNPs with more modest effects on HDL cholesterol levels, illustrated by differential segregation of the functional SNPs in groups with extreme phenotypes (15, 23). Whether one can detect such a frequency difference between extreme phenotype groups depends on the size of the study, the frequency of the SNP, the order of magnitude of the phenotype effect of the SNP (in this case, on HDL cholesterol or apoA-I), and whether this effect is equally strong in both genders. In this study, we used this approach to determine whether genetic variation in ZNF202 affected HDL cholesterol, because the majority of ZNF202 target genes play a role in HDL metabolism. However, because ZNF202 is a transcriptional repressor of the actual functional gene, genetic variation in ZNF202 may have a less distinct effect on the intermediate phenotype than genetic variation in the structural gene itself.
Limitations The high HDL cohort had extremely elevated HDL cholesterol concentrations (2.93.3 mmol/l), so we cannot rule out that a subset of these subjects may harbor functional variants in other genes, such as cholesteryl ester transfer protein, hepatic lipase, lipoprotein lipase, and endothelial lipase, that are also associated with increased HDL cholesterol levels. Finally, we screened a relatively small portion of the promoter (700 bp), all protein-coding exons (exons 510), and the corresponding exon/intron boundaries of ZNF202, raising the possibility that functional variants affecting gene expression, or variants in introns or non-protein-coding exons affecting gene regulation, may have been missed. In support of intronic variants being important for gene regulation, a functional variant in an intron was recently identified in the USF1 gene (24). In conclusion, we show that genetic variation in ZNF202 is common in the general population. However, SNPs in and around the protein-coding region of ZNF202 do not make a major contribution to HDL cholesterol levels in the general population.
The authors thank Mette Refstrup for expert technical assistance. The authors also thank the subjects who participated in the study. This work was supported by the Danish Heart Foundation, the Danish Medical Research Council, Ingeborg and Leo Dannin's Grant, and the Research Fund at Rigshospitalet, Copenhagen University Hospital. Manuscript received November 30, 2005 and in revised form January 27, 2006.
This article has been cited by other articles:
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||