|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Journal of Lipid Research, Vol. 46, 1914-1922, September 2005 Higher order lipase gene association with plasma triglycerides
* Cardiovascular Division and Institute for Translational Medicine and Therapeutics, University of Pennsylvania School of Medicine, Philadelphia, PA Published, JLR Papers in Press, June 16, 2005. DOI 10.1194/jlr.M500042-JLR200
1 To whom correspondence should be addressed. e-mail: muredach{at}spirit.gcrc.upenn.edu
Lipoprotein lipase, HL, and endothelial lipase (EL) are proteoglycan-bound enzymes that regulate plasma lipoprotein levels through coordinated triglyceride (TG) lipase and phospholipase activity. We hypothesized that single nucleotide polymorphisms (SNPs) in lipase genes would have higher order impact on plasma lipoproteins beyond the influence of individual SNPs. In a sample of asymptomatic Caucasian subjects (n = 738), we used a two-stage approach, first identifying groups of subjects with similar multilocus lipase genotypes and then characterizing the relationships between genotype groups and plasma lipids. Using complementary methods, including a permutation test procedure and a mixed-effects modeling approach, we found a higher order interaction between four SNPs in three lipase genes (EL 2,237 3' untranslated region, EL Thr111Ile, HL 514C/T, and LPL Hind III) and plasma TG levels. Subjects who were heterozygous for all four lipase SNPs had significantly higher plasma TG levels beyond the effect of individual lipase SNPs and environmental factors, even after correcting for multiple comparisons. In conclusion, lipase genes had synergistic association with plasma TG beyond individual gene effects. Higher order multilocus genotype contributions to dyslipidemia and atherosclerotic cardiovascular disease need to be considered a priori because they may have an important effect even in the absence of significant main effects of the individual genes.
Supplementary key words lipoproteins lipases higher order gene interactions
Plasma lipoproteins are major determinants of the risk for atherosclerotic cardiovascular disease (CVD) (1). Although a wealth of biochemical and genetic evidence implicates multiple individual genes in the regulation of plasma lipids (2), defining the genetic basis of these complex traits across community-based populations has proven difficult. This reflects, in part, the modest main effects of individual gene variations and complex higher order interactions (3). Identifying gene-gene and gene-environment interactions may be of particular importance because the molecular mechanisms that control lipid levels may provide greater insight into the risk of developing CVD than the absolute lipid values. Lipoprotein lipase (LPL by HUGO nomenclature; also called LPD), HL (LIPC by HUGO nomenclature), and endothelial lipase (EL; LIPG by HUGO nomenclature), members of the same lipase gene family, are extracellular proteoglycan-bound enzymes that influence plasma lipoprotein levels through coordinated hydrolysis of lipoprotein triglyceride (TG) and phospholipid (Fig. 1). LPL preferentially hydrolyzes TGs in chylomicrons and VLDLs; HL hydrolyzes lipoprotein TG and phospholipids, particularly in HDLs and LDLs; and EL preferentially hydrolyzes HDL phospholipids (4). Gene deletion and overexpression studies in animal models have confirmed the lipoprotein-specific effects of these lipases in vivo (4).
In humans, lipase gene variants are associated with changes in plasma lipoproteins, particularly TG and HDL-cholesterol (HDL-C). Deficiency of LPL in humans, even heterozygous forms, results in increased TGs and low HDL-C (5). The relatively common LPL variant Ser447Stop and variants that are in linkage disequilibrium (e.g., Hind III) (6) are associated with increased LPL levels, decreased TG, and increased HDL-C (7, 8). HL deficiency is associated with increased HDL levels and particle size (9). A common variant in the HL promoter (514), which is in linkage disequilibrium with three other common HL polymorphisms (10), is associated with reduced HL activity, increased HDL-C levels, and, in some studies, higher TGs (1012). Early data suggest an association between common EL variants [T111I and 2,237 3' untranslated region (UTR)] and HDL-C (1316). Given distinct, but complementary, tissue distribution and lipoprotein substrates, we hypothesized that common variants in lipase genes may have higher order effects on plasma lipoproteins, that is, that multilocus combinations of lipase single nucleotide polymorphisms (SNPs) would have effects above and beyond the influence of the individual SNPs. In a sample of asymptomatic Caucasian subjects (n = 738), we used a two-stage approach, first identifying groups of subjects with similar multilocus genotypes and then characterizing the relationships between these genotype groups and plasma lipids. Using complementary methods, including a permutation test procedure and mixed-effects modeling, we characterized higher order relationships, controlling for demographic and environmental factors, and tested genotype group-sex interactions that were motivated by the reported influence of sex (6, 10, 17) on lipase gene-lipoprotein associations.
Study subjects The Study of Inherited Risk of Coronary Atherosclerosis is a cross-sectional study of risk factors for coronary atherosclerosis in healthy volunteer subjects and families enriched for family history of premature coronary artery disease (CAD). The study design and preliminary findings have been published (1821). Inclusion criteria included 1) men aged 3065 years and women aged 3570 years and 2) at least one first-degree relative with premature CAD (males younger than 60 years and females younger than 70 years). Exclusion criteria included established clinical CAD and extremes of major established CAD risk factors (known diabetes, total cholesterol > 300 mg/dl, cigarette smoking of more than one pack per day, and serum creatinine > 3.0) to facilitate the discovery of novel genetic influences on CVD. This report focuses on 738 unrelated Caucasian subjects with complete genotype data for lipase SNPs. The University of Pennsylvania Institutional Review Board approved the study protocol. Written informed consent was given by all subjects.
Evaluated parameters For this study, we chose SNPs in lipase genes that have been associated previously with plasma lipids. These included EL 2,237 3'UTR, EL Thr111Ile, HL 514, and LPL Hind III. Genotyping of lipase SNPs was performed on an Orchid 25K SNPstreamTM platform (Orchid Bioscience, Princeton, NJ) using a single-base primer extension method. A set of three primers was chosen for each SNP: two PCR primers selected to amplify a 100200 bp product, and a 25 bp single-base primer extension primer. For EL 2,237 3'UTR, AGTGCAACCCAWGAGAWCCCCAACAGC, GTGTTCAATAGACATTTGCTCAATTAA, and GTACTCTGCCTGACGAGGAAC; for EL Thr111Ile, GGGGAGCCAGTCAACCAC, AACTACATTGGCGTCTTTCTCTCTT, and TGCAGATGAGCGGTATCTTTG; for HL 514C/T, GTGTGZTGCAGAAAACCCTTZACCCCC, CAAATTTCTGTTGGGTTCAGTGA, and GTCACTTGGCAAGGGCATC; for LPL Hind III, AACATTACCCAGZTT-GATCATGTA, AAAATGGATGTGAATATGCCATG, and ATTCTGATGTGGCCTGAGTGT. SNP-IT primers were extended by one base at the polymorphic site of interest using DNA polymerase I. The extension mixes contained two labeled terminating nucleotides (one fluorescein, one biotin). An ELISA technique was used for detection of the extension product: anti-fluorescein-alkaline phosphatase (Boehringer Mannheim, Indianapolis, IN) for fluoresceinated nucleotides (allele 1) and anti-biotin-horseradish peroxidase conjugate (Zymed, San Francisco, CA) for biotinylated nucleotides (allele 2). Raw optical density data were analyzed by GetGenos software, and genotypes were corroborated by visual inspection.
Statistical analyses Association of individual lipase SNPs with plasma lipids Multivariable linear regression models were fit to assess the association between plasma lipids and each of the four lipase SNPs and all two-way SNP-SNP and SNP-sex interactions to identify SNPs and two-way interactions for inclusion in subsequent analyses of multilocus genotypes. SNPs were treated as three-level categorical variables (homozygous wild type, heterozygous, homozygous rare). These analyses were adjusted for the effects of age, sex, cigarette smoking (current vs. former and never), exercise (none vs. any), alcohol intake (none vs. any), systolic blood pressure, and use of the following medications (statins, niacin, fibrates, and hormone replacement therapy in women). The likelihood-ratio Chi-square test (LRT) was used to test for sex interactions. We did not correct for multiple testing to be inclusive of all potentially important effects of individual SNPs in our subsequent multilocus modeling. Multilocus genotype groups We created genotype groups defined simply as groups of individuals with identical observed multilocus genotypes. In general, this technique is useful for combining human SNP information when the number of SNPs is relatively small, the prevalence of a single SNP is low, or SNPs are highly correlated, as occurs in linkage disequilibrium. In these settings, the number of observed groups may be substantially less than the number of possible multilocus genotypes. In our analysis of four SNPs, there are 81 possible multilocus genotype groups, but only 55 were observed.
Unadjusted analysis of multilocus genotype group effects on lipids
Wilcoxon rank sum statistics for each lipid variable (e.g., ln-TG) were generated for tests comparing each genotype group to the reference group (the most prevalent genotype group). Because of known sex differences in plasma lipids, these analyses were performed separately in men and women. Significance was determined for the association with each lipid outcome controlling for experiment-wide error rate using both Bonferroni adjusted Multivariable modeling of individual genotype groups Multivariable linear regression of individual lipid outcomes was performed on genotype groups that were determined to be statistically significant by Bonferroni tests. Unlike the Wilcoxon approach, the goal was to determine whether the association of a specific genotype group, compared with all other groups, with lipid outcome was 1) independent of environmental influences and individual SNPs, and 2) modified by sex, as determined by the LRT. Mixed-effects modeling of all genotype groups simultaneously We also applied a comprehensive mixed modeling approach, as described by Foulkes et al. (24, 25), that permits the characterization of interactions between demographic factors (e.g., sex) and any multilocus genotypes regardless of whether they have a significant main effect or not. Posterior means of random genotype group-specific effects and corresponding variance estimates were used to assess the significance of multilocus genotypes and genotype-demographic interactions. The observed genotype groups can be considered a random sample from the general population of genotypes and, therefore, treated as random effects in a linear model. The general form of this model has been given (26) and provides a flexible framework for adjusting for potential confounders (through the fixed effects) and allowing genotype effects to vary across values of these variables (through the random effects). The mixed modeling approach allows for two stages of testing: 1) an omnibus LRT test for overall variability in the multilocus genotype effects on the outcome, and 2) genotype-specific tests based on conditional means of random effects and corresponding prediction intervals. Note that the significance cutoff for the omnibus test is based on a 50:50 mixture of a Chi-square [1 degree of freedom (d.f.)] and a Chi-square (0 d.f.) distribution, because we are testing the variance parameter at a boundary (27). In the case of a single random effect (e.g., genotype), this equals 1.92. Calculation of empirical Bayes estimates of random effects and corresponding 95% prediction intervals are described in a similar latent variable approach to the analysis of gene expression profiles (28).
Table 1 shows demographic, clinical, and laboratory characteristics of the study sample. Levels of traditional risk factors and plasma lipoproteins were close to the population average in these asymptomatic subjects, reflecting a recruitment strategy that excluded major lipoprotein abnormalities. Lipase SNP alleles were found to be in Hardy-Weinberg proportions in our study sample. There was highly significant linkage disequilibrium (D' = 0.93, P < 0.001) between the EL 2,237 3'UTR and EL Thr111Ile SNPs.
Association of individual lipase SNPs with plasma lipoproteins Unadjusted lipid data by lipase genotypes are shown in Table 2. In multivariable linear regression, after controlling for confounders and specifically assessing gene-gender interaction, the strength of lipase genotype associations with plasma lipid levels was modest and often differed for men and women, as reported in previous studies (6, 7, 10, 11, 14, 16). Briefly, 1) for ln-TGs in stratified analysis, LPL Hind III GG was associated with lower TGs in men (P = 0.02) but not in women (P = 0.57) (sex interaction LRT P = 0.05); 2) for total cholesterol, HL 514 TT was associated with higher total cholesterol in men and women combined (P = 0.03); 3) for HDL-C, LPL GG was associated with higher HDL in combined analysis (P = 0.006), and EL Thr111Ile CT and TT were associated with lower HDL in women (P = 0.003 and P = 0.05, respectively) but not in men (P = 0.39 and P = 0.48, respectively) (sex interaction LRT P = 0.006). EL haplotypes were not associated with any lipid outcome beyond the effect of the individual EL SNPs (data not shown). Contrary to the report of Xin et al. (8), we did not find an interaction effect of LPL and HL variants on plasma TGs (P = 0.85). These analyses were not the primary focus of our study but a first step to identify individual SNP associations, without penalizing for multiple comparisons, to be included in subsequent modeling of higher order multilocus genotype effects.
Genotype groups Multilocus genotype groups were defined as groups of individuals with identical genotypes across the four lipase SNPs. To limit the number of tests being performed, our primary analyses included only genotype groups that contained at least 10 individuals, and a confirmatory analysis included genotype groups that contained at least 5 individuals. Of 55 groups observed in our data set, 23 groups contained 10 or more individuals (87% of the sample) (Table 3) and 31 groups contained 5 or more subjects (93% of the sample). Genotypes listed in parentheses below refer to genotypes for EL 2,237 3'UTR, EL Thr111Ile, HL 514, and LPL Hind III, respectively. The most prevalent group, group 15 (GA, CC, CC, TT), was defined as the referent and contained 57 individuals. Note that an assigned referent group is necessary for the Wilcoxon-based approach but not for the mixed modeling method (see below).
Univariate analysis of genotype groups and plasma lipids In analysis stratified by sex, we found a significant association (after adjusting for multiple comparisons) between genotype group 13 (GA, CT, CT, GT; representing the heterozygous state for all individual lipase SNPs) and ln-TG in women but not in men (Fig. 2, Table 4). Interestingly, the Wilcoxon test statistic for the (GA, CT, CT, GT) group compared with the referent group varied markedly between men and women. There was no significant association between genotype groups and any other lipid variable after adjusting for multiple comparisons. Results were similar in sensitivity analyses using genotype groups that contained at least five individuals (data not shown).
Multivariable linear regression of the (GA, CT, CT, GT) genotype group and TGs We used traditional linear regression to test specifically for the potential sex interaction with the (GA, CT, CT, GT) group while controlling for other factors. Notably, in age-adjusted analysis, the (GA, CT, CT, GT) group was associated (fold increase) with ln-TG in women [1.76 (1.332.33)] but not in men [1.18 (0.851.56)] (interaction LRT P = 0.05). In fully adjusted analysis, however, including all individual lipase SNPs, the (GA, CT, CT, GT) group remained significantly associated with ln-TG in women [1.60 (1.221.53)] but not in men [1.14 (0.861.53)]), but there was no longer a significant interaction with sex (LRT P = 0.22). Therefore, results of fully adjusted models are presented for men and women combined (Table 5), in which associations with ln-TG were found for the (GA, CT, CT, GT) group [1.34 (1.091.64); P = 0.005], fasting plasma glucose (P < 0.001), waist circumference (P < 0.001), statin therapy (P = 0.01), systolic blood pressure (P = 0.02), and sex (P = 0.04), with nonsignificant effects for LPL Hind III GG genotype (P = 0.06) and cigarette smoking (P = 0.07). Adjustment for the LPL-HL SNP interaction, reported to be a significant predictor of TG levels by Xin et al. (8), did not attenuate the association between the (GA, CT, CT, GT) group and TG (P = 0.005).
Mixed-effects modeling of all genotype groups with plasma lipids As a more comprehensive approach to the association of multilocus genotypes with plasma lipids, we applied linear mixed-effects modeling as described previously (2426). The unique advantage of the mixed modeling approach is that it allows us to draw from the totality of the genetic data; it differs from the approach described in the previous section because it allows for the identification of an interaction between sex and all multilocus genotype groups, even in the absence of a main effect of individual multilocus genotype groups. Similar to the findings described above, in age-adjusted models, a significant sex-genotype group interaction with ln-TG was observed [global Chi-square (2 d.f.) = 4.86 for test of variance of random genotype effect equal to 0], and again the heterozygote genotype group (GA, CT, CT, GT) was solely identified as having a sex interaction effect that differs from 0 [95% protease inhibitor = (0.415, 0.103)]. As before, however, in fully adjusted models, there was no longer a significant interaction of genotype groups with sex [global Chi-square (2 d.f.) = 2.45], but the heterozygous genotype was a significant predictor of ln-TG [global Chi-square (1 d.f.) = 2.87; 95% prediction interval = (0.025, 0.27)]. The conditional means and prediction intervals of all random genotype group effects with >10 observations are illustrated in Fig. 3. The results were similar after additionally adjusting for individual SNPs as main effects in the model. Neither genotype groups nor their interactions with gender were associated with HDL-C or LDL-C in age-adjusted or fully adjusted mixed-effects models.
We report, for the first time, a higher order interaction between genetic variants in LPL, HL, and EL in the association with plasma TG levels. These lipases, all members of a TG lipase gene family (4), are likely to act synergistically to regulate plasma levels of lipoproteins and modulate the risk of developing atherosclerosis. Although studies of combined and interaction effects of genetic variants in candidate lipoprotein genes are emerging, these have been mostly restricted to an examination of two-way gene-gene interactions (8, 17, 29) rather than true higher order interaction effects of multiple genes (30). Using a two-stage analysis, with a conservative approach to multiple testing as well as a more flexible mixed-effects modeling approach, we identified a combination of lipase genotypes, the all-heterozygous (GA, CT, CT, GT) genotype group, that was strongly predictive of higher plasma TG levels. This association was independent of all individual lipase SNPs and environmental influences and, notably, was identified in a study sample with average values of plasma lipids. This study provides one of a few examples of a complex genetic effect on CVD risk factor traits that cannot be detected without considering, a priori, the influence of combinations of candidate genes, even when there appears to be weak or no association of individual genes with the traits (30). It is likely that the lipase gene family (LPL, HL, and EL) has evolved to regulate plasma lipids in a complex yet coordinated and synergistic manner (4) (Fig. 1). LPL, mostly acting in muscle and fat tissue, hydrolyzes TGs in chylomicrons and VLDLs. HL, predominantly localized to hepatic sinusoids, hydrolyzes LDL- and HDL-TGs and phospholipids. EL is expressed throughout the endothelium and hydrolyzes HDL phospholipids and also has modest lipolytic activity toward apoB-containing lipoproteins (31). Importantly, several studies have demonstrated a role for lipase genes in the development of atherosclerosis in mouse models (4, 32) and an association between lipase gene variation and CVD in human samples (11, 33, 34). We selected the specific lipase gene SNPs in this study because they represented strong candidate SNPs based on published literature. Several studies have demonstrated the association of the LPL Hind III variant with lower TGs and higher HDL-C (68) and of the HL 514 variant with increased HDL-C, reduced small dense LDL-C, and in some reports increased TGs (1012). Recent papers have reported associations of the EL 2,237 3'UTR and EL Thr111Ile variants with reduced HDL-C levels (14, 16). However, the effects of lipase SNPs have been reported to vary by age (35), sex (6, 10, 17), race (12), body mass (36), and environmental influences (15, 17). Indeed, an interaction of LPL Ser447Stop and HL 514 variants in the association with plasma TG levels has been reported (8). Thus, the modest and inconsistent contribution of candidate genes, including lipases, to plasma lipid levels and CVD events may reflect, in part, the failure to consider higher order gene-gene and gene-environment interactions. Nelson et al. (30) examined the synergistic influence of six distinct loci on plasma TG levels and found that many combinations of loci predicted TG variability in a nonadditive manner, suggesting that traditional methods, which rely on statistically significant single-locus effects, even those considering multiple genes (37), may fail to identify combinations of genes that best predict lipid variability. In our sample, subjects who were heterozygous for all four lipase SNPs (GA, CT, CT, GT) had significantly higher plasma TG levels than would be expected from the effect of individual lipase SNPs. LPL Hind III heterozygosity results in greater chylomicron and VLDL-TG hydrolysis. In this setting, heterozygosity for HL 514 may further promote the accumulation of apoB remnant particles, and EL variants, associated with increased HDL-C levels, also may influence apoB lipoprotein hydrolysis (31). Consistent with these expected individual lipase variant effects, the (GA, CT, CT, GT) group was associated with modest increases in HDL-C and LDL-C as well as TG. However, our cross-sectional study cannot address the mechanistic basis and functional impact of the heterozygous (GA, CT, CT, GT) state on plasma lipids. Metabolic studies in humans and examination of particle size and density will be required to provide additional insights. Because of reported sex-dependent lipase SNP-lipoprotein associations (6, 10, 17) and the known influence of sex hormones on lipase expression (38), we hypothesized a priori that there would be a sex difference in the multilocus genotype association with lipids. Although not statistically significant in fully adjusted models (which included factors that may be intermediate in the association), the difference in the strength of association for men and women suggests that the influence of sex on lipase-lipid associations may extend to higher order effects. These analyses also may have limited power with the given sample size (our unpublished results). These apparent sex differences require mechanistic study as well as further assessment in independent large samples with adequate power. Our study is notable for the lack of a multilocus lipase association with HDL-C. This may reflect a lack of synergistic higher order effects, failure to consider other important genetic determinants of HDL (e.g., apoA-I, cholesteryl ester transfer protein), a lack of power in our study, or type II error because of a conservative approach. Characterizing the association between multiple gene polymorphisms and measures of disease is challenging because of problems associated with multiple testing (inflating type I error) and limitations in current analytic approaches (3). Several papers describe approaches to reduce the number of candidate genes and thus reduce the number of comparisons (23, 37, 39). However, many of these approaches are limited because they fail to consider the interactive effects of SNPs that have modest or no main effects (3). We used a two-stage approach that provides a flexible framework for the analysis of higher order genetic data. Our first stage involved a data dimension reduction step for the generation of genotype groups that does not require the presence of a main effect of individual candidate SNPs. We used a pattern-based approach to define multilocus genotype groups, although this may be useful only in certain settings when using a relatively small number of SNPs. In the second stage of analysis, we initially applied a conservative approach to identify genotype groups with significant association with lipids. Groups with a significant effect in unadjusted analysis, after correction for multiple testing, were selected for multivariable modeling and to test genotype group-sex interaction. Given the limitations of traditional regression approaches, we also applied mixed-effects modeling, a more flexible and robust approach that allows modeling of all multilocus genotypes simultaneously as well as their interaction with demographic factors. Alternative approaches to dimension reduction, including clustering algorithms, and methods for the selection of multilocus genotypes, such as permutation procedures and false discovery rate (40), can be integrated into our framework (24, 25). It is important to note that in analyses requiring multiple tests of association, there is a greater potential for spurious findings resulting from inflation of type I error. To address this concern, we implemented a rigorous, nonparametric permutation-based testing procedure that is particularly useful in the context of correlated tests and notably does not rely on large-sample asymptotic theory. The results of this test imply >90% confidence that our findings are not spurious. In conclusion, we found a higher order interaction of genetic variants in lipase family members (LPL, HL, and EL) in the association with plasma TG levels. Consistent with a true synergistic multilocus effect, the all-heterozygous lipase group (GA, CT, CT, GT) was predictive of higher TG levels, beyond the influence of individual lipase SNPs. Given the inability of traditional approaches to detect these synergistic, and clinically relevant, genetic effects, our statistical approach and findings have important implications for defining the role of common genetic variations in dyslipidemia and atherosclerotic CVD.
This study was funded in part by Grant M01 RR-00040 from the National Center for Research Resources/National Institutes of Health supporting the University of Pennsylvania General Clinical Research Center. M.P.R. is supported the National Center for Research Resources (Grant RR-15532-02), by Grant RO1 HL-73278-01 from the National Institutes of Health, and by the W. W. Smith Charitable Trust (Grant H0204). D.J.R. is supported by grants from the National Heart, Lung, and Blood Institute, the National Institute of Diabetes and Digestive and Kidney Diseases, and the National Center for Research Resources and is a recipient of the Burroughs Wellcome Fund Clinical Scientist Award in Translational Research and a recipient of a Doris Duke Distinguished Clinical Investigator Award. Manuscript received February 3, 2005 and in revised form March 25, 2005 and in re-revised form June 10, 2005.
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||