|
|
||||||||
Papers In Press, published online ahead of print December 1, 2007
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Journal of Lipid Research, Vol. 48, 2632-2639, December 2007
Copyright © 2007 by American Society for Biochemistry and Molecular Biology

Washington University School of Medicine, St. Louis, MO 63110
The online version of this article (available at http://www.jlr.org) contains supplementary data in the form of one table. ![]()
Published, JLR Papers in Press, September 23, 2007.
1 To whom correspondence should be addressed. e-mail: roz{at}psychiatry.wustl.edu
| ABSTRACT |
|---|
|
|
|---|
Supplementary key words genetics cardiovascular disease lipids genome-wide linkage analysis
Abbreviations: ACADSB, acyl-coenzyme A dehydrogenase short/branched chain; apoB, apolipoprotein B; BMI, body mass index; cM, centimorgan; FHBL, familial hypobetalipoproteinemia; FOXP1, forkhead box protein P1; h2, narrow sense heritabilities; LOD, log of the odds; PCSK9, proprotein convertase subtilisin/kexin type 9; PNLIP, pancreatic triacylglycerol lipase precursor; PNLIPRP, pancreatic lipase-related protein precursor; PPAPDC1A, phosphatidic acid phosphatase type 2 domain-containing 1A; SCD, stearoyl-coenzyme A desaturase; SLC25A26, solute carrier family 25 member 26; SNP, single nucleotide polymorphism; UTR, untranslated region
| INTRODUCTION |
|---|
|
|
|---|
2-fold increase in coronary heart disease risk over those in the lower tertiles after multivariate adjustment. Other studies suggest that increased apoB levels may be associated with an even greater risk of coronary heart disease than high LDL cholesterol concentrations (3, 4). Data from twin studies show that there is a strong heritability for apoB and LDL levels (5, 6), and numerous genome scans have attempted to identify loci linked to lipid levels. A recent summary of the results from various lipid genome scans lists significant or suggestive evidence for linkage on chromosomes 1, 2, 3, 6, and 11 for apoB and on chromosomes 1, 3, 4, 5, 6, 10, 11, 13, 15, 17, 19, and X for LDL (7). In addition, linkage scans for loci influencing both traits have identified regions on chromosomes 2, 11, and 18 (7).
In this study, we performed genome-wide linkage analysis in a cohort of Caucasian families ascertained for familial hypobetalipoproteinemia (FHBL). FHBL is a disorder in which subjects have extremely low levels of apoB and LDL, generally in the lower sex- and age-adjusted 5% of the population (8), a condition that may provide a possible protective advantage against coronary problems. Mutations in the apoB gene (APOB) on chromosome 2p23-24 have been shown to produce truncations in the apoB protein, resulting in low apoB levels, but only in a minority of FHBL families (9). Previously, we reported strong linkage signals to a region of chromosome 3p21 in several FHBL families without the truncated form of the apoB protein (10). We have also shown that these families differ significantly in fatty liver content from FHBL families, with truncation producing mutations in the apoB gene (11). The FHBL families analyzed here do not have truncated forms of the apoB protein, nor do they show linkage to chromosome 3p21. We followed up our linkage analyses by testing for an association between apoB and LDL levels and single nucleotide polymorphisms (SNPs) in potential susceptibility genes in the linked regions.
| METHODS |
|---|
|
|
|---|
Lipid analyses, DNA isolation, and genotyping
Methods for the determination of lipid levels and blood extraction have been described (13). Microsatellite genotyping was performed by the Mammalian Genotyping Service (Marshfield, WI) (14) using Screening Set 14 (http://research.marshfieldclinic.org/genetics/home/index.asp). SNP genotyping was conducted at the Genotyping Core in the Department of Human Genetics, Washington University School of Medicine, using the high-performance Sequenom MassARRAY system. Briefly, the technology involves PCR amplification of the region containing the SNP of interest, an optimized primer extension reaction to generate allele-specific DNA products, and chip-based mass spectrometry for the separation and analysis of the DNA. Primers of selected SNPs were designed to fit them into four multiplexes of 384-well format. The overall successful rate of this genotyping project was 96%. All pedigrees and genotypes were checked for errors with PEDCHECK (15), PREST (16), and MERLIN-ERROR (17), and unlikely genotypes were set to missing using the PEDWIPE feature in MERLIN.
Statistical methods
We used two fundamentally different methods of quantitative linkage analysis, a recently introduced regression model, MERLIN-REGRESS implemented in the MERLIN software package (17), and a variance components model implemented in SOLAR version 2.0.3 (18). The regression-based linkage analysis uses trait-squared sums and differences to predict identity by descent allele sharing between relative pairs in pedigrees. This reversal of the independent and dependent variables used in Haseman-Elston methodologies for analyzing quantitative traits in general pedigrees (19) has the benefits of robustness for nonnormality of quantitative traits and misspecification of the required input parameters, the population trait's mean, variance, and heritability (17). We estimated the means and variances of apoB and LDL from the founders in families in the current study plus additional FHBL families used in our previous analyses (10, 13, 20) (n = 58). Heritability for this regression method was estimated using SOLAR. Because variance components analyses require normal trait distributions, the phenotypes were defined to be the square root of the subject's serum apoB or LDL level. Age, gender, and body mass index (BMI) were included as covariates. Because LDL and apoB levels are highly correlated (21), a bivariate model was also specified in SOLAR to test for loci jointly contributing to the variance of apoB and LDL. Bivariate models estimate variance parameters and regression parameters for each individual trait as in univariate models but also include a parameter estimating the linkage correlations between two traits. This can increase the power to detect loci with pleiotropic effects (22). Five thousand replicates of our kindreds were simulated for the regions showing the most evidence for linkage under the hypothesis of no linkage between trait and marker. Empirical log of the odds (LOD) score distributions and P values were computed for both the regression-based and variance components genome scans using the simulated data.
To narrow the linkage regions, we chose one tag SNP for genotyping (using an r2 threshold of 0.8) in coding regions or untranslated regions (UTRs) of purported candidate genes located within a 1 LOD score interval of our peak linkage regions identified on chromosomes 3, 6, 10, and 13 (see Table 2 below). For a more even distribution, a few additional tag SNPs were selected on genes whose function was unknown. Although we preferred common SNPs, some SNPs with low minor allele frequencies (
0.05) were retained because of the possibility that rare variants may prove to be important for our quantitative traits (23). We tested for an association of these SNPs with apoB and LDL using linear regression models with covariance corrected for correlation among relatives (the GENMOD procedure in SAS version 9.1). To increase our power, three additional families in which linkage to the APOB gene had been ruled out were included in this analysis. To reduce the skewness in the distributions, measured apoB and LDL levels were log-transformed. Genetic effects were modeled as the additive effect of the number of minor alleles, and age, gender, and BMI were included as covariates. Additive coding has been shown to be robust for the detection of nonadditive effects when the minor allele frequency is >20% (24).
|
| RESULTS |
|---|
|
|
|---|
40% and 60% lower, respectively, than the average levels in those subjects considered normal (8). Nonparametric Kruskal-Wallis tests showed significant differences in the means of apoB (P < 0.0001), LDL (P < 0.0001), and BMI (P < 0.0001) between FHBL and normal individuals, whereas the difference in mean age between these groups was marginally statistically significant (P = 0.06).
|
|
|
|
The maximum LOD scores in the apoB-LDL bivariate analyses occurred on chromosomes 6q24.3 (LOD = 1.43; P = 0.01) and 10q25.1 (LOD = 1.74; P = 0.004) at the same locations as the signals observed for each individual trait. Table 2 summarizes the locations and significance of the highest LOD scores from regression and variance components analyses.
Candidate gene association analysis
We tested for genetic association between 49 SNPs on chromosomes 3, 6, 10, and 13 in potential candidate genes (see supplementary Table I) and the log of the individuals' measured serum apoB and LDL. Our sample size increased to 153 genotyped subjects when we added the three additional families. Table 3
displays the most significant (unadjusted for multiple testing) association findings by gene and chromosome, along with the direction of the effect on apoB and LDL and the location and type of the polymorphisms. Table 3 shows that the most significant association was between rs2277250 and LDL in the gene acyl-coenzyme A dehydrogenase short/branched chain (ACADSB) on chromosome 10 (Z = 5.33; P = 0.000001, unadjusted for multiple testing). This finding is still significant after using the Bonferroni correction for 49 tests. (Our SNPs are all in different genes, and apoB and LDL are very highly correlated traits.) There was an associated increase in apoB and LDL levels of 10% and 14%, respectively, for each minor allele carried at this locus. Overall, five SNPs were associated with LDL, and four were associated with apoB, at P < 0.05. Four of these were associated with both apoB and LDL.
|
Determining which SNPs influence linkage signals
In an attempt to determine whether the most significantly trait-associated SNPs were responsible for the initially observed linkage signals, we repeated the regression-based scans of chromosome 10 on apoB and LDL after regressing out the effects of the most significant SNPs in the region (rs2277250 and rs2463147). The LOD score for LDL decreased from 4.22 to 0.58 at 140 cM after adjustment for rs2277250 and to 0.88 after adjustment for rs2463147. Likewise, the LOD score for apoB in the region decreased from 3.5 to 0.63 after adjustment for rs2277250 and to 0.9 after adjusting for rs2463147. The LOD score on chromosome 3 in the variance components analysis of LDL (LOD = 1.52) also decreased after adjustment for the most significant SNPs in the region (to 0.95 after adjusting for rs9819895 and to 0.58 after adjusting for rs2311298), indicating that each of the SNPs we identified accounts for a significant proportion of the allele sharing among phenotypically similar relatives in the linkage regions.
| DISCUSSION |
|---|
|
|
|---|
17 Mb upstream from our signal. The chromosome 13q13.2 (32.9 Mb) peak for apoB is in the same region as two previous linkage findings for low levels of serum HDL cholesterol (29, 30), and our peak on chromosome 6 is <20 cM downstream from a previously reported HDL3 linkage region (31).
Candidate genes
We expanded our search for potential variants associated with FHBL or low LDL levels by identifying SNPs in genes within our most significant linkage regions. Our approach may be considered a "discovery" phase because the function of some of these genes with respect to these lipid measures is unknown. However, several genes located near the peak on chromosome 10 have potential links to lipid metabolism. For example, pancreatic triacylglycerol lipase precursor (PNLIP), a 56 kDa protein secreted by the pancreas that is essential for the hydrolysis and absorption of long-chain triglyceride fatty acids in the intestine is located
1 Mb upstream from the chromosome 10 peak (32). As mentioned previously, a SNP in PNLIP (1359 C/T) was significantly associated with decreased serum apoB, apoA-I, lipoprotein [a], and total cholesterol in 185 neonates of African descent (29). However, the polymorphism was not present in a sample of 100 Caucasians selected from the general population. Therefore, because pancreatic lipase-related protein 3 precursor (PNLIPRP3) and PNLIPRP1 interact with and flank PNLIP (33), we used our resources to sequence SNPs in those genes.
The fatty acid CoA ligase ACSL5, also located on 10q25.1, catalyzes the formation of acyl-CoA from fatty acid, ATP, and CoA, a reaction essential to mammalian fatty acid metabolism. ACSL5, in conjunction with fatty acid transporter protein, also mediates the cellular uptake of fatty acids (34). One SNP was typed in this gene but it was not significantly associated with apoB or LDL.
Another candidate on chromosome 10q24.31, SCD, catalyzes a rate-limiting step in the synthesis of unsaturated fatty acids. It has been shown in animal models to play an important role in cholesterol and lipoprotein homeostasis (35). It is thought to be an intermediary between leptin and obesity, as leptin represses RNA levels and the enzymatic activity of SCD. In another experiment, mice without Scd1 were lean and hypermetabolic, and in genetically obese (ob/ob) mice, those with mutations in Scd1 were significantly less obese than ob/ob mice without Scd1 mutations. The mice with mutations in Scd1 also had significantly reduced hepatic triglyceride storage and VLDL production (36).
Although no SNPs subsequently typed in the initially hypothesized candidate genes appear to be associated with apoB or LDL, several other SNPs in these and other regions may be. At the 0.05 level, three SNPs on chromosome 10 showed association with both traits: two located
2 Mb apart in the genes ACADSB and phosphatidic acid phosphatase type 2 domain-containing 1A (PPAPDC1A). ACADSB is a mitochondrial enzyme that oxidizes straight-chain or branched-chain acyl-CoAs in the metabolism of fatty acids or branched-chain amino acids (37). In this sample, minor alleles of rs2277250 are more common in those with higher apoB and LDL, and the association is stronger for LDL. The SNP is located in the gene's 5' UTR and is a known tag SNP. The second significant SNP on chromosome 10, rs2463147, is in the gene PPAPDC1A, which is likely a membrane-associated phosphatidic acid phosphatase, based on domain homology (38).
Another candidate, solute carrier family 25 member 26 (SLC25A26), located under the peak on 3p14.1, is involved in the mitochondrial transport of S-adenosylmethionine, the methyl group donor for almost all biological methylation reactions, and is necessary for the biosynthesis of lipoic acid and ubiquinone, which are in turn cofactors in enzyme complexes necessary for aerobic metabolism (39). Finally, SNPs were chosen in forkhead box protein P1 (FOXP1) based on its proximity to the linkage peak on chromosome 3 and the fact that it is a transcription factor with significant sequence homology with the DNA binding domains of the hepatocyte nuclear factor-3/forkhead protein family (40) and is expressed at high levels in gastrointestinal tissue (41).
One of our goals was to better characterize the pathways causing FHBL besides the common truncation mutations in the apoB gene. One possibility is that these families harbor one of the several apoB mutations not detectable by immunoblotting. This seems unlikely, as we have the ability to detect truncated forms as small as apoB-30 and extensive fine-mapping on chromosome 2 reveals no evidence for linkage to the apoB region in any of these families. Another possible source of variation in apoB or LDL levels is variation in the apoE gene, as we have shown previously that
4 alleles are associated with increased serum apoB and LDL (42). We did not type apoE variants in the current study, although it is unlikely that differences at this locus could fully account for the magnitude of the differences in apoB and LDL observed between FHBL-affected and unaffected family members. To adjust for apoE effects, we will likely genotype this locus in the future. Another gene of substantial interest for decreasing LDL levels is proprotein convertase subtilisin/kexin type 9 (PCSK9) on chromosome 1p32.3. Increasing interest in PCSK9 has produced several new human and animal studies of this gene. PCSK9 promotes the degradation of the LDL receptor in hepatocytes, apparently both intracellularly and by being a secreted protein that can bind the LDL receptor and be internalized. It has emerged as a promising therapeutic target (43).
The candidate genes with evidence for an association with apoB and LDL act in pathways related to the absorption of lipids in the gut (PNLIP), system-wide (PPAPDC1A) or mitochondrial (ACADSB and SLC25A26) lipid metabolism, or transcription factors active in the liver and gastrointestinal tract with unknown consequences (FOXP1). The observation that certain individuals with FHBL exhibit steatorrhea and diarrhea supports the hypothesis that a subset of FHBL cases are attributable to reduced intestinal lipid absorption. The lower BMI observed in FHBL cases also supports this hypothesis, as does the possibility that there is an overall increase in the rate at which these individuals metabolize lipids.
Sensitivity analysis
Because of the ascertainment of probands with FHBL, the mean apoB and LDL levels in this population were significantly lower than the means measured in the founders in the pedigrees and from previous population estimates. Although the LOD scores from regression-based methods are purported not to be sensitive to the input h2 and population means (19), we tested the effects of varying these estimates for both traits within a biologically plausible range. For both traits, reducing the h2 estimate by one-third (to 40% for apoB and 14% for LDL) decreased the maximum LOD score on chromosome 10. Increasing the h2 of apoB to 80% for apoB and 27% for LDL increased the magnitude of the chromosome 10 LOD score for LDL but decreased the LOD score for apoB. Increasing the input population mean LDL and apoB by 20 mg/dl reduced both peak LOD scores, and reducing the population mean by the same amount increased both LOD scores. Although the magnitude of the peak LOD scores on chromosome 10 changed after systematically changing the input population means, variances, or h2 estimates, the location of the peak LOD score on chromosome 10 did not change, indicating that the location of linkage peaks may be robust to misspecification of the model parameters.
Strengths and limitations
One limitation of this analysis is the lack of data on covariates known to affect apoB and LDL levels. Although some key factors were measured, potentially important lifestyle covariates such as physical activity, dietary composition, and smoking status were not. Adjustment for BMI may act as a surrogate for lifestyle factors and reduce the potential for confounding as a result of differences in diet and exercise.
The results presented are based on a relatively small sample size of nine three-generation pedigrees containing 91 genotyped subjects. Nevertheless, the smallest P values on chromosome 10 provide strong evidence for suggestive linkage after correction for multiple tests according to the guidelines of Lander and Kruglyak (44), increasing our confidence that the signal is real. In addition, the agreement between the results from linkage methods with totally different mathematical underpinnings also increases our confidence in the results. Furthermore, the fact that there was evidence for linkage to both apoB and LDL on chromosome 10 in both the regression and variance components methods, along with the corroborating results from the bivariate scan, increases the chance that we are observing a locus affecting both traits rather than an artifact of testing correlated traits. Perhaps the most intriguing aspect of the signal on chromosome 10 is its proximity to the SNP in PNLIP that showed an association with lower serum apoB in African neonates (28). If this SNP is present in these families, it may explain why they exhibit FHBL without the apoB truncation mutation.
Finally, the SNP genotyping conducted here was in no way comprehensive; instead, it was meant to narrow the search for candidate genes in linkage regions. Subsequent genotyping will provide better coverage of the linked regions and attempt to locate causal polymorphisms for FHBL in the genes we did identify.
| ACKNOWLEDGMENTS |
|---|
Manuscript received February 12, 2007 and in revised form May 30, 2007 and in re-revised form September 18, 2007.
| REFERENCES |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Journal of Biological Chemistry |
| Molecular and Cellular Proteomics | ASBMB Today |