The mouse QTL map helps interpret human genome-wide association studies for HDL cholesterol.

Genome-wide association (GWA) studies represent a powerful strategy for identifying susceptibility genes for complex diseases in human populations but results must be confirmed and replicated. Because of the close homology between mouse and human genomes, the mouse can be used to add evidence to genes suggested by human studies. We used the mouse quantitative trait loci (QTL) map to interpret results from a GWA study for genes associated with plasma HDL cholesterol levels. We first positioned single nucleotide polymorphisms (SNPs) from a human GWA study on the genomic map for mouse HDL QTL. We then used mouse bioinformatics, sequencing, and expression studies to add evidence for one well-known HDL gene (Abca1) and three newly identified genes (Galnt2, Wwox, and Cdh13), thus supporting the results of the human study. For GWA peaks that occur in human haplotype blocks with multiple genes, we examined the homologous regions in the mouse to prioritize the genes using expression, sequencing, and bioinformatics from the mouse model, showing that some genes were unlikely candidates and adding evidence for candidate genes Mvk and Mmab in one haplotype block and Fads1 and Fads2 in the second haplotype block. Our study highlights the value of mouse genetics for evaluating genes found in human GWA studies.

Coronary artery disease (CAD) is the leading cause of death in the United States. Clinical and mouse studies show that a high plasma cholesterol level is a major predictor of CAD, whereas a high level of HDL cholesterol protects against CAD ( 1 ). Complex traits such as HDL-cholesterol levels are the result of a combination of multiple genetic and environmental factors. Quantitative trait loci (QTL) analysis has been used to study the genetics of HDL levels in humans and mice. By 2005, approximately 40 HDLregulating QTL had been identifi ed in each species ( 2 ). In the past 2 years, numerous human genome-wide association (GWA) studies have confi rmed known genes and identifi ed new genomic loci associated with HDL levels (3)(4)(5)(6)(7)(8)(9)(10)(11). Human GWA studies typically identify only a few signifi cant genes because of the stringent statistical threshold, but they also highlight a number of genes with suggestive P values that are not investigated further due to the potential of identifying a high number of false positives. With the mouse, we can independently evaluate both signifi cant and suggestive loci from GWA studies. One method is to test in vivo mouse models such as knockout or transgenic mice. Another method is to use the mouse model for sequencing, expression studies, or breeding when a QTL has been detected in the mouse homologous region. Even when GWA peaks are replicated in additional human studies, an experimental model such as the mouse is useful for elucidating gene function.
This approach is based on the hypothesis that a QTL for a trait in the mouse that maps to the homologous location as a QTL or GWA peak for the same trait in humans is most likely caused by the same gene. Although it may not always be true, this is a reasonable hypothesis that has been Abstract Genome-wide association (GWA) studies represent a powerful strategy for identifying susceptibility genes for complex diseases in human populations but results must be confi rmed and replicated. Because of the close homology between mouse and human genomes, the mouse can be used to add evidence to genes suggested by human studies. We used the mouse quantitative trait loci (QTL) map to interpret results from a GWA study for genes associated with plasma HDL cholesterol levels. We fi rst positioned single nucleotide polymorphisms (SNPs) from a human GWA study on the genomic map for mouse HDL QTL. We then used mouse bioinformatics, sequencing, and expression studies to add evidence for one well-known HDL gene ( Abca1 ) and three newly identifi ed genes ( Galnt2 , Wwox , and Cdh13 ), thus supporting the results of the human study. For GWA peaks that occur in human haplotype blocks with multiple genes, we examined the homologous regions in the mouse to prioritize the genes using expression, sequencing, and bioinformatics from the mouse model, showing that some genes were unlikely candidates and adding evidence for candidate genes Mvk and Mmab in one haplotype block and Fads1 and Fads2 in the second haplotype block. Our study highlights the value of mouse genetics for evaluating genes found in human GWA studies. matics and molecular evidence including: 1 ) haplotype differences between the parental QTL strains, and 2 ) either a gene expression difference or the presence of a nonsynonymous coding polymorphism between the QTL parental strains. For haplotype analysis, we used the Center for Genome Dynamics (CGD) new mouse diversity array genotype data ( 21 ) and the CGD strain comparison tool, which allow the determination of genomic region identical between two or more strains. Genes located within these identical regions can be excluded as candidate genes for the QTL, as they are not likely to carry the causative polymorphism. Amino acid changes were assessed by examining the CGD imputed database available on their website ( 22 ), by the Mouse Phenome Database (MPD), or by resequencing of the parental strains. Additionally, we used the Sanger Institute database when the strains of a QTL were among the fully resequenced 17 strains. Expression differences were assessed using the results of the 12-strain microarray survey from Shockley et al. ( 23 ), available at the CGD website, or by performing real-time PCR in the parental strains using mice of the same sex and fed the same diet as used to detect the QTL. We assessed the potential functionality of the single nucleotide polymorphisms (SNPs) located in the promoter region by using Alibaba v2.1, which uses the Transfac database as a reference for transcription factor binding sites.

Expression analysis
For real-time PCR, we generated cDNA from liver RNA that was collected for male and female mice from 12 strains fed a chow or atherogenic diet as described previously for microarray survey experiments ( 23 ). We also collected tissues from additional strains [C57BL/6J (B6), CAST, NZB/BINJ (NZB), and NZW/LacJ (NZW)] in an identical manner. Gene-specifi c primers were designed by Primerdesign Ltd. (Southampton, UK) (supplementary Table III). Real-time PCR was performed using the ABI Prism 7000 sequence detection system (Applied Biosystems, Inc., Foster City, CA) using SYBR green. Beta-2 microglobulin or Gapdh were used as controls. Data were analyzed by LinRegPCR (v11.0) ( 24 ). Expression ratios between the two strains were calculated using the Relative Expression Software Tool (REST©) ( 25 ).

Sequencing of candidate genes
We obtained strain-specifi c genomic DNA from the Mouse DNA Resource at The Jackson Laboratory. Using custom primers, we resequenced the exons and splice sites of Abca1 (NM_013454. 3) in CAST, PERA, I, D2, and 129, and Wwox (NM_019573.3) and Galnt2 (NC_000074.5) in B6, CAST, and CASA/RkJ (CASA). We also resequenced 713 bp upstream of Abca1 and the 3 ′ UTR of Galnt2 in the relevant strains. Exon1 of Galnt2 was not resequenced due to technical diffi culties. For Cdh13 , we searched the CGD imputed SNP database for polymorphisms that contain CAST and B6 but not CASA. Direct sequencing was performed on the PCR products using Big Dye Terminator Cycle Sequencing Chemistry and the ABI 3700 Sequence Detection System (Applied Biosystems, Inc.). Results were analyzed using Sequencher software (version 4.2). All SNPs and genotypes have been reported to NCBI dbSNP. Each polymorphism was evaluated for functionality using the Match TM Algorithm (BIOBASE Biological Databases GmbH, Wolfenbüttel, Germany) for polymorphisms in the promoter region and PolyPhen and SIFT for nonsynonymous coding polymorphisms ( 26,27 ). We used the University of California Santa Cruz (UCSC) database to assess if the amino acid was conserved in mammals.

Abca1 promoter activity assay
We amplifi ed 713 bp upstream of the transcription start site of the mouse Abca1 gene from strains CAST and D2 and ligated used to fi nd the causative genes for many traits. In addition, if two genes are closely linked to each other and have the same or similar functions, then a mutation in one could affect HDL in humans and a mutation in the other could affect HDL in mice.
Mouse QTL are detected by crossing two strains [as reviewed in Peters et al. ( 12 )], phenotyping all offspring, and genotyping polymorphisms at regular intervals in the genome. We use the mouse "toolbox" to investigate candidate genes within a locus and ultimately to identify the QTL gene ( 13,14 ). This toolbox is based on genomic comparison of the parental strains of the QTL cross. The list of evidence includes databases and bioinformatic tools to compare haplotype blocks, differences in gene expression, and nonsynonymous polymorphisms predicted to change function.
In this study, we demonstrate how the published mouse QTL map helps interpret human GWA studies for HDL. For the genes identifi ed in human GWA studies, we use the mouse model to provide bioinformatics, sequencing, and expression studies to test whether the gene is likely to be an HDL candidate gene in the mouse, thus supporting the human GWA fi ndings. Then, for GWA peaks that point to multiple genes, we evaluate each gene, pinpoint the most credible HDL genes, and judge which genes are unlikely candidates.

Human dataset
The GWA meta-analysis ( 5, 6 ) comprised three studies: the Diabetes Genetics Initiative (DGI), the Finland-United States Investigation of NIDDM Genetics (FUSION) and SardiNIA Study of Aging ( 5, 6 ). The dataset included the results from the association analysis between the genotypes of approximately 2.5 million SNPs (genotyped and inferred) and the phenotype (HDL levels) for 8,816 individuals. SNPs with P < 10 Ϫ 3 were sorted into bins by linkage disequilibrium using LDSelect with a cut-off value of r 2 = 0.5 ( 15 ).

Mouse datasets
We used the published mouse HDL QTL map as a reference ( 2 ); we updated it with recent publications. The updated mouse HDL QTL list is provided in supplementary Tables I and II. The PERA/EiJxDBA/2J (PERAxD2), PERA/EiJxI/LnJ (PERAxI), CAST/EiJx129S1/SvImJ (CASTx129), and DBA/2JxCAST/EiJ (D2xCAST) crosses had previously been published by our group (16)(17)(18) and the raw data was available to us to perform combined QTL analysis as described in ( 18,19 ).

Mapping approach
We used the human genome map from NCBI (build 36) and the refSNP position to place the GWA SNPs onto the human genome. To identify the mouse orthologs of the human GWA genes, we downloaded the human-mouse orthology gene list from the Biomart database, which integrates genetic resources including the Ensembl homology ( 20 ).

Mouse bioinformatics toolbox
The mouse bioinformatic toolbox has previously been described in Burgess-Herbert et al. ( 14 ) and DiPetrillo et al. ( 13 ). In summary, a mouse HDL QTL gene should present bioinfor-( Abca1 ) as "proof of principle" and continuing with three unknown HDL genes ( Galnt2 , Wwox , and Cdh13 ).

Evidence for a known mouse HDL gene, Abca1
Abca1 , identifi ed in the GWA with P = 7. 36  , NZBxNZW, and B6xC3H) at the Abca1 location at 53 Mb ( Fig. 2A ) ( 16-18, 28, 29 ). The homologous region between mouse and human QTL contained 496 genes ( Fig. 2A ). We combined the data from four QTL crosses for which overlapping wild-derived strains were used: 129xCAST with CASTxD2 and PERAxI with PERAxD2 (supplementary Fig. I ). The combined QTL exhibits a reduced confi dence interval (CI) and a higher logarithm of the odds (LOD) score, two consistent characteristics indicating that the same gene causes a QTL in two crosses ( Fig. 2A ). The QTL peak is located at 28 cM (53 Mb) for the PERA crosses and 22-26 cM for the CAST crosses; Abca1 is at 28.6 cM. This combined approach reduced the region from 495 genes to 171 genes for the PERA crosses, and to 208 genes for the CAST crosses; 157 genes, including the known HDL gene Abca1 , were found in both PERA and CAST crosses (supplementary Table VIII).
We resequenced the exons and about 1Kb upstream of the gene in the parental strains (PERA, CAST, I, D2, and 129) to encompass the proximal promoter region of Abca1 (supplementary Table IX). Through this resequencing, we identifi ed three haplotypes within the gene region specifi c to CAST, PERA, and I/D2/129 ( Fig. 2B ). This fi nding is consistent with the results of the combined QTL cross; I and D2 are identical but different from PERA, D2 and 129 are identical but different from CAST ( Fig. 2B ).
We then examined Abca1 for a causal polymorphism: either a nonsynonymous coding polymorphism that differs between the parental strains and is likely to change function or a difference in expression between the parental strains, implying a regulatory polymorphism. For PERA crosses, we identifi ed eight polymorphisms that changed amino acids between PERA and the strains I and D2; one of these, E1550K, was damaging according to Polyphen ( Table 1 and supplementary Table IX). Therefore, we suggest that this may be the causal polymorphism that results in an HDL QTL between PERA and the strains I and D2.
Nevertheless, this polymorphism cannot explain the QTL in the CAST crosses because CAST, D2, and 129 did not have any polymorphisms that changed amino acids. However, the sequence of the promoter region differs by 10 polymorphisms in CAST ( Fig. 2B and supplementary  Table IX) compared with D2 and 129. Therefore, we hypothesized that a difference in Abca1 expression may explain the CAST crosses. Using our 12-strain microarray survey, we identifi ed an expression difference between CAST and (D2 and 129) males on a high-fat diet but no expression difference among PERA, I, and D2 females ( Table 1 and supplementary Table X). This result was confi rmed by real-time PCR ( Fig. 2C ). By luciferase assay, we each amplicon to the PGL3-enhancer vector (Promega, Madison, WI). Each promoter construct was cotransfected to HEK293 cells (American Type Culture Collection, Manassas, VA) with plasmids CMX-BGAL, CMX-mlXRa, and CMX-mRXRa. Transfection was accomplished using FuGENE 6 according to the manufacturer's instructions (Roche Diagnostics, Indianapolis, IN). Transfected cells were incubated for 24 h with vehicle alone (ethanol), 22OH (10 M), 9cRA (10 M), or with both ligands. After washes, cells were collected and lysed; supernatant was collected after centrifugation. Luciferase activity was measured using a luminescence counter (Victor 2, Perkin Elmer, Boston, MA) immediately after addition of luciferase assay reagent to an aliquot of cell lysate.

Mapping of GWA study genes onto the mouse QTL map for HDL levels
We mapped results of the human GWA study from Kathiresan et al. ( 5 ) onto homologous locations on the mouse genome, investigating the seven signifi cant loci (104 SNPs with P < 5 × 10 Ϫ 8 as well as suggestive SNPs, defi ned as P < 10 Ϫ 3 . These 3,882 SNPs were fi rst binned by linkage disequilibrium into 1,121 bins using r 2 = 0. 5. To reduce the number of potential false positive results from the human GWA study, we only selected the most significant bins carrying at least one SNP with P < 10 Ϫ 4 (171 bins or 1,324 SNPs in total indicated in supplementary Table  IV). Each bin contained between one and 116 SNPs (supplementary Table IV). The SNPs within each bin were mapped on the human genome (build 36). If the SNP fell within a gene, we assigned the gene to that bin; if the SNP fell in an intergenic region, we assigned it the closest gene. In the latter case, the SNPs were within 138 Kb of the closest gene on average. For 62 bins, we also pooled adjacent bins into a single locus (supplementary Table V) because they mapped to the same or adjacent neighboring genes, leaving 109 unique loci. For each human gene, we identifi ed the mouse homolog using the Biomart database. A total of 17 loci were removed from the list because the mouse does not have a homolog for the human gene or the locus did not map to a mouse location accurately; i.e., the human SNP fell into an intergenic region and its proximal and distal genes mapped to different mouse chromosomes (supplementary Table VI 1 . Seventy-two loci contained one gene, 18 loci contained between two and six genes, and two loci, both on mouse chromosome (Chr) 8, contained 18 and 30 genes, respectively. The largest locus was an intergenic region of 753 Kb on mouse Chr 10. Mapping these 92 loci onto the mouse map showed that 49 (53%) of the GWA loci were within 5 Mb of a mouse QTL peak. Eight human GWA loci mapped to genes known to be involved in HDL metabolism ( Abca1 , Ppara , Angptl4 , Cpe , Lcat , Lipc , Lipg , and Lpl) for which an in vivo mouse model (knockout, transgenic or spontaneous mutation) shows a difference in HDL level ( Fig. 1 ). We then used the mouse model to obtain supporting evidence for some of the GWA peaks starting with a known HDL gene Mouse evidence for novel genes found in the GWA study: Galnt2 , Wwox , and Cdh13 Three potential HDL genes, whose role in regulating HDL was previously unknown, were identifi ed by the GWA study. These candidates mapped to distal mouse  2D ); the CAST promoter construct displayed greater promoter activity compared with the D2 promoter. The reduced expression in the D2 strain, leading to a decrease in HDL level, is consistent with the Abca1 knockout mouse model that shows lower HDL ( 30 ). Therefore, in the four crosses involving a wild-derived mouse strain, we conclude that Abca1 is the causal QTL gene but that in the two CAST crosses, the causative polymorphism is located in the promoter region and causes an expression difference whereas in the two PERA crosses, an amino acid change causes a functional change in protein activity. Thus, the Abca1 gene has two different mutations, providing an allelic series and strengthening the evidence that Abca1 is the causal QTL gene.
Additionally, we investigated if any Abca1 polymorphisms could also explain the three additional QTL located in the region (B6xC3H, NZBxNZW, and NODxB6.129 apoE Ϫ / Ϫ ). A derived sub-strain of B6 (C57BL/6NJ), C3H, and NOD have been fully resequenced by the Sanger Institute, and the sequences of the B6, C3H, and NOD strains are identical at the Abca1 locus ( Abca1 gene and promoter). In addition, as indicated by the CDG database, NZB and NZW have identical haplotypes. These results indicates that Abca1 is not the QTL gene for the B6xC3H, NZBxNZW, and NODxB6.129 apoE Ϫ / Ϫ crosses and further investigation will determine which gene is responsible for these QTL.  ( 2 ). We updated this list with recent HDL QTL results. A QTL is defi ned as a main effect QTL with a suggestive or signifi cant LOD score as reported in the study. Each QTL peak positions, CIs, strains, sex, diet, and reference study are indicated in supplementary data (supplementary Tables I, II). Gray bars represent the 95% CI of each mouse QTL, black bars indicate QTL peaks. SNPs from the GWA study with a P < 10 Ϫ3 were selected and binned by linkage disequilibrium using r 2 = 0. 5. Horizontal gray lines to the right of the chromosomes represent the 92 selected GWA study hits (bins), lines with arrows represent genes falling within the 95% CI of the HDL QTL. Genes discussed in this report are also indicated in this map: Abca1 (1), Ppara (2) Table XII). The polymorphism segregated between B6 and CAST but not between B6 and CASA. We also identifi ed another nonsynonymous coding polymorphism (P365T) that segregated between B6 and CASA (supplementary Table XII). However, this polymorphism was specifi c to only one transcript and its functionality could not be assessed due to the lack of sequence alignment at that position within SIFT. A QTL gene may have an expression difference or an amino acid difference that changes function; because V186M may affect the function or structure of WWOX , we think that Wwox is a QTL gene for the B6xCAST cross. These results support this gene being associated with HDL metabolism.
For Cdh13 , no difference in expression was observed in our 12-strain microarray survey between B6 and CAST fed chow in males and females ( P > 0.05) ( Table 1 and supplementary Table X). However, use of SIFT showed that one out of the three amino acid changes reported in MPD was characterized as damaging (Y214F).
Therefore, based on the evidence outlined above for these three genes, the B6xCAST cross has polymorphisms in Wwox and Cdh13, and differs in expression at Galnt2 (con-tary Table XI). Because the GWA study genes were among the mouse HDL QTL candidate genes identifi ed by haplotype analysis, we searched for additional molecular evidence for all three genes in the mouse.
For Galnt2 , resequencing did not identify any nonsynonymous polymorphisms but did establish that the haplotypes within the gene are identical between CAST and CASA but different from B6 ( Fig. 3B and supplementary  Table XII). CASA was not in our 12-strain microarray survey, and CASA mice were not available for this study, but both CASA and CAST have the same haplotype in regions surrounding and within the gene ( Fig. 3B ). Therefore, we used CAST as a surrogate for CASA in our expression analysis. In our microarray study, Galnt2 was differentially expressed between B6 and CAST in liver of females and males on a chow diet (supplementary Table X). Real-time PCR verifi ed these results in females (1.27-fold decrease in expression in B6 compared with CAST, P = 0.01) ( Table  1 ). This result is consistent with recent in vitro results showing that inhibiting Galnt2 expression increases HDL ( 11 ); B6 is the high HDL strain at this locus and showed lower expression of Galnt2 compared with CAST/CASA.
For Wwox , we performed real-time PCR between B6 and CAST on chow in males and females but did not fi nd any difference in expression ( P > 0.05) ( Table 1 ). However, by  Table IX. E1550K was predicted to be damaging. Expression difference for Abca1 was evaluated between CAST, D2, and 129 by quantitative PCR (C). RNA was extracted from liver tissue from 16-week-old males on a high-fat diet. Promoter activity difference was evaluated in CAST and D2 (D). Both constructs (CAST and D2) were cotransfected with Beta-galactosidase (bGAL), liver X receptor alpha (LXRa), and retinoid X receptor alpha (RXRa) plasmids into HEK 293 cells. LXRa and RXRa ensured transcriptional activation of Abca1 and were activated with ligands 22-hydroxycholesterol (22OH, 10 µM) or 9-cis -retinoic acid (9cRA, 10 µM), respectively, or with both ligands. The results summarize four replicates for each construct. The CAST promoter exhibited higher transcriptional activity compared with that of D2. * P < 0.001, † P < 0.05 (C, D).
We fi rst examined the gene expression differences for these fi ve genes between the QTL parental strains. Myo1 h and Ube3b had no expression differences among any QTL strains in our 12-strain microarray survey ( P > 0.01) (supplementary Table X). Real-time PCR on Mvk, Mmab , and Kctd10 showed no signifi cant expression differences between the QTL strains ( Table 2 ) except for Mmab in a single cross, NZBxNZW. Mmab and Mvk are positioned in the opposite direction with only 280 bp between their respective 5 ′ UTR. By resequencing, we identifi ed a G/A polymorphism 62 bp upstream of Mvk that was specifi c to NZW ( Fig. 4B ). However, we could not identify a transcription factor binding site at this location. Because we found no difference in expression for four of these genes and the expression difference for Mmab could only explain one cross (NZBxNZW), we investigated the possibility of a nonsynonymous coding polymorphism that could explain the QTL.
We resequenced the coding regions of all fi ve genes in NZB, 129, B6, SM, and NZW (supplementary Table XIII). We did not identify any nonsynonymous coding polymorphisms in Myo1 h , Kctd10 , Ube3b , and Mmab . Mvk contained two nonsynonymous polymorphisms, I207V and F216Y ( Fig. 4C ). Neither polymorphism is predicted to change the function of the protein (Polyphen/SIFT) but both are located in a highly conserved region. F216Y changes the polarity of the amino acid and segregates between the parental strains of all crosses. Based on this evidence, we think Mvk is the most likely causal gene for the crosses, sistent with the broad QTL and large 95% CI). The B6xCASA cross differs in expression at Galnt2 , whereas the effect of P365T in Wwox is yet to be determined for this cross.

Using mouse genetics as a tool to resolve regions with multiple genes
MVK/MMAB locus. The human GWA identifi ed a locus on human chromosome 12 at 108 Mb (12q23-24) encompassing fi ve genes: MYO1H , KCTD10 , UBE3B , MVK , and MMAB ( P = 1.88 × 10 Ϫ 6 ). These genes cannot be dissociated through GWA studies or replication in additional human populations because of the underlying high linkage disequilibrium in the region. The issue is which gene is the actual HDL gene? Homologous genes in the mouse are located in the same sequential order on Chr 5 at 114 Mb. This region in the mouse has multiple QTL, some with multiple peaks. The most relevant crosses, which we selected based on peak location (within 5 Mb of the gene locus), are B6xNZB (peak, 113 Mb, LOD = 3.6), NZBxSM/J (SM) (peak, 113 Mb, LOD = 5.6), NZBxNZW (peak, 115 Mb, LOD = 12.7), and B6 × 129 (peak, 114 Mb, LOD = 3.3) ( 29,(33)(34)(35) ( Fig. 4A ). NZB and 129 carry the high HDL allele, strains B6, NZW, and SM carry the low HDL allele. Overlapping QTL could be due to different genes. For this particular locus, however, we assumed that the same gene was responsible for all four crosses because three of these crosses had one mouse strain in common (NZB), suggesting that a polymorphism specifi c to this strain was responsible for the QTL.  CASA. b Gene expression differences between the QTL parental strains. All quantitative PCR performed regarding both loci are reported here. Quantitative PCR was performed on cDNA from liver samples from at least three mice for the most relevant strain/diet/sex. Quantitative PCR was performed for Abca1 , Galnt2 , and Wwox using ␤ 2 microglobulin as the endogenous control. The microarray result is indicated for Cdh13 (probe 1431824_at). The fold change is indicated as the gene expression of the high allele strain (bold) compared with the low allele strain.
c For Cdh13 , amino acid substitutions were determined using the high-density SNP component of the MPD database and the imputed SNPs from the CGD with a minimum confi dence level of 0. 9. For Galnt2 and Wwox , amino acid substitutions were determined by resequencing of B6 and CAST. Functionality was determined by using Polyphen and SIFT. Conservation was determined by comparison with mammals at the UCSC website. For Abca1 , the amino acid substitutions are arranged in two groups: the crosses involving CAST and the crosses involving PERA. Additional information on the sequencing results is available in supplementary Table IX and XII. lished work from our laboratory has tentatively identifi ed the gene for the B6xA QTL and it is not located in this cluster; therefore, we focused on the B6xBALB cross. By haplotype analysis, we eliminated Fen1 as a likely candidate gene because the haplotypes within the gene were identical in both B6 and BALB. We judged Gm98 , 1810006K21Rik , and Fads3 to be unlikely candidates because none of these three genes had nonsynonymous polymorphisms between B6 and BALB (based on SNP databases) and none showed a difference in liver expression between B6 and BALB, based on our 12-strain microarray database (supplementary Table X). Therefore, we focused our attention on Fads1 and Fads2. We identifi ed a nonsynonymous SNP in Fads2 between B6 and BALB by resequencing, V163I (supplementary Table XIV). However, we do not think it is responsible for the QTL because the amino acid change does not change polarity or charge, it is not located in a conserved region, and it was considered nonfunctional (Polyphen/SIFT). Using the 12-strain microarray database, we found a difference in expression between B6 and BALB for Fads1 and Fads2 (supplementary Table XIV), which we confi rmed by real-time PCR ( Table 2 ). Expression of both Fads1 and Fads2 was higher in BALB compared with B6 and both genes are involved in the same biological pathway. Further work in humans and mice is required to determine whether Fads1 or Fads2 (or both) is the QTL gene, but using the mouse model did help to narrow the focus of future work from six to two genes.

DISCUSSION
Recent human GWA studies have identifi ed well-known HDL genes such as ABCA1 , LIPG , LPL , and LIPC ( 3-10 ), as well as new genes such as GALNT2 . However, GWA studies may not detect HDL genes because some GWA peaks are only suggestive and others map to haplotype blocks with multiple genes. Because 90% of the HDL QTL overlap between humans and mice ( 2 ), we suggest an innovative strategy based on mouse genetics to help decipher and add support to the results of human GWA studies. We fi rst showed a strong similarity between results of a human GWA study from Kathiresan et al. ( 5 ) and the mouse HDL QTL map ( 2 ). When a gene identifi ed by a human GWA study mapped within a mouse HDL QTL at a homologous location, we compared parental strains of the mouse QTL cross for additional bioinformatic and molecular evidence. We demonstrated this strategy as a proof of principle to a well-known HDL gene ( ABCA1 ) and a newly identifi ed HDL gene ( GALNT2 ) ( 11 ). We then applied this strategy to candidate genes for two other categories of GWA study results: 1 ) when the new gene reaches only a suggestive P -value ( WWOX and CDH13 ), and 2 ) when the identifi ed locus contains more than one gene ( MVK/MMAB and FADS loci).
In this report, we explain and provide examples of our strategy to add evidence to human loci identifi ed at the signifi cant and suggestive level by GWA studies by using a mouse bioinformatics toolbox ( 13,14 ). Our analyses are based on recommendations by the Complex Trait Consortium, which in 2003, published criteria for determining due to the presence of a conserved amino acid change (F216Y) that segregates between high and low HDL strains. Although we have no proof that this amino acid changes the protein function, we suggest that further testing may show a causal functional change. This result illustrates how merging crosses can help narrow QTL and pinpoint potential causal genes and polymorphisms.  ( 36,37 ). B6 was the high allele strain in the B6xA cross and the low allele strain in the B6xBALB cross, so the QTL in these two crosses are most likely not caused by the same gene. Other unpub- candidate gene. However, three of the four crosses that detected this QTL were found in mice fed a high-fat, atherogenic diet; the Mvk Ϫ / Ϫ mouse needs to be tested on such a diet. Mouse models for Cdh13 , Fads1 , and Fads2 have not been tested for HDL cholesterol levels, but based on the GWA study and our bioinformatics results in the mouse, the genes should be prioritized for further study. CDH13 encodes T-cadherin and binds to LDLs ( 41 ). The mechanism and the physiological consequences of this interaction are not completely understood but the binding of CDH13 to LDL could play a role in HDL metabolism. Fads1 encodes the delta 5 desaturase in fatty acid metabolism, Fads2 encodes the delta 6 desaturase. These two genes are part of the same pathway that produces arachidonic acid from linoleic acid. In the mouse, both genes have a similar expression difference between the strains that detected the QTL. As a result, no choice can be made between these two genes without further work. In fact, it may be that a single regulatory polymorphism causes the expression difference for both genes. Because there are two possible QTL genes and because they are so closely related in function, one or both could play a role in HDL metabolism. Molecular evidence in human populations that suggests cis -regulation of WWOX ( 42 ) and FADS1 ( 7 ) is confi rmed by our results in the mouse. The direction of the difference in expression matches what we found in the mouse for both genes. For instance, in humans, FADS1 was positively correlated with HDL levels whereas in the mouse, a higher expression of the gene was observed in the high when a gene is a QTL gene ( 38 ). They recommended using several independent lines of evidence for each gene. Within each mouse QTL, we use haplotype analysis between parental strains of QTL crosses. We then use the high-density SNP databases and expression profi le databases among multiple inbred mouse strains to add evidence to the genes located within each QTL ( 13,14 ). To consider an HDL gene as a viable candidate, we require it to have either a molecular difference, such as a nonsynonymous coding difference that affects either the structure/function of the protein ( Abca1 , Wwox , Cdh13 , and Mvk ), or a difference in gene expression level ( Abca1 , Galnt2 , Mmab ). A current limitation of our bioinformatics toolbox is the requirement to further investigate alternative splicing and posttranslational modifi cation differences between the parental strains in mice. Both of these are possible causes of QTL genes, but databases today do not have suffi cient information for a bioinformatic approach. Until such information is available, these possibilities must be examined with sequencing or with direct measurement of protein levels with antibodies.
Among the candidate genes investigated in this report, in vivo mouse models for some of the genes are available. For instance, the Wwox knockout mouse has previously been shown to exhibit hypercholesterolemia ( 39 ); HDL was not measured, but hypercholesterolemia in a chow-fed mouse would indicate increased HDL because 80% to 90% of cholesterol in a chow-fed mouse is HDL. The

Mvk
Ϫ / Ϫ mouse does not have elevated cholesterol when fed chow ( 40 ), a fact that does not support Mvk as the quencing before rejecting the gene as an HDL gene. In addition, we have evidence for Acads as a candidate gene for this locus in the mouse ( 48 ). Acads is located just 2 Mb away from Mvk/Mmab in the mouse but more than 10 Mb from these genes in humans. The human GWA study did not point to Acads. Therefore, it is possible that both genes ( Acads and Mvk ) are involved in HDL regulation in mice but that only the MVK/MMAB locus is involved in humans.
Although we expect the genes involved in HDL cholesterol to be the same among species, the molecular mutation that causes the QTL may differ within and among species. For instance, in this report, we found that a nonsynonymous coding polymorphism in Abca1 was responsible for the QTL from the CAST crosses on Chr 4 whereas a difference in expression of Abca1 was responsible for the QTL from the PERA crosses. In addition, Lipg is accepted as a known HDL gene, but the molecular basis of observed association and linkage with HDL vary from a nonsynonymous coding polymorphism in humans ( 49 ) to a difference in expression of the gene in mice and baboons ( 29,50 ).
A limitation of our strategy is the existence of important species differences in HDL metabolism, such as the absence of the CETP gene in the mouse. CETP produces the exchange of cholesteryl ester between HDL and VLDL, which participates in HDL degradation in humans. This difference could explain the fact that some human loci were found outside a mouse HDL QTL. However, we also HDL strain ( 7 ). Additionally, at the MVK/MMAB locus, cis -expression of MMAB has been observed in some ( 7,43,44 ) but not all (45)(46)(47) human populations, and thus has been implied as responsible for HDL differences in some of these studies. We did identify an Mmab expression difference consistent with observations in humans in one mouse cross (NZBxNZW) ( 7 ) but not in the other crosses. Instead, the mouse shows stronger evidence for Mvk (as opposed to Mmab ) based on a nonsynonymous coding polymorphism that may affect the function or structure of the protein. Perhaps both genes are involved in HDL regulation, but during evolution, one gene mutated in mice and the other mutated in humans. Our approach is based on the hypothesis that a QTL for a trait in the mouse that maps to the homologous location as a QTL or GWA peak for the same trait in humans is most likely caused by the same gene. This is a reasonable hypothesis and has been used to fi nd the causative genes for many traits. However, if two genes are closely linked to each other and have the same or similar functions, then a mutation in one could affect HDL in humans and a mutation in the other could affect HDL in mice. To our knowledge, neither MMAB not MVK has been screened for coding mutations in humans. Therefore, although there is expression evidence for MMAB in human populations, the possibility of deleterious mutations affecting the structure and function of the MVK protein should be investigated in humans by rese-  strain. b Gene expression differences between the QTL parental strains. All quantitative PCR performed regarding both loci are reported here. Experiments were performed in liver samples from at least three mice for the most relevant strain/sex/diet combination using ␤ 2 microglobulin as the endogenous control. The fold change is indicated as the gene expression of the high allele strain (bold) compared with the low allele strain. Only females were used to verify the microarray results provided in supplementary Table X. c Sequencing results are available in supplementary Tables XIII and XIV. Functionality was determined by using Polyphen and SIFT. Conservation was determined by comparison with mammals at the UCSC website. recently showed that the underlying genes for HDL QTL are likely to be the same between rats and mice ( 51 ). Therefore, we think that comparative genomics is a very powerful approach to identify candidate genes for complex traits such as HDL.
To conclude, by combining bioinformatics tools with limited laboratory experiments in the mouse, our humanmouse strategy can add support to human GWA results. This approach is advantageous with phenotypes for which mouse QTL results are publicly available, such as triglyceride level, bone mineral density, or other complex traits. We have provided here our current list of HDL QTL as supplementary material. When the mouse homologous gene from a GWA gene is located under a mouse QTL for the same phenotype, one can apply the bioinformatics toolbox and gather evidence before progressing to more targeted studies using animal models, such as knockout and transgenic mice, which are time consuming and expensive but offer great advantages for studying the direct effect of a candidate gene in vivo. Importantly, our humanmouse comparative strategy represents a fast pragmatic approach to help human geneticists extend and clarify GWA studies that identify candidate genes for other complex traits.