Identification of four novel genes contributing to familial elevated plasma HDL cholesterol in humans.

While genetic determinants strongly influence HDL cholesterol (HDLc) levels, most genetic causes underlying variation in HDLc remain unknown. We aimed to identify novel rare mutations with large effects in candidate genes contributing to extreme HDLc in humans, utilizing family-based Mendelian genetics. We performed next-generation sequencing of 456 candidate HDLc-regulating genes in 200 unrelated probands with extremely low (≤10th percentile) or high (≥90th percentile) HDLc. Probands were excluded if known mutations existed in the established HDLc-regulating genes ABCA1, APOA1, LCAT, cholesteryl ester transfer protein (CETP), endothelial lipase (LIPG), and UDP-N-acetyl-α-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 2 (GALNT2). We identified 93 novel coding or splice-site variants in 72 candidate genes. Each variant was genotyped in the proband’s family. Family-based association analyses were performed for variants with sufficient power to detect significance at P < 0.05 with a total of 627 family members being assessed. Mutations in the genes glucokinase regulatory protein (GCKR), RNase L (RNASEL), leukocyte immunoglobulin-like receptor 3 (LILRA3), and dynein axonemal heavy chain 10 (DNAH10) segregated with elevated HDLc levels in families, while no mutations associated with low HDLc. Taken together, we have identified mutations in four novel genes that may play a role in regulating HDLc levels in humans.

Other covariables such as age, sex, BMI, medical history, alcohol intake, and smoking history were available for all individuals.

DNA sequencing and data analysis
Genomic coordinates for the exons of the 456 genes were compiled by querying the Ensembl database (v56) using the perl application program interface. Proband DNA was enriched for each exon and at least 50 bp of adjacent intron sequence using SureSelect bead technology (Agilent, Santa Clara, CA). Captured sequences from a pool of fi ve proband DNA samples (24 pools for HHDL samples and 16 pools for LHDL samples) were then pooled in equimolar amounts and sequenced together (thus sequencing of 200 individuals consisted of 40 sequencing runs) using nextgeneration paired-end read sequencing (Illumina, San Diego, CA) ( 18 ). For each sequencing run of fi ve pooled DNA samples, we obtained an average 246 ± 37-fold sequencing coverage per sample pool, or 49 ± 7-fold coverage per proband DNA sample.
Sequence changes in sample pools were identifi ed by alignment of sequence data to the human genome (NCBI build 36.1) using the CASAVA v1.7 software (Illumina). Identifi ed sequence changes were characterized using the Ensembl v56 database and the perl application program interface as synonymous, missense, nonsense, or splice site variants.
Sequence changes of interest were confi rmed by standard fl uorescent dye terminator chemistry sequencing (Beckman Coulter Genomics, Danvers, MA and SeqWright, Houston, TX) and analyzed using Sequencher v4.7 (Gene Codes Corporation, Ann Arbor, MI). For each sequence change, standard Sanger sequencing was performed on each of the original proband DNAs comprising the fi ve pooled samples to identify the specifi c carrier individual. In all cases, data analyses were performed by scientists blinded to the phenotypes of the sequenced samples. Exonprobe and primer sequences are available upon request.

Segregation analysis of mutations in families and statistical analyses
A total of 685 family members of 59 probands with sequence changes of interest were genotyped using standard Sanger sequencing techniques described above for 93 SNPs in 72 genes. Only the variant found in a specifi c proband was genotyped in the proband's family. After genotyping, only variants where genotypes from six or more family members were available, and of these family members, at least three mutation carriers identifi ed were included in the statistical analyses for segregation.
Linear regression was performed on each variant for HDLc level, fi tting age, sex, and BMI as covariables. The ␤ -statistic indicating the effect-size and direction of the mutation on HDLc was used to recode alleles to H (for increasing HDL) and L (for decreasing HDL). Family-based analysis was performed on the recoded alleles using within-and between-family-based association tests for each gene, noting that most genes were represented by a single variant, using the family-based-associated test function of PLINK and permuting the data 1,000,000 times to obtain the empirical P values for significance ( 19 ). Each gene was considered an independent test, as different samples and families were used for the analysis (different variants were genotyped in different families), and thus not amenable to multiple-testing correction. An empirical P value of <0.05 was deemed signifi cant.
One signifi cant limitation of identifying novel mutations that result in large changes to HDLc is that they are frequently extremely rare (i.e., <0.1% in the general population), which precludes testing for associations. Thus assessing the segregation of genetic variations with apparent Mendelian forms of extreme HDLc in families is a useful and well-validated method to examine whether novel rare mutations associate with HDLc phenotypes ( 6,9,(12)(13)(14)(15).
Recent advances in sequencing technology allow parallel sequencing of many genes and even whole exomes or genomes across populations. However, a major challenge to this approach is identifying those few mutations that associate with large changes in HDLc, or any other complex trait, from the many additional identifi ed unrelated sequence variations. We propose here that novel mutations that underlie extreme HDLc levels can be distinguished from other sequence variations found by next-generation sequencing by initially assessing their segregation in families with extreme HDLc levels.
Here, we sequenced the exons and exon/intron boundaries of 456 genes that potentially infl uence HDLc levels in 200 unrelated individuals, including 80 with low HDLc (LHDL; р 10th percentile) and 120 with high HDLc (HHDL; у 90th percentile ). We then employed a series of data analysis fi lters to identify rare novel sequence variants that are enriched in either the extreme high or low HDLc phenotype and are expected to have large impacts on protein function. Finally, we performed segregation analyses of these sequence variations across family members of the probands, recruiting a total of 59 families (constituting 685 individuals), and identifi ed mutations in four novel genes that may play a role in elevated plasma HDLc levels.

Probands
We identifi ed 80 unrelated Dutch Caucasian probands with LHDL and 120 unrelated probands of Dutch Caucasian ancestry with HHDL, based on age-and sex-specifi c Lipid Research Clinic data and as described previously ( 6,16 ) with no other abnormal lipid measures. We also studied 685 family members of 59 probands with mutations (59 pedigrees). Study protocols were approved by the ethics committees of the Academic Medical Center, Amsterdam and the University of British Columbia, Vancouver. All subjects provided written informed consent. Lipoprotein measurements were performed on fresh plasma as described ( 17 ). Cholesterol and triglyceride levels were determined in total plasma and plasma at d < 1.006 g/ml obtained after preparative ultracentrifugation and before and after precipitation with dextran manganese. known coding or splicing mutations in ABCA1 , APOA1 , or LCAT were selected as part of low-HDLc screening cohort (LHDL ; Table 1 ). Separately, from an initial cohort of 171 unrelated Dutch probands ( 9 ), 120 probands with HDLc у 90th percentile with no known coding or splicing mutations in CETP , LIPG , or GALNT2 were selected as part of high-HDLc screening cohort (HHDL; Table 1 ). No other major lipid abnormalities or other confounding factors like severely elevated BMI, diabetes mellitus, hypertension, extensive medical history or medication use, or excessive alcohol, smoking, hormone replacement therapy, or other drug use were reported by these probands. An overview of the study design is shown in Fig. 1 .

Selection of genes for sequencing
We manually curated and prioritized 456 genes for sequencing from multiple datasets that included genes or sets of genes identified from, or implicated in: A ) genetic regions with suggestive linkage (Log of odds score у 2.0) with HHDL or LHDL in families (data not shown); B ) HDLc regulation reported in published genome-wide association studies ( 4,20 ); C ) significant expression changes in an in vitro APOA1 siRNA screen in HepG2 cells ( 21 ); D ) in silico liver centric Bayesian network analysis for five well-characterized genes related to HDL metabolism [ APOA1 , ABCA1 , CETP , scavenger receptor class B member 1 ( SCARB1 ), and LIPG ] ( 22 ); E ) significant expression changes in Apoa1 knockout mice ( 23 ); and F ) direct and indirect literature support for roles in HDL regulation or metabolism not reported in genome-wide assiciation studies (GWAS) , in addition to paralogs of select genes with literature support (supplementary Tables I, II).
In the union of these gene lists from A to F , we selected 450 genes for sequencing. We also included ABCA1 , APOA1 , LCAT , CETP , LIPG , and GALNT2 as internal controls to identify any additional mutations not detected by our previous sequencing efforts in the 200 selected probands ( 6,9 ). Supplementary Table III lists the total 456 genes sequenced in the 200 probands.

Validation of sequencing data
To validate that sequence variations were readily detected in sample pools, where each sample pool constituted fi ve proband DNA samples, we assessed the presence of known and presumably benign sequence variation from previous standard sequencing of ABCA1 , APOA1 , and LCAT in LHDL probands, and CETP , LIPG , and GALNT2 in HHDL probands ( 6,9 ). Of 97 SNPs that were either noncoding or synonymous sequence variations previously detected in these probands, all 97 were readily detected in the sample pools, including 29 of 97 SNPs (29.9%) where the minor allele was present in no more than one proband per sample pool, indicating that sequence variation present in single individuals is readily detected from pooled sample sequence data (supplementary Table IV).
We also identifi ed additional mutations not previously detected by standard sequencing, including six within the LHDL probands ( APOA1 L202P in one proband, LCAT V371M in one proband, L338H in two probands, and T147I in two probands) ( 6 ), and four within the HHDL probands ( LIPG N396S in three probands and G196R in one proband) ( 9 ). These mutations were inadvertently missed during our standard sequencing of control genes in the extreme HDLc probands, but were confi rmed by reviewing the initial standard sequencing chromatograms, demonstrating again that rare mutations are readily detected by our experimental approach.
No CETP , LIPG , or GALNT2 mutations were found in LHDL probands, and no ABCA1 , APOA1 , or LCAT mutations were found in HHDL probands. Our fi nal discovery cohorts therefore constituted 74 probands for LHDL and 116 for HHDL.

Identifi cation of novel sequence changes
As we could detect known SNPs and mutations in known HDLc-regulating genes in these probands, we next searched for novel mutations in the remaining 450 genes to identify novel sequence variants that potentially underlie extreme HDLc levels ( Fig. 2 ). To reduce the identifi cation of false variant calls, the following quality control fi lters were applied: A ) at least 75-fold sequence coverage per sample pool of fi ve probands (at least ‫ف‬ 15-fold per proband DNA); B ) a minor allele frequency >0.05 in sample pools; C ) detection in pooled high quality sequence data: major allele BaCON score >10, minor allele BaCON score >6, and BaCON score ratio <15:1 (the BaCON score is generated by Illumina and represents the discrepancy of variants. A score of р 6 indicates variants below the threshold of detection) ( 24 ); and D ) found only in LHDL or HHDL sample pools ( Fig. 2 ).
As we anticipated that mutations underlying large changes in HDLc levels would be extremely rare in the general population (i.e., <0.1% or unique to individual families), we next prioritized sequence changes that were absent from database of SNPs (dbSNPs) (build 130). We then prioritized those sequence changes that were predicted to result in nonsense mutations, splicing defects in the fi rst or last 2 bp of an intron (Ensembl essential splice), or nonsynonymous amino acid substitutions predicted to be damaging according to the sorting intolerant from tolerant (SIFT) algorithm ( 25 ). We next prioritized those genes where all remaining sequence variation in a given gene was found in LHDL or HHDL probands only. Finally, all remaining sequence variation was confi rmed to be "possibly damaging" or "probably damaging" by the Poly-Phen-2 algorithm ( 26 ), and sequence data in opposite phenotype sample pools was reviewed to confi rm that no potentially deleterious sequence variations were present (i.e., SNPs predicted to be deleterious by PolyPhen-2 but not SIFT). Using this approach, we identifi ed a total of 136 variants with a putative role in HDLc regulation, 20 sequence changes in 20 genes from LHDL probands, and 116 sequence changes in 59 genes from the HHDL probands ( Fig. 2 ).
Confi rmation of the presence of the 136 identifi ed variants was performed by standard Sanger sequencing in the original probands ( Fig. 2 ). Of the 136 variants identifi ed, 93 changes in 72 genes were confi rmed (68%). The changes and genes are shown in supplementary  Table V.

Segregation of novel sequence changes with HDLc phenotypes
To assess the likelihood of the 93 confi rmed variants underlying the extreme HDLc levels in families, we genotyped each mutation in all available family members of the originating proband. A total of 685 individuals from 59 families (representing 59 separate pedigrees) were genotyped for specifi c mutations found only in the respective proband. To provide suffi cient statistical power for further analyses, a given variant was required to be present in at least three family members, and absent in at least three additional family members in the same pedigree. Only 33 of the 93 variants passed this fi lter. For these 33 variants, the average size of each pedigree was 22 ± 16 and consisted of a total of 627 family members in 34 pedigrees. This indicates that some of the 60 remaining novel variants identifi ed cannot account for the extreme HDLc phenotype observed in a given family, as insuffi cient affected family members would be carriers, while others would be bona fi de HDLc modulating mutations for which insuffi cient statistical power was available to assess segregation.
Nevertheless, mutations in several genes were identifi ed to segregate signifi cantly with extreme HDLc levels, as determined by family-based association analysis. For LHDL, we did not identify any novel mutations that signifi cantly segregated with HDLc percentile. For HHDL, we identifi ed new mutations in glucokinase regulatory protein ( GCKR ), RNase L ( RNASEL ), leukocyte immunoglobulin-like receptor 3 ( LILRA3 ), and dynein axonemal heavy chain 10 ( DNAH10 ) ( Table 2 ; Fig. 3 ). For GCKR , the mutation R232Q was found in two unrelated probands, while R518W was found in a third; both mutations fall within sugar isomerase domains. For RNASEL , a G179R mutation was found in one proband while E265X was found in fi ve unrelated probands; these mutations occur in the fi fth and seventh ankyrin domains, respectively. For LILRA3 , an IVS1+0T>C mutation was found in two unrelated probands, which is predicted to disrupt splicing within the N-terminal coding sequence ( Fig. 3 ). Finally, for DNAH10 , one mutation was identifi ed near the C terminus, IVS63+0 G>A, an essential splice site mutation in one proband. Of these probands, only those with pedigrees with three or more carriers and three or more noncarriers underwent statistical analyses. In all genes, carriers of the mutations had signifi cantly elevated HDLc percentiles when compared with noncarrier family members. Table 2 summarizes the HDLc phenotypes of mutation carriers and non carriers in families.

DISCUSSION
Here we combined next-generation sequencing of 456 genes having known or putative roles in HDL metabolism with Mendelian family-based analyses of 59 different families and 685 total family members. As a result, we identifi ed mutations in four genes, GCKR, RNASEL , LILRA3 , and DNAH10 , that signifi cantly segregate with increased plasma HDLc in families. Our goal was to identify novel and extremely rare nonsynonymous, nonsense, and splice mutations with large functional impacts on HDLc, most of which are not captured by current genome-wide association strategies. Although extreme HDLc in the absence of other lipid abnormalities was the only phenotype considered here, clearly this strategy could be applicable to identifying mutations for other extreme lipid traits.
Overall, six mutations in seven probands (with 120 family members) in four novel genes that are in coding regions or exon/intron boundaries, not present in dbSNP (build 130), predicted damaging by in silico algorithms showed signifi cant segregation with elevated HDLc in families. We previously reported that in the initial cohort of 171 unrelated HHDL probands, we found 22 with LIPG mutations (12.9%, inclusive of those mutations described here), 1 with a CETP mutation (0.6%), 4 with GALNT2 mutations (2.3%), 2 with APOC3 mutations (2.3%) ( 27 ), and 2 with SCARB1 mutations (2.3%) ( 9,28,29 ). Taken together, we have identifi ed to date a total of 43 of 171 HHDL probands with mutations (25.1%), making this the highest frequency of mutations identifi ed to date in a HHDL cohort. Of note, the RNASEL E265X mutation was found in 5 of 171 HHDL probands (2.9%), raising the possibility of testing for associations with HDLc levels in large populations with this mutation.
Because our goal was to identify only those mutations that were novel, likely deleterious, and exclusive to one phenotype, nonsynonymous mutations predicted to be benign by at least one in silico analysis were discounted here. Indels were also excluded, as >75% of these sequence changes detected from next-generation sequencing data using algorithms available to us were not detected by Sanger sequencing . We also excluded variants present in dbSNP at low frequency, although this likely excluded a subset of bona fi de HDLc-modulating mutations. This step also importantly excluded common sequence variation that was unlikely to underlie large changes in HDLc levels. Mutations that were suppressed by strong mutations of the opposite phenotype were also excluded. For example, we previously showed that the HDLc-elevating phenotypes of  LIPG N396S and SCARB1 S112F mutations may be suppressed by ABCA1 mutations IVS24+1G>C and V2091I, respectively ( 9,28 ). Finally, we also did not assess the association of common polymorphisms in these 456 genes with HDLc phenotypes, although it is worth exploring in concert with additional populations. Clearly, false negatives and positives will inevitably arise from our initial approach, refl ecting the challenges of understanding the genetics of complex traits, and requiring subsequent analysis strategies for mutation discovery. Evolving bioinformatics tools and statistical methods will also continually afford identifi cation of additional mutations with potential functional consequences.
In addition to the novel mutations identifi ed here, we identifi ed both APOC3 and SCARB1 mutations ( 4,(27)(28)(29) that have previously been associated with both loss-of-function and elevated plasma HDLc in humans, thus offering validation for our approach. We also identifi ed additional mutations in APOA1 (L202P), LCAT (V371M, L338H, and T147I), and LIPG (N396S and G196R) that were not detected in the original Sanger sequencing, providing further validation. We note that only 93 of 136 (68.4%) prioritized sequence changes predicted by next-generation sequencing were confi rmed in the Sanger sequencing, possibly refl ecting that our initial quality fi lters were not stringent enough.
While GCKR, LILRA3 , and DNAH10 have been implicated by GWAS to modulate HDLc ( 4,20 ), nonsynonymous mutations segregating with an elevated HDLc trait have not been reported in these genes. Our observations support previous studies that implicate novel genes in HDLc regulation and validate the use of next-generation sequencing and family-based segregation approaches to identify novel mutations.
GCKR was included in our 456 gene list because a variant in GCKR showed genome-wide signifi cant association with HDL in published GWAS ( 20,30,31 ). We identifi ed two missense mutations, R232Q and R518W, in three unrelated probands (R232Q was found in two probands). GCKR is a regulatory protein that inhibits glucokinase (GK) in liver and pancreatic islet cells. It participates in the modulation of GK activity and location by binding free cytoplasmic GK. As glucose levels decline, GCKR moves GK into the nucleus, where it is held in reserve in an inactive form ( 32 ). As glucose and insulin levels rise, GK is released from GCKR and moves back to the cytoplasm. It is localized widely, with highest expression in the liver, testes, ovaries, and adipocytes (Illumina human body map).
The identifi cation of GCKR as an HDLc gene suggests a novel link between genes infl uencing HDLc levels and glucose metabolism.
LILRA3 was included in our 456 gene list because a SNP was previously identifi ed showing genome-wide signifi cant association with elevated plasma HDLc levels ( 4,33 ). We identifi ed one mutation, IVS1+0T>C, an essential splice site mutation, in two probands in this gene. Genotyping did not identify three or more carriers in the pedigree of one proband. However, the mutation segregated significantly with elevated HDLc in the pedigree of the second proband. LILRA3 , also known as CD85 antigen-like family member E (CD85E), is expressed predominantly on monocytes and B cells and at lower levels on dendritic cells and natural killer cells. It acts as a soluble receptor for class I major histocompatibility complex antigens, binding both classical and nonclassical human leukocyte antigen class I molecules ( 34 ). While its specifi c role in regulating plasma HDLc is not known, its presence is associated with chronic infl ammatory conditions such as rheumatoid arthritis, suggesting that it may affect HDLc levels indirectly through regulation of chronic infl ammation ( 35 ).
DNAH10 was included in our 456 gene list because it was supported by GWAS ( 20 ). We identifi ed one mutation in one proband with HHDL, IVS63+0G>A, an essential splice site mutation, that segregated with elevated HDLc. Dyneins are microtubule-associated motor protein complexes composed of several heavy, light, and intermediate chains. They are found in cilia and fl agella, where they facilitate ATP-driven movement ( 36 ). DNAH10 is an inner arm dynein heavy chain ( 37 ). Mutations in other dynein heavy chain genes cause primary ciliary dyskinesia ( 38 ). DNAH10 protein is expressed in platelets and the liver . However, its role in HDLc metabolism remains unknown.
In addition, we found a novel gene not previously implicated in the regulation of HDLc levels, RNASEL . The role of this gene in HDLc metabolism is unclear. We sequenced RNASEL because of its close proximity to a noncoding variant identifi ed by a GWAS for HDLc (data not shown). It is widely expressed, with high levels in white blood cells, lung, liver, kidney, and adipocytes in humans (Illumina human body map). It functions in anti-viral and antiproliferative roles of interferons through modulating RNA stability ( 39 ), and is part of the body's innate immune defense. RNASEL is likely to mediate its antiviral effects through a combination of direct cleavage of single-stranded viral RNAs, inhibition of protein synthesis through the degradation of rRNA, induction of apoptosis, and induction of other antiviral genes. Mutations in this gene have been associated with predisposition to prostate cancer ( 40,41 ).
We identifi ed two different mutations in the extremely high HDLc probands in our study, E265X, which results in a premature truncation of the protein, and G179R, a missense mutation. The E265X mutation (rs74315364) has been annotated recently in the 1000 Genomes Project, the National Heart, Lung, and Blood Institute (NHLBI) Exome Sequencing Project, and the Exome Chip Project. In the Exome Sequencing Project population, it was found at a minor allele frequency of 0.003 . Despite the low minor allele frequency observed in the Exome Sequencing Project population for this variant, of the 120 unrelated high HDLc probands sequenced here, a surprising 5 (4.2%) carried the E265X mutation, indicating a signifi cant enrichment of this variant in those with extremely high HDLc. No association with plasma lipids has previously been described for RNASEL . Our studies implicate new signaling pathways that may regulate HDL, which may provide a better understanding of both the biological functions of this complex particle.
Importantly, these discoveries would not have been readily possible without the use of Mendelian family-based resources, emphasizing the power of family data to supplement and interpret genome-wide population datasets. Indeed, a recent study used exome array screening of 56,000 individuals to identify a handful of novel mutations with signifi cant effects on plasma HDLc, including mutations in PAFAH1B1 , ANGPTL8 , COL18A1 , and PCSK7 ( 42 ). However, while this approach has clearly identifi ed new genetic factors that underlie HDLc levels in humans, it is not suffi cient to detect extremely rare variants (i.e., <0.1% in the general population) that may be unique to single families. In contrast, by restricting our study to probands with extreme HDLc in whom other confounding factors, such as elevated BMI, complicated medical history and medication use, excessive smoking, alcohol, drug use, and extremes in other lipid parameters are minimized, we have identifi ed four new but separate causes of elevated plasma HDLc levels by screening only 200 unrelated probands and 59 families consisting of 685 family members. Thus both population-and family-based strategies will likely continue to contribute to discovery of new factors underlying plasma HDLc.
The following limitations to our study should be considered. First, as mentioned above, our study refl ects an initial analysis of a very large dataset, and more mutations will likely be identifi ed using additional analysis strategies. For example, additional analyses of putative mutations with low frequency (i.e., 0.1-2% in the general population) in dbSNP or 1000 Genomes are in progress. Second, we lack suffi cient family members to confi rm or rule out statistically many of the SNPs identifi ed here. This limitation is likely to confound many independent researchers using next-generation sequencing approaches and argues for the need for researchers with family-based genetic resources to initiate broad collaborations, replicating the success of very large groups of individuals that identifi ed new genetic susceptibility loci through GWAS of large populations. To this end, the segregation of mutations reported here should be replicated in independent groups. Third, although family members were genotyped for mutations in ABCA1 , APOA1 , LCAT , CETP , LIPG , and GALNT2 that were found in the original probands, these genes were not sequenced in their entirety in nonproband family members. The presence, in family members, of additional mutations not present in the probands may in part explain the presence of phenocopy or nonpenetrant genotypes in the families, thereby lowering our statistical power during segregation analyses. A recent study on Dutch individuals showed that variants in more than one putative HDL gene can be present per individual ( 43 ), refl ecting the challenges of identifying mutations with direct Mendelian inheritance. Fourth, we limited our study to coding regions and adjacent sequence, so rare untranslated region, promoter, and other regulatory mutations are not considered here. In vitro functional studies, in addition to detailed phenotyping of HDL particles from individuals described here, are also desired in order to confi rm the deleterious effects of some mutations described here, particularly in genes not previously implicated in HDLc regulation. Finally, while our study provides strong support for a genetic role for GCKR , RNASEL , LILRA3 , and DNAH10 in HDLc regulation, the biological roles of these four genes in HDLc metabolism, in addition to their clinical signifi cance to cardiovascular and other metabolic diseases, remain targets for future investigation.
In summary, we have performed sequencing of 456 genes in 200 individuals with extreme HDLc levels to identify novel monogenic causes of altered HDLc in humans. Using a family-based Mendelian approach, a total of 59 different families were genotyped to assess segregation of mutations found in probands. Overall, we identifi ed four novel genes with mutations segregating in families with HHDL. We substantially increase the number of known monogenic susceptibility loci for elevated plasma HDLc levels and outline methods that help interpret and prioritize novel mutations associated with a phenotypic trait of interest from large next-generation sequencing datasets.