Novel common and rare genetic determinants of paraoxonase activity: FTO, SERPINA12, and ITGAL.

HDL-associated paraoxonase 1 (PON1) activity is associated with cardiovascular and other human diseases. As the role of genetic variants outside of the PON gene cluster on PON1 activity is unknown, we sought to identify common and rare variants in such loci. We typed 33,057 variants on the CVD chip in 1,362 subjects to test for their effects on adjusted-PON1 activity. Three novel genes (FTO, ITGAL, and SERPINA12) and the PON gene cluster had SNPs associated with PON1 arylesterase (AREase) activity. These loci were carried forward for rare-variant analysis using Exome chip genotypes in an overlapping subset of 1,051 subjects using sequence kernel association testing. PON1 (P = 2.24 × 10(-4)), PON3 (P = 0.022), FTO (P = 0.019), and SERPINA12 (P = 0.039) had both common and rare variants associated with PON1 AREase. ITGAL variants were associated with PON1 activity when using weighted sequence kernel association testing (SKAT) analysis (P = 2.63 × 10(-3)). When adjusting for the initial common variants, SERPINA12 became marginally significant (P = 0.09), whereas all other findings remained significant (P < 0.05), suggesting independent rare-variant effects. We present novel findings that common and rare variants in FTO, SERPINA12, and ITGAL predict PON1 activity. These results further link PON1 to diabetes and inflammation and may inform the role of HDL in human disease.

analyzed due to under-representation of minority samples in this cohort. Medications, including use of statins, were ascertained from pharmacy and medical records, as well as subject report. Current smoking status was by self-report and review of medical records. Descriptive statistics for the cohort are presented in Table 1 .

Genotyping
The four known functional PON1 SNPs, PON1 Q192R , PON1 L55M , PON1 -108C/T , and PON1 -162A/G , were genotyped using previously described methods ( 14,27 ). We genotyped 48,742 SNPs relevant to cardiovascular disease in 1,362 subjects using the Illumina Hu-manCVD BeadChip (CVD chip) ( 28 ). Duplicate genotyping for 34 individuals showed 99.7% consistency in calls for the CVD chip. Subjects were fi ltered out at an individual call rate of < 97%. SNPs were fi ltered with cutoffs of minor allele frequency < 1% (for power considerations), Hardy-Weinberg equilibrium P < 10 Ϫ 4 , or call-rate < 97%. After SNP fi ltering, 33,057 polymorphic SNPs remained for analyses. A subset of the CLEAR cohort enriched for carotid artery disease (CAAD) cases (n = 1,051 subjects, with 933 overlapping with the Illumina HumanCVD BeadChip data) underwent further genotyping of rare coding variants and common variants implicated in genome-wide association studies using the Illumina HumanExome BeadChip (Exome chip) ( 29 ). Of 251,336 Exome chip variants present, 92,253 were polymorphic in these data. All genotyping was performed blinded to phenotype.

PON1 phenotyping
The PON1 AREase activity was measured by a continuous spectrophotometric assay as previously described ( 27 ). AREase activity was measured in triplicate and averaged. AREase activity was utilized as the primary measured outcome of PON gene cluster variation due to its closer correlation with protein levels than hydrolysis rates of the toxic substrates paraoxon or diazoxon.

Common variant analysis
Regression analyses of common variants genotyped on the CVD chip were performed using the PLINK analysis package ( 30 ). PON1 AREase activity was fi rst adjusted for age, sex, current smoking status, and the four major functional PON1 polymorphisms ( PON1 Q192R , PON1 L55M , PON1 -108C/T , and PON1 -162A/G ) through linear regression. These adjusted PON1 activity residuals were further evaluated by linear regression under an additive genetic model.

Rare variant analysis
Methods that pool variants across loci have been utilized for rare SNV association testing due to the limited power of traditional functional PON1 mutations ( 13 ): two missense mutations [ PON1 Q192R (rs662) and PON1 L55M (rs854560)] and two 5 ′ regulatory [ PON1 -108C/T (rs705379) and PON1 -162A/G (rs705381)]. PON1 -108C/T has the largest effect on AREase activity, altering expression likely due to modifi cation of an Sp1 binding site ( 14,15 ). Recent fi ndings within this Carotid Lesion Epidemiology and Risk (CLEAR) cohort attribute approximately 21% of PON1 AREase activity to these four functional PON1 mutations and six additional common variants within the PON gene cluster (including those in PON2 and PON3 ) ( 16 ). Rare deleterious variants have also been identifi ed ( 17 ). In addition, we reported that dietary cholesterol, alcohol, and vitamin C were all positively predictive of PON1 activity, while dietary iron and folic acid predicted reduced PON1 activity in 1,402 subjects from the CLEAR cohort ( 18 ); these factors accounted for an additional 8.2% of PON1 activity when adjusting for age, sex, and current smoking status.
Despite these fi ndings, a large portion of PON1 activity variance remains unexplained. This may be due to genetic variation either at other loci or due to rare single nucleotide variants (SNV) in PON1 . While rare SNVs are well known as causative mutations in Mendelian traits, they are only beginning to be studied in large datasets considering complex diseases or in quantitative phenotypes. Rare SNVs are expected to account for some of the "missing heritability" of traits such as PON1 activity (19)(20)(21)(22). Moreover, rare coding SNVs often affect protein function, as compared with noncoding regulatory variants that affect the levels of protein. Accordingly, we used a two-step process to determine whether common or rare SNVs at other loci contribute to variation in paraoxonase activity. We fi rst considered the associations from the relatively common SNPs on the Illumina HumanCVD BeadChip, and then selected genes nominated by this analysis to consider the effects of rare SNVs typed utilizing the new Illumina HumanExome BeadChip. This was done to limit the number of hypotheses tested and maximize power. We also cataloged the 92 nonsynonymous and 5 ′ -and 3 ′ -noncoding SNVs present in the PON gene cluster in the Exome Sequencing Project (ESP) 6500 data to report their frequencies and evolutionary conservation.

Ethics statement
Institutional Review Boards at the University of Washington, Virginia Mason Medical Center, and Veterans Affairs Puget Sound Health Care approved the study. Written, informed consent was obtained from all participants.

Sample
The study cohort for these analyses consisted of 1,362 subjects from the previously described CLEAR study ( 16,18,(23)(24)(25) who had PON1 phenotype, covariate, and genetic data for the common variant analysis and 1,051 partially overlapping subjects for the rare variant analysis. Ancestry was confi rmed using STRUCTURE with three ancestral groups ( 26 ). Only Caucasian subjects were We explored and summarized the PON1 , PON2 , and PON3 variants in the exomic data of 6,503 ESP individuals, of which 4,300 and 2,203 were of European and African ancestry, respectively. Genomic positions (Hg19) and the reference and alternate alleles on the forward strand were determined for each variant. Conservation for single-base variants was assessed through phylogenetic analysis with space/time model (phastCons) ( 32 ) and genomic evolutionary rate profi ling (GERP) ( 33 ) scores using SeattleSeq SNP annotation (http://snp.washington.edu/ SeattleSeqAnnotation134/). Only variants that were nonsynonymous, 5 ′ -noncoding, or 3 ′ -noncoding are presented, stratifi ed by European ancestry (EA) and African ancestry (AA).

Common variant association with PON1 AREase activity
Demographic measures of the CLEAR cohort are presented in Table 1 . A total of 1,362 subjects had complete CVD chip genotype, covariate, and phenotype data for analyses of common variant association with PON1 AR-Ease activity. The average age was 67.8 years, the subjects were 86.7% male, 18.2% current smokers, and 39.4% statin users. Of the 1,362 subjects, 460 (33.8%) were CAAD cases with greater than 80% stenosis in one or both internal carotid arteries or were postcarotid endarterectomy, 687 (50.5%) were controls with less than 15% stenosis in both carotid arteries, 166 (12.2%) had moderate internal carotid obstruction with 50-79% stenosis, 27 (2.0%) had single-marker association analyses. Such a method, sequence kernel association testing (SKAT), was utilized for testing of SNVs genotyped by the Exome chip ( 31 ) utilizing an R plugin (http:// r-project.org). SKAT utilizes score-based variance-component tests to test for association between SNV sets within a region (with the ability to include both common and rare variants) and a phenotype, while adjusting for potentially confounding covariates in the model. The covariates adjusted for in the primary SKAT analyses of PON1 AREase activity were age, sex, current smoking status, and the four functional PON1 mutations ( PON1 Q192R , Because SKAT is a regional DNA window-based test of association and we wanted to limit our tests, we chose to perform analyses on candidate genes (described in Table 2 ) identifi ed by common variant analyses and with a false discovery rate (FDR) < 0.10 ( PON1 , FTO , ITGAL , and SERPINA12 ). For these gene regions found to not be signifi cant for association with PON1 AREase activity, a second SKAT analysis was performed using adjusted weights that give higher weights to rare variants ( 31 ). To determine whether the rare variant associations were independent of the common variant associations in the same genes, a separate sensitivity analysis was performed in 933 subjects with both Exome and CVD chip genotypes, with the SKAT analysis model adjusted for the common variants in addition to the other covariates.
In addition, we performed a secondary analysis considering the entire PON gene cluster ( PON1 , PON2 , and PON3 ), as variants in PON2 and PON3 have been previously identifi ed to be predictive of PON1 AREase activity in this cohort ( 16 ). Variants in the entire PON gene cluster were considered in a SKAT analysis, with the model also adjusting for six common variants previously reported to predict PON1 activity ( 16 ) (rs854567, rs2299257, rs223783, rs2375005, rs3917486, and rs11768074) in addition to the other covariates.

Exome Sequencing Project PON region variant data
The ESP6500 dataset is composed of individuals obtained from a number of large-scale National Heart, Lung, and Blood Institute (NHLBI) cohorts. It includes exome sequence data of individuals from the Women's Health Initiative, Framingham Heart Study, Jackson Heart Study, Multi-ethnic Study of Atherosclerosis, genotypes of SNPs from the common variant analysis with FDR < 0.10 (see Table 2 ) in addition to age, sex, current smoking status, and the four functional PON1 genotypes. From this analysis, adjusting for common variants, PON1 ( P = 3.42 × 10 Ϫ 3 ) and FTO ( P = 9.52 × 10 Ϫ 3 ) SNVs remained signifi cant for association with adjusted PON1 AREase activity. SERPINA12 became marginally signifi cant ( P = 0.09) when adjusting for common SERPINA12 variants previously identifi ed in this cohort. When adjusting for its common variants, ITGAL SNVs remained signifi cant for association with PON1 activity using adjusted weights ( P = 9.88 × 10 Ϫ 4 ).
Prior research in a male-only subset of this cohort identifi ed SNPs in PON2 and PON3 that predicted of PON1 activity ( 16 ). Thus, we performed a secondary SKAT analysis of the entire PON gene cluster ( PON1 , PON2 , PON3 ) on chromosome 7. In total, there were 13 polymorphic SNVs within the region represented in our cohort, including the seven PON1 SNVs previously analyzed by SKAT (see Table  3 ). When testing for association with PON1 AREase in a SKAT model adjusting for age, sex, current smoking status, and the four functional PON1 mutations, the PON gene cluster was highly associated ( P = 7.10 × 10 Ϫ 4 ), though modestly less signifi cant when compared with the PON1 gene region alone ( P = 2.24 × 10 Ϫ 4 ), suggesting that little predictive power was gained by considering PON2 and PON3 SNVs. To evaluate the role of PON2 and PON3 SNVs, we analyzed PON2 (two SNVs in the gene) and PON3 (four SNVs in the gene) separately and found that while PON3 SNVs were signifi cantly associated with PON1 AREase activity ( P = 0.022), PON2 SNVs were not ( P = 0.76). As the two SNVs present in PON2 (rs12026 and rs7493) in our cohort are in LD (r 2 = 1.0, MAF = 0.246 for both), we ran a separate association analysis considering only rs12026. PON2 SNV rs12026 remained not signifi cantly associated with PON1 activity ( P = 0.65).
Prior work in the CLEAR cohort identifi ed six additional SNPs in the PON gene cluster ( 16 ) (four in PON1 and one each in PON2 and PON3) , excluding the four commonly recognized functional SNPs, that were predictive of PON1 activity. We therefore performed a separate SKAT analysis of PON1 and the PON gene cluster SNV associations with PON1 AREase activity to insure that our results were not due to LD with these previously identifi ed common variants. In this SKAT model, the model included the covariates of the genotypes of these six previously identifi ed SNPs (rs854567, rs2299257, rs223783, rs2375005, rs3917486, and rs11768074), as well as the four functional PON1 mutations, age, sex, and current smoking status. We found that the P values for both PON1 (1.92 × 10 Ϫ 3 ) and the PON gene cluster (9.30 × 10 Ϫ 3 ) were less signifi cant when adjusting for these variants, although still highly predictive of PON1 activity. Only PON1 SNV rs3917503 from the Exome chip data was in LD of r 2 > 0.60 with one of the six previously identifi ed common variants rs2299257 (r 2 = 0.74). These results suggest that, except for rs2299257, the SNVs reported to be associated with PON1 activity here are independent of previously reported common variants. mild carotid obstruction with 15-49% stenosis, and 21 (1.5%) were ascertained for lower extremity peripheral vascular disease (ankle-brachial index less than 0.9).

Exome chip analysis
SKAT analyses of the Exome chip data considered an overlapping group of subjects (n = 933 with both CVD and Exome chip data). The Exome chip genotyping was performed preferentially on CAAD cases (70%, see Table 1 ). The average age was 69.6 years, and these subjects were 89.8% male, 14.9% current smokers, and 52.5% statin users. The mean PON1 AREase activity was 135.39 IU, nearly identical to that of the CVD chip subjects.
To test whether the rare variant association results were independent of the common SNP associations detected in the CVD chip analysis, we ran a secondary analysis in the overlapping subjects with both CVD and Exome chip genotypes (n = 933), adjusting the SKAT model for the Of the 33 SNVs in the ESP6500 dataset, 31 were nonsynonymous, one was 5 ′ noncoding, and one was 3 ′ noncoding (see supplemental Table III). Of the 31 nsSNVs, there was one stop mutation (at position 32), three missense near-splice, and 27 missense SNVs. Only one identifi ed SNV had a MAF greater than 0.01: rs17883013 (Ala179Asp; MAF = 0.018). Twelve and 14 rare SNVs are present only in AA and EA populations, respectively.
As expected, we did not observe all of the reported ESP SNVs in our dataset. Considering the EA ESP variants, we observed 12.2% (10/82) of the coding and none of the 3 ′ -and 5 ′ -noncoding variants identifi ed in the ESP6500 data. Thus, larger studies are needed to determine whether the further variation in the PON gene cluster identifi ed in the Exome Variant Server data is functional.

DISCUSSION
PON1 is a glycoprotein enzyme physically associated with HDL. It demonstrates wide substrate specifi city: from oxidized LDL and its resulting atheroprotective effects, to the metabolism of toxic organophosphates, statins ( 34 ),

Description of the PON gene cluster SNVs in the ESP6500 data
Selection of only nonsynonymous and 5 ′ -and 3 ′noncoding SNVs from the ESP6500 data identifi ed 35 SNVs within PON1 , which are presented in supplemental Table I. Of these 35 SNVs, two and three are 3 ′ -noncoding and 5 ′ -noncoding, respectively. Of the 30 remaining nonsynonymous SNVs (nsSNV), there are two nonsense mutations (at protein positions 21 and 194, respectively), one missense near-splice, and 27 missense variants. Included in the 27 nsSNVs are two of the four functional PON1 mutations, PON1 Q192R (rs662) and PON L55M (rs854560). These two nsSNPs are also the only nsSNPs with MAF > 1% in the ESP6500 data for PON1 . Ten and 16 rare SNVs are present only in AA and EA populations, respectively.
Of the 24 SNVs in PON2 , there were 21 nonsynonymous, three 3 ′ noncoding, and zero 5 ′ noncoding SNVs (see supplemental Table II). Of the 21 nsSNVs, there are two missense near-splice and 19 missense variants. Of these 24 variants, only two nsSNPs are common: rs7493 (Ser299Cys; MAF = 0.247) and rs12026 (Ala136Gly; MAF = 0.246). Seven and 14 rare SNVs are present only in AA and EA populations, respectively.  were positively associated with PON1 activity. Further functional studies will be required to determine both the relationships of FTO and SERPINA12 to paraoxonase biology; these results suggest that the relationship of PON1 activity to diabetes may be driven by loci other than PON1 and be more complex than the prior hypothesis of reduced PON1 in diabetes due to oxidative stress ( 54,62 ).
In contrast to FTO and SERPINA12 , ITGAL is not directly related to cardiovascular disease and instead encodes the integrin ␣ L chain, which combines with the ␤ -2 chain to form the integrin lymphocyte function-associated antigen (LFA-1), which is expressed on all leukocytes ( 64 ). LFA-1 is a receptor for intercellular adhesion molecules (ICAM) and plays a key role in leukocyte intercellular adhesion through interaction with ICAMs. Like HDL ( 65, 66 ), PON1 is an acute phase reactant and has been associated with the immune response ( 67,68 ) and infl ammation ( 69 ). Specifi cally, PON1 has been demonstrated to be a host modulator of Pseudomonas aeruginosa quorum sensing ( 35 ), with transgenic expression of PON1 being protective in Drosophila from infection lethality ( 36 ). Therefore, the fi nding of ITGAL SNP rs1557672 being associated with a decrease in PON1 activity ( ␤ coeffi cient = Ϫ 7.16) may indicate a further role of immune regulation of HDL, and specifically, PON1 activity.
Of the SNPs found to be associated with PON1 activity at a level of suggestive signifi cance, SNPs in two genes, ALOX12 and PDE4D , and nearby FAM178A did not meet our FDR < 0.10 cutoff for further consideration. Notably 12-lipoxygenase (12-LO), like paraoxonase, can oxidize esterifi ed fatty acids in lipoproteins and phospholipids and has been implicated in atherosclerosis in animal models ( 70,71 ). Paraoxonase, on the other hand, appears to metabolize oxidized lipids ( 2 ). Further, ALOX12 variation has been associated with infl ammatory markers CRP and ICAM ( 72 ) and poor glycemic control ( 73 ) in type 2 diabetes patients. Interestingly, 12-LO is a nonheme, yet ironcontaining enzyme, and we have recently shown that dietary iron is associated with reduced PON1 activity ( 18 ). One could speculate that the inhibitory effect of iron may be through increased 12-LO. We observe that the SNPs in ALOX12 and PDE4D and nearby FAM178A are among those with the largest effect size, but the smallest MAFs. Thus, it is possible that these represent true positives that we are underpowered to detect a signifi cant association for due to low MAF. Further investigation of these loci, especially of ALOX12 due to its biology, is suggested by our fi ndings.
Utilizing SKAT to analyze both the rare and common variants represented on the Illumina Exome chip, we were able to detect signifi cant associations for PON1 , PON3 , and the PON gene cluster as a whole, even when known functional SNPs were considered as covariates. PON3 is also an HDL-associated enzyme, but it is expressed in much lower levels than PON1 ( 74 ). In contrast to both PON1 and PON3 , PON2 is ubiquitously expressed ( 75 ). Given that SNPs in PON2 have been previously reported in this cohort to predict PON1 enzyme activity, the failure of additional PON2 SNPs to predict PON1 activity in SKAT analyses may and the quorum-sensing factor of pseudomonas aeruginosa ( 35,36 ). Perhaps due to this extensive substrate specifi city, PON1 also has been implicated in numerous human diseases in addition to vascular disease, such as Parkinson's disease (37)(38)(39)(40)(41), systemic lupus erythematosus (42)(43)(44), breast cancer ( 45,46 ), age-related macular degeneration (47)(48)(49), and diabetes ( 50 ), among other disorders ( 51 ). Given the recent failures of HDL-C to prevent atherosclerotic outcomes in recent trials and the varied effects of PON1 on numerous important human diseases, a fuller understanding of PON1 activity may shed light on the cardioprotective role of HDL. To this end, we have expanded the search for genetic variation that predicts PON1 activity to loci outside of the PON1 coding region and to rare variants.
In this study, we report novel fi ndings that common variants in FTO , ITGAL , and SERPINA12 predict PON1 activity. In addition, we fi nd that rare SNVs within the PON gene cluster are predictive of PON1 AREase activity. For FTO , ITGAL and PON1 , there were signifi cant effects of rare SNVs on PON1 activity that were independent of the effects of the common variants, while for SERPINA12 , the independent effects of the rare variants were marginal ( P = 0.09). Variation in FTO , ITGAL , and SERPINA12 has not previously been explored for effects on PON1 activity.
Of these three novel loci, both FTO and SERPINA12 are related to the cardiovascular risk factors, diabetes, and body mass index (BMI). FTO encodes a nuclear protein of the AlkB-related nonheme iron superfamily and has been associated with type-2 diabetes ( 52-54 ) as well as obesity and BMI in humans ( 55,56 ). Its mechanism is not understood. Interestingly, FTO variants have been reported to have gene-by-gene interaction with PON1 SNP rs854560 in the prediction of BMI ( 57 ). However, none of the 5 ′ PON1 activity-predicting FTO SNPs in our study are in signifi cant LD (r 2 < 0.60) with the FTO SNPs implicated in genomewide association studies for diabetes or BMI, nor for geneby-gene interactions with PON1. Moreover, PON1 activity and BMI are not signifi cantly associated ( P = 0.56) in our data. When we tested all FTO SNPs present on the CVD chip, we did fi nd signifi cant ( P < 0.05) associations with BMI in SNPs not predictive for PON1, but we found no associations in any FTO SNP with diabetes in our data (data not shown). FTO variants have not previously been tested for association with PON1 activity. SERPINA12 , or visceral adipose tissue-derived serpin (VASPIN), encodes an insulin-sensitizing adipocytokine ( 58 ). Like FTO , SERPINA12 variants have been implicated in human type-2 diabetes ( 59, 60 ) and obesity ( 58,61 ). However, when testing all SERPINA12 variants present in our data, we did not fi nd a signifi cant association with either BMI or diabetes (data not shown). Interestingly, PON1 activity has long been known to be lower in those with diabetes ( 50 ), and it has also been reported as an antidiabetic enzyme that stimulates insulin release from pancreatic ␤ cells ( 62 ). However, SNPs within PON1 have not been found to predict diabetes ( 63 ). The SERPINA12 SNP rs7152296 minor allele was associated with a decrease in PON1 AREase ( ␤ coeffi cient = Ϫ 4.8); in contrast, all FTO SNP minor alleles identifi ed involved in cardiovascular phenotypes. The CVD chip and thus this study could have excluded consideration of loci relevant to PON1 biology. Finally, while replication in an independent sample is ideal, given the need for larger datasets to identify rare variant effects, we considered all available data, rather than stratifying to provide a replication set. Cohorts in which PON1 activity can be measured are limited due to the fact that PON1 activity is calcium dependent and cannot be measured from plasma stored in EDTA tubes due to calcium chelation and irreversible inactivation of PON1. Therefore, replication data were not available. However, our fi nding of independent effects of both common and pooled rare variants in the same loci for the same trait provides some validation of the associations of variation at these loci and PON1 activity.
In conclusion, we describe the novel fi ndings that common and rare genetic variants in FTO , ITGAL , and SER-PINA12 are associated with PON1 enzyme activity. Additionally, an association with ALOX12 variants does not meet our experiment-wide cutoff, but it suggests the need for further study. These associations strengthen the tie of PON1 activity to diabetes and infl ammation. We also identify rare variants in these loci and the PON1 gene cluster that predict PON1 activity. In addition, we describe 92 nonsynonymous, 5 ′ -and 3 ′ -noncoding variants from the ESP6500 data for the PON gene cluster. Given the importance of PON1 to HDL biology and human disease and the potential for rare variants to explain a large portion of the "missing heritability" of complex genetic traits, future studies will be needed to further elucidate the genetic determinants of PON1 enzymatic activity and their relationship to cardiovascular disease risk, including BMI and infl ammation.
refl ect a lack of rare SNVs in this sample, as only two PON2 SNPs from the Exome chip were polymorphic in the CLEAR cohort; these two SNVs, rs7493 and rs12026, were common and in tight LD (MAF of 24.6% for both and r 2 = 1.0).
When adjusting for PON gene cluster variants that have been previously associated with PON1 activity ( 16 ), the P values for association of the previously unidentifi ed variants remained signifi cant, but not as strongly associated with PON1 AREase for both PON1 and the PON gene cluster. One possible explanation for the change is nonindependence of previously identifi ed SNPs and the SNVs tested from the Exome chip due to LD. Such LD was found only for PON1 SNP rs3917503 from the Exome chip data and the previously identifi ed SNP rs2299257 (r 2 = 0.74). Rs2299257 was reported to predict 0.85% of PON1 enzyme activity in the CLEAR study ( 29 ). The MAFs for rs3917503 and rs2299257 are 0.46 and 0.39, respectively, with both minor alleles being associated with an increase in PON1 AREase activity (data not shown). Both SNPs are intronic (rs3917503 is in the third and rs2299257 is in the fourth of eight introns of PON1 ). It is also notable that two of the seven SNVs present in PON1 (Leu90Pro, MAF = 0.0010, and Trp194STOP, MAF = 0.0005) were fi rst described as functional in the CLEAR study in 2003 by an early application of sequencing based on extreme or atypical phenotype ( 17 ). Overall, our results support the expectation of rare functional alleles.
The current catalog of PON gene cluster nonsynonymous and 5 ′ /3 ′ -noncoding SNVs in the ESP6500 data, both rare and common, are also summarized here. Of the 93 such SNVs presented in PON1 , PON2 , and PON3 in the ESP data, only 7, 2, and 4 variants, respectively, are present in these data. With regard to SNVs present in EA-stratifi ed populations, we had data on 4 missense, 2 intronic, and 1 nonsense SNVs within PON1 . To the best of the authors' knowledge, this marks the fi rst publication analyzing the effects of the PON1 nonsense SNV Trp194STOP. Of the 14 PON1 EA variants with a GERP conservation score > 3 present, 3 were present here. No rare PON2 variants were present in our data, while 10 PON2 SNVs with MAF < 0.01 and GERP conservation score > 3 were present in EA subjects in the ESP6500 data. These data had 4 PON3 SNVs: 1 intronic, 1 missense, 1 missense near-splice, and 1 nonsense. Of the 17 EA variants with a GERP conservation score > 3, only 2 are present in this study.
Some limitations of this study must be considered. First, this cohort is composed entirely of subjects of European ancestry, which limits the generalization to other ancestral groups. This is particularly of interest as there are many rare SNVs in the PON gene cluster that are only present in individuals of African ancestry. Similarly, although our study is large in size in the context of paraoxonase epidemiology, it remains small when considering rare genetic variants; thus, true positives may be underpowered or unsampled here. Additionally, our analysis strategy for rare variants utilized a candidate gene approach based on fi ndings from common variant analyses. The common variant analyses were performed with the Illumina HumanCVD BeadChip, which was selective for genes that were potentially