Novel association of TM6SF2 rs58542926 genotype with increased serum tyrosine levels and decreased apoB-100 particles in Finns[S]

A glutamate-to-lysine variant (rs58542926-T) in transmembrane 6 superfamily member 2 (TM6SF2) is associated with increased fatty liver disease and diabetes in conjunction with decreased cardiovascular disease risk. To identify mediators of the effects of TM6SF2, we tested for associations between rs58542926-T and serum lipoprotein/metabolite measures in cross-sectional data from nondiabetic statin-naïve participants. We identified independent associations between rs58542926-T and apoB-100 particles (β = −0.057 g/l, P = 1.99 × 10−14) and tyrosine levels (β = 0.0020 mmol/l, P = 1.10 × 10−8), controlling for potential confounders, in 6,929 Finnish men. The association between rs58542926-T and apoB-100 was confirmed in an independent sample of 2,196 Finnish individuals from the FINRISK study (βreplication = −0.029, Preplication = 0.029). Secondary analyses demonstrated an rs58542926-T dose-dependent decrease in particle concentration, cholesterol, and triglyceride (TG) content for VLDL and LDL particles (P < 0.001 for all). No significant associations between rs58542926-T and HDL measures were observed. TM6SF2 SNP rs58542926-T and tyrosine levels were associated with increased incident T2D risk in both METSIM and FINRISK. Decreased liver production/secretion of VLDL, decreased cholesterol and TGs in VLDL/LDL particles in serum, and increased tyrosine levels identify possible mechanisms by which rs58542926-T exerts its effects on increasing risk of fatty liver disease, decreasing cardiovascular disease, and increasing diabetes risk, respectively.

The online version of this article (available at http://www.jlr.org) contains a supplement.

patient-oriented and epidemiological research
Nonalcoholic fatty liver disease (NAFLD) is defined by the presence of more than 5% hepatic steatosis in the absence of other known causes of liver fat accumulation (e.g., alcohol use). NAFLD is common in developed countries, with an estimated prevalence of 30-60% in adults, with significant differences by genetic ancestry (1). Hepatic steatosis can predispose to development of advanced liver disease, including cirrhosis, liver failure, and hepatocellular carcinoma (2). NAFLD is associated with the metabolic syndrome; some consider it the hepatic manifestation of the metabolic syndrome (3). NAFLD has been shown to be predictive of incident T2D and cardiovascular disease, after controlling for obesity (4), suggesting that hepatic factors may contribute to the development of metabolic syndrome, possibly through influencing hepatic glucose and lipid metabolism (3).
In vitro experiments in human Huh7 and HepG2 cell lines demonstrated that TM6SF2 was localized to the endoplasmic reticulum and the endoplasmic reticulum-Golgi intermediate compartment (11). In addition, TM6SF2 silencing via RNA inhibition resulted in decreased TG and apoB secretion suggestive of decreased total VLDL secretion and increased intracellular lipid droplet content (11). Conversely, overexpression of TM6SF2 reduced liver cell steatosis, indicating a role for TM6SF2 in liver fat metabolism (11). Separate short hairpin RNA knockdown of Tm6sf2 in mice resulted in increased liver TG accumulation (10) in conjunction with decreased plasma total cholesterol and decreased TG content in VLDL fractions (8,10). Transient overexpression of Tm6sf2 in the liver of mice using adenoviruses from a separate group showed increased total and LDL cholesterol and increased TG, whereas knockdown decreased total serum cholesterol (8). These lipid perturbations from functional gene silencing experiments, increased hepatic steatosis from intracellular TG accumulation in conjunction with decreased serum total cholesterol and TG levels, are consistent with human population-level associations with rs58542926-T (6)(7)(8)10). These data together suggest a possible loss-of-function role for rs58542926-T with regard to TM6SF2 protein function in causing increased hepatic steatosis while decreasing total serum cholesterol (5).
Follow-up studies of tm6sf2 / KO mice, however, have led to different proposed molecular models of how TM6SF2 may affect lipid metabolism. The first study found that tm6sf2 / KO mice had altered gene expression and plasma lipid levels suggestive of a role of TM6SF2 in cholesterol biosynthesis (12). The second study using tm6sf2 / KO mice reported decreases in plasma cholesterol and TG levels and a decrease in VLDL particle size without a concurrent decrease in apoB levels (13). These results from mouse models were suggestive of either decreased cholesterol biosynthesis (12) or decreased TG loading of VLDL without altered secretion of apoB reflecting total VLDL particle secretion (13). This latter result in mice conflicts with the originally reported findings of reduced apoB secretion in two human cell lines when TM6SF2 function is reduced (11).
Further investigations into the metabolic effects of the TM6SF2 p.Glu167Lys mutation in humans are necessary to better understand the role of TM6SF2 in NAFLD pathogenesis and lipid metabolism. Here, we leveraged highthroughput NMR lipoprotein and metabolic data from nondiabetic and statin-naïve men from the Metabolic Syndrome in Men (METSIM) study and identified two metabolic variables (apoB and tyrosine levels) that are altered in carriers of rs58542926-T. We replicated the association of decreased serum apoB and rs58542926-T in the independent FINRISK study. In secondary analyses, we demonstrated an association of rs58542926-T with increased T2D risk, which may be mediated by tyrosine levels; both rs585429260T and tyrosine levels were significantly associated with increased hazard of developing diabetes. These results help to identify possible metabolic mechanisms by which the TM6SF2 p.Glu167Lys mutation promotes liver disease and T2D, while protecting against cardiovascular disease.

Ethics statement
Both the METSIM and FINRISK studies were performed in accordance with the Helsinki Declaration and were approved by the ethics committee of the University of Kuopio and Kuopio University and the National Public Health Institute of Finland, respectively. Informed, written consent was obtained for all METSIM participants.

METSIM study participants
The METSIM study is a population-based study, with participants between the ages of 45 and 70 years randomly selected from the population register of the town of Kuopio in Eastern Finland. Each participant had a 1 day outpatient visit, from which history of previous diseases and current medication list were obtained. Evaluation of metabolic syndrome and cardiovascular risk factors were also assessed at this outpatient visit. Fasting blood samples were drawn after 12 h of fasting, followed by an oral glucose tolerance test (OGTT) (14). Of the 10,197 METSIM participants, we examined 6,926 subjects chosen to exclude those with diabetes at baseline (n = 915) and those undergoing statin pharmacotherapy (n = 2,353), as both conditions are associated with altered metabolic and lipid characteristics (15,16). A review of the METSIM study and its major findings was recently reported in the Journal of Lipid Research (17).

Insulin sensitivity calculation
Results from the OGTT were used to calculate the Matsuda index of insulin sensitivity (ISI) as 10,000/sqrt (fasting insulin × fasting glucose × mean insulin during OGTT × mean glucose during OGTT) (18).

Serum NMR metabolomics
Fasting serum samples for METSIM were collected at enrollment, stored at 80°C, and thawed overnight in a refrigerator before sample preparation. A high-throughput serum NMR metabolomics platform was used to quantify the levels of individual metabolites and lipoprotein measures. The NMR-based metabolic profiling was previously used in multiple large-scale epidemiological and genetic studies (19) and the experimentation described elsewhere (20,21).
The NMR-based quantification of lipoprotein subclasses was calibrated using HPLC (21,22). The lipoprotein subclass sizes measured were as follows: six VLDL subclasses, ranging from extremely large (average particle diameter 64.0 nm) to very small (31.3 nm); three LDL subclasses, ranging from large (25.5 nm) to small (18.7 nm); and four HDL subclasses, ranging from very large (14.3 nm) to small (8.7 nm). For each lipoprotein subclass, particle concentration, TG content, and cholesterol content were quantified (see supplemental Table S1).

Genotyping, imputation, and quality control
METSIM participant samples were genotyped on the Hu-manOmniExpresss-12v1_C BeadChip (OmniExpress) and Infinium HumanExome-12 v1.0 BeadChip (Exome Chip) platforms. Quality controls in the METSIM study included sample-level controls for identifying sex and relatedness confirmation, sample duplication, and detection of sample genetic ancestry outliers using principal-components analysis. Based on these quality control measures, we removed 14 samples with sex chromosome anomalies, 18 with evidence of participant duplication, and 12 that were population outliers. Additionally, we removed one individual from seven monozygotic twin pairs. Variants in the METSIM study were filtered for low mapping quality of probes to genome build GRCh37, low genotype completeness (<95% and <98% for the OmniExpress and ExomeChip, respectively), or Hardy-Weinberg equilibrium P < 10 6 .
OmniExpress variants passing quality control were then phased with Shape-It v2 (23) and imputed using minimac v2 (24). Imputation used a reference panel of 26.7 M variants from the GoT2D study (including SNPs, indels, and large deletions) based on the whole genome sequence of 2,874 Europeans, including 1,004 Finnish individuals, the largest panel of Finnish genomes available (9). Following imputation, variants directly genotyped on the ExomeChip were added. In cases of discordance between imputed and genotyped variants, the directly genotyped call from the ExomeChip was used.

FINRISK replication study participants
We performed replication analyses in the FINRISK study, a cross-sectional general population study with data collected every 5 years to assess the health of the Finnish population between 25 and 74 years of age (25). The statin-naïve and nondiabetic 2,196 participants analyzed in this work were recruited in 1997. NMRbased metabolite and lipoprotein measurements have previously been described (26)(27)(28), and were analyzed by the same laboratory using the same methods that produced the aforementioned METSIM NMR data. Genotyping in FINRISK was performed on Illumina platforms (26,29) and imputation was performed similarly to METSIM (26).

Analysis
We performed all analyses using standard R (https://www.rproject.org/) packages. Due to the right skew of the lipoprotein variables, we Winsorized extreme observations of all lipoprotein variables to three SDs from the mean (30), to allow for natural variation in lipid traits while minimizing the effect of potentially influential outliers. Genotypes of genetic variants were coded using an additive model.
Association testing. We used the TM6SF2 SNP rs58542926 genotype as the outcome variable, as it allowed for use of stepwise linear regression to identify independent predictors of genotype (which would not be possible in the converse, with rs58542926 as the predictor). Due to the cross-sectional design of the study, the association (P value) did not change between two variables when dependent/independent roles were switched. Thus, for interpretability, we present the rescaled estimated  coefficients () and SEs for the effect of each minor allele of rs58542926 on each covariate.
As lipoprotein variables are highly correlated, we used a twostep approach to identify independent metabolic and lipoprotein predictors of TM6SF2 SNP rs58542926. First, we performed univariate tests of association for each metabolic and lipoprotein variable across genotype group, using the Pearson test for categorical variables and the Kruskal-Wallis test for continuous variables. Forty-three lipoprotein and amino acid variables that were nominally significantly different by TM6SF2 SNP rs58542926 genotype were carried forward to the second multivariable step.
We used stepwise linear regression to identify independent lipoprotein and amino acid predictors of TM6SF2 SNP rs58542926 genotype with the aforementioned 43 variables entering the model. Model comparison was performed using Akaike's information criterion to examine the fit of each model, beginning with a base model that included age, BMI, current smoking status, serum alanine aminotransferase (ALT) activity, and Matsuda ISI. Lipoprotein and amino acid variables that were included in the final stepwise linear regression model independently increased the ability of the model to predict TM6SF2 SNP rs58542926 genotype. Robust SE estimates were used to account for the lack of homoscedasticity in lipoprotein variables across genotype groups. We used a false discovery rate (FDR) threshold of <10 4 for all variables for inclusion in the final regression model to correct for multiple comparisons. A sensitivity analysis considering recessive genotype coding of rs58542926 was performed.
Replication analyses. We attempted to replicate our primary findings from METSIM in 2,196 FINRISK participants with demographic, clinical, NMR metabolite, and lipoprotein data, and TM6SF2 SNP rs58542926 genotype. Due to the lack of Matsuda ISI and serum ALT data, we adjusted our replication analyses for age, sex (FINRISK is composed of both men and women), BMI, current smoking status, and tyrosine (when apoB-100 was the outcome studied) and apoB-100 (when tyrosine was the outcome studied). We used linear regression with robust SEs, as described above. Reanalysis of the association of rs58542926-T with apoB-100 and tyrosine was performed in METSIM with the updated replication covariates.
Association with incident T2D in METSIM and FINRISK. We performed analyses of the association of both TM6SF2 SNP rs58542926-T and tyrosine levels with incident T2D in statin-naïve and nondiabetic participants in both METSIM and FINRISK. Longitudinal analyses used a Cox proportional hazards model with robust SEs, adjusting for baseline age, female gender (for FINRISK), BMI, and current smoking status. To visualize the association of tyrosine and incident T2D, we plotted covariateadjusted cumulative T2D incidence of the top 50th percentile as compared with the bottom 50th percentile for tyrosine levels.

RESULTS
Baseline clinical, lipid, and lipoprotein variables are presented in Table 1, stratified by TM6SF2 SNP rs58542926 genotype. Univariate associations of specific lipoprotein variables (e.g., particle concentration of extremely large VLDL) with rs58542926-T are presented in supplemental  Table S1. We observed a difference in apoB-100 levels (P < 0.001), but not in apoA-I levels (P = 0.53) in univariate analyses across genotype group. Accordingly, we observed nominally significant (P < 0.001) differences in apoB-100-associated particles: VLDL and LDL, both of which had genotype-dependent differences in their particle concentration (VLDL-P and LDL-P), TG content (VLDL-TG and LDL-TG), and cholesterol content (VLDL-C and LDL-C). We did identify differences by rs58542926 genotype for the mean diameter of VLDL (P = 0.027), but not LDL or HDL.
A total of 43 lipoprotein and metabolite variables were nominally significant (P < 0.05) for association with TM6SF2 SNP rs58542926-T (see Table 1, supplemental Table S1) and were carried forward to stepwise linear regression to identify which of the 43 variables represented the strongest independent signals in multivariate analysis. Beginning with a base regression model containing age, BMI, smoking status, ALT, and Matsuda ISI, we identified two additional variables that improved model prediction of rs58542926 with FDR stepwise <10 4 : apoB-100 ( = 0.057 g/l per minor T allele, P = 1.99 × 10 14 ) and tyrosine ( = 0.0020 mmol/l per minor T allele, P = 1.10 × 10 8 ). We noted that once apoB-100 was in the model, no other lipoprotein variable (including the specific subclass measures) significantly increased model prediction ( Table 2). A sensitivity analysis using recessive coding of TM6SF2 SNP rs58542926 found consistent direction of effects and nominally significant associations for apoB-100 [ = 0.13 g/l for those with the homozygous for the minor (NAFLD risk increasing) allele (T/T) genotype as compared with the heterozygous (C/T) and homozygous for the major allele (C/C) genotype groups, P = 0.0012] and tyrosine ( = 0.0089 mmol/l for those with the T/T genotype as compared with the C/T and C/C genotype group, P = 4.71 × 10 6 ) levels (see supplemental Table S2).
To determine the significance of our apoB-100 and tyrosine associations with TM6SF2 SNP rs58542926-T, we present replication analyses from 2,196 nondiabetic and statin-naïve participants of the independent FINRISK study in Table 3. The association of apoB-100 and rs58542926-T was significant in FINRISK ( = 0.029, P = 0.029). The association of tyrosine with rs58542926-T was not significant in FINRISK ( = 0.0010, P = 0.15); however, the direction of effect was consistent with METSIM.
We performed secondary multivariate regression analyses in METSIM to identify potential lipid pathway perturbations associated with the TM6SF2 SNP rs58542926-T and apoB-100 association (see supplemental Table S3). Visualization of the covariate-adjusted association of decreasing apoB-100 levels per T allele of rs58542926 (P = 1.99 × 10 14 ) is presented in Fig. 1A. We noted a lack of association between genotype and apoA-I (P = 0.67). Examination of covariate-adjusted serum lipid levels revealed a nominally significant dose-dependent decrease for both TG (P = 2.70 × 10 12 ) and cholesterol (P = 1.61 × 10 8 ) levels (see Fig. 1B). Figure 2 shows the association of the particle concentrations, TG levels, and cholesterol content of VLDL ( Fig. 2A), LDL (Fig. 2B), and HDL (Fig. 2C). Consistent with the univariate analyses, we saw no association across rs58542926 genotype for any HDL measure (P > 0.05). Particle concentration, TG levels, and cholesterol content of both VLDL and LDL all decreased with each T allele of TM6SF2 SNP rs58542926 (P < 0.001 for all observations; Fig. 2A, B). Examination of univariate genotype association with particle diameter revealed nominally significant differences in VLDL diameter (P = 0.027). Follow-up examination of covariate-adjusted VLDL particle diameter did not reveal differences by genotype (P = 0.13, see supplemental Fig. S6).
To elucidate the role of TM6SF2 in T2D pathogenesis, we analyzed the association of rs58542926-T and tyrosine levels with incident T2D in both METSIM and FINRISK. Using Cox proportional hazards modeling, we found that TM6SF2 SNP rs58542926-T was significantly associated with incident T2D in METSIM [hazard ratio (HR) = 1.24, P = 0.046] and FINRISK (HR = 1.42, P = 0.0059; see Fig. 3A, supplemental Table S4). Analyses of the association of tyrosine levels demonstrated a strong pro-diabetes effect in both METSIM and FINRISK (see supplemental Table S5): those with higher tyrosine levels (>50th percentile) had a 63% and 118% increased hazard of incident diabetes as compared with the lower tyrosine levels group in METSIM (HR = 1.63, P = 8.22 × 10 6 ) and FINRISK (HR = 2.18, P = 0.00014), respectively (see Fig. 3B).

DISCUSSION
Since its discovery, the TM6SF2 p.Glu167Lys missense variant has been associated with decreased total serum cholesterol, LDL and TG concentration, and decreased myocardial infarction risk (8). Separately, knockdown of tm6sf2 expression in mouse liver has been causally linked to increased hepatic steatosis (10) and rs58542926 has since been associated with a wide spectrum of liver disease (31)(32)(33). This paradoxical association, where rs58542926-T decreases serum lipid levels and therefore decreases cardiovascular disease risk, but increases liver disease risk through increased hepatic fat accumulation, has been confirmed in meta-analyses (34) and recently has been the topic of a commentary (35). This same variant has now also been associated with increased risk of T2D (9). Despite the importance of rs58542926-T in metabolic disease risk, how rs58542926-T exerts its effect to predispose to these metabolic changes is unknown (36).
To help elucidate its mechanism of action, we aimed to identify metabolites associated with the TM6SF2 p.Glu167Lys missense variant. Using high-throughput NMR lipoprotein and metabolite profiling of a large sample (n = 6,929) of genetically homogeneous, nondiabetic, and statin-naïve Finnish men, we identified two highly significant associations in our data: a novel genotype-dependent increase in tyrosine levels ( = 0.0020 mmol/l, P = 1.10 × 10 8 ) and a strong decrease in apoB-100 levels ( = 0.057 g/l, P = 1.99 × 10 14 ) per T allele of rs58542926. The association of apoB-100 levels with rs58542926-T was subsequently replicated in the independent FINRISK study ( = 0.029, P = 0.029). The association of tyrosine levels with rs58542926-T was not replicated in FINRISK ( = 0.0010, P = 0.15); however, given the same direction of effect in both cohorts and the smaller more diverse sample size of FINRISK (2,129 men and women in FINRISK vs. 6,929 men in METSIM), the failure of replication in this work may reflect a lack of statistical power to detect an association. Moreover, given the strong association of tyrosine levels with incident T2D in both METSIM and FINRISK, alterations of tyrosine levels by TM6SF2 SNP rs58542926 genotype may reflect a potential causal pathway through which TM6SF2 affects diabetes risk. Sensitivity analyses of associations with TM6SF2 SNP rs58542926-T showed decreases in total apoB-100-containing particles (VLDL-P and LDL-P) and their TG (VLDL-TG and LDL-TG) and cholesterol content (VLDL-C and LDL-C). We also observed a lack of effect of rs58542926 genotype with apoA-I levels and HDL measures, which are the primary carriers of apoA-I. Finally, the particle diameters of VLDL, LDL, and HDL were not affected by rs58542926 genotype in multivariate analyses.
These lipoprotein data are consistent with the potential mechanism of decreased production or secretion of VLDL particles from the liver in individuals carrying the T (minor) allele of rs58542926. This hypothesis of a decrease in VLDL particle concentration would also result in decreased LDL particle concentration and decreased TG/cholesterol levels of these particle classes in the serum [we note that in our data that apoB, VLDL-P, VLDL-TG, and VLDL-C are all highly correlated (supplemental Fig. S1)]. As with apoB mutations causing familial hypobetalipoproteinemia (37), the replicated finding that TM6SF2 p.Glu167Lys is associated with significantly decreased apoB-100 levels suggests that the variant may act to promote accumulation of lipids and cholesterol in liver to cause fatty liver disease and advanced liver disease, while decreasing the levels of these lipids in the serum. Although these data are consistent with the hypothesis of decreased VLDL secretion (11), we cannot exclude from our cross-sectional analyses the possibility that production of VLDL is affected.
Within the context of prior literature on TM6SF2, our data are in apparent conflict with recent work by Smagris et al. (13) using a tm6sf2 / KO mouse model from which the authors concluded that, while TM6SF2 was implicated in lipidation of VLDL particles in the liver, it was not involved in secretion of VLDL. Data from Smagris et al. (13) that supported this conclusion was the decreased hepatic secretion of VLDL TGs and the lack of decreased hepatic apoB-100 secretion in conjunction with decreased VLDL diameter, all of which suggests decreased TG content of VLDL, but no decrease in absolute VLDL particle concentration in serum. With regard to another recent report using a tm6sf2 / KO mouse, our data cannot refute the hypothesis that TM6SF2 is involved in cholesterol biosynthesis (12). Further work in large-scale human studies of gene expression with enough participants with the minor allele homozygous genotype at rs58542926 (of which we had 17) will be needed to determine whether TM6SF2 lossof-function results in altered expression in cholesterol biosynthesis genes (12).
The conflict between our results, which are consistent with prior studies in humans (8,11), and with the recent work using tm6sf2 / KO mice (12,13), may reflect differences between the species. First, the lipoproteome of VLDL in mice is poorly understood as compared with humans (38). It is possible that regulation and secretion of VLDL differs in mice compared with humans, similar to the example of mice carrying the majority of cholesterol on HDL due to a lack of cholesterol ester transport protein (39). Second, humans have significantly different genetic regulation as compared with mice, as reflected by data from the Mouse ENCODE project, which found that approximately half of the transcription factor binding sites in the mouse genome were not found in the orthologous human genetic regions, while a quarter of transcription sites migrated to different positions within the regulatory element (40). Third, the effect of metabolic stressors, particularly the effects of diet, may differ between species. A 10 week high-fat diet commonly used to induce fatty liver disease, for example, did not result in the development of excess hepatic steatosis in mice; an effect that may be observed by the significantly higher daily intake and liver biosynthesis of cholesterol in mice (41).
In addition to our lipoprotein findings, we report the novel discovery that tyrosine levels are associated with TM6SF2 SNP rs58542926-T in METSIM and that tyrosine levels are independently associated with increased incidence of T2D in both METSIM and FINRISK. We note that while the association of tyrosine and rs58542926-T did not replicate in FINRISK, this may be due to smaller sample size and cohort heterogeneity, as the effect size direction was consistent with what was observed in METSIM and the P value was suggestive of significance. Tyrosine, an aromatic amino acid, has previously been associated in a partially overlapping subset of METSIM (n = 9,369) to be predictive of incident diabetes at 4.7 year follow-up (15). We further confirmed this association in a smaller statinnaïve subset of METSIM (n = 6,929) with approximately 12 years of follow-up time, and found that individuals in the top 50th percentile of tyrosine levels have a 1.69 greater hazard of incident T2D as compared with individuals in the lower 50th percentile. We similarly report that tyrosine levels are strongly predictive of incident T2D in a statin-naïve subset of the FINRISK study, with individuals having higher tyrosine levels at 2.18 greater hazard of incident T2D. Furthermore, in an independent and nonoverlapping population-based study of Finnish individuals, tyrosine levels were associated with baseline measures of insulin resistance, and at 6 year follow-up in men only (42), and separately, with measures of oral glucose tolerance at baseline and 6.5 year follow-up (43).
These tyrosine association data in aggregate suggest a role of rs58542926-T in the development of diabetes via alterations to aromatic amino acid metabolism. This hypothesis is strengthened by recent genetic fine-mapping of low frequency variants by the GoT2D and T2D-GENES a Replication analyses in both METSIM and FINRISK excluded for those with known diabetes and on lipid-lowering medication at time of enrollment. Linear regression models adjusted for age, sex (in FINRISK), BMI, current smoking status, and tyrosine (when apoB-100 was the outcome) and apoB-100 (when tyrosine was the outcome).
consortia, from which they concluded that TM6SF2 SNP rs58542926 was the likely causal variant for diabetes in the 19p13.11 locus (9). In this work, each T minor allele of rs58542926 was associated with 21% increased odds of T2D (9). We validate this association: each T minor allele rs58542926 was significantly associated with 24% and 42% increased hazard of incident T2D in METSIM and FIN-RISK, respectively. As a result of our data and prior work, it is plausible to speculate that TM6SF2 rs58542926-T increases tyrosine levels, which in turn increase T2D risk. However, it remains to be determined whether the effect of rs58542926 on tyrosine is direct on its metabolism or indirect via effects on lipid levels (44).
Some limitations of this study should be considered. First, this analysis was of cross-sectional data; therefore, we could not prove proposed mechanisms of action from these data alone. Second, the METSIM population in which discovery analyses were performed consisted solely of men; therefore, generalizability to women will need further evaluation in similarly large cohorts. However, using just men resulted in greater homogeneity across important risk factors for chronic disease and likely increased sensitivity to detect true positives (see prior discussion on the lack of replication of the tyrosine association in FINRISK). Third, concerns with multiple collinearity are important with analyses of highly correlated lipoprotein data. In our analyses, we used stepwise linear regression to identify independent predictors from among the numerous correlated lipoprotein variables. We note that our two significant variables, apoB-100 and tyrosine levels, are not highly correlated (r = 0.20). Fourth, we lacked liver biopsy and NMR liver fat measurements; thus, we were unable to make inferences as to the effect of rs58542926 on hepatic histologic processes. Future work is needed to assess this genotype-dependent effect on hepatic pathology on a large population scale.
The strengths and weakness of NMR metabolomic and lipoprotein profiling should be reviewed as compared with MS methods [see (19) for a recent review]. The strengths of NMR include lower cost and higher throughput, as compared with MS, which allow NMR metabolic and lipoprotein profiling to be used in large-scale studies such as METSIM. Weaknesses of NMR as compared with MS include less specific data, as MS separates signals from individual molecular identities with mass differences and thus leads to rich and complicated spectrometric data. Moreover, there is a strong focus on lipid species with NMR metabolomic profiling. Our study was designed to identify changes in lipoprotein particles and their cholesterol/TG content to address recent hypotheses by Smagris et al. (13) and Fan et al. (12) that TM6SF2 affected VLDL lipidation, but not secretion and cholesterol biosynthesis, respectively. As a result, our study would not have been able to assess these potential mechanisms with MS metabolomic data, as lipoproteins are not captured by the method.
In summary, we performed a novel study associating detailed lipoprotein subclass data and multiple circulating metabolite measures with TM6SF2 SNP rs58542926-T. We identified genotype-dependent decreases in the concentrations of apoB-100 particles: VLDL and LDL, and their TG and cholesterol contents. These findings are most consistent with a defect in VLDL production or secretion from hepatocytes. We also report the novel association of rs58542926-T with tyrosine levels and validate prior reports of tyrosine levels increasing incident T2D risk. This finding that rs58542926-T affects tyrosine levels may reflect a pathway through which TM6SF2 is associated with insulin resistance and incident T2D through alteration of tyrosine levels. Due to the importance of TM6SF2 in lipid metabolism and disease risk, further molecular and largescale studies are warranted.
The authors thank the participants of the METSIM study.