A structural mean modeling Mendelian randomization approach to investigate the lifecourse effect of adiposity: applied and methodological considerations.
기술보고
1/5 보강
PICO 자동 추출 (휴리스틱, conf 2/4)
유사 논문P · Population 대상 환자/모집단
In this study, we applied g-estimation of SMMs within an MR framework to estimate the period effects of adiposity measured at two life stages, childhood and adulthood, on cardiovascular disease (CVD), type 2 diabetes (T2D), and breast canc…
I · Intervention 중재 / 시술
추출되지 않음
C · Comparison 대조 / 비교
추출되지 않음
O · Outcome 결과 / 결론
We found persistent period effects of higher adulthood adiposity on increased risk of CVD and T2D.
Mendelian randomization (MR) is a technique that uses genetic variation to address causal questions about how modifiable exposures influence health.
APA
Power GM, Palmer T, et al. (2026). A structural mean modeling Mendelian randomization approach to investigate the lifecourse effect of adiposity: applied and methodological considerations.. American journal of epidemiology, 195(1), 21-31. https://doi.org/10.1093/aje/kwaf029
MLA
Power GM, et al.. "A structural mean modeling Mendelian randomization approach to investigate the lifecourse effect of adiposity: applied and methodological considerations.." American journal of epidemiology, vol. 195, no. 1, 2026, pp. 21-31.
PMID
39968723 ↗
Abstract 한글 요약
Mendelian randomization (MR) is a technique that uses genetic variation to address causal questions about how modifiable exposures influence health. For some time-varying phenotypes, genetic effects may have differential importance at different periods in the lifecourse. MR studies often employ conventional instrumental variable (IV) methods designed to estimate average lifetime effects. Recently, several extensions of MR have been proposed to investigate time-varying effects, including structural mean models (SMMs). SMMs exploit IVs through g-estimation and circumvent some of the parametric assumptions required by other MR methods. In this study, we applied g-estimation of SMMs within an MR framework to estimate the period effects of adiposity measured at two life stages, childhood and adulthood, on cardiovascular disease (CVD), type 2 diabetes (T2D), and breast cancer. We found persistent period effects of higher adulthood adiposity on increased risk of CVD and T2D. Higher childhood adiposity had a protective period effect on breast cancer risk. We compared this approach with an inverse variance weighted multivariable MR method, which also uses multiple IVs to assess time-varying effects but relies on a different set of assumptions. We highlight the strengths and limitations of each approach and conclude by emphasizing the importance of underlying methodological assumptions in the application of MR to lifecourse research.
🏷️ 키워드 / MeSH 📖 같은 키워드 OA만
같은 제1저자의 인용 많은 논문 (1)
📖 전문 본문 읽기 PMC JATS · ~67 KB · 영문
Introduction
Introduction
Mendelian randomization (MR) exploits the random assortment of genetic variants, independent of other traits, to mitigate confounding bias and reverse causality.1 Within a causal inference framework, MR builds on instrumental variable (IV) analysis, whereby specific assumptions are required to hold.2 The IV must (1) be associated with the exposure of interest (“relevance”), (2) not share common causes with the outcome (“independence”), and (3) not affect the outcome other than through the exposure (“exclusion-restriction”).3,4 MR studies typically use genetic instruments associated with a phenotype, that are not specific to any particular time period in the lifecourse, to estimate the effect of an exposure on an outcome. The effect estimate obtained is therefore defined as a “lifetime” effect. However, this approach cannot disentangle whether early life exposures directly influence the outcome, whether their effect is mediated through later exposures, or if the observed effect arises from genetic variants acting on the exposure in later periods. Equally, it is unclear whether the exposure experienced in later periods affects the outcome directly, or if the effect is confounded by the exposure in an earlier period affecting the later period and outcome.
A lifecourse approach sets out to investigate the contribution of early and later life exposures to identify risk and protective mechanisms across the lifespan.5-7 It has been shown that genetic effects may have differential importance in the development of an exposure at different periods in the lifecourse.8-12 Therefore, “standard” MR designs may lead to incorrect interpretations.13 MR methods that rely on having multiple instruments with different effects on an exposure have been developed, or adapted, to address lifecourse research questions.8,9,13-24 These include inverse variance weighted (IVW) multivariable MR (IVW-MVMR) and MR using g-estimation of structural mean models (SMM-MR).9,23
SMM-MR exploits IVs through g-estimation.25,26 Structural mean models are semi-parametric, circumventing some parametric assumptions made by other MR methods such as the assumption of a linear relationship with a continuous outcome when using two-stage least-squares (2SLS).27 They comprise a wide array of models, including those with binary, categorical, and time-to-event outcomes. In a lifecourse context, three distinct types of causal parameters may be estimated using this approach if combined with IVs that satisfy specific assumptions: the point effect, period effect, and lifetime effect.23 Each of these effects represents the difference between expected outcomes in a population when an exposure is increased by one unit under different scenarios. Definitions and assumptions of the causal parameters targeted by SMM-MR approaches are summarized in Table 1.
IVW-MVMR offers an alternative approach to conduct analysis in a lifecourse setting.9 IVW-MVMR estimates the controlled effects of correlated exposures on an outcome, where each effect is controlled for other exposures included in the model.9,16 The method relies on three assumptions, which expand on the core MR assumptions highlighted above: (1) the “relevance” assumption is modified to indicate that the liability (an underlying normally distributed latent (unmeasured) variable) to each exposure is robustly predicted by the genetic variants controlled for other exposures included in the estimation; (2) the “independence” assumption remains unchanged; and (3) the “exclusion-restriction” assumption proposes genetic variants are not associated with the outcome other than via liabilities to exposures included in the estimation.9,28 In a lifecourse setting, the correlated exposures in IVW-MVMR correspond to repeated measures of the same exposure over time.
Different assumptions about the nature of the association of the genetic variants used as instruments and the exposure at time periods not included in the model have been proposed for SMM-MR and IVW-MVMR. To satisfy the third instrumental condition when employing a SMM-MR design each component of the time-varying exposure outside of the period considered must not be directly associated with the instrument or the outcome. Under this assumption, using an example with an exposure measured at one time period (X2), there should be no direct pathway from genetic variants (Z) to X1 and X3 or no pathway from X1 or X3 directly to Y (see Figure 1). If multiple time periods had been measured and included in this model, the same assumptions would apply to those time periods included. This could be checked empirically.
Conversely, IVW-MVMR proposes that there may be a path from the instruments to the outcome via exposures at other periods, which have not been included in the estimation.9,29 This path will form part of the targeted causal effect. Consequently, the estimand being identified is the effect of an increase in one unit of liability to the exposure. With reference to Figure 2, the instruments associated with the liability (L) in at least one period of the lifecourse may be associated with L in different periods. The degree of this association may vary across periods.
IVW-MVMR has previously been used to evaluate whether childhood adiposity has an effect on disease outcomes, after controlling for adiposity in adulthood.16,30-33 If the main exposure (ie, childhood adiposity) has a causal influence on disease risk, IVW-MVMR allows us to decipher whether this effect is either partly mediated by, or is not acting through, adulthood adiposity. In this application, while increased childhood adiposity was a risk factor for coronary heart disease (CHD) and type 2 diabetes (T2D), the controlled period effects were negligible.16 This suggests that the influence of childhood adiposity on these diseases is mediated through adult adiposity, emphasizing the public health message that obesity prevention before adulthood could mitigate adverse effects.34 The study also found that increased childhood adiposity had a protective effect on breast cancer risk, independent of adult adiposity, and showed very little evidence of a causal effect on prostate cancer.16
In this paper we apply SMM-MR to estimate the period effects of childhood and adult adiposity on cardiovascular disease (CVD), T2D, and breast cancer in later life. We use this application to demonstrate this approach and provide a comprehensive interpretation of results. We compare results using the same data in an IVW-MVMR framework. We then discuss plausible next steps for MR methods used to address lifecourse epidemiology questions.
Mendelian randomization (MR) exploits the random assortment of genetic variants, independent of other traits, to mitigate confounding bias and reverse causality.1 Within a causal inference framework, MR builds on instrumental variable (IV) analysis, whereby specific assumptions are required to hold.2 The IV must (1) be associated with the exposure of interest (“relevance”), (2) not share common causes with the outcome (“independence”), and (3) not affect the outcome other than through the exposure (“exclusion-restriction”).3,4 MR studies typically use genetic instruments associated with a phenotype, that are not specific to any particular time period in the lifecourse, to estimate the effect of an exposure on an outcome. The effect estimate obtained is therefore defined as a “lifetime” effect. However, this approach cannot disentangle whether early life exposures directly influence the outcome, whether their effect is mediated through later exposures, or if the observed effect arises from genetic variants acting on the exposure in later periods. Equally, it is unclear whether the exposure experienced in later periods affects the outcome directly, or if the effect is confounded by the exposure in an earlier period affecting the later period and outcome.
A lifecourse approach sets out to investigate the contribution of early and later life exposures to identify risk and protective mechanisms across the lifespan.5-7 It has been shown that genetic effects may have differential importance in the development of an exposure at different periods in the lifecourse.8-12 Therefore, “standard” MR designs may lead to incorrect interpretations.13 MR methods that rely on having multiple instruments with different effects on an exposure have been developed, or adapted, to address lifecourse research questions.8,9,13-24 These include inverse variance weighted (IVW) multivariable MR (IVW-MVMR) and MR using g-estimation of structural mean models (SMM-MR).9,23
SMM-MR exploits IVs through g-estimation.25,26 Structural mean models are semi-parametric, circumventing some parametric assumptions made by other MR methods such as the assumption of a linear relationship with a continuous outcome when using two-stage least-squares (2SLS).27 They comprise a wide array of models, including those with binary, categorical, and time-to-event outcomes. In a lifecourse context, three distinct types of causal parameters may be estimated using this approach if combined with IVs that satisfy specific assumptions: the point effect, period effect, and lifetime effect.23 Each of these effects represents the difference between expected outcomes in a population when an exposure is increased by one unit under different scenarios. Definitions and assumptions of the causal parameters targeted by SMM-MR approaches are summarized in Table 1.
IVW-MVMR offers an alternative approach to conduct analysis in a lifecourse setting.9 IVW-MVMR estimates the controlled effects of correlated exposures on an outcome, where each effect is controlled for other exposures included in the model.9,16 The method relies on three assumptions, which expand on the core MR assumptions highlighted above: (1) the “relevance” assumption is modified to indicate that the liability (an underlying normally distributed latent (unmeasured) variable) to each exposure is robustly predicted by the genetic variants controlled for other exposures included in the estimation; (2) the “independence” assumption remains unchanged; and (3) the “exclusion-restriction” assumption proposes genetic variants are not associated with the outcome other than via liabilities to exposures included in the estimation.9,28 In a lifecourse setting, the correlated exposures in IVW-MVMR correspond to repeated measures of the same exposure over time.
Different assumptions about the nature of the association of the genetic variants used as instruments and the exposure at time periods not included in the model have been proposed for SMM-MR and IVW-MVMR. To satisfy the third instrumental condition when employing a SMM-MR design each component of the time-varying exposure outside of the period considered must not be directly associated with the instrument or the outcome. Under this assumption, using an example with an exposure measured at one time period (X2), there should be no direct pathway from genetic variants (Z) to X1 and X3 or no pathway from X1 or X3 directly to Y (see Figure 1). If multiple time periods had been measured and included in this model, the same assumptions would apply to those time periods included. This could be checked empirically.
Conversely, IVW-MVMR proposes that there may be a path from the instruments to the outcome via exposures at other periods, which have not been included in the estimation.9,29 This path will form part of the targeted causal effect. Consequently, the estimand being identified is the effect of an increase in one unit of liability to the exposure. With reference to Figure 2, the instruments associated with the liability (L) in at least one period of the lifecourse may be associated with L in different periods. The degree of this association may vary across periods.
IVW-MVMR has previously been used to evaluate whether childhood adiposity has an effect on disease outcomes, after controlling for adiposity in adulthood.16,30-33 If the main exposure (ie, childhood adiposity) has a causal influence on disease risk, IVW-MVMR allows us to decipher whether this effect is either partly mediated by, or is not acting through, adulthood adiposity. In this application, while increased childhood adiposity was a risk factor for coronary heart disease (CHD) and type 2 diabetes (T2D), the controlled period effects were negligible.16 This suggests that the influence of childhood adiposity on these diseases is mediated through adult adiposity, emphasizing the public health message that obesity prevention before adulthood could mitigate adverse effects.34 The study also found that increased childhood adiposity had a protective effect on breast cancer risk, independent of adult adiposity, and showed very little evidence of a causal effect on prostate cancer.16
In this paper we apply SMM-MR to estimate the period effects of childhood and adult adiposity on cardiovascular disease (CVD), T2D, and breast cancer in later life. We use this application to demonstrate this approach and provide a comprehensive interpretation of results. We compare results using the same data in an IVW-MVMR framework. We then discuss plausible next steps for MR methods used to address lifecourse epidemiology questions.
Methods
Methods
Data sources
UK Biobank data were collected between 2006 and 2010 on individuals aged between 40 and 69 years old at baseline, from clinical examinations, assays of biological samples, detailed information on self-reported health characteristics, and genome-wide genotyping, using a prospective cohort study design.35 The childhood adiposity measure applied in this study used recall questionnaire data, involving responses from adult participants who were asked whether, compared to the average, they were “thinner,” “about average,” or “plumper,” when they were aged 10 years old. The adult adiposity variable was derived using clinically measured body mass index (BMI) data (mean age, 56.5 years). It was then separated into a 3-tier variable based on the same proportions as the early life adiposity variable (ie, thinner, plumper, and about average).36,37 This was to ensure that derived effect estimates from subsequent analyses were as comparable as possible. Individuals that did not have data for both childhood and adult adiposity were excluded from analyses. Genetic variants strongly associated with childhood and adult adiposity (using P < 5 × 10−8 and r2 < 0.001) were identified in a large-scale genome-wide association study (GWAS), previously undertaken on 463 005 individuals in the UK Biobank study, adjusting for age, sex, and genotyping chip.35,38 These instruments have been independently validated in three distinct cohorts, validating their reliability in measuring childhood and adult adiposity.16,36,37 Phenotypic data for the outcomes under investigation, CVD, T2D, and breast cancer, were obtained from the UK Biobank. These outcomes were classified using the International Classification of Diseases, Tenth Revision (ICD-10) codes, described in Table S1. The UK Biobank study have obtained ethics approval from the Research Ethics Committee (REC; approval number: 11/NW/0382) and informed consent from all participants enrolled in UK Biobank. Estimates were derived using data from the UK Biobank (app #76538).
Statistical analysis
First, we employed a SMM technique in a univariable framework using polygenic risk scores (PRSs) generated from selected genetic variants to estimate period effects of childhood and adulthood adiposity on CVD, T2D, and breast cancer. We then applied SMMs in a multivariable framework to compare with an IVW-MVMR approach used to estimate controlled period effects. For comparative purposes, we employed the IVW method to conduct MR analyses using summary statistics generated with the same sets of genetic variants used in the SMM analyses. These steps are described in further detail below.
Polygenic risk scores
We calculated PRSs for each individual in the study using PLINK 2.00.39 Polygenic risk scores were generated with varying thresholds of association stringency between single nucleotide polymorphisms (SNPs) and the exposure phenotypes, childhood and adulthood adiposity, based on SNPs identified in the GWAS described above. The specific criteria for each PRS, including the SNP association thresholds, are detailed in Table 2. Complete lists of SNPs included in each PRS are available in Tables S2A-S2N.
Genetic IVs in a MR setting are conventionally selected from an independent dataset where the sample does not overlap with the dataset being analyzed to avoid overfitting bias.40 Conducting MR using overlapping sets of participant samples may bias in the direction of the conventional observational results between the exposure and outcome.41 In our investigation it was not possible to avoid this. However, we compare our IVW-MVMR results to the same analysis using non-overlapping outcome data, replicating analyses previously conducted,16 to assess how using these data may have affected our results. For this analysis GWAS data for CHD, T2D, and breast cancer were obtained from large scale consortia, which did not include data from the UK Biobank.42-44 Details on these datasets have been described previously.16
MR using structural mean modeling
We used individual-level data to perform SMM-MR. To highlight results obtained in most “standard” MR analyses we first used genetic variants strongly associated with adiposity around age 10 years (P ≤ 5 × 10−8), regardless of their association with adulthood adiposity and genetic variants strongly associated with adiposity around 57 years, regardless of their association with childhood adiposity. These results are interpreted as period effects as we have assumed that the genetic variants are only directly associated with the exposure over the periods of childhood or adulthood. However, under an alternative assumption in which associations between genetic variants and adiposity remain constant throughout the lifecourse, they could be interpreted as lifetime effects. Next, we used the genetic variants strongly associated with adiposity around age 10 years (P ≤ 5 × 10−8), excluding SNPs strongly associated with adiposity around 57 years at P ≤ 5 10−8, to estimate the period effect of childhood adiposity for a period that started in childhood and ended at some point between 10 and 57 years. We used the genetic variants strongly associated with adiposity around age 57 years (P ≤ 5 × 10−8), excluding SNPs associated with childhood adiposity at P ≤ 5 10−8, to estimate the period effect of adulthood adiposity for a period that started at some point between 10 and 57 years up to the point the outcome is measured. While definition of these periods is imprecise, the interpretation of this causal parameter is supported by previous validation studies.16,36,37 These have demonstrated that the genetic variants strongly associated with adiposity at age 10 years were better predictors of childhood BMI than those predicting adulthood BMI. In late adolescence, neither set of variants optimally predicted BMI, but in adulthood, variants associated with adiposity at age 57 years were stronger predictors of this timeframe than early life variants.16,36,37 We recommend conducting comparable validation analyses for novel variants utilized in subsequent studies. For instance, if data permit, examining the phenotypic associations of these novel variants across various age groups could help clarify which assumptions are most plausible.
To further explore the impact of instrument selection when estimating period effects, we applied the SNP association thresholds outlined in Table 2 and ran models with the PRS generated from each of these, in turn. We additionally applied SMMs in a multivariable framework to calculate the effects of childhood adiposity and adulthood adiposity, simultaneously, on CVD, T2D, and breast cancer. This approach estimates the controlled period effect of an exposure at a specific period by controlling for the exposure at a period not considered the main exposure period. The MR analyses conducted and their interpretation, along with the closest equivalent IVW estimator, are summarized in Table 3.
Importantly, childhood adiposity strongly influences adulthood adiposity.45 Genetic variants with varying levels of association with childhood and adult adiposity may be targeting different causal pathways underlying the relationship between adiposity and complex outcomes that vary in importance at different life stages.46 For instance, some variants linked to adult adiposity might affect aspects of early-life adiposity that childhood-associated variants do not capture.46 This can lead to deviations from the gene–environment equivalence assumption in MR,47 which asserts that genetic perturbations should mirror the effects of modifiable exposures on outcomes.46 This is important to consider when selecting genetic variants to instrument period effects.
Genome-wide association studies
We conducted GWAS to assess associations between genetic variants and the three outcomes of interest: CVD, T2D, and breast cancer. These analyses were performed using BOLT-LMM, a Bayesian linear mixed model that accounts for relatedness and population stratification when estimating associations between genetic variants and a trait.48,49 Age at baseline, sex, and genotyping array type were included as covariates in the model. The resulting GWAS were subsequently used in the IVW-MR analyses.
MR using inverse variance weighting
We employed the IVW method on outcomes from two data sources.50 The inverse variance weighted approach takes SNP-outcome estimates and regresses them on the SNP-exposure associations, weighting each SNP by the inverse of the variance of its SNP-outcome association.51 First, we used the GWAS generated from UK Biobank data to compare against the MR analyses using SMMs. Second, we used the GWAS obtained from large scale consortia, which did not include data from the UK Biobank,42-44 to assess how using overlapping outcome data may have affected our results. For each of the sets of IVW analyses, we estimated the “total” effects in a univariable framework. We then ran MVMR analyses to estimate the controlled period effect.
The IVW-MR analyses were conducted using the TwoSampleMR R package.52 Plots in this paper were generated using the R package “ggplot2.”53 These analyses were undertaken using R (version 3.5.1).
Data sources
UK Biobank data were collected between 2006 and 2010 on individuals aged between 40 and 69 years old at baseline, from clinical examinations, assays of biological samples, detailed information on self-reported health characteristics, and genome-wide genotyping, using a prospective cohort study design.35 The childhood adiposity measure applied in this study used recall questionnaire data, involving responses from adult participants who were asked whether, compared to the average, they were “thinner,” “about average,” or “plumper,” when they were aged 10 years old. The adult adiposity variable was derived using clinically measured body mass index (BMI) data (mean age, 56.5 years). It was then separated into a 3-tier variable based on the same proportions as the early life adiposity variable (ie, thinner, plumper, and about average).36,37 This was to ensure that derived effect estimates from subsequent analyses were as comparable as possible. Individuals that did not have data for both childhood and adult adiposity were excluded from analyses. Genetic variants strongly associated with childhood and adult adiposity (using P < 5 × 10−8 and r2 < 0.001) were identified in a large-scale genome-wide association study (GWAS), previously undertaken on 463 005 individuals in the UK Biobank study, adjusting for age, sex, and genotyping chip.35,38 These instruments have been independently validated in three distinct cohorts, validating their reliability in measuring childhood and adult adiposity.16,36,37 Phenotypic data for the outcomes under investigation, CVD, T2D, and breast cancer, were obtained from the UK Biobank. These outcomes were classified using the International Classification of Diseases, Tenth Revision (ICD-10) codes, described in Table S1. The UK Biobank study have obtained ethics approval from the Research Ethics Committee (REC; approval number: 11/NW/0382) and informed consent from all participants enrolled in UK Biobank. Estimates were derived using data from the UK Biobank (app #76538).
Statistical analysis
First, we employed a SMM technique in a univariable framework using polygenic risk scores (PRSs) generated from selected genetic variants to estimate period effects of childhood and adulthood adiposity on CVD, T2D, and breast cancer. We then applied SMMs in a multivariable framework to compare with an IVW-MVMR approach used to estimate controlled period effects. For comparative purposes, we employed the IVW method to conduct MR analyses using summary statistics generated with the same sets of genetic variants used in the SMM analyses. These steps are described in further detail below.
Polygenic risk scores
We calculated PRSs for each individual in the study using PLINK 2.00.39 Polygenic risk scores were generated with varying thresholds of association stringency between single nucleotide polymorphisms (SNPs) and the exposure phenotypes, childhood and adulthood adiposity, based on SNPs identified in the GWAS described above. The specific criteria for each PRS, including the SNP association thresholds, are detailed in Table 2. Complete lists of SNPs included in each PRS are available in Tables S2A-S2N.
Genetic IVs in a MR setting are conventionally selected from an independent dataset where the sample does not overlap with the dataset being analyzed to avoid overfitting bias.40 Conducting MR using overlapping sets of participant samples may bias in the direction of the conventional observational results between the exposure and outcome.41 In our investigation it was not possible to avoid this. However, we compare our IVW-MVMR results to the same analysis using non-overlapping outcome data, replicating analyses previously conducted,16 to assess how using these data may have affected our results. For this analysis GWAS data for CHD, T2D, and breast cancer were obtained from large scale consortia, which did not include data from the UK Biobank.42-44 Details on these datasets have been described previously.16
MR using structural mean modeling
We used individual-level data to perform SMM-MR. To highlight results obtained in most “standard” MR analyses we first used genetic variants strongly associated with adiposity around age 10 years (P ≤ 5 × 10−8), regardless of their association with adulthood adiposity and genetic variants strongly associated with adiposity around 57 years, regardless of their association with childhood adiposity. These results are interpreted as period effects as we have assumed that the genetic variants are only directly associated with the exposure over the periods of childhood or adulthood. However, under an alternative assumption in which associations between genetic variants and adiposity remain constant throughout the lifecourse, they could be interpreted as lifetime effects. Next, we used the genetic variants strongly associated with adiposity around age 10 years (P ≤ 5 × 10−8), excluding SNPs strongly associated with adiposity around 57 years at P ≤ 5 10−8, to estimate the period effect of childhood adiposity for a period that started in childhood and ended at some point between 10 and 57 years. We used the genetic variants strongly associated with adiposity around age 57 years (P ≤ 5 × 10−8), excluding SNPs associated with childhood adiposity at P ≤ 5 10−8, to estimate the period effect of adulthood adiposity for a period that started at some point between 10 and 57 years up to the point the outcome is measured. While definition of these periods is imprecise, the interpretation of this causal parameter is supported by previous validation studies.16,36,37 These have demonstrated that the genetic variants strongly associated with adiposity at age 10 years were better predictors of childhood BMI than those predicting adulthood BMI. In late adolescence, neither set of variants optimally predicted BMI, but in adulthood, variants associated with adiposity at age 57 years were stronger predictors of this timeframe than early life variants.16,36,37 We recommend conducting comparable validation analyses for novel variants utilized in subsequent studies. For instance, if data permit, examining the phenotypic associations of these novel variants across various age groups could help clarify which assumptions are most plausible.
To further explore the impact of instrument selection when estimating period effects, we applied the SNP association thresholds outlined in Table 2 and ran models with the PRS generated from each of these, in turn. We additionally applied SMMs in a multivariable framework to calculate the effects of childhood adiposity and adulthood adiposity, simultaneously, on CVD, T2D, and breast cancer. This approach estimates the controlled period effect of an exposure at a specific period by controlling for the exposure at a period not considered the main exposure period. The MR analyses conducted and their interpretation, along with the closest equivalent IVW estimator, are summarized in Table 3.
Importantly, childhood adiposity strongly influences adulthood adiposity.45 Genetic variants with varying levels of association with childhood and adult adiposity may be targeting different causal pathways underlying the relationship between adiposity and complex outcomes that vary in importance at different life stages.46 For instance, some variants linked to adult adiposity might affect aspects of early-life adiposity that childhood-associated variants do not capture.46 This can lead to deviations from the gene–environment equivalence assumption in MR,47 which asserts that genetic perturbations should mirror the effects of modifiable exposures on outcomes.46 This is important to consider when selecting genetic variants to instrument period effects.
Genome-wide association studies
We conducted GWAS to assess associations between genetic variants and the three outcomes of interest: CVD, T2D, and breast cancer. These analyses were performed using BOLT-LMM, a Bayesian linear mixed model that accounts for relatedness and population stratification when estimating associations between genetic variants and a trait.48,49 Age at baseline, sex, and genotyping array type were included as covariates in the model. The resulting GWAS were subsequently used in the IVW-MR analyses.
MR using inverse variance weighting
We employed the IVW method on outcomes from two data sources.50 The inverse variance weighted approach takes SNP-outcome estimates and regresses them on the SNP-exposure associations, weighting each SNP by the inverse of the variance of its SNP-outcome association.51 First, we used the GWAS generated from UK Biobank data to compare against the MR analyses using SMMs. Second, we used the GWAS obtained from large scale consortia, which did not include data from the UK Biobank,42-44 to assess how using overlapping outcome data may have affected our results. For each of the sets of IVW analyses, we estimated the “total” effects in a univariable framework. We then ran MVMR analyses to estimate the controlled period effect.
The IVW-MR analyses were conducted using the TwoSampleMR R package.52 Plots in this paper were generated using the R package “ggplot2.”53 These analyses were undertaken using R (version 3.5.1).
Results
Results
Univariable SMM-MR analyses, representing a period effect, indicated that higher childhood adiposity (regardless of SNP association with adulthood adiposity) increased CVD and T2D risk (risk difference (RD), 95% CI: 0.021, 0.013 to 0.029 and 0.049, 0.043 to 0.054, respectively) as did higher adult adiposity (RD, 95% CI: 0.062, 0.056 to 0.068 and 0.107, 0.101 to 0.113, respectively; Figure 3, Tables S3A and S3B). There was minimal evidence for period effects of higher childhood adiposity (excluding SNPs associated with adulthood adiposity at P ≤ 5 × 10−8) on increased CVD risk and some evidence on T2D risk (RD, 95% CI: 0.004, −0.008 to 0.016 and 0.018, 0.008 to 0.028, respectively). There was strong evidence of a period effect of higher adult adiposity (excluding SNPs associated with childhood adiposity at P ≤ 5 × 10−8) on increased CVD and T2D risk (RD, 95% CI: 0.067, 0.059 to 0.075 and 0.109, 0.103 to 0.115, respectively). Under the alternative assumption that the associations between genetic variants and adiposity remain constant throughout the lifecourse these estimates could be interpreted as lifetime effects.
SMM-MVMR analyses showed a controlled period effect of higher childhood adiposity on reduced CVD and T2D risk (RD, 95% CI: −0.024, −0.034 to −0.014, and −0.024, −0.032 to −0.016, respectively). There was a controlled period effect of higher adult adiposity on the CVD and T2D risk in later life (RD, 95% CI: 0.074, 0.066 to 0.082 and 0.118, 0.110 to 0.126, respectively).
Univariable SMM-MR analyses indicated that higher childhood adiposity (regardless of SNP association with adulthood adiposity) reduced breast cancer risk (RD, 95% CI: −0.032, −0.042 to −0.022; Figure 3; Table S3C). There was evidence of a period effect of higher childhood adiposity (excluding SNPs associated with adulthood adiposity at P ≤ 5 × 10−8) on reduced breast cancer risk (RD, 95% CI: −0.029, −0.043 to −0.015). There was minimal evidence of a period effect of higher adult adiposity (excluding SNPs associated with childhood adiposity at P ≤ 5 × 10−8) on breast cancer (RD, 95% CI: 0.006, −0.004 to 0.016). SMM-MVMR analyses also showed evidence that higher childhood adiposity reduced the risk of breast cancer, after controlling for adult adiposity (RD, 95% CI: −0.040, −0.054 to −0.026), while higher adult adiposity slightly increased breast cancer risk, controlling for childhood adiposity (RD, 95% CI: 0.013, 0.001 to 0.025).
When using different SNP association thresholds to estimate period effects, univariable SMM-MR consistently showed that that higher adulthood adiposity increased risk of CVD (Figure 4A; Table S3A) and T2D (Figure 4B; Table S3B). As the stringency of SNP selection increased, excluding increasing number of SNPs associated with adult adiposity, evidence of the effect of childhood adiposity on CVD diminished. Some evidence of an effect on T2D persisted using SNPs with the lowest stringency (exclusion threshold of adulthood adiposity SNPs at P ≤ 5 × 10−8).
Additionally, analyses consistently showed that higher childhood adiposity reduced breast cancer risk (Figure 4C; Table S3C). A protective effect of adulthood adiposity was observed when including all SNPs, but this effect diminished or reversed direction when excluding SNPs associated with childhood adiposity.
IVW univariable and MVMR analyses using UK Biobank data, showed similar trends to the SMM approach, though generally weaker (Figure 4A-C; Tables S4A-S4C). Inverse variance weighted univariable and MVMR analyses using the outcomes generated from the large-scale consortia data indicated effects in the same directions as the SMM approach (Tables S5A-S5C; Figure S1A-C), though with larger magnitude and broader confidence intervals, likely due to Winner’s curse from sample overlap between the discovery sample and dataset used in core analyses.54 Specifically, UK Biobank data indicated that increased adulthood adiposity increased breast cancer risk (Figure 4C; Table S4C). In contrast, there was minimal evidence of an effect of adulthood adiposity on breast cancer when using the large scale meta-analyzed outcome consortia data (Table S5C; Figure S1C).
Univariable SMM-MR analyses, representing a period effect, indicated that higher childhood adiposity (regardless of SNP association with adulthood adiposity) increased CVD and T2D risk (risk difference (RD), 95% CI: 0.021, 0.013 to 0.029 and 0.049, 0.043 to 0.054, respectively) as did higher adult adiposity (RD, 95% CI: 0.062, 0.056 to 0.068 and 0.107, 0.101 to 0.113, respectively; Figure 3, Tables S3A and S3B). There was minimal evidence for period effects of higher childhood adiposity (excluding SNPs associated with adulthood adiposity at P ≤ 5 × 10−8) on increased CVD risk and some evidence on T2D risk (RD, 95% CI: 0.004, −0.008 to 0.016 and 0.018, 0.008 to 0.028, respectively). There was strong evidence of a period effect of higher adult adiposity (excluding SNPs associated with childhood adiposity at P ≤ 5 × 10−8) on increased CVD and T2D risk (RD, 95% CI: 0.067, 0.059 to 0.075 and 0.109, 0.103 to 0.115, respectively). Under the alternative assumption that the associations between genetic variants and adiposity remain constant throughout the lifecourse these estimates could be interpreted as lifetime effects.
SMM-MVMR analyses showed a controlled period effect of higher childhood adiposity on reduced CVD and T2D risk (RD, 95% CI: −0.024, −0.034 to −0.014, and −0.024, −0.032 to −0.016, respectively). There was a controlled period effect of higher adult adiposity on the CVD and T2D risk in later life (RD, 95% CI: 0.074, 0.066 to 0.082 and 0.118, 0.110 to 0.126, respectively).
Univariable SMM-MR analyses indicated that higher childhood adiposity (regardless of SNP association with adulthood adiposity) reduced breast cancer risk (RD, 95% CI: −0.032, −0.042 to −0.022; Figure 3; Table S3C). There was evidence of a period effect of higher childhood adiposity (excluding SNPs associated with adulthood adiposity at P ≤ 5 × 10−8) on reduced breast cancer risk (RD, 95% CI: −0.029, −0.043 to −0.015). There was minimal evidence of a period effect of higher adult adiposity (excluding SNPs associated with childhood adiposity at P ≤ 5 × 10−8) on breast cancer (RD, 95% CI: 0.006, −0.004 to 0.016). SMM-MVMR analyses also showed evidence that higher childhood adiposity reduced the risk of breast cancer, after controlling for adult adiposity (RD, 95% CI: −0.040, −0.054 to −0.026), while higher adult adiposity slightly increased breast cancer risk, controlling for childhood adiposity (RD, 95% CI: 0.013, 0.001 to 0.025).
When using different SNP association thresholds to estimate period effects, univariable SMM-MR consistently showed that that higher adulthood adiposity increased risk of CVD (Figure 4A; Table S3A) and T2D (Figure 4B; Table S3B). As the stringency of SNP selection increased, excluding increasing number of SNPs associated with adult adiposity, evidence of the effect of childhood adiposity on CVD diminished. Some evidence of an effect on T2D persisted using SNPs with the lowest stringency (exclusion threshold of adulthood adiposity SNPs at P ≤ 5 × 10−8).
Additionally, analyses consistently showed that higher childhood adiposity reduced breast cancer risk (Figure 4C; Table S3C). A protective effect of adulthood adiposity was observed when including all SNPs, but this effect diminished or reversed direction when excluding SNPs associated with childhood adiposity.
IVW univariable and MVMR analyses using UK Biobank data, showed similar trends to the SMM approach, though generally weaker (Figure 4A-C; Tables S4A-S4C). Inverse variance weighted univariable and MVMR analyses using the outcomes generated from the large-scale consortia data indicated effects in the same directions as the SMM approach (Tables S5A-S5C; Figure S1A-C), though with larger magnitude and broader confidence intervals, likely due to Winner’s curse from sample overlap between the discovery sample and dataset used in core analyses.54 Specifically, UK Biobank data indicated that increased adulthood adiposity increased breast cancer risk (Figure 4C; Table S4C). In contrast, there was minimal evidence of an effect of adulthood adiposity on breast cancer when using the large scale meta-analyzed outcome consortia data (Table S5C; Figure S1C).
Discussion
Discussion
In this investigation we apply MR within a lifecourse epidemiological framework, using SMM and IVW approaches with a time-varying exposure. SMM-MR and IVW-MVMR target different causal estimands that rely on distinct assumptions. SMM-MR computed the effect of increasing adiposity by one unit of measurement and IVW-MVMR, the effect of an increase in one unit of liability to adiposity. For SMM-MR, each component of the time-varying exposure outside of the period considered is assumed to be unaffected by the instrument or affect the outcome only by affecting exposures during the time period under consideration. For IVW-MVMR, the genetically predicted effect of increasing the exposure by a unit of liability at one time period is assumed to include genetic effects via the exposure at other time periods if the exposure in those periods has an effect on the outcome.
We estimated the effects of adiposity on CVD, T2D, and breast cancer. First, we evaluated the period effects of childhood and adult adiposity using genetic variants strongly associated with adiposity at each period, regardless of their association with adiposity at the alternate period. We then evaluated stringently defined period effects and finally controlled period effects.
Our findings indicated that the period effects of higher childhood and adult adiposity, when using less stringent SNP-association criteria, increased the risk of CVD and T2D, while reducing the risk of breast cancer. However, when applying stricter SNP-association criteria, the effect of higher childhood adiposity on CVD and T2D was attenuated, while controlled period effects indicated a reduced risk associated with higher childhood adiposity for these outcomes. Conversely, more stringently instrumented period effects, as well as controlled period effects, indicated that higher adulthood adiposity may elevate breast cancer risk. Our findings suggest that higher childhood adiposity was protective against breast cancer across both period and controlled period estimates, aligning with previous research.33 Interpretation of these results as period effects depends on the assumption that the genetic variants are associated with adiposity only for the period in which they were identified. Under the alternative assumption that the associations between genetic variants and adiposity remain constant throughout the lifecourse these could be interpreted as lifetime effects.
Our results highlight the importance of considering potential associations of the genetic variants with traits at other time periods when estimating period effects. For example, we see evidence of increased period effects on T2D in the set of SNPs strongly associated with childhood adiposity, irrespective of their association with adult adiposity, as well as in SNPs not associated with adulthood adiposity at P ≤ 5 10−8. However, increasing the stringency for SNP inclusion in childhood instruments showed little evidence of an effect of childhood adiposity on T2D. Furthermore, SNPs used to instrument adiposity at one period might still be associated with adiposity at another period, albeit with a lower level of association. Since our analysis includes only two time points, we cannot determine SNP associations with other time periods beyond those measured.
When researchers use genetic instruments that are not specific to a time period, which is commonplace in MR analyses, they are implicitly targeting a lifetime effect of liability to a unit higher level of the exposure at the time it was measured.29 This may result in incorrect interpretations as highlighted in our applied examples. To allow estimation of period-specific effects, we can also use a SMM- or IVW-MVMR approach, to estimate the controlled period effect of each exposure on the outcome. Results may still reflect the effects of other periods that are not included in the estimation if they are influenced by the genetic variants used.
We have assumed throughout that there is no time-varying confounding; however, the existence of time-varying confounding would not affect the interpretation of our results. A detailed explanation for this is provided in Table S6. We have also assumed all models are linear and that there are no interactions between the exposure at different time periods.
The IVW-MVMR results generated from largescale consortia indicated little evidence of a controlled effect of higher adult adiposity on breast cancer.16 The observed risk increase using the UK Biobank data could be due to higher adulthood adiposity influencing onset and survival from breast cancer differently in this cohort study compared to the case–control studies used in producing the largescale outcome data. Differences may also be partially due to a “healthy volunteer” selection bias. The UK Biobank is not representative of the sampling population—participants are a sample of volunteers with healthier lifestyles, higher levels of education and better health than the general UK population.55,56 If participation in the UK Biobank is a consequence of either our exposure or outcome, then a key MR condition (exchangeability) is violated, inducing an association between genetic instruments and unmeasured confounders of the exposure–outcome relationship.56 Another reason for inconsistencies could be weak instrument bias. This may push a null effect to be non-null, as seen in the effect between adulthood adiposity and breast cancer risk.57 In addition, when exposure and outcome effects are estimated in a single (ie, fully overlapping) sample, as they were for our analyses, bias will be in the direction of the (confounded) observational association.58
There is no meaningful difference between the estimation results obtained using SMM and IVW-MR in relatively simple models such as the one we have used; however, the importance of this work lies in both the clear explanation of the interpretations of the estimates from each method, and detailed assumptions being made when employing each method. Comprehensively understanding the interpretation of results after employing these methods is of key public health importance. For instance, when considering the potential differences in mediating pathways from one period (ie, childhood adiposity) and not another (ie, adulthood adiposity) we might hypothesize that prepubertal childhood and adult adiposity could be distinct phenotypes; an insight gained through conducting these approaches. This helps us to further unpick pathways associated with disease risk. For example, there are long-run effects of childhood adiposity on physical development shown though mammographic density being a strong mediator for the protective effects of childhood adiposity on breast cancer.59 In addition, situations in which SMMs may prove more useful, specifically when addressing lifecourse questions, should be explored in future work.
This study provides important detail on novel MR methods available to perform lifecourse research; however, an important limitation lies the limited availability of period-specific instruments. This may be attributed to the genetic associations of many phenotypes remaining constant over time as well as the lack of research investigating potential temporal variations in these associations. Compounding this is the absence of adequately large datasets that include age-specific information paired with corresponding genetic data across a wide range of phenotypes. Such data have the potential to greatly enhance our understanding of this research area. In cases where genetic associations do not vary over time, researchers seeking to estimate period effects might instead identify examples of period-specific non-genetic instruments and use alternative methods such as time-defined natural experiments.
This study underscores the importance of carefully considering underlying methodological assumptions when applying SMM-MR and IVW-MVMR approaches in a lifecourse setting and offers guidance on accurately interpreting the results generated by these methods. It also highlights the critical role of instrument selection in these analyses. A clear and informed understanding of the assumptions and interpretations is essential for conducting robust and meaningful MR analyses in lifecourse research—considerations that have yet to be fully explored in the current literature. We encourage practitioners to thoroughly think through their research question, IV selection, model conditions, and data availability before pursuing a particular MR approach within a lifecourse setting.
In this investigation we apply MR within a lifecourse epidemiological framework, using SMM and IVW approaches with a time-varying exposure. SMM-MR and IVW-MVMR target different causal estimands that rely on distinct assumptions. SMM-MR computed the effect of increasing adiposity by one unit of measurement and IVW-MVMR, the effect of an increase in one unit of liability to adiposity. For SMM-MR, each component of the time-varying exposure outside of the period considered is assumed to be unaffected by the instrument or affect the outcome only by affecting exposures during the time period under consideration. For IVW-MVMR, the genetically predicted effect of increasing the exposure by a unit of liability at one time period is assumed to include genetic effects via the exposure at other time periods if the exposure in those periods has an effect on the outcome.
We estimated the effects of adiposity on CVD, T2D, and breast cancer. First, we evaluated the period effects of childhood and adult adiposity using genetic variants strongly associated with adiposity at each period, regardless of their association with adiposity at the alternate period. We then evaluated stringently defined period effects and finally controlled period effects.
Our findings indicated that the period effects of higher childhood and adult adiposity, when using less stringent SNP-association criteria, increased the risk of CVD and T2D, while reducing the risk of breast cancer. However, when applying stricter SNP-association criteria, the effect of higher childhood adiposity on CVD and T2D was attenuated, while controlled period effects indicated a reduced risk associated with higher childhood adiposity for these outcomes. Conversely, more stringently instrumented period effects, as well as controlled period effects, indicated that higher adulthood adiposity may elevate breast cancer risk. Our findings suggest that higher childhood adiposity was protective against breast cancer across both period and controlled period estimates, aligning with previous research.33 Interpretation of these results as period effects depends on the assumption that the genetic variants are associated with adiposity only for the period in which they were identified. Under the alternative assumption that the associations between genetic variants and adiposity remain constant throughout the lifecourse these could be interpreted as lifetime effects.
Our results highlight the importance of considering potential associations of the genetic variants with traits at other time periods when estimating period effects. For example, we see evidence of increased period effects on T2D in the set of SNPs strongly associated with childhood adiposity, irrespective of their association with adult adiposity, as well as in SNPs not associated with adulthood adiposity at P ≤ 5 10−8. However, increasing the stringency for SNP inclusion in childhood instruments showed little evidence of an effect of childhood adiposity on T2D. Furthermore, SNPs used to instrument adiposity at one period might still be associated with adiposity at another period, albeit with a lower level of association. Since our analysis includes only two time points, we cannot determine SNP associations with other time periods beyond those measured.
When researchers use genetic instruments that are not specific to a time period, which is commonplace in MR analyses, they are implicitly targeting a lifetime effect of liability to a unit higher level of the exposure at the time it was measured.29 This may result in incorrect interpretations as highlighted in our applied examples. To allow estimation of period-specific effects, we can also use a SMM- or IVW-MVMR approach, to estimate the controlled period effect of each exposure on the outcome. Results may still reflect the effects of other periods that are not included in the estimation if they are influenced by the genetic variants used.
We have assumed throughout that there is no time-varying confounding; however, the existence of time-varying confounding would not affect the interpretation of our results. A detailed explanation for this is provided in Table S6. We have also assumed all models are linear and that there are no interactions between the exposure at different time periods.
The IVW-MVMR results generated from largescale consortia indicated little evidence of a controlled effect of higher adult adiposity on breast cancer.16 The observed risk increase using the UK Biobank data could be due to higher adulthood adiposity influencing onset and survival from breast cancer differently in this cohort study compared to the case–control studies used in producing the largescale outcome data. Differences may also be partially due to a “healthy volunteer” selection bias. The UK Biobank is not representative of the sampling population—participants are a sample of volunteers with healthier lifestyles, higher levels of education and better health than the general UK population.55,56 If participation in the UK Biobank is a consequence of either our exposure or outcome, then a key MR condition (exchangeability) is violated, inducing an association between genetic instruments and unmeasured confounders of the exposure–outcome relationship.56 Another reason for inconsistencies could be weak instrument bias. This may push a null effect to be non-null, as seen in the effect between adulthood adiposity and breast cancer risk.57 In addition, when exposure and outcome effects are estimated in a single (ie, fully overlapping) sample, as they were for our analyses, bias will be in the direction of the (confounded) observational association.58
There is no meaningful difference between the estimation results obtained using SMM and IVW-MR in relatively simple models such as the one we have used; however, the importance of this work lies in both the clear explanation of the interpretations of the estimates from each method, and detailed assumptions being made when employing each method. Comprehensively understanding the interpretation of results after employing these methods is of key public health importance. For instance, when considering the potential differences in mediating pathways from one period (ie, childhood adiposity) and not another (ie, adulthood adiposity) we might hypothesize that prepubertal childhood and adult adiposity could be distinct phenotypes; an insight gained through conducting these approaches. This helps us to further unpick pathways associated with disease risk. For example, there are long-run effects of childhood adiposity on physical development shown though mammographic density being a strong mediator for the protective effects of childhood adiposity on breast cancer.59 In addition, situations in which SMMs may prove more useful, specifically when addressing lifecourse questions, should be explored in future work.
This study provides important detail on novel MR methods available to perform lifecourse research; however, an important limitation lies the limited availability of period-specific instruments. This may be attributed to the genetic associations of many phenotypes remaining constant over time as well as the lack of research investigating potential temporal variations in these associations. Compounding this is the absence of adequately large datasets that include age-specific information paired with corresponding genetic data across a wide range of phenotypes. Such data have the potential to greatly enhance our understanding of this research area. In cases where genetic associations do not vary over time, researchers seeking to estimate period effects might instead identify examples of period-specific non-genetic instruments and use alternative methods such as time-defined natural experiments.
This study underscores the importance of carefully considering underlying methodological assumptions when applying SMM-MR and IVW-MVMR approaches in a lifecourse setting and offers guidance on accurately interpreting the results generated by these methods. It also highlights the critical role of instrument selection in these analyses. A clear and informed understanding of the assumptions and interpretations is essential for conducting robust and meaningful MR analyses in lifecourse research—considerations that have yet to be fully explored in the current literature. We encourage practitioners to thoroughly think through their research question, IV selection, model conditions, and data availability before pursuing a particular MR approach within a lifecourse setting.
Research ethics and informed consent
Research ethics and informed consent
The UK Biobank study have obtained ethics approval from the Research Ethics Committee (REC; approval number: 11/NW/0382). The UK Biobank study have obtained informed consent from all participants enrolled in UK Biobank. Estimates were derived using data from the UK Biobank (app #76538).
The UK Biobank study have obtained ethics approval from the Research Ethics Committee (REC; approval number: 11/NW/0382). The UK Biobank study have obtained informed consent from all participants enrolled in UK Biobank. Estimates were derived using data from the UK Biobank (app #76538).
Supplementary Material
Supplementary Material
Web_Material_kwaf029
Web_Material_kwaf029
출처: PubMed Central (JATS). 라이선스는 원 publisher 정책을 따릅니다 — 인용 시 원문을 표기해 주세요.
🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반
- A Phase I Study of Hydroxychloroquine and Suba-Itraconazole in Men with Biochemical Relapse of Prostate Cancer (HITMAN-PC): Dose Escalation Results.
- Self-management of male urinary symptoms: qualitative findings from a primary care trial.
- Clinical and Liquid Biomarkers of 20-Year Prostate Cancer Risk in Men Aged 45 to 70 Years.
- Diagnostic accuracy of Ga-PSMA PET/CT versus multiparametric MRI for preoperative pelvic invasion in the patients with prostate cancer.
- Comprehensive analysis of androgen receptor splice variant target gene expression in prostate cancer.
- Clinical Presentation and Outcomes of Patients Undergoing Surgery for Thyroid Cancer.