TY - JOUR
T1 - Development and validation of DNA methylation scores in two European cohorts augment 10-year risk prediction of type 2 diabetes
AU - Cheng, Yipeng
AU - Gadd, Danni A.
AU - Gieger, Christian
AU - Monterrubio-Gómez, Karla
AU - Zhang, Yufei
AU - Berta, Imrich
AU - Stam, Michael J.
AU - Szlachetka, Natalia
AU - Lobzaev, Evgenii
AU - Wrobel, Nicola
AU - Murphy, Lee
AU - Campbell, Archie
AU - Nangle, Cliff
AU - Walker, Rosie M.
AU - Fawns-Ritchie, Chloe
AU - Peters, Annette
AU - Rathmann, Wolfgang
AU - Porteous, David J.
AU - Evans, Kathryn L.
AU - McIntosh, Andrew M.
AU - Cannings, Timothy I.
AU - Waldenberger, Melanie
AU - Ganna, Andrea
AU - McCartney, Daniel L.
AU - Vallejos, Catalina A.
AU - Marioni, Riccardo E.
N1 - Funding Information:
This research was funded in whole, or in part, by the Wellcome Trust (nos. 104036/Z/14/Z, 108890/Z/15/Z and 216767/Z/19/Z). For the purpose of open access, we have applied a CC BY public copyright licence to any author accepted manuscript version arising from this submission. Generation Scotland received core support from the Chief Scientist Office of the Scottish Government Health Directorates (no. CZD/16/6) and the Scottish Funding Council (no. HR03006) and is currently supported by the Wellcome Trust (no. 216767/Z/19/Z). DNAm profiling of the Generation Scotland samples was carried out by the Genetics Core Laboratory at the Edinburgh Clinical Research Facility and was funded by the Medical Research Council UK and the Wellcome Trust (Wellcome Trust Strategic Award ‘STratifying Resilience and Depression Longitudinally’ (ref. no. 104036/Z/14/Z)). The DNAm data assayed for Generation Scotland was partially funded by a 2018 NARSAD Young Investigator Grant from the Brain & Behavior Research Foundation (ref. no. 27404; awardee: D. M. Howard) and by a JMAS SIM fellowship from the Royal College of Physicians of Edinburgh (awardee: H. C. Whalley). Y.C. is supported by the University of Edinburgh and University of Helsinki joint PhD program in Human Genomics. D.A.G. is supported by funding from the Wellcome Trust 4-year PhD in Translational Neuroscience—training the next generation of basic neuroscientists to embrace clinical research (no. 108890/Z/15/Z). C.A.V. is a Chancellor’s Fellow funded by the University of Edinburgh. D.L.M. and R.E.M. are supported by an Alzheimer’s Research UK major project grant no. ARUK-PG2017B-10. R.E.M. is supported by an Alzheimer’s Society major project grant no. AS-PG-19b-010. M.J.S., N.S. and E.L. are supported by the United Kingdom Research and Innovation (grant EP/S02431X/1), UKRI Centre for Doctoral Training in Biomedical AI at the University of Edinburgh, School of Informatics. Recruitment to the CovidLife study was facilitated by the Scottish Health Research Register (SHARE) and Biobank. SHARE is supported by NHS Research Scotland, the universities of Scotland and the Chief Scientist Office of the Scottish Government. The KORA S4 study was initiated and financed by the Helmholtz Zentrum München—German Research Center for Environmental Health, which is funded by the German Federal Ministry of Education and Research and by the State of Bavaria. Furthermore, the KORA research has been supported by the Munich Center of Health Sciences, Ludwig-Maximilians-Universität München as part of LMUinnovativ and is supported by the German Centre for Cardiovascular Research. The KORA S4 study is funded by the Bavarian State Ministry of Health and Care through the research project DigiMed Bayern ( www.digimed-bayern.de ).
Copyright:
© 2023, The Author(s), under exclusive licence to Springer Nature America, Inc.
PY - 2023/4/6
Y1 - 2023/4/6
N2 - Type 2 diabetes mellitus (T2D) presents a major health and economic burden that could be alleviated with improved early prediction and intervention. While standard risk factors have shown good predictive performance, we show that the use of blood-based DNA methylation information leads to a significant improvement in the prediction of 10-year T2D incidence risk. Previous studies have been largely constrained by linear assumptions, the use of cytosine–guanine pairs one-at-a-time and binary outcomes. We present a flexible approach (via an R package, MethylPipeR) based on a range of linear and tree-ensemble models that incorporate time-to-event data for prediction. Using the Generation Scotland cohort (training set ncases = 374, ncontrols = 9,461; test set ncases = 252, ncontrols = 4,526) our best-performing model (area under the receiver operating characteristic curve (AUC) = 0.872, area under the precision-recall curve (PRAUC) = 0.302) showed notable improvement in 10-year onset prediction beyond standard risk factors (AUC = 0.839, precision–recall AUC = 0.227). Replication was observed in the German-based KORA study (n = 1,451, ncases = 142, P = 1.6 × 10−5).
AB - Type 2 diabetes mellitus (T2D) presents a major health and economic burden that could be alleviated with improved early prediction and intervention. While standard risk factors have shown good predictive performance, we show that the use of blood-based DNA methylation information leads to a significant improvement in the prediction of 10-year T2D incidence risk. Previous studies have been largely constrained by linear assumptions, the use of cytosine–guanine pairs one-at-a-time and binary outcomes. We present a flexible approach (via an R package, MethylPipeR) based on a range of linear and tree-ensemble models that incorporate time-to-event data for prediction. Using the Generation Scotland cohort (training set ncases = 374, ncontrols = 9,461; test set ncases = 252, ncontrols = 4,526) our best-performing model (area under the receiver operating characteristic curve (AUC) = 0.872, area under the precision-recall curve (PRAUC) = 0.302) showed notable improvement in 10-year onset prediction beyond standard risk factors (AUC = 0.839, precision–recall AUC = 0.227). Replication was observed in the German-based KORA study (n = 1,451, ncases = 142, P = 1.6 × 10−5).
KW - Ageing
KW - DNA methylation
KW - Machine learning
KW - Predictive markers
UR - http://www.scopus.com/inward/record.url?scp=85152023970&partnerID=8YFLogxK
U2 - 10.1038/s43587-023-00391-4
DO - 10.1038/s43587-023-00391-4
M3 - Article
C2 - 37117793
AN - SCOPUS:85152023970
VL - 3
SP - 450
EP - 458
JO - Nature Aging
JF - Nature Aging
IS - 4
ER -