TY - JOUR
T1 - Analysing trajectories of a longitudinal exposure
T2 - a causal perspective on common methods in lifecourse research
AU - Gadd, Sarah C.
AU - Tennant, Peter W. G.
AU - Heppenstall, Alison J.
AU - Boehnke, Jan
AU - Gilthorpe, Mark S.
N1 - This work was supported by the Economic and Social Research Council (esrc.ukri.org) [ES/P000746/1 to S.C.G.]; and the Alan Turing Institute (turing.ac.uk) [EP/N510129/1 to P.W.G.T. and M.S.G., ES/R007918/1 to A.H.]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript
PY - 2019/12/4
Y1 - 2019/12/4
N2 - Longitudinal data is commonly analysed to inform prevention policies for diseases that may develop throughout life. Commonly methods interpret the longitudinal data as a series of discrete measurements or as continuous patterns. Some of the latter methods condition on the outcome, aiming to capture ‘average’ patterns within outcome groups, while others capture individual-level pattern features before relating these to the outcome. Conditioning on the outcome may prevent meaningful interpretation. Repeated measurements of a longitudinal exposure (weight) and later outcome (glycated haemoglobin levels) were simulated to match three scenarios: one with no causal relationship between growth rate and glycated haemoglobin; two with a positive causal effect of growth rate on glycated haemoglobin. Two methods that condition on the outcome and one that did not were applied to the data in 1000 simulations. The interpretation of the two-step method matched the simulation in all causal scenarios, but that of the methods conditioning on the outcome did not. Methods that condition on the outcome do not accurately represent a causal relationship between a longitudinal pattern and outcome. Researchers considering longitudinal data should carefully determine if they wish to analyse longitudinal data as a series of discrete time points or by extracting pattern features.
AB - Longitudinal data is commonly analysed to inform prevention policies for diseases that may develop throughout life. Commonly methods interpret the longitudinal data as a series of discrete measurements or as continuous patterns. Some of the latter methods condition on the outcome, aiming to capture ‘average’ patterns within outcome groups, while others capture individual-level pattern features before relating these to the outcome. Conditioning on the outcome may prevent meaningful interpretation. Repeated measurements of a longitudinal exposure (weight) and later outcome (glycated haemoglobin levels) were simulated to match three scenarios: one with no causal relationship between growth rate and glycated haemoglobin; two with a positive causal effect of growth rate on glycated haemoglobin. Two methods that condition on the outcome and one that did not were applied to the data in 1000 simulations. The interpretation of the two-step method matched the simulation in all causal scenarios, but that of the methods conditioning on the outcome did not. Methods that condition on the outcome do not accurately represent a causal relationship between a longitudinal pattern and outcome. Researchers considering longitudinal data should carefully determine if they wish to analyse longitudinal data as a series of discrete time points or by extracting pattern features.
UR - http://www.scopus.com/inward/record.url?scp=85075979872&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0225217
DO - 10.1371/journal.pone.0225217
M3 - Article
C2 - 31800576
SN - 1932-6203
VL - 14
SP - 1
EP - 12
JO - PLoS ONE
JF - PLoS ONE
IS - 12
M1 - e0225217
ER -