A Federated Database for Obesity Research: An IMI-SOPHIA Study

Carl Delfin, Iulian Dragan, Dmitry Kuznetsov, Juan Fernandez Tajes, Femke Smit, Daniel E. Coral, Ali Farzaneh, André Haugg, Andreas Hungele, Anne Niknejad, Christopher Hall, Daan Jacobs, Diana Marek, Diane P. Fraser, Dorothee Thuillier (Lead / Corresponding author), Fariba Ahmadizar, Florence Mehl, Francois Pattou, Frederic Burdet, Gareth HawkesIlja C. W. Arts, Jordi Blanch, Johan Van Soest, José-Manuel Fernández-Real, Juergen Boehl, Katharina Fink, Marleen M. J. van Greevenbroek, Maryam Kavousi, Michiel Minten, Nicole Prinz, Niels Ipsen, Paul W. Franks, Rafael Ramos, Reinhard W. Holl, Scott Horban, Talita Duarte-Salles, Van Du T. Tran, Violeta Raverdy, Yenny Leal, Adam Lenart, Ewan Pearson, Thomas Sparsø, Giuseppe N. Giordano, Vassilios Ioannidis, Keng Soh, Timothy M. Frayling, Carel W. Le Roux, Mark Ibberson (Lead / Corresponding author)

Research output: Contribution to journalArticlepeer-review

63 Downloads (Pure)

Abstract

Obesity is considered by many as a lifestyle choice rather than a chronic progressive disease. The Innovative Medicines Initiative (IMI) SOPHIA (Stratification of Obesity Phenotypes to Optimize Future Obesity Therapy) project is part of a momentum shift aiming to provide better tools for the stratification of people with obesity according to disease risk and treatment response. One of the challenges to achieving these goals is that many clinical cohorts are siloed, limiting the potential of combined data for biomarker discovery. In SOPHIA, we have addressed this challenge by setting up a federated database building on open-source DataSHIELD technology. The database currently federates 16 cohorts that are accessible via a central gateway. The database is multi-modal, including research studies, clinical trials, and routine health data, and is accessed using the R statistical programming environment where statistical and machine learning analyses can be performed at a distance without any disclosure of patient-level data. We demonstrate the use of the database by providing a proof-of-concept analysis, performing a federated linear model of BMI and systolic blood pressure, pooling all data from 16 studies virtually without any analyst seeing individual patient-level data. This analysis provided similar point estimates compared to a meta-analysis of the 16 individual studies. Our approach provides a benchmark for reproducible, safe federated analyses across multiple study types provided by multiple stakeholders.

Original languageEnglish
Article number262
Number of pages14
JournalLife (Basel, Switzerland)
Volume14
Issue number2
DOIs
Publication statusPublished - 16 Feb 2024

Keywords

  • bioinformatics
  • federated database system
  • obesity
  • remote statistical analysis
  • risk prediction

ASJC Scopus subject areas

  • Ecology, Evolution, Behavior and Systematics
  • General Biochemistry,Genetics and Molecular Biology
  • Space and Planetary Science
  • Palaeontology

Fingerprint

Dive into the research topics of 'A Federated Database for Obesity Research: An IMI-SOPHIA Study'. Together they form a unique fingerprint.

Cite this