Abstract
Background: Genetic studies for complex diseases have predominantly discoveredmain effects at individual loci, but have not focused on genomic and environmentalcontexts important for a phenotype. Gene Set Enrichment Analysis (GSEA) aims toaddress this by identifying sets of genes or biological pathways contributing to aphenotype, through gene-gene interactions or other mechanisms, which are not thefocus of conventional association methods.
Results: Approaches that utilize GSEA can now take input from array chips, eithergene-centric or genome-wide, but are highly sensitive to study design, SNP selectionand pruning strategies, SNP-to-gene mapping, and pathway definitions. Here, wepresent lessons learned from our experience with GSEA of heart failure, a particularlychallenging phenotype due to its underlying heterogeneous etiology.
Conclusions: This case study shows that proper data handling is essential to avoidfalse-positive results. Well-defined pipelines for quality control are needed to avoidreporting spurious results using GSEA.
Results: Approaches that utilize GSEA can now take input from array chips, eithergene-centric or genome-wide, but are highly sensitive to study design, SNP selectionand pruning strategies, SNP-to-gene mapping, and pathway definitions. Here, wepresent lessons learned from our experience with GSEA of heart failure, a particularlychallenging phenotype due to its underlying heterogeneous etiology.
Conclusions: This case study shows that proper data handling is essential to avoidfalse-positive results. Well-defined pipelines for quality control are needed to avoidreporting spurious results using GSEA.
Original language | English |
---|---|
Article number | 18 |
Pages (from-to) | 1-11 |
Number of pages | 11 |
Journal | BioData Mining |
Volume | 10 |
DOIs | |
Publication status | Published - 26 May 2017 |
Keywords
- Gene set enrichment analyses
- Heart failure
- Coronary artery disease