TY - JOUR
T1 - Machine learning and data mining frameworks for predicting drug response in cancer
T2 - An overview and a novel in silico screening process based on association rule mining
AU - Vougas, Konstantinos
AU - Sakelaropoulos, Theodore
AU - Kotsinas, Athanassios
AU - Foukas, George-Romanos P.
AU - Ntargaras, Andreas
AU - Koinis, Filippos
AU - Polyzos, Alexander
AU - Myrianthopoulos, Vassilis
AU - Zhou, Hua
AU - Narang, Sonali
AU - Georgoulias, Vassilis
AU - Alexopoulos, Leonidas
AU - Aifantis, Iannis
AU - Townsend, Paul A.
AU - Sfikakis, Petros
AU - Fitzgerald, Rebecca
AU - Thanos, Dimitris
AU - Bartek, Jiri
AU - Petty, Russell
AU - Tsirigos, Aristotelis
AU - Gorgoulis, Vassilis G.
N1 - Financial support was from the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grants agreement No. 722729 (SYNTRAIN); the Welfare Foundation for Social & Cultural Sciences (KIKPE), Greece; Pentagon Biotechnology Ltd, UK; DeepMed IO Ltd, UK and NKUA-SARG grants No 70/3/9816, 70/3/12128. Dr. Tsirigos and the NYU Applied Bioinformatics Laboratories (ABL) are partially supported by the Cancer Center Support Grant P30CA016087 at the Laura and Isaac Perlmutter Cancer Center (A.T.).
PY - 2019/11
Y1 - 2019/11
N2 - A major challenge in cancer treatment is predicting the clinical response to anti-cancer drugs on a personalized basis. The success of such a task largely depends on the ability to develop computational resources that integrate big "omic" data into effective drug-response models. Machine learning is both an expanding and an evolving computational field that holds promise to cover such needs. Here we provide a focused overview of: 1) the various supervised and unsupervised algorithms used specifically in drug response prediction applications, 2) the strategies employed to develop these algorithms into applicable models, 3) data resources that are fed into these frameworks and 4) pitfalls and challenges to maximize model performance. In this context we also describe a novel in silico screening process, based on Association Rule Mining, for identifying genes as candidate drivers of drug response and compare it with relevant data mining frameworks, for which we generated a web application freely available at: https://compbio.nyumc.org/drugs/. This pipeline explores with high efficiency large sample-spaces, while is able to detect low frequency events and evaluate statistical significance even in the multidimensional space, presenting the results in the form of easily interpretable rules. We conclude with future prospects and challenges of applying machine learning based drug response prediction in precision medicine.
AB - A major challenge in cancer treatment is predicting the clinical response to anti-cancer drugs on a personalized basis. The success of such a task largely depends on the ability to develop computational resources that integrate big "omic" data into effective drug-response models. Machine learning is both an expanding and an evolving computational field that holds promise to cover such needs. Here we provide a focused overview of: 1) the various supervised and unsupervised algorithms used specifically in drug response prediction applications, 2) the strategies employed to develop these algorithms into applicable models, 3) data resources that are fed into these frameworks and 4) pitfalls and challenges to maximize model performance. In this context we also describe a novel in silico screening process, based on Association Rule Mining, for identifying genes as candidate drivers of drug response and compare it with relevant data mining frameworks, for which we generated a web application freely available at: https://compbio.nyumc.org/drugs/. This pipeline explores with high efficiency large sample-spaces, while is able to detect low frequency events and evaluate statistical significance even in the multidimensional space, presenting the results in the form of easily interpretable rules. We conclude with future prospects and challenges of applying machine learning based drug response prediction in precision medicine.
KW - Drug Response Prediction
KW - Precision Medicine
KW - Data mining
KW - Machine Learning Association Rule Mining
KW - Association Rule Mining
KW - Machine Learning
UR - http://www.scopus.com/inward/record.url?scp=85070755907&partnerID=8YFLogxK
U2 - 10.1016/j.pharmthera.2019.107395
DO - 10.1016/j.pharmthera.2019.107395
M3 - Article
C2 - 31374225
SN - 0163-7258
VL - 203
JO - Pharmacology & Therapeutics
JF - Pharmacology & Therapeutics
M1 - 107395
ER -