A weight function method for selection of proteins to predict an outcome using protein expression data

Gajendra K. Vishwakarma, Abin Thomas, Atanu Bhattacharjee

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

There are multiple feature selection methods available in the literature for removing unwanted features from modelling. The existing techniques have drawbacks of reproducibility due to random selection of training and validation datasets. In this study, we propose a new resampling approach for feature selection, which helps resolve this drawback. The method will allocate a weight value for all the features in the dataset, and candidate features are selected by placing a cut-off value for the feature weight. The illustrated example shows that the method could select ten features from a set of 254. Results are used to develop a predictive model with a predictive accuracy of 92.3% represented in terms of area under the ROC curve. The results show that the method can successfully select the relevant features which result in an excellent predictive model building compared to commonly used L1, L2, and elastic net regularisation.

Original languageEnglish
Article number113465
Number of pages10
JournalJournal of Computational and Applied Mathematics
Volume391
Early online date3 Feb 2021
DOIs
Publication statusPublished - 1 Aug 2021

Keywords

  • Filter method
  • Regularisation
  • Resampling
  • Wrapper method

ASJC Scopus subject areas

  • Computational Mathematics
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'A weight function method for selection of proteins to predict an outcome using protein expression data'. Together they form a unique fingerprint.

Cite this