Abstract
There are multiple feature selection methods available in the literature for removing unwanted features from modelling. The existing techniques have drawbacks of reproducibility due to random selection of training and validation datasets. In this study, we propose a new resampling approach for feature selection, which helps resolve this drawback. The method will allocate a weight value for all the features in the dataset, and candidate features are selected by placing a cut-off value for the feature weight. The illustrated example shows that the method could select ten features from a set of 254. Results are used to develop a predictive model with a predictive accuracy of 92.3% represented in terms of area under the ROC curve. The results show that the method can successfully select the relevant features which result in an excellent predictive model building compared to commonly used L1, L2, and elastic net regularisation.
Original language | English |
---|---|
Article number | 113465 |
Number of pages | 10 |
Journal | Journal of Computational and Applied Mathematics |
Volume | 391 |
Early online date | 3 Feb 2021 |
DOIs | |
Publication status | Published - 1 Aug 2021 |
Keywords
- Filter method
- Regularisation
- Resampling
- Wrapper method
ASJC Scopus subject areas
- Computational Mathematics
- Applied Mathematics