Two-phase incremental kernel PCA for learning massive or online datasets

Feng Zhao, Islem Rekik, Seong-Whan Lee, Jing Liu, Junying Zhang, Dinggang Shen (Lead / Corresponding author)

Research output: Contribution to journalArticlepeer-review

10 Citations (Scopus)
190 Downloads (Pure)

Abstract

As a powerful non-linear feature extractor, kernel principal component analysis (KPCA) has been widely adopted in many machine learning applications. However, KPCA is usually performed in a batch mode, leading to some potential problems when handling massive or online datasets. To overcome this drawback of KPCA, in this paper, we propose a two-phase incremental KPCA (TP-IKPCA) algorithm which can incorporate data into KPCA in an incremental fashion. In the first phase, an incremental algorithm is developed to explicitly express the data in the kernel space. In the second phase, we extend an incremental principal component analysis (IPCA) to estimate the kernel principal components. Extensive experimental results on both synthesized and real datasets showed that the proposed TP-IKPCA produces similar principal components as conventional batch-based KPCA but is computationally faster than KPCA and its several incremental variants. Therefore, our algorithm can be applied to massive or online datasets where the batch method is not available.
Original languageEnglish
Article number5937274
Number of pages17
JournalComplexity
Volume2019
DOIs
Publication statusPublished - 11 Feb 2019

Keywords

  • Kernel principal component analysis (KPCA)
  • Incremental learning
  • Big data
  • Orthonormal basis

Fingerprint

Dive into the research topics of 'Two-phase incremental kernel PCA for learning massive or online datasets'. Together they form a unique fingerprint.

Cite this