Construction of the Kolmogorov-Arnold networks using the Newton-Kaczmarz method

Michael Poluektov (Lead / Corresponding author), Andrew Polar

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)
20 Downloads (Pure)

Abstract

It is known that any continuous multivariate function can be represented exactly by a composition functions of a single variable—the so-called Kolmogorov-Arnold representation. It can be a convenient tool for tasks where it is required to obtain a predictive model that maps some vector input of a black box system into a scalar output. In this case, the representation may not be exact, and it is more correct to refer to such structure as the Kolmogorov-Arnold model (or, as more recently popularised, ‘network’). Construction of such model based on the recorded input–output data is a challenging task. In the present paper, it is suggested to decompose the underlying functions of the representation into continuous basis functions and parameters. It is then proposed to find the parameters using the Newton-Kaczmarz method for solving systems of non-linear equations. The algorithm is then modified to support parallelisation. The paper demonstrates that such approach is also an excellent tool for data-driven solution of partial differential equations. Numerical examples show that for the considered model, the Newton-Kaczmarz method for parameter estimation is efficient and more robust with respect to the section of the initial guess than the straightforward application of the Gauss-Newton method. Finally, the Kolmogorov-Arnold model is compared to the MATLAB’s built-in neural networks on a relatively large-scale problem (25 inputs, datasets of 10 million records), significantly outperforming the multilayer perceptrons in this particular problem (4–10 min vs. 4–8 h of training time, as well as higher accuracy, lower CPU usage, and smaller memory footprint).

Original languageEnglish
Article number185
Number of pages36
JournalMachine Learning
Volume114
Issue number8
Early online date11 Jul 2025
DOIs
Publication statusPublished - Aug 2025

Keywords

  • Discrete Urysohn operator
  • Generalised additive model
  • Kaczmarz method
  • Kolmogorov-Arnold networks
  • Kolmogorov-Arnold representation
  • Newton-Kaczmarz method
  • Ridge function

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Construction of the Kolmogorov-Arnold networks using the Newton-Kaczmarz method'. Together they form a unique fingerprint.

Cite this