[2305.02657] On the Eigenvalue Decay Rates of a Class of Neural-Network Related Kernel Functions Defined on General Domains
About this article
Abstract page for arXiv paper 2305.02657: On the Eigenvalue Decay Rates of a Class of Neural-Network Related Kernel Functions Defined on General Domains
Statistics > Machine Learning arXiv:2305.02657 (stat) [Submitted on 4 May 2023 (v1), last revised 7 Apr 2026 (this version, v5)] Title:On the Eigenvalue Decay Rates of a Class of Neural-Network Related Kernel Functions Defined on General Domains Authors:Yicheng Li, Zixiong Yu, Guhan Chen, Qian Lin View a PDF of the paper titled On the Eigenvalue Decay Rates of a Class of Neural-Network Related Kernel Functions Defined on General Domains, by Yicheng Li and 3 other authors View PDF HTML (experimental) Abstract:In this paper, we provide a strategy to determine the eigenvalue decay rate (EDR) of a large class of kernel functions defined on a general domain rather than $\mathbb S^{d}$. This class of kernel functions include but are not limited to the neural tangent kernel associated with neural networks with different depths and various activation functions. After proving that the dynamics of training the wide neural networks uniformly approximated that of the neural tangent kernel regression on general domains, we can further illustrate the minimax optimality of the wide neural network provided that the underground truth function $f\in [\mathcal H_{\mathrm{NTK}}]^{s}$, an interpolation space associated with the RKHS $\mathcal{H}_{\mathrm{NTK}}$ of NTK. We also showed that the overfitted neural network can not generalize well. We believe our approach for determining the EDR of kernels might be also of independent interests. Subjects: Machine Learning (stat.ML); Machine Learning...