[2603.25024] Improving Infinitely Deep Bayesian Neural Networks with

[2603.25024] Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method

arXiv - Machine Learning March 27, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.25024: Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method

Statistics > Machine Learning arXiv:2603.25024 (stat) [Submitted on 26 Mar 2026] Title:Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method Authors:Chenxu Yu, Wenqi Fang View a PDF of the paper titled Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method, by Chenxu Yu and 1 other authors View PDF HTML (experimental) Abstract:As a representative continuous-depth neural network approach, stochastic differential equation (SDE)-based Bayesian neural networks (BNNs) have attracted considerable attention due to their solid theoretical foundations and strong potential for real-world applications. However, their reliance on numerical SDE solvers inevitably incurs a large number of function evaluations (NFEs), resulting in high computational cost and occasional convergence instability. To address these challenges, we propose a Nesterov-accelerated gradient (NAG) enhanced SDE-BNN model. By integrating NAG into the SDE-BNN framework along with an NFE-dependent residual skip connection, our method accelerates convergence and substantially reduces NFEs during both training and testing. Extensive empirical results show that our model consistently outperforms conventional SDE-BNNs across various tasks, including image classification and sequence modeling, achieving lower NFEs and improved predictive accuracy. Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG) Cite as: arXiv:2603.25024 [stat.ML]...

Originally published on March 27, 2026. Curated by AI News.

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 3 hours ago

Machine Learning

[D] Looking for definition of open-world ish learning problem

Hello! Recently I did a project where I initially had around 30 target classes. But at inference, the model had to be able to handle a lo...

Reddit - Machine Learning · 1 min · about 3 hours ago

Machine Learning

Mystery Shopping Meets Machine Learning: Can Algorithms Become the Ultimate Customer Experience Auditor?

Customer expectations across Africa are shifting faster than most organisations can track. A single inconsistent interaction can ignite a...

AI News - General · 8 min · about 4 hours ago

Machine Learning

GitHub to Use User Data for AI Training by Default

submitted by /u/i-drake [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

[2603.25024] Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method

About this article

Related Articles

UMKC Announces New Master of Science in Artificial Intelligence

[D] Looking for definition of open-world ish learning problem

Mystery Shopping Meets Machine Learning: Can Algorithms Become the Ultimate Customer Experience Auditor?

GitHub to Use User Data for AI Training by Default

No comments

Stay updated with AI News