[2603.23860] Why the Maximum Second Derivative of Activations Matters

[2603.23860] Why the Maximum Second Derivative of Activations Matters for Adversarial Robustness

arXiv - Machine Learning March 26, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.23860: Why the Maximum Second Derivative of Activations Matters for Adversarial Robustness

Computer Science > Machine Learning arXiv:2603.23860 (cs) [Submitted on 25 Mar 2026] Title:Why the Maximum Second Derivative of Activations Matters for Adversarial Robustness Authors:Yunrui Yu, Hang Su, Jun Zhu View a PDF of the paper titled Why the Maximum Second Derivative of Activations Matters for Adversarial Robustness, by Yunrui Yu and 2 other authors View PDF HTML (experimental) Abstract:This work investigates the critical role of activation function curvature -- quantified by the maximum second derivative $\max|\sigma''|$ -- in adversarial robustness. Using the Recursive Curvature-Tunable Activation Family (RCT-AF), which enables precise control over curvature through parameters $\alpha$ and $\beta$, we systematically analyze this relationship. Our study reveals a fundamental trade-off: insufficient curvature limits model expressivity, while excessive curvature amplifies the normalized Hessian diagonal norm of the loss, leading to sharper minima that hinder robust generalization. This results in a non-monotonic relationship where optimal adversarial robustness consistently occurs when $\max|\sigma''|$ falls within 4 to 10, a finding that holds across diverse network architectures, datasets, and adversarial training methods. We provide theoretical insights into how activation curvature affects the diagonal elements of the hessian matrix of the loss, and experimentally demonstrate that the normalized Hessian diagonal norm exhibits a U-shaped dependence on $\max|\sigm...

Originally published on March 26, 2026. Curated by AI News.

Machine Learning

Ml project user give dataset and I give best model [D] [P]

Tl,dr : suggest me a solution to create a ai ml project where user will give his dataset as input and the project should give best model ...

Reddit - Machine Learning · 1 min · 29 minutes ago

Machine Learning

[D] ICML Reviewer Acknowledgement

Hi, I'm a little confused about ICML discussion period Does the period for reviewer acknowledging responses have already ended? One of th...

Reddit - Machine Learning · 1 min · about 3 hours ago

Llms

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

Hey everyone I've set up a self-hosted API gateway using [New-API](QuantumNous/new-ap) to manage and distribute Claude Opus 4.6 access ac...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

Machine Learning

[D] ICML reviewer making up false claim in acknowledgement, what to do?

In a rebuttal acknowledgement we received, the reviewer made up a claim that our method performs worse than baselines with some hyperpara...

Reddit - Machine Learning · 1 min · about 5 hours ago

[2603.23860] Why the Maximum Second Derivative of Activations Matters for Adversarial Robustness

About this article

Related Articles

Ml project user give dataset and I give best model [D] [P]

[D] ICML Reviewer Acknowledgement

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

[D] ICML reviewer making up false claim in acknowledgement, what to do?

No comments

Stay updated with AI News