[2603.21991] λ-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks

[2603.21991] λ-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks

arXiv - Machine Learning 4 min read

About this article

Abstract page for arXiv paper 2603.21991: λ-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks

Computer Science > Machine Learning arXiv:2603.21991 (cs) [Submitted on 23 Mar 2026] Title:λ-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks Authors:Cristian Pérez-Corral, Alberto Fernández-Hernández, Jose I. Mestre, Manuel F. Dolz, Enrique S. Quintana-Ortí View a PDF of the paper titled {\lambda}-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks, by Cristian P\'erez-Corral and 4 other authors View PDF HTML (experimental) Abstract:Gaussian Error Linear Unit (GELU) is a widely used smooth alternative to Rectifier Linear Unit (ReLU), yet many deployment, compression, and analysis toolchains are most naturally expressed for piecewise-linear (ReLU-type) networks. We study a hardness-parameterized formulation of GELU, f(x;{\lambda})=x{\Phi}({\lambda} x), where {\Phi} is the Gaussian CDF and {\lambda} \in [1, infty) controls gate sharpness, with the goal of turning smooth gated training into a controlled path toward ReLU-compatible models. Learning {\lambda} is non-trivial: naive updates yield unstable dynamics and effective gradient attenuation, so we introduce a constrained reparameterization and an optimizer-aware update scheme. Empirically, across a diverse set of model--dataset pairs spanning MLPs, CNNs, and Transformers, we observe structured layerwise hardness profiles and assess their robustness under different initializations. We further study a deterministic ReLU-ization strategy in which the learned gates are progr...

Originally published on March 24, 2026. Curated by AI News.

Related Articles

All the latest in AI ‘music’ | The Verge
Ai Infrastructure

All the latest in AI ‘music’ | The Verge

The Verge is about technology and how it makes us feel. Founded in 2011, we offer our audience everything from breaking news to reviews t...

The Verge - AI · 18 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Llms

If AI is really making us more productive... why does it feel like we are working more, not less...?

The promise of AI was the ultimate system optimisation: Efficiency. On paper, the tools are delivering something similar to what they pro...

Reddit - Artificial Intelligence · 1 min ·
Ai Infrastructure

[P] Built an open source tool to find the location of any street picture

Hey guys, Thank you so much for your love and support regarding Netryx Astra V2 last time. Many people are not that technically savvy to ...

Reddit - Machine Learning · 1 min ·
More in Ai Infrastructure: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime