[2603.21991] λ-GELU: Learning Gating Hardness for Controlled

[2603.21991] λ-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks

arXiv - Machine Learning March 24, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.21991: λ-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks

Computer Science > Machine Learning arXiv:2603.21991 (cs) [Submitted on 23 Mar 2026] Title:λ-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks Authors:Cristian Pérez-Corral, Alberto Fernández-Hernández, Jose I. Mestre, Manuel F. Dolz, Enrique S. Quintana-Ortí View a PDF of the paper titled {\lambda}-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks, by Cristian P\'erez-Corral and 4 other authors View PDF HTML (experimental) Abstract:Gaussian Error Linear Unit (GELU) is a widely used smooth alternative to Rectifier Linear Unit (ReLU), yet many deployment, compression, and analysis toolchains are most naturally expressed for piecewise-linear (ReLU-type) networks. We study a hardness-parameterized formulation of GELU, f(x;{\lambda})=x{\Phi}({\lambda} x), where {\Phi} is the Gaussian CDF and {\lambda} \in [1, infty) controls gate sharpness, with the goal of turning smooth gated training into a controlled path toward ReLU-compatible models. Learning {\lambda} is non-trivial: naive updates yield unstable dynamics and effective gradient attenuation, so we introduce a constrained reparameterization and an optimizer-aware update scheme. Empirically, across a diverse set of model--dataset pairs spanning MLPs, CNNs, and Transformers, we observe structured layerwise hardness profiles and assess their robustness under different initializations. We further study a deterministic ReLU-ization strategy in which the learned gates are progr...

Originally published on March 24, 2026. Curated by AI News.

Ai Infrastructure

All the latest in AI ‘music’ | The Verge

The Verge is about technology and how it makes us feel. Founded in 2011, we offer our audience everything from breaking news to reviews t...

The Verge - AI · 18 min · 20 minutes ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Llms

If AI is really making us more productive... why does it feel like we are working more, not less...?

The promise of AI was the ultimate system optimisation: Efficiency. On paper, the tools are delivering something similar to what they pro...

Reddit - Artificial Intelligence · 1 min · about 9 hours ago

Ai Infrastructure

[P] Built an open source tool to find the location of any street picture

Hey guys, Thank you so much for your love and support regarding Netryx Astra V2 last time. Many people are not that technically savvy to ...

Reddit - Machine Learning · 1 min · about 13 hours ago

[2603.21991] λ-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks

About this article

Related Articles

All the latest in AI ‘music’ | The Verge

UMKC Announces New Master of Science in Artificial Intelligence

If AI is really making us more productive... why does it feel like we are working more, not less...?

[P] Built an open source tool to find the location of any street picture

No comments

Stay updated with AI News