[2603.18492] AIMER: Calibration-Free Task-Agnostic MoE Pruning

arXiv - Machine Learning April 14, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.18492: AIMER: Calibration-Free Task-Agnostic MoE Pruning

Computer Science > Machine Learning arXiv:2603.18492 (cs) [Submitted on 19 Mar 2026 (v1), last revised 13 Apr 2026 (this version, v2)] Title:AIMER: Calibration-Free Task-Agnostic MoE Pruning Authors:Zongfang Liu, Shengkun Tang, Yifan Shen, Huan Wang, Xin Yuan View a PDF of the paper titled AIMER: Calibration-Free Task-Agnostic MoE Pruning, by Zongfang Liu and 4 other authors View PDF HTML (experimental) Abstract:Mixture-of-Experts (MoE) language models increase parameter capacity without proportional per-token compute, but the deployment still requires storing all experts, making expert pruning important for reducing memory and serving overhead. Existing task-agnostic expert pruning methods are typically calibration-dependent: they estimate expert importance from routing or activation statistics on a calibration set, which makes pruning outcomes sensitive to the choice of calibration set and adds substantial preprocessing cost. We introduce AIMER (\textbf{A}bsolute mean over root mean square \textbf{IM}portance for \textbf{E}xpert \textbf{R}anking), a simple calibration-free criterion that yields clear within-layer score separation and distinct expert stratification. Across 7B to 30B MoE language models at 25\% and 50\% pruning ratios over 16 benchmarks, AIMER consistently delivers competitive or stronger overall performance against state-of-the-art calibration-based expert pruning baselines with only 0.22--1.27 seconds for scoring the experts. Subjects: Machine Learning (...

Originally published on April 14, 2026. Curated by AI News.

Llms

Researchers asked ChatGPT, Gemini and Claude which jobs are most exposed to AI. The chatbots wildly diagree

A study reveals that AI models disagree on which jobs are most vulnerable to automation, highlighting the unreliability of AI-generated e...

AI Tools & Products · 4 min · about 6 hours ago

Llms

I stopped treating ChatGPT like Google — and everything suddenly clicked

I stopped using ChatGPT like Google and started treating it like a thinking partner — here’s why that simple shift made the AI dramatical...

AI Tools & Products · 8 min · about 6 hours ago

Llms

Hackers abuse Google ads, Claude.ai chats to push Mac malware

AI Tools & Products · 6 min · about 6 hours ago

Llms

Does Claude dream of electric gavels? A federal case with Kansas connections sets an AI precedent.

AI Tools & Products · about 6 hours ago

[2603.18492] AIMER: Calibration-Free Task-Agnostic MoE Pruning

About this article

Related Articles

Researchers asked ChatGPT, Gemini and Claude which jobs are most exposed to AI. The chatbots wildly diagree

I stopped treating ChatGPT like Google — and everything suddenly clicked

Hackers abuse Google ads, Claude.ai chats to push Mac malware

Does Claude dream of electric gavels? A federal case with Kansas connections sets an AI precedent.

No comments

Stay updated with AI News