[2604.06465] Multi-objective Evolutionary Merging Enables Efficient

[2604.06465] Multi-objective Evolutionary Merging Enables Efficient Reasoning Models

arXiv - AI April 09, 2026 3 min read

About this article

Abstract page for arXiv paper 2604.06465: Multi-objective Evolutionary Merging Enables Efficient Reasoning Models

Computer Science > Computation and Language arXiv:2604.06465 (cs) [Submitted on 7 Apr 2026] Title:Multi-objective Evolutionary Merging Enables Efficient Reasoning Models Authors:Mario Iacobelli, Adrian Robert Minut, Tommaso Mencattini, Donato Crisostomi, Andrea Santilli, Iacopo Masi, Emanuele Rodolà View a PDF of the paper titled Multi-objective Evolutionary Merging Enables Efficient Reasoning Models, by Mario Iacobelli and 6 other authors View PDF HTML (experimental) Abstract:Reasoning models have demonstrated remarkable capabilities in solving complex problems by leveraging long chains of thought. However, this more deliberate reasoning comes with substantial computational overhead at inference time. The Long-to-Short (L2S) reasoning problem seeks to maintain high accuracy using fewer tokens, but current training-free model merging approaches rely on scalarized, fixed-hyperparameter arithmetic methods that are highly brittle and force suboptimal compromises. To address this gap, we introduce Evo-L2S, a novel framework that formulates L2S reasoning as a multi-objective optimization challenge. By leveraging evolutionary model merging, Evo-L2S explicitly optimizes the trade-off between accuracy and output length to produce a robust Pareto front of merged models. To make this search computationally tractable for large language models, we propose an entropy-based subset sampling technique that drastically reduces the overhead of fitness estimation. Comprehensive experiments a...

Originally published on April 09, 2026. Curated by AI News.

Machine Learning

PyTorch reproduction of TensorFlow paper underperforms by 4 pp on DermaMNIST , what cross-framework issues should I check? [R]

I'm reproducing a published paper's hybrid Gabor + CNN architecture in PyTorch. The original implementation is in TensorFlow. My reproduc...

Reddit - Machine Learning · 1 min · 38 minutes ago

Machine Learning

eTPS Site Plan – Simple Leaderboard + What You’ll Actually See

Building on the last post, here’s what the first version of effectiveTPS will look like. **Core display (v1):** - Clean table comparing p...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

Diffusion for generating/editing ASTs? [D]

I’m not a machine learning expert or anything, but I do enjoy learning about how it all works. I’ve noticed that one of the main limitati...

Reddit - Machine Learning · 1 min · about 3 hours ago

Machine Learning

I trained a NER model on 33,000 Indian Supreme Court judgments (1950–2024) CASE_CITATION hits 97.76% F1, +17 points over the only prior baseline [P]

TL;DR: Released en_legal_ner_ind_trf v0.1 - InLegalBERT fine-tuned on ~34,700 silver-annotated chunks from 33k Indian SC judgments. 13 la...

Reddit - Machine Learning · 1 min · about 3 hours ago

[2604.06465] Multi-objective Evolutionary Merging Enables Efficient Reasoning Models

About this article

Related Articles

PyTorch reproduction of TensorFlow paper underperforms by 4 pp on DermaMNIST , what cross-framework issues should I check? [R]

eTPS Site Plan – Simple Leaderboard + What You’ll Actually See

Diffusion for generating/editing ASTs? [D]

I trained a NER model on 33,000 Indian Supreme Court judgments (1950–2024) CASE_CITATION hits 97.76% F1, +17 points over the only prior baseline [P]

No comments

Stay updated with AI News