[2604.17188] Beyond Overlap Metrics: Rewarding Reasoning and

[2604.17188] Beyond Overlap Metrics: Rewarding Reasoning and Preferences for Faithful Multi-Role Dialogue Summarization

arXiv - AI April 29, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.17188: Beyond Overlap Metrics: Rewarding Reasoning and Preferences for Faithful Multi-Role Dialogue Summarization

Computer Science > Computation and Language arXiv:2604.17188 (cs) [Submitted on 19 Apr 2026 (v1), last revised 28 Apr 2026 (this version, v2)] Title:Beyond Overlap Metrics: Rewarding Reasoning and Preferences for Faithful Multi-Role Dialogue Summarization Authors:Xiaoyong Mei, Tingting Zuo, Da Chen, Guangyu Hu, Xiangyu Wen, Chao Duan, Mingyan Zhang, Fudan Zheng View a PDF of the paper titled Beyond Overlap Metrics: Rewarding Reasoning and Preferences for Faithful Multi-Role Dialogue Summarization, by Xiaoyong Mei and 6 other authors View PDF HTML (experimental) Abstract:Multi-role dialogue summarization requires modeling complex interactions among multiple speakers while preserving role-specific information and factual consistency. However, most existing methods optimize for automatic metrics such as ROUGE and BERTScore, which favor surface-level imitation of references rather than genuine gains in faithfulness or alignment with human preferences. We propose a novel framework that couples explicit cognitive-style reasoning with reward-based optimization for multi-role dialogue summarization. Our method first distills structured reasoning traces (e.g., step-by-step inferences and intermediate reflections) from a large teacher model and uses them as auxiliary supervision to initialize a reasoning-aware summarizer via staged supervised fine-tuning. It then applies GRPO with a dual-principle reward that blends metric-based signals with human-aligned criteria targeting key info...

Originally published on April 29, 2026. Curated by AI News.

Machine Learning

Google Translate Adds AI Pronunciation Training as It Expands into Language Learning

Google Translate is moving beyond simple translation, introducing a new artificial intelligence feature designed to help users improve th...

AI News - General · 3 min · 22 minutes ago

Machine Learning

Solving the “Whac-a-mole dilemma”: A smarter way to debias AI vision models

A new debiasing approach called WRING resolves the "Whac-a-Mole dilemma" of existing debiasing approaches that can create or am...

AI News - General · 7 min · 22 minutes ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · 23 minutes ago

Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min · 23 minutes ago

[2604.17188] Beyond Overlap Metrics: Rewarding Reasoning and Preferences for Faithful Multi-Role Dialogue Summarization

About this article

Related Articles

Google Translate Adds AI Pronunciation Training as It Expands into Language Learning

Solving the “Whac-a-mole dilemma”: A smarter way to debias AI vision models

UMKC Announces New Master of Science in Artificial Intelligence

Improving AI models’ ability to explain their predictions

No comments

Stay updated with AI News