[2604.17188] Beyond Overlap Metrics: Rewarding Reasoning and Preferences for Faithful Multi-Role Dialogue Summarization

[2604.17188] Beyond Overlap Metrics: Rewarding Reasoning and Preferences for Faithful Multi-Role Dialogue Summarization

arXiv - AI 4 min read

About this article

Abstract page for arXiv paper 2604.17188: Beyond Overlap Metrics: Rewarding Reasoning and Preferences for Faithful Multi-Role Dialogue Summarization

Computer Science > Computation and Language arXiv:2604.17188 (cs) [Submitted on 19 Apr 2026 (v1), last revised 28 Apr 2026 (this version, v2)] Title:Beyond Overlap Metrics: Rewarding Reasoning and Preferences for Faithful Multi-Role Dialogue Summarization Authors:Xiaoyong Mei, Tingting Zuo, Da Chen, Guangyu Hu, Xiangyu Wen, Chao Duan, Mingyan Zhang, Fudan Zheng View a PDF of the paper titled Beyond Overlap Metrics: Rewarding Reasoning and Preferences for Faithful Multi-Role Dialogue Summarization, by Xiaoyong Mei and 6 other authors View PDF HTML (experimental) Abstract:Multi-role dialogue summarization requires modeling complex interactions among multiple speakers while preserving role-specific information and factual consistency. However, most existing methods optimize for automatic metrics such as ROUGE and BERTScore, which favor surface-level imitation of references rather than genuine gains in faithfulness or alignment with human preferences. We propose a novel framework that couples explicit cognitive-style reasoning with reward-based optimization for multi-role dialogue summarization. Our method first distills structured reasoning traces (e.g., step-by-step inferences and intermediate reflections) from a large teacher model and uses them as auxiliary supervision to initialize a reasoning-aware summarizer via staged supervised fine-tuning. It then applies GRPO with a dual-principle reward that blends metric-based signals with human-aligned criteria targeting key info...

Originally published on April 29, 2026. Curated by AI News.

Related Articles

Google Translate Adds AI Pronunciation Training as It Expands into Language Learning
Machine Learning

Google Translate Adds AI Pronunciation Training as It Expands into Language Learning

Google Translate is moving beyond simple translation, introducing a new artificial intelligence feature designed to help users improve th...

AI News - General · 3 min ·
Solving the “Whac-a-mole dilemma”: A smarter way to debias AI vision models
Machine Learning

Solving the “Whac-a-mole dilemma”: A smarter way to debias AI vision models

A new debiasing approach called WRING resolves the "Whac-a-Mole dilemma" of existing debiasing approaches that can create or am...

AI News - General · 7 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Improving AI models’ ability to explain their predictions
Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime