[2511.08409] Faithful-First Reasoning, Planning, and Acting for

[2511.08409] Faithful-First Reasoning, Planning, and Acting for Multimodal LLMs

arXiv - AI April 09, 2026 3 min read

About this article

Abstract page for arXiv paper 2511.08409: Faithful-First Reasoning, Planning, and Acting for Multimodal LLMs

Computer Science > Artificial Intelligence arXiv:2511.08409 (cs) [Submitted on 11 Nov 2025 (v1), last revised 8 Apr 2026 (this version, v4)] Title:Faithful-First Reasoning, Planning, and Acting for Multimodal LLMs Authors:Junxian Li, Xinyue Xu, Sai Ma, Di Zhang, Sichao Li View a PDF of the paper titled Faithful-First Reasoning, Planning, and Acting for Multimodal LLMs, by Junxian Li and 4 other authors View PDF HTML (experimental) Abstract:Multimodal Large Language Models (MLLMs) frequently suffer from unfaithfulness, generating reasoning chains that drift from visual evidence or contradict final predictions. We propose Faithful-First Reasoning, Planning, and Acting (RPA) framework in which FaithEvi provides step-wise and chain-level supervision by evaluating the faithfulness of intermediate reasoning, and FaithAct uses these signals to plan and execute faithfulness-aware actions during inference. Experiments across multiple multimodal reasoning benchmarks show that faithful-first RPA improves perceptual faithfulness by up to 24% over prompt-based and tool-augmented reasoning frameworks, without degrading task accuracy. Our analysis shows that treating faithfulness as a guiding principle perceptually faithful reasoning trajectories and mitigates hallucination behavior. This work thereby establishes a unified framework for both evaluating and enforcing faithfulness in multimodal reasoning. Code is at this https URL. Comments: Subjects: Artificial Intelligence (cs.AI) Cite a...

Originally published on April 09, 2026. Curated by AI News.

Llms

Diffusion for generating/editing ASTs? [D]

I’m not a machine learning expert or anything, but I do enjoy learning about how it all works. I’ve noticed that one of the main limitati...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

ChatGPT’s ‘Trusted Contact’ will alert loved ones of safety concerns | The Verge

OpenAI is launching an optional safety feature for ChatGPT that allows adult users to assign an emergency contact for mental health and s...

The Verge - AI · 4 min · about 1 hour ago

Llms

AI is helpful but still not “there” yet

what I mean is that every time I use Claude, or Grok or any of the AI platforms and tools, I realize how far this technology is from repl...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

ChatGPT Has 'Goblin' Mania in the US. In China It Will 'Catch You Steadily' | WIRED

OpenAI's chatbot has some weird linguistic tics in Chinese that are driving users crazy.

Wired - AI · 8 min · about 2 hours ago

[2511.08409] Faithful-First Reasoning, Planning, and Acting for Multimodal LLMs

About this article

Related Articles

Diffusion for generating/editing ASTs? [D]

ChatGPT’s ‘Trusted Contact’ will alert loved ones of safety concerns | The Verge

AI is helpful but still not “there” yet

ChatGPT Has 'Goblin' Mania in the US. In China It Will 'Catch You Steadily' | WIRED

No comments

Stay updated with AI News