[2604.16909] PRISM: Probing Reasoning, Instruction, and Source Memory

[2604.16909] PRISM: Probing Reasoning, Instruction, and Source Memory in LLM Hallucinations

arXiv - AI April 29, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.16909: PRISM: Probing Reasoning, Instruction, and Source Memory in LLM Hallucinations

Computer Science > Computation and Language arXiv:2604.16909 (cs) [Submitted on 18 Apr 2026 (v1), last revised 26 Apr 2026 (this version, v2)] Title:PRISM: Probing Reasoning, Instruction, and Source Memory in LLM Hallucinations Authors:Yuhe Wu, Guangyu Wang, Yuran Chen, Jiatong Zhang, Yutong Zhang, Yujie Chen, Jiaming Shang, Guang Zhang, Zhuang Liu View a PDF of the paper titled PRISM: Probing Reasoning, Instruction, and Source Memory in LLM Hallucinations, by Yuhe Wu and 8 other authors View PDF HTML (experimental) Abstract:As large language models (LLMs) evolve from conversational assistants into agents capable of handling complex tasks, they are increasingly deployed in high-risk domains. However, existing benchmarks largely rely on mixed queries and posterior evaluation, output-level scoring, which quantifies hallucination severity but offers limited insight into where and why hallucinations arise in the generation pipeline. We therefore reformulate hallucination evaluation as a diagnostic problem and propose PRISM, a controlled benchmark that disentangles hallucinations into four dimensions: knowledge missing, knowledge errors, reasoning errors, and instruction-following errors, grounded in three stages of generation (memory, instruction, and reasoning). PRISM contains 9,448 instances across 65 tasks and supports fine-grained, stage-aware diagnostic evaluation. Evaluating 24 mainstream open-source and proprietary LLMs, we uncover consistent trade-offs across instructi...

Originally published on April 29, 2026. Curated by AI News.

Llms

[2604.07802] Latent Anomaly Knowledge Excavation: Unveiling Sparse Sensitive Neurons in Vision-Language Models

Abstract page for arXiv paper 2604.07802: Latent Anomaly Knowledge Excavation: Unveiling Sparse Sensitive Neurons in Vision-Language Models

arXiv - AI · 4 min · about 2 hours ago

Llms

[2602.07605] Fine-R1: Make Multi-modal LLMs Excel in Fine-Grained Visual Recognition by Chain-of-Thought Reasoning

Abstract page for arXiv paper 2602.07605: Fine-R1: Make Multi-modal LLMs Excel in Fine-Grained Visual Recognition by Chain-of-Thought Rea...

arXiv - AI · 4 min · about 2 hours ago

Llms

[2602.07096] RealFin: How Well Do LLMs Reason About Finance When Users Leave Things Unsaid?

Abstract page for arXiv paper 2602.07096: RealFin: How Well Do LLMs Reason About Finance When Users Leave Things Unsaid?

arXiv - AI · 3 min · about 2 hours ago

Llms

[2601.22246] MirrorMark: A Distortion-Free Multi-Bit Watermark for Large Language Models

Abstract page for arXiv paper 2601.22246: MirrorMark: A Distortion-Free Multi-Bit Watermark for Large Language Models

arXiv - AI · 3 min · about 2 hours ago

[2604.16909] PRISM: Probing Reasoning, Instruction, and Source Memory in LLM Hallucinations

About this article

Related Articles

[2604.07802] Latent Anomaly Knowledge Excavation: Unveiling Sparse Sensitive Neurons in Vision-Language Models

[2602.07605] Fine-R1: Make Multi-modal LLMs Excel in Fine-Grained Visual Recognition by Chain-of-Thought Reasoning

[2602.07096] RealFin: How Well Do LLMs Reason About Finance When Users Leave Things Unsaid?

[2601.22246] MirrorMark: A Distortion-Free Multi-Bit Watermark for Large Language Models

No comments

Stay updated with AI News