[2510.14377] PluriHopRAG: Exhaustive, Recall-Sensitive QA Through

[2510.14377] PluriHopRAG: Exhaustive, Recall-Sensitive QA Through Corpus-Specific Document Structure Learning

arXiv - Machine Learning April 02, 2026 4 min read

About this article

Abstract page for arXiv paper 2510.14377: PluriHopRAG: Exhaustive, Recall-Sensitive QA Through Corpus-Specific Document Structure Learning

Computer Science > Computation and Language arXiv:2510.14377 (cs) [Submitted on 16 Oct 2025 (v1), last revised 1 Apr 2026 (this version, v2)] Title:PluriHopRAG: Exhaustive, Recall-Sensitive QA Through Corpus-Specific Document Structure Learning Authors:Mykolas Sveistrys, Richard Kunert View a PDF of the paper titled PluriHopRAG: Exhaustive, Recall-Sensitive QA Through Corpus-Specific Document Structure Learning, by Mykolas Sveistrys and 1 other authors View PDF HTML (experimental) Abstract:Retrieval-Augmented Generation (RAG) has been used in question answering (QA) systems to improve performance when relevant information is in one (single-hop) or multiple (multi-hop) passages. However, many real life scenarios (e.g. dealing with financial, legal, medical reports) require checking all documents for relevant information without a clear stopping condition. We term these pluri-hop questions, and formalize them by 3 conditions - recall sensitivity, exhaustiveness, and exactness. To study this setting, we introduce PluriHopWIND, a multilingual diagnostic benchmark of 48 pluri-hop questions over 191 real wind-industry reports, with high repetitiveness to reflect the challenge of distractors in real-world datasets. Naive, graph-based, and multimodal RAG methods only reach up to 40% statement-wise F1 on PluriHopWIND. Motivated by this, we propose PluriHopRAG, which learns from synthetic examples to decompose queries according to corpus-specific document structure, and employs a cr...

Originally published on April 02, 2026. Curated by AI News.

Llms

[2602.00750] Bypassing Prompt Injection Detectors through Evasive Injections

Abstract page for arXiv paper 2602.00750: Bypassing Prompt Injection Detectors through Evasive Injections

arXiv - AI · 4 min · about 1 hour ago

Nlp

[2512.18640] Geometric-Photometric Event-based 3D Gaussian Ray Tracing

Abstract page for arXiv paper 2512.18640: Geometric-Photometric Event-based 3D Gaussian Ray Tracing

arXiv - AI · 4 min · about 1 hour ago

Llms

[2511.08225] Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback

Abstract page for arXiv paper 2511.08225: Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback

arXiv - AI · 4 min · about 1 hour ago

Llms

[2511.20224] DuoTok: Source-Aware Dual-Track Tokenization for Multi-Track Music Language Modeling

Abstract page for arXiv paper 2511.20224: DuoTok: Source-Aware Dual-Track Tokenization for Multi-Track Music Language Modeling

arXiv - AI · 3 min · about 1 hour ago

[2510.14377] PluriHopRAG: Exhaustive, Recall-Sensitive QA Through Corpus-Specific Document Structure Learning

About this article

Related Articles

[2602.00750] Bypassing Prompt Injection Detectors through Evasive Injections

[2512.18640] Geometric-Photometric Event-based 3D Gaussian Ray Tracing

[2511.08225] Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback

[2511.20224] DuoTok: Source-Aware Dual-Track Tokenization for Multi-Track Music Language Modeling

No comments

Stay updated with AI News