[2601.13227] Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?
Abstract page for arXiv paper 2601.13227: Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?
Text understanding and language tasks
Abstract page for arXiv paper 2601.13227: Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?
Abstract page for arXiv paper 2601.22440: AI and My Values: User Perceptions of LLMs' Ability to Extract, Embody, and Explain Human Value...
Abstract page for arXiv paper 2601.13222: Incorporating Q&A Nuggets into Retrieval-Augmented Generation
Abstract page for arXiv paper 2603.21925: Guideline-grounded retrieval-augmented generation for ophthalmic clinical decision support
Abstract page for arXiv paper 2603.21698: A Blueprint for Self-Evolving Coding Agents in Vehicle Aerodynamic Drag Prediction
Abstract page for arXiv paper 2603.21687: Mirage The Illusion of Visual Understanding
Abstract page for arXiv paper 2603.21636: Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confide...
Abstract page for arXiv paper 2603.21630: EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises
Abstract page for arXiv paper 2603.21607: INTRYGUE: Induction-Aware Entropy Gating for Reliable RAG Uncertainty Estimation
Abstract page for arXiv paper 2603.21563: Counterfactual Credit Policy Optimization for Multi-Agent Collaboration
Abstract page for arXiv paper 2603.21558: Stabilizing Iterative Self-Training with Verified Reasoning via Symbolic Recursive Self-Alignment
Abstract page for arXiv paper 2603.21473: Beyond Correlation: Refutation-Validated Aspect-Based Sentiment Analysis for Explainable Energy...
Abstract page for arXiv paper 2603.21448: Safety as Computation: Certified Answer Reuse via Capability Closure in Task-Oriented Dialogue
Abstract page for arXiv paper 2603.21430: DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation
Abstract page for arXiv paper 2603.21344: The AI Scientific Community: Agentic Virtual Lab Swarms
Abstract page for arXiv paper 2603.21272: The Library Theorem: How External Organization Governs Agentic Reasoning Capacity
Abstract page for arXiv paper 2603.21155: Can LLMs Fool Graph Learning? Exploring Universal Adversarial Attacks on Text-Attributed Graphs
Abstract page for arXiv paper 2603.21013: A Framework for Low-Latency, LLM-driven Multimodal Interaction on the Pepper Robot
Abstract page for arXiv paper 2603.20815: GMPilot: An Expert AI Agent For FDA cGMP Compliance
Abstract page for arXiv paper 2603.20650: From 50% to Mastery in 3 Days: A Low-Resource SOP for Localizing Graduate-Level AI Tutors via S...
Abstract page for arXiv paper 2603.20724: Multi-RF Fusion with Multi-GNN Blending for Molecular Property Prediction
Abstract page for arXiv paper 2603.20670: Towards Intelligent Geospatial Data Discovery: a knowledge graph-driven multi-agent framework p...
Abstract page for arXiv paper 2603.20510: Grounded Chess Reasoning in Language Models via Master Distillation
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime