Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Llms

[2601.13227] Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?

Abstract page for arXiv paper 2601.13227: Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?

arXiv - AI · 3 min · about 2 hours ago

Llms

[2601.22440] AI and My Values: User Perceptions of LLMs' Ability to Extract, Embody, and Explain Human Values from Casual Conversations

Abstract page for arXiv paper 2601.22440: AI and My Values: User Perceptions of LLMs' Ability to Extract, Embody, and Explain Human Value...

arXiv - AI · 4 min · about 2 hours ago

Nlp

[2601.13222] Incorporating Q&A Nuggets into Retrieval-Augmented Generation

Abstract page for arXiv paper 2601.13222: Incorporating Q&A Nuggets into Retrieval-Augmented Generation

arXiv - AI · 3 min · about 2 hours ago

All Content

Nlp

[2603.21925] Guideline-grounded retrieval-augmented generation for ophthalmic clinical decision support

Abstract page for arXiv paper 2603.21925: Guideline-grounded retrieval-augmented generation for ophthalmic clinical decision support

arXiv - AI · 4 min · 6 days ago

Nlp

[2603.21698] A Blueprint for Self-Evolving Coding Agents in Vehicle Aerodynamic Drag Prediction

Abstract page for arXiv paper 2603.21698: A Blueprint for Self-Evolving Coding Agents in Vehicle Aerodynamic Drag Prediction

arXiv - AI · 4 min · 6 days ago

Machine Learning

[2603.21687] Mirage The Illusion of Visual Understanding

Abstract page for arXiv paper 2603.21687: Mirage The Illusion of Visual Understanding

arXiv - AI · 4 min · 6 days ago

Llms

[2603.21636] Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks

Abstract page for arXiv paper 2603.21636: Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confide...

arXiv - AI · 4 min · 6 days ago

Llms

[2603.21630] EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises

Abstract page for arXiv paper 2603.21630: EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises

arXiv - AI · 3 min · 6 days ago

Llms

[2603.21607] INTRYGUE: Induction-Aware Entropy Gating for Reliable RAG Uncertainty Estimation

Abstract page for arXiv paper 2603.21607: INTRYGUE: Induction-Aware Entropy Gating for Reliable RAG Uncertainty Estimation

arXiv - AI · 3 min · 6 days ago

Llms

[2603.21563] Counterfactual Credit Policy Optimization for Multi-Agent Collaboration

Abstract page for arXiv paper 2603.21563: Counterfactual Credit Policy Optimization for Multi-Agent Collaboration

arXiv - AI · 3 min · 6 days ago

Machine Learning

[2603.21558] Stabilizing Iterative Self-Training with Verified Reasoning via Symbolic Recursive Self-Alignment

Abstract page for arXiv paper 2603.21558: Stabilizing Iterative Self-Training with Verified Reasoning via Symbolic Recursive Self-Alignment

arXiv - AI · 4 min · 6 days ago

Nlp

[2603.21473] Beyond Correlation: Refutation-Validated Aspect-Based Sentiment Analysis for Explainable Energy Market Returns

Abstract page for arXiv paper 2603.21473: Beyond Correlation: Refutation-Validated Aspect-Based Sentiment Analysis for Explainable Energy...

arXiv - Machine Learning · 3 min · 6 days ago

Nlp

[2603.21448] Safety as Computation: Certified Answer Reuse via Capability Closure in Task-Oriented Dialogue

Abstract page for arXiv paper 2603.21448: Safety as Computation: Certified Answer Reuse via Capability Closure in Task-Oriented Dialogue

arXiv - AI · 3 min · 6 days ago

Llms

[2603.21430] DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation

Abstract page for arXiv paper 2603.21430: DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation

arXiv - AI · 4 min · 6 days ago

Machine Learning

[2603.21344] The AI Scientific Community: Agentic Virtual Lab Swarms

Abstract page for arXiv paper 2603.21344: The AI Scientific Community: Agentic Virtual Lab Swarms

arXiv - AI · 3 min · 6 days ago

Machine Learning

[2603.21272] The Library Theorem: How External Organization Governs Agentic Reasoning Capacity

Abstract page for arXiv paper 2603.21272: The Library Theorem: How External Organization Governs Agentic Reasoning Capacity

arXiv - Machine Learning · 4 min · 6 days ago

Llms

[2603.21155] Can LLMs Fool Graph Learning? Exploring Universal Adversarial Attacks on Text-Attributed Graphs

Abstract page for arXiv paper 2603.21155: Can LLMs Fool Graph Learning? Exploring Universal Adversarial Attacks on Text-Attributed Graphs

arXiv - AI · 4 min · 6 days ago

Llms

[2603.21013] A Framework for Low-Latency, LLM-driven Multimodal Interaction on the Pepper Robot

Abstract page for arXiv paper 2603.21013: A Framework for Low-Latency, LLM-driven Multimodal Interaction on the Pepper Robot

arXiv - Machine Learning · 4 min · 6 days ago

Nlp

[2603.20815] GMPilot: An Expert AI Agent For FDA cGMP Compliance

Abstract page for arXiv paper 2603.20815: GMPilot: An Expert AI Agent For FDA cGMP Compliance

arXiv - AI · 3 min · 6 days ago

Llms

[2603.20650] From 50% to Mastery in 3 Days: A Low-Resource SOP for Localizing Graduate-Level AI Tutors via Shadow-RAG

Abstract page for arXiv paper 2603.20650: From 50% to Mastery in 3 Days: A Low-Resource SOP for Localizing Graduate-Level AI Tutors via S...

arXiv - AI · 4 min · 6 days ago

Machine Learning

[2603.20724] Multi-RF Fusion with Multi-GNN Blending for Molecular Property Prediction

Abstract page for arXiv paper 2603.20724: Multi-RF Fusion with Multi-GNN Blending for Molecular Property Prediction

arXiv - Machine Learning · 3 min · 6 days ago

Llms

[2603.20670] Towards Intelligent Geospatial Data Discovery: a knowledge graph-driven multi-agent framework powered by large language models

Abstract page for arXiv paper 2603.20670: Towards Intelligent Geospatial Data Discovery: a knowledge graph-driven multi-agent framework p...

arXiv - AI · 4 min · 6 days ago

Llms

[2603.20510] Grounded Chess Reasoning in Language Models via Master Distillation

Abstract page for arXiv paper 2603.20510: Grounded Chess Reasoning in Language Models via Master Distillation

arXiv - AI · 4 min · 6 days ago

Previous Page 20 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

[2601.13227] Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?

[2601.22440] AI and My Values: User Perceptions of LLMs' Ability to Extract, Embody, and Explain Human Values from Casual Conversations

[2601.13222] Incorporating Q&A Nuggets into Retrieval-Augmented Generation

All Content

[2603.21925] Guideline-grounded retrieval-augmented generation for ophthalmic clinical decision support

[2603.21698] A Blueprint for Self-Evolving Coding Agents in Vehicle Aerodynamic Drag Prediction

[2603.21687] Mirage The Illusion of Visual Understanding

[2603.21636] Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks

[2603.21630] EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises

[2603.21607] INTRYGUE: Induction-Aware Entropy Gating for Reliable RAG Uncertainty Estimation

[2603.21563] Counterfactual Credit Policy Optimization for Multi-Agent Collaboration

[2603.21558] Stabilizing Iterative Self-Training with Verified Reasoning via Symbolic Recursive Self-Alignment

[2603.21473] Beyond Correlation: Refutation-Validated Aspect-Based Sentiment Analysis for Explainable Energy Market Returns

[2603.21448] Safety as Computation: Certified Answer Reuse via Capability Closure in Task-Oriented Dialogue

[2603.21430] DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation

[2603.21344] The AI Scientific Community: Agentic Virtual Lab Swarms

[2603.21272] The Library Theorem: How External Organization Governs Agentic Reasoning Capacity

[2603.21155] Can LLMs Fool Graph Learning? Exploring Universal Adversarial Attacks on Text-Attributed Graphs

[2603.21013] A Framework for Low-Latency, LLM-driven Multimodal Interaction on the Pepper Robot

[2603.20815] GMPilot: An Expert AI Agent For FDA cGMP Compliance

[2603.20650] From 50% to Mastery in 3 Days: A Low-Resource SOP for Localizing Graduate-Level AI Tutors via Shadow-RAG

[2603.20724] Multi-RF Fusion with Multi-GNN Blending for Molecular Property Prediction

[2603.20670] Towards Intelligent Geospatial Data Discovery: a knowledge graph-driven multi-agent framework powered by large language models

[2603.20510] Grounded Chess Reasoning in Language Models via Master Distillation

Related Topics

Stay updated with AI News