Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Llms

[2602.00750] Bypassing Prompt Injection Detectors through Evasive Injections

Abstract page for arXiv paper 2602.00750: Bypassing Prompt Injection Detectors through Evasive Injections

arXiv - AI · 4 min · about 1 hour ago

Nlp

[2512.18640] Geometric-Photometric Event-based 3D Gaussian Ray Tracing

Abstract page for arXiv paper 2512.18640: Geometric-Photometric Event-based 3D Gaussian Ray Tracing

arXiv - AI · 4 min · about 1 hour ago

Llms

[2511.08225] Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback

Abstract page for arXiv paper 2511.08225: Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback

arXiv - AI · 4 min · about 1 hour ago

All Content

Machine Learning

[2412.20816] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval

The paper presents MomentMix, a novel augmentation technique using Length-Aware DETR to enhance video moment retrieval, particularly for ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2411.08254] Toward Automated Validation of Language Model Synthesized Test Cases using Semantic Entropy

The paper presents VALTEST, a framework for validating test cases generated by large language models (LLMs) using semantic entropy, impro...

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.24276] G-reasoner: Foundation Models for Unified Reasoning over Graph-structured Knowledge

The G-reasoner paper introduces a unified framework that enhances reasoning over graph-structured knowledge using a new graph foundation ...

arXiv - AI · 4 min · about 1 month ago

Nlp

[2602.10195] Versor: A Geometric Sequence Architecture

The paper introduces Versor, a novel geometric sequence architecture that leverages Conformal Geometric Algebra for enhanced performance ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.09789] When Less is More: The LLM Scaling Paradox in Context Compression

The paper explores the paradox of scaling large language models (LLMs) in context compression, revealing that larger models may reduce th...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2508.01780] LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?

The paper presents LiveMCPBench, a benchmark designed to evaluate the capabilities of agents using Model Context Protocol (MCP) tools in ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.07706] FHIR-RAG-MEDS: Integrating HL7 FHIR with Retrieval-Augmented Large Language Models for Enhanced Medical Decision Support

The paper presents FHIR-RAG-MEDS, a system that integrates HL7 FHIR with Retrieval-Augmented Generation models to enhance personalized me...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2512.05865] Sparse Attention Post-Training for Mechanistic Interpretability

The paper presents a novel post-training method that enhances transformer attention sparsity while maintaining performance, revealing ins...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2510.27480] Simplex-to-Euclidean Bijections for Categorical Flow Matching

The paper presents a novel method for learning and sampling from probability distributions on the simplex, utilizing smooth bijections to...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2510.15464] Learning to Answer from Correct Demonstrations

The paper explores a novel approach to learning answer generation from correct demonstrations, formalizing it as imitation learning withi...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.05725] Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies

This article presents a novel approach to improving masked diffusion models (MDMs) for language modeling by introducing a learned schedul...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.21936] Statistical Advantage of Softmax Attention: Insights from Single-Location Regression

This article explores the statistical advantages of softmax attention mechanisms in large language models, particularly in single-locatio...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.23286] SPARTA: Scalable and Principled Benchmark of Tree-Structured Multi-hop QA over Text and Tables

The paper presents SPARTA, a novel framework for generating scalable benchmarks for tree-structured multi-hop question answering (QA) ove...

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.21013] Predicting LLM Reasoning Performance with Small Proxy Model

This article presents rBridge, a small proxy model that predicts reasoning performance in large language models (LLMs), demonstrating sig...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.23228] MovieTeller: Tool-augmented Movie Synopsis with ID Consistent Progressive Abstraction

The paper presents MovieTeller, a novel framework for generating movie synopses using tool-augmented progressive abstraction to enhance c...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.23225] Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding?

This paper investigates why Diffusion Language Models (DLMs) often default to autoregressive decoding instead of utilizing their potentia...

arXiv - AI · 4 min · about 1 month ago

Llms

[2506.14261] RL-Obfuscation: Can Language Models Learn to Evade Latent-Space Monitors?

This article explores RL-Obfuscation, a method for training language models to evade latent-space monitors that detect undesirable behavi...

arXiv - Machine Learning · 4 min · about 1 month ago

Nlp

[2602.23071] Quantity Convergence, Quality Divergence: Disentangling Fluency and Accuracy in L2 Mandarin Prosody

This study examines the relationship between fluency and accuracy in L2 Mandarin prosody, revealing that while learners may achieve quant...

arXiv - AI · 3 min · about 1 month ago

Ai Safety

[2602.23070] Make It Hard to Hear, Easy to Learn: Long-Form Bengali ASR and Speaker Diarization via Extreme Augmentation and Perfect Alignment

This paper presents a novel approach to long-form Bengali Automatic Speech Recognition (ASR) and speaker diarization, introducing a compr...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.23057] Affine-Scaled Attention: Towards Flexible and Stable Transformer Attention

The paper introduces Affine-Scaled Attention, a novel approach to Transformer attention that enhances flexibility and stability by modify...

arXiv - AI · 4 min · about 1 month ago

Previous Page 66 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

[2602.00750] Bypassing Prompt Injection Detectors through Evasive Injections

[2512.18640] Geometric-Photometric Event-based 3D Gaussian Ray Tracing

[2511.08225] Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback

All Content

[2412.20816] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval

[2411.08254] Toward Automated Validation of Language Model Synthesized Test Cases using Semantic Entropy

[2509.24276] G-reasoner: Foundation Models for Unified Reasoning over Graph-structured Knowledge

[2602.10195] Versor: A Geometric Sequence Architecture

[2602.09789] When Less is More: The LLM Scaling Paradox in Context Compression

[2508.01780] LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?

[2509.07706] FHIR-RAG-MEDS: Integrating HL7 FHIR with Retrieval-Augmented Large Language Models for Enhanced Medical Decision Support

[2512.05865] Sparse Attention Post-Training for Mechanistic Interpretability

[2510.27480] Simplex-to-Euclidean Bijections for Categorical Flow Matching

[2510.15464] Learning to Answer from Correct Demonstrations

[2510.05725] Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies

[2509.21936] Statistical Advantage of Softmax Attention: Insights from Single-Location Regression

[2602.23286] SPARTA: Scalable and Principled Benchmark of Tree-Structured Multi-hop QA over Text and Tables

[2509.21013] Predicting LLM Reasoning Performance with Small Proxy Model

[2602.23228] MovieTeller: Tool-augmented Movie Synopsis with ID Consistent Progressive Abstraction

[2602.23225] Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding?

[2506.14261] RL-Obfuscation: Can Language Models Learn to Evade Latent-Space Monitors?

[2602.23071] Quantity Convergence, Quality Divergence: Disentangling Fluency and Accuracy in L2 Mandarin Prosody

[2602.23070] Make It Hard to Hear, Easy to Learn: Long-Form Bengali ASR and Speaker Diarization via Extreme Augmentation and Perfect Alignment

[2602.23057] Affine-Scaled Attention: Towards Flexible and Stable Transformer Attention

Related Topics

Stay updated with AI News