Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

How I cut ~$220/month from redundant AI tools, the exact quarterly audit process I use

A few months ago I finally sat down and audited every AI subscription my team was paying for. Turns out we were quietly burning roughly $...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

2026 Advanced Deep Learning Projects

As a hiring manager who’s been deep in the 2026 market, I wanted to share some real insights + a video I found that the community might f...

Reddit - ML Jobs · 1 min · about 4 hours ago

Llms

[D] Production gaps in context-window compression for AI agent memory

've been working on AI memory infrastructure and recently spent a few weeks reading through the source code of an open-source context-win...

Reddit - Machine Learning · 1 min · about 4 hours ago

All Content

Nlp

[2602.10195] Versor: A Geometric Sequence Architecture

The paper introduces Versor, a novel geometric sequence architecture that leverages Conformal Geometric Algebra for enhanced performance ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.09789] When Less is More: The LLM Scaling Paradox in Context Compression

The paper explores the paradox of scaling large language models (LLMs) in context compression, revealing that larger models may reduce th...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2508.01780] LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?

The paper presents LiveMCPBench, a benchmark designed to evaluate the capabilities of agents using Model Context Protocol (MCP) tools in ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.07706] FHIR-RAG-MEDS: Integrating HL7 FHIR with Retrieval-Augmented Large Language Models for Enhanced Medical Decision Support

The paper presents FHIR-RAG-MEDS, a system that integrates HL7 FHIR with Retrieval-Augmented Generation models to enhance personalized me...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2512.05865] Sparse Attention Post-Training for Mechanistic Interpretability

The paper presents a novel post-training method that enhances transformer attention sparsity while maintaining performance, revealing ins...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2510.27480] Simplex-to-Euclidean Bijections for Categorical Flow Matching

The paper presents a novel method for learning and sampling from probability distributions on the simplex, utilizing smooth bijections to...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2510.15464] Learning to Answer from Correct Demonstrations

The paper explores a novel approach to learning answer generation from correct demonstrations, formalizing it as imitation learning withi...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.05725] Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies

This article presents a novel approach to improving masked diffusion models (MDMs) for language modeling by introducing a learned schedul...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.21936] Statistical Advantage of Softmax Attention: Insights from Single-Location Regression

This article explores the statistical advantages of softmax attention mechanisms in large language models, particularly in single-locatio...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.23286] SPARTA: Scalable and Principled Benchmark of Tree-Structured Multi-hop QA over Text and Tables

The paper presents SPARTA, a novel framework for generating scalable benchmarks for tree-structured multi-hop question answering (QA) ove...

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.21013] Predicting LLM Reasoning Performance with Small Proxy Model

This article presents rBridge, a small proxy model that predicts reasoning performance in large language models (LLMs), demonstrating sig...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.23228] MovieTeller: Tool-augmented Movie Synopsis with ID Consistent Progressive Abstraction

The paper presents MovieTeller, a novel framework for generating movie synopses using tool-augmented progressive abstraction to enhance c...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.23225] Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding?

This paper investigates why Diffusion Language Models (DLMs) often default to autoregressive decoding instead of utilizing their potentia...

arXiv - AI · 4 min · about 1 month ago

Llms

[2506.14261] RL-Obfuscation: Can Language Models Learn to Evade Latent-Space Monitors?

This article explores RL-Obfuscation, a method for training language models to evade latent-space monitors that detect undesirable behavi...

arXiv - Machine Learning · 4 min · about 1 month ago

Nlp

[2602.23071] Quantity Convergence, Quality Divergence: Disentangling Fluency and Accuracy in L2 Mandarin Prosody

This study examines the relationship between fluency and accuracy in L2 Mandarin prosody, revealing that while learners may achieve quant...

arXiv - AI · 3 min · about 1 month ago

Ai Safety

[2602.23070] Make It Hard to Hear, Easy to Learn: Long-Form Bengali ASR and Speaker Diarization via Extreme Augmentation and Perfect Alignment

This paper presents a novel approach to long-form Bengali Automatic Speech Recognition (ASR) and speaker diarization, introducing a compr...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.23057] Affine-Scaled Attention: Towards Flexible and Stable Transformer Attention

The paper introduces Affine-Scaled Attention, a novel approach to Transformer attention that enhances flexibility and stability by modify...

arXiv - AI · 4 min · about 1 month ago

Nlp

[2502.06051] Towards a Sharp Analysis of Offline Policy Learning for $f$-Divergence-Regularized Contextual Bandits

This paper presents a detailed analysis of offline policy learning in contextual bandits, focusing on $f$-divergence regularization and i...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.22967] Discovery of Interpretable Physical Laws in Materials via Language-Model-Guided Symbolic Regression

This paper presents a novel framework that utilizes language models to guide symbolic regression in discovering interpretable physical la...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.22935] A Holistic Framework for Robust Bangla ASR and Speaker Diarization with Optimized VAD and CTC Alignment

This paper presents a robust framework for Bangla Automatic Speech Recognition (ASR) and Speaker Diarization, addressing challenges in pr...

arXiv - AI · 3 min · about 1 month ago

Previous Page 63 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

How I cut ~$220/month from redundant AI tools, the exact quarterly audit process I use

2026 Advanced Deep Learning Projects

[D] Production gaps in context-window compression for AI agent memory

All Content

[2602.10195] Versor: A Geometric Sequence Architecture

[2602.09789] When Less is More: The LLM Scaling Paradox in Context Compression

[2508.01780] LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?

[2509.07706] FHIR-RAG-MEDS: Integrating HL7 FHIR with Retrieval-Augmented Large Language Models for Enhanced Medical Decision Support

[2512.05865] Sparse Attention Post-Training for Mechanistic Interpretability

[2510.27480] Simplex-to-Euclidean Bijections for Categorical Flow Matching

[2510.15464] Learning to Answer from Correct Demonstrations

[2510.05725] Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies

[2509.21936] Statistical Advantage of Softmax Attention: Insights from Single-Location Regression

[2602.23286] SPARTA: Scalable and Principled Benchmark of Tree-Structured Multi-hop QA over Text and Tables

[2509.21013] Predicting LLM Reasoning Performance with Small Proxy Model

[2602.23228] MovieTeller: Tool-augmented Movie Synopsis with ID Consistent Progressive Abstraction

[2602.23225] Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding?

[2506.14261] RL-Obfuscation: Can Language Models Learn to Evade Latent-Space Monitors?

[2602.23071] Quantity Convergence, Quality Divergence: Disentangling Fluency and Accuracy in L2 Mandarin Prosody

[2602.23070] Make It Hard to Hear, Easy to Learn: Long-Form Bengali ASR and Speaker Diarization via Extreme Augmentation and Perfect Alignment

[2602.23057] Affine-Scaled Attention: Towards Flexible and Stable Transformer Attention

[2502.06051] Towards a Sharp Analysis of Offline Policy Learning for $f$-Divergence-Regularized Contextual Bandits

[2602.22967] Discovery of Interpretable Physical Laws in Materials via Language-Model-Guided Symbolic Regression

[2602.22935] A Holistic Framework for Robust Bangla ASR and Speaker Diarization with Optimized VAD and CTC Alignment

Related Topics

Stay updated with AI News