Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[P] Remote sensing foundation models made easy to use.

This project enables the idea of tasking remote sensing models to acquire embeddings like we task satellites to acquire data! https://git...

Reddit - Machine Learning · 1 min · about 1 hour ago

Nlp

Anyone else feel like AI security is being figured out in production right now?

I’ve been digging into AI security incident data from 2025 into this year, and it feels like something isn’t being talked about enough ou...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

Machine Learning

[D] ICML 2026 Average Score

Hi all, I’m curious about the current review dynamics for ICML 2026, especially after the rebuttal phase. For those who are reviewers (or...

Reddit - Machine Learning · 1 min · about 5 hours ago

All Content

Machine Learning

[2602.19143] Incremental Learning of Sparse Attention Patterns in Transformers

This paper explores how transformers learn through incremental acquisition of sparse attention patterns, revealing shifts in learning dyn...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.18455] Impact of AI Search Summaries on Website Traffic: Evidence from Google AI Overviews and Wikipedia

This article examines the impact of AI-generated search summaries on website traffic, specifically analyzing how Google's AI Overviews af...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.18449] Prompt Optimization Via Diffusion Language Models

The paper presents a novel diffusion-based framework for optimizing prompts in language models, enhancing performance through iterative r...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.18447] ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification

The paper presents ConfSpec, a novel framework for efficient step-level speculative reasoning in large language models, achieving signifi...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.18437] FineRef: Fine-Grained Error Reflection and Correction for Long-Form Generation with Citations

The paper presents FineRef, a novel framework for improving citation accuracy in long-form generation by addressing citation mismatch and...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.19066] IDLM: Inverse-distilled Diffusion Language Models

The paper presents Inverse-distilled Diffusion Language Models (IDLM), a method that significantly accelerates inference in text generati...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20117] ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models

The paper presents ReSyn, a novel pipeline for autonomously generating diverse synthetic environments for training reasoning language mod...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.18955] Incremental Transformer Neural Processes

The paper introduces Incremental Transformer Neural Processes (incTNP), a model designed for efficient sequential data processing, achiev...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.20059] Interaction Theater: A case of LLM Agents Interacting at Scale

The paper explores the interactions of autonomous LLM agents on a social platform, revealing that while agents produce varied text, meani...

arXiv - AI · 4 min · about 1 month ago

Nlp

[2602.20048] CodeCompass: Navigating the Navigation Paradox in Agentic Code Intelligence

The paper presents CodeCompass, a solution to the Navigation Paradox in code intelligence, highlighting the distinction between navigatio...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18946] Exponential Convergence of (Stochastic) Gradient Descent for Separable Logistic Regression

This paper presents a novel approach to gradient descent and stochastic gradient descent, demonstrating exponential convergence for separ...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.18948] Toward Manifest Relationality in Transformers via Symmetry Reduction

This paper discusses a novel approach to enhance Transformer models by addressing internal redundancy through symmetry reduction, proposi...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.19633] TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents

The paper presents TAPE, a novel framework for enhancing language model agents' planning and execution capabilities, addressing vulnerabi...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18897] HEHRGNN: A Unified Embedding Model for Knowledge Graphs with Hyperedges and Hyper-Relational Edges

The paper presents HEHRGNN, a unified embedding model for knowledge graphs that incorporates hyperedges and hyper-relational edges, enhan...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.18858] Hyperbolic Busemann Neural Networks

The paper introduces Hyperbolic Busemann Neural Networks, which enhance neural network components by adapting them to hyperbolic space, i...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.19562] A Multimodal Framework for Aligning Human Linguistic Descriptions with Visual Perceptual Data

This paper presents a computational framework that aligns human linguistic descriptions with visual perceptual data, enhancing understand...

arXiv - AI · 4 min · about 1 month ago

Nlp

[2602.18856] Issues with Measuring Task Complexity via Random Policies in Robotic Tasks

This paper evaluates the effectiveness of measuring task complexity in robotic tasks using random policies, revealing contradictions in e...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.18849] Exact Attention Sensitivity and the Geometry of Transformer Stability

This article presents a stability theory for transformers, explaining key training dynamics and architectural considerations that affect ...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.19396] Hiding in Plain Text: Detecting Concealed Jailbreaks via Activation Disentanglement

This paper presents a novel framework for detecting concealed jailbreaks in large language models (LLMs) by disentangling semantic factor...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19367] Time Series, Vision, and Language: Exploring the Limits of Alignment in Contrastive Representation Spaces

This paper investigates the alignment of representations from time series, vision, and language modalities, revealing insights into their...

arXiv - AI · 4 min · about 1 month ago

Previous Page 91 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

[P] Remote sensing foundation models made easy to use.

Anyone else feel like AI security is being figured out in production right now?

[D] ICML 2026 Average Score

All Content

[2602.19143] Incremental Learning of Sparse Attention Patterns in Transformers

[2602.18455] Impact of AI Search Summaries on Website Traffic: Evidence from Google AI Overviews and Wikipedia

[2602.18449] Prompt Optimization Via Diffusion Language Models

[2602.18447] ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification

[2602.18437] FineRef: Fine-Grained Error Reflection and Correction for Long-Form Generation with Citations

[2602.19066] IDLM: Inverse-distilled Diffusion Language Models

[2602.20117] ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models

[2602.18955] Incremental Transformer Neural Processes

[2602.20059] Interaction Theater: A case of LLM Agents Interacting at Scale

[2602.20048] CodeCompass: Navigating the Navigation Paradox in Agentic Code Intelligence

[2602.18946] Exponential Convergence of (Stochastic) Gradient Descent for Separable Logistic Regression

[2602.18948] Toward Manifest Relationality in Transformers via Symmetry Reduction

[2602.19633] TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents

[2602.18897] HEHRGNN: A Unified Embedding Model for Knowledge Graphs with Hyperedges and Hyper-Relational Edges

[2602.18858] Hyperbolic Busemann Neural Networks

[2602.19562] A Multimodal Framework for Aligning Human Linguistic Descriptions with Visual Perceptual Data

[2602.18856] Issues with Measuring Task Complexity via Random Policies in Robotic Tasks

[2602.18849] Exact Attention Sensitivity and the Geometry of Transformer Stability

[2602.19396] Hiding in Plain Text: Detecting Concealed Jailbreaks via Activation Disentanglement

[2602.19367] Time Series, Vision, and Language: Exploring the Limits of Alignment in Contrastive Representation Spaces

Related Topics

Stay updated with AI News