Natural Language Processing

Text understanding and language tasks

Top This Week

Llms

[P] Remote sensing foundation models made easy to use.

This project enables the idea of tasking remote sensing models to acquire embeddings like we task satellites to acquire data! https://git...

Reddit - Machine Learning · 1 min ·
Nlp

Anyone else feel like AI security is being figured out in production right now?

I’ve been digging into AI security incident data from 2025 into this year, and it feels like something isn’t being talked about enough ou...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[D] ICML 2026 Average Score

Hi all, I’m curious about the current review dynamics for ICML 2026, especially after the rebuttal phase. For those who are reviewers (or...

Reddit - Machine Learning · 1 min ·

All Content

[2602.19143] Incremental Learning of Sparse Attention Patterns in Transformers
Machine Learning

[2602.19143] Incremental Learning of Sparse Attention Patterns in Transformers

This paper explores how transformers learn through incremental acquisition of sparse attention patterns, revealing shifts in learning dyn...

arXiv - Machine Learning · 3 min ·
[2602.18455] Impact of AI Search Summaries on Website Traffic: Evidence from Google AI Overviews and Wikipedia
Llms

[2602.18455] Impact of AI Search Summaries on Website Traffic: Evidence from Google AI Overviews and Wikipedia

This article examines the impact of AI-generated search summaries on website traffic, specifically analyzing how Google's AI Overviews af...

arXiv - AI · 4 min ·
[2602.18449] Prompt Optimization Via Diffusion Language Models
Llms

[2602.18449] Prompt Optimization Via Diffusion Language Models

The paper presents a novel diffusion-based framework for optimizing prompts in language models, enhancing performance through iterative r...

arXiv - Machine Learning · 3 min ·
[2602.18447] ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification
Llms

[2602.18447] ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification

The paper presents ConfSpec, a novel framework for efficient step-level speculative reasoning in large language models, achieving signifi...

arXiv - AI · 3 min ·
[2602.18437] FineRef: Fine-Grained Error Reflection and Correction for Long-Form Generation with Citations
Llms

[2602.18437] FineRef: Fine-Grained Error Reflection and Correction for Long-Form Generation with Citations

The paper presents FineRef, a novel framework for improving citation accuracy in long-form generation by addressing citation mismatch and...

arXiv - AI · 4 min ·
[2602.19066] IDLM: Inverse-distilled Diffusion Language Models
Llms

[2602.19066] IDLM: Inverse-distilled Diffusion Language Models

The paper presents Inverse-distilled Diffusion Language Models (IDLM), a method that significantly accelerates inference in text generati...

arXiv - AI · 3 min ·
[2602.20117] ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models
Llms

[2602.20117] ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models

The paper presents ReSyn, a novel pipeline for autonomously generating diverse synthetic environments for training reasoning language mod...

arXiv - Machine Learning · 3 min ·
[2602.18955] Incremental Transformer Neural Processes
Machine Learning

[2602.18955] Incremental Transformer Neural Processes

The paper introduces Incremental Transformer Neural Processes (incTNP), a model designed for efficient sequential data processing, achiev...

arXiv - Machine Learning · 4 min ·
[2602.20059] Interaction Theater: A case of LLM Agents Interacting at Scale
Llms

[2602.20059] Interaction Theater: A case of LLM Agents Interacting at Scale

The paper explores the interactions of autonomous LLM agents on a social platform, revealing that while agents produce varied text, meani...

arXiv - AI · 4 min ·
[2602.20048] CodeCompass: Navigating the Navigation Paradox in Agentic Code Intelligence
Nlp

[2602.20048] CodeCompass: Navigating the Navigation Paradox in Agentic Code Intelligence

The paper presents CodeCompass, a solution to the Navigation Paradox in code intelligence, highlighting the distinction between navigatio...

arXiv - AI · 3 min ·
[2602.18946] Exponential Convergence of (Stochastic) Gradient Descent for Separable Logistic Regression
Machine Learning

[2602.18946] Exponential Convergence of (Stochastic) Gradient Descent for Separable Logistic Regression

This paper presents a novel approach to gradient descent and stochastic gradient descent, demonstrating exponential convergence for separ...

arXiv - Machine Learning · 4 min ·
[2602.18948] Toward Manifest Relationality in Transformers via Symmetry Reduction
Machine Learning

[2602.18948] Toward Manifest Relationality in Transformers via Symmetry Reduction

This paper discusses a novel approach to enhance Transformer models by addressing internal redundancy through symmetry reduction, proposi...

arXiv - Machine Learning · 3 min ·
[2602.19633] TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents
Llms

[2602.19633] TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents

The paper presents TAPE, a novel framework for enhancing language model agents' planning and execution capabilities, addressing vulnerabi...

arXiv - AI · 3 min ·
[2602.18897] HEHRGNN: A Unified Embedding Model for Knowledge Graphs with Hyperedges and Hyper-Relational Edges
Machine Learning

[2602.18897] HEHRGNN: A Unified Embedding Model for Knowledge Graphs with Hyperedges and Hyper-Relational Edges

The paper presents HEHRGNN, a unified embedding model for knowledge graphs that incorporates hyperedges and hyper-relational edges, enhan...

arXiv - AI · 4 min ·
[2602.18858] Hyperbolic Busemann Neural Networks
Machine Learning

[2602.18858] Hyperbolic Busemann Neural Networks

The paper introduces Hyperbolic Busemann Neural Networks, which enhance neural network components by adapting them to hyperbolic space, i...

arXiv - Machine Learning · 3 min ·
[2602.19562] A Multimodal Framework for Aligning Human Linguistic Descriptions with Visual Perceptual Data
Machine Learning

[2602.19562] A Multimodal Framework for Aligning Human Linguistic Descriptions with Visual Perceptual Data

This paper presents a computational framework that aligns human linguistic descriptions with visual perceptual data, enhancing understand...

arXiv - AI · 4 min ·
[2602.18856] Issues with Measuring Task Complexity via Random Policies in Robotic Tasks
Nlp

[2602.18856] Issues with Measuring Task Complexity via Random Policies in Robotic Tasks

This paper evaluates the effectiveness of measuring task complexity in robotic tasks using random policies, revealing contradictions in e...

arXiv - Machine Learning · 4 min ·
[2602.18849] Exact Attention Sensitivity and the Geometry of Transformer Stability
Machine Learning

[2602.18849] Exact Attention Sensitivity and the Geometry of Transformer Stability

This article presents a stability theory for transformers, explaining key training dynamics and architectural considerations that affect ...

arXiv - AI · 3 min ·
[2602.19396] Hiding in Plain Text: Detecting Concealed Jailbreaks via Activation Disentanglement
Llms

[2602.19396] Hiding in Plain Text: Detecting Concealed Jailbreaks via Activation Disentanglement

This paper presents a novel framework for detecting concealed jailbreaks in large language models (LLMs) by disentangling semantic factor...

arXiv - AI · 4 min ·
[2602.19367] Time Series, Vision, and Language: Exploring the Limits of Alignment in Contrastive Representation Spaces
Machine Learning

[2602.19367] Time Series, Vision, and Language: Exploring the Limits of Alignment in Contrastive Representation Spaces

This paper investigates the alignment of representations from time series, vision, and language modalities, revealing insights into their...

arXiv - AI · 4 min ·
Previous Page 91 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime