Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Nlp

[D] Is ACL more about the benchmarks now?

I am not a NLP guy, but afaik ACL is one of the premium venues of NLP. And given that the results were announced recently, my LinkedIn an...

Reddit - Machine Learning · 1 min · 43 minutes ago

Llms

[2604.01676] GPA: Learning GUI Process Automation from Demonstrations

Abstract page for arXiv paper 2604.01676: GPA: Learning GUI Process Automation from Demonstrations

arXiv - AI · 3 min · about 2 hours ago

Llms

[2604.01413] Adaptive Stopping for Multi-Turn LLM Reasoning

Abstract page for arXiv paper 2604.01413: Adaptive Stopping for Multi-Turn LLM Reasoning

arXiv - AI · 4 min · about 2 hours ago

All Content

Machine Learning

[2602.13690] Physics Aware Neural Networks: Denoising for Magnetic Navigation

This paper presents a novel framework for denoising magnetic navigation data using physics-aware neural networks, addressing challenges i...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.13980] Cognitive Chunking for Soft Prompts: Accelerating Compressor Learning via Block-wise Causal Masking

This article presents a novel method called Parallelized Iterative Compression (PIC) for enhancing soft prompt compression in Large Langu...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.13967] Neuromem: A Granular Decomposition of the Streaming Lifecycle in External Memory for LLMs

The paper presents Neuromem, a framework for evaluating external memory modules in large language models (LLMs) under a dynamic streaming...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13935] Statistical Early Stopping for Reasoning Models

The paper presents statistical early stopping methods for reasoning models, addressing inefficiencies in large language models (LLMs) tha...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.13933] HyMem: Hybrid Memory Architecture with Dynamic Retrieval Scheduling

The paper presents HyMem, a hybrid memory architecture designed to enhance the performance of large language models (LLMs) in extended di...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13659] Zero-Order Optimization for LLM Fine-Tuning via Learnable Direction Sampling

This article presents a novel zero-order optimization framework for fine-tuning large language models (LLMs) using learnable direction sa...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.13880] VSAL: A Vision Solver with Adaptive Layouts for Graph Property Detection

The paper presents VSAL, a vision-based framework for graph property detection that utilizes adaptive layouts to enhance the detection of...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.13634] Optimization-Free Graph Embedding via Distributional Kernel for Community Detection

This article presents a novel method for graph embedding that addresses over-smoothing in Neighborhood Aggregation Strategy (NAS) methods...

arXiv - Machine Learning · 3 min · about 2 months ago

Nlp

[2602.13852] Experimentation Accelerator: Interpretable Insights and Creative Recommendations for A/B Testing with Content-Aware ranking

The paper presents the Experimentation Accelerator, a framework that enhances A/B testing by providing interpretable insights and creativ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13524] Singular Vectors of Attention Heads Align with Features

This paper explores the alignment of singular vectors of attention heads with feature representations in language models, providing theor...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.13498] TrasMuon: Trust-Region Adaptive Scaling for Orthogonalized Momentum Optimizers

TrasMuon introduces a novel optimization technique that enhances the stability and efficiency of orthogonalized momentum optimizers, outp...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.13483] Finding Highly Interpretable Prompt-Specific Circuits in Language Models

This article presents a novel approach to understanding prompt-specific circuits in language models, demonstrating that circuits vary by ...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.13418] Text Has Curvature

The paper 'Text Has Curvature' explores the concept of intrinsic curvature in language, proposing a new measurement called Texture to ana...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.13639] Guided Collaboration in Heterogeneous LLM-Based Multi-Agent Systems via Entropy-Based Understanding Assessment and Experience Retrieval

The paper discusses a novel Entropy-Based Adaptive Guidance Framework for enhancing collaboration in heterogeneous multi-agent systems us...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13594] Hippocampus: An Efficient and Scalable Memory Module for Agentic AI

The paper introduces Hippocampus, a scalable memory module designed for agentic AI, enhancing retrieval speed and storage efficiency comp...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.13583] Differentiable Rule Induction from Raw Sequence Inputs

This paper presents a novel approach to differentiable rule induction from raw sequence inputs, enhancing interpretability in machine lea...

arXiv - Machine Learning · 3 min · about 2 months ago

Nlp

[2602.13345] BLUEPRINT Rebuilding a Legacy: Multimodal Retrieval for Complex Engineering Drawings and Documents

The paper presents Blueprint, a multimodal retrieval system designed to enhance the accessibility of complex engineering drawings and doc...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.13264] Directional Concentration Uncertainty: A representational approach to uncertainty quantification for generative models

The paper introduces Directional Concentration Uncertainty (DCU), a flexible framework for uncertainty quantification in generative model...

arXiv - AI · 4 min · about 2 months ago

Ai Agents

[2602.13530] REMem: Reasoning with Episodic Memory in Language Agent

The paper presents REMem, a novel framework for enhancing language agents' episodic memory, enabling better recollection and reasoning ov...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.13321] Detecting Jailbreak Attempts in Clinical Training LLMs Through Automated Linguistic Feature Extraction

This study explores automated detection of jailbreak attempts in clinical training large language models (LLMs) using linguistic feature ...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 132 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

[D] Is ACL more about the benchmarks now?

[2604.01676] GPA: Learning GUI Process Automation from Demonstrations

[2604.01413] Adaptive Stopping for Multi-Turn LLM Reasoning

All Content

[2602.13690] Physics Aware Neural Networks: Denoising for Magnetic Navigation

[2602.13980] Cognitive Chunking for Soft Prompts: Accelerating Compressor Learning via Block-wise Causal Masking

[2602.13967] Neuromem: A Granular Decomposition of the Streaming Lifecycle in External Memory for LLMs

[2602.13935] Statistical Early Stopping for Reasoning Models

[2602.13933] HyMem: Hybrid Memory Architecture with Dynamic Retrieval Scheduling

[2602.13659] Zero-Order Optimization for LLM Fine-Tuning via Learnable Direction Sampling

[2602.13880] VSAL: A Vision Solver with Adaptive Layouts for Graph Property Detection

[2602.13634] Optimization-Free Graph Embedding via Distributional Kernel for Community Detection

[2602.13852] Experimentation Accelerator: Interpretable Insights and Creative Recommendations for A/B Testing with Content-Aware ranking

[2602.13524] Singular Vectors of Attention Heads Align with Features

[2602.13498] TrasMuon: Trust-Region Adaptive Scaling for Orthogonalized Momentum Optimizers

[2602.13483] Finding Highly Interpretable Prompt-Specific Circuits in Language Models

[2602.13418] Text Has Curvature

[2602.13639] Guided Collaboration in Heterogeneous LLM-Based Multi-Agent Systems via Entropy-Based Understanding Assessment and Experience Retrieval

[2602.13594] Hippocampus: An Efficient and Scalable Memory Module for Agentic AI

[2602.13583] Differentiable Rule Induction from Raw Sequence Inputs

[2602.13345] BLUEPRINT Rebuilding a Legacy: Multimodal Retrieval for Complex Engineering Drawings and Documents

[2602.13264] Directional Concentration Uncertainty: A representational approach to uncertainty quantification for generative models

[2602.13530] REMem: Reasoning with Episodic Memory in Language Agent

[2602.13321] Detecting Jailbreak Attempts in Clinical Training LLMs Through Automated Linguistic Feature Extraction

Related Topics

Stay updated with AI News