Natural Language Processing

Text understanding and language tasks

Top This Week

Nlp

[D] Is ACL more about the benchmarks now?

I am not a NLP guy, but afaik ACL is one of the premium venues of NLP. And given that the results were announced recently, my LinkedIn an...

Reddit - Machine Learning · 1 min ·
[2604.01676] GPA: Learning GUI Process Automation from Demonstrations
Llms

[2604.01676] GPA: Learning GUI Process Automation from Demonstrations

Abstract page for arXiv paper 2604.01676: GPA: Learning GUI Process Automation from Demonstrations

arXiv - AI · 3 min ·
[2604.01413] Adaptive Stopping for Multi-Turn LLM Reasoning
Llms

[2604.01413] Adaptive Stopping for Multi-Turn LLM Reasoning

Abstract page for arXiv paper 2604.01413: Adaptive Stopping for Multi-Turn LLM Reasoning

arXiv - AI · 4 min ·

All Content

[2602.13690] Physics Aware Neural Networks: Denoising for Magnetic Navigation
Machine Learning

[2602.13690] Physics Aware Neural Networks: Denoising for Magnetic Navigation

This paper presents a novel framework for denoising magnetic navigation data using physics-aware neural networks, addressing challenges i...

arXiv - Machine Learning · 4 min ·
[2602.13980] Cognitive Chunking for Soft Prompts: Accelerating Compressor Learning via Block-wise Causal Masking
Llms

[2602.13980] Cognitive Chunking for Soft Prompts: Accelerating Compressor Learning via Block-wise Causal Masking

This article presents a novel method called Parallelized Iterative Compression (PIC) for enhancing soft prompt compression in Large Langu...

arXiv - Machine Learning · 4 min ·
[2602.13967] Neuromem: A Granular Decomposition of the Streaming Lifecycle in External Memory for LLMs
Llms

[2602.13967] Neuromem: A Granular Decomposition of the Streaming Lifecycle in External Memory for LLMs

The paper presents Neuromem, a framework for evaluating external memory modules in large language models (LLMs) under a dynamic streaming...

arXiv - AI · 4 min ·
[2602.13935] Statistical Early Stopping for Reasoning Models
Llms

[2602.13935] Statistical Early Stopping for Reasoning Models

The paper presents statistical early stopping methods for reasoning models, addressing inefficiencies in large language models (LLMs) tha...

arXiv - Machine Learning · 3 min ·
[2602.13933] HyMem: Hybrid Memory Architecture with Dynamic Retrieval Scheduling
Llms

[2602.13933] HyMem: Hybrid Memory Architecture with Dynamic Retrieval Scheduling

The paper presents HyMem, a hybrid memory architecture designed to enhance the performance of large language models (LLMs) in extended di...

arXiv - AI · 4 min ·
[2602.13659] Zero-Order Optimization for LLM Fine-Tuning via Learnable Direction Sampling
Llms

[2602.13659] Zero-Order Optimization for LLM Fine-Tuning via Learnable Direction Sampling

This article presents a novel zero-order optimization framework for fine-tuning large language models (LLMs) using learnable direction sa...

arXiv - Machine Learning · 4 min ·
[2602.13880] VSAL: A Vision Solver with Adaptive Layouts for Graph Property Detection
Machine Learning

[2602.13880] VSAL: A Vision Solver with Adaptive Layouts for Graph Property Detection

The paper presents VSAL, a vision-based framework for graph property detection that utilizes adaptive layouts to enhance the detection of...

arXiv - AI · 3 min ·
[2602.13634] Optimization-Free Graph Embedding via Distributional Kernel for Community Detection
Machine Learning

[2602.13634] Optimization-Free Graph Embedding via Distributional Kernel for Community Detection

This article presents a novel method for graph embedding that addresses over-smoothing in Neighborhood Aggregation Strategy (NAS) methods...

arXiv - Machine Learning · 3 min ·
[2602.13852] Experimentation Accelerator: Interpretable Insights and Creative Recommendations for A/B Testing with Content-Aware ranking
Nlp

[2602.13852] Experimentation Accelerator: Interpretable Insights and Creative Recommendations for A/B Testing with Content-Aware ranking

The paper presents the Experimentation Accelerator, a framework that enhances A/B testing by providing interpretable insights and creativ...

arXiv - AI · 4 min ·
[2602.13524] Singular Vectors of Attention Heads Align with Features
Llms

[2602.13524] Singular Vectors of Attention Heads Align with Features

This paper explores the alignment of singular vectors of attention heads with feature representations in language models, providing theor...

arXiv - AI · 3 min ·
[2602.13498] TrasMuon: Trust-Region Adaptive Scaling for Orthogonalized Momentum Optimizers
Machine Learning

[2602.13498] TrasMuon: Trust-Region Adaptive Scaling for Orthogonalized Momentum Optimizers

TrasMuon introduces a novel optimization technique that enhances the stability and efficiency of orthogonalized momentum optimizers, outp...

arXiv - AI · 3 min ·
[2602.13483] Finding Highly Interpretable Prompt-Specific Circuits in Language Models
Llms

[2602.13483] Finding Highly Interpretable Prompt-Specific Circuits in Language Models

This article presents a novel approach to understanding prompt-specific circuits in language models, demonstrating that circuits vary by ...

arXiv - AI · 4 min ·
[2602.13418] Text Has Curvature
Machine Learning

[2602.13418] Text Has Curvature

The paper 'Text Has Curvature' explores the concept of intrinsic curvature in language, proposing a new measurement called Texture to ana...

arXiv - Machine Learning · 4 min ·
[2602.13639] Guided Collaboration in Heterogeneous LLM-Based Multi-Agent Systems via Entropy-Based Understanding Assessment and Experience Retrieval
Llms

[2602.13639] Guided Collaboration in Heterogeneous LLM-Based Multi-Agent Systems via Entropy-Based Understanding Assessment and Experience Retrieval

The paper discusses a novel Entropy-Based Adaptive Guidance Framework for enhancing collaboration in heterogeneous multi-agent systems us...

arXiv - AI · 4 min ·
[2602.13594] Hippocampus: An Efficient and Scalable Memory Module for Agentic AI
Llms

[2602.13594] Hippocampus: An Efficient and Scalable Memory Module for Agentic AI

The paper introduces Hippocampus, a scalable memory module designed for agentic AI, enhancing retrieval speed and storage efficiency comp...

arXiv - AI · 3 min ·
[2602.13583] Differentiable Rule Induction from Raw Sequence Inputs
Machine Learning

[2602.13583] Differentiable Rule Induction from Raw Sequence Inputs

This paper presents a novel approach to differentiable rule induction from raw sequence inputs, enhancing interpretability in machine lea...

arXiv - Machine Learning · 3 min ·
[2602.13345] BLUEPRINT Rebuilding a Legacy: Multimodal Retrieval for Complex Engineering Drawings and Documents
Nlp

[2602.13345] BLUEPRINT Rebuilding a Legacy: Multimodal Retrieval for Complex Engineering Drawings and Documents

The paper presents Blueprint, a multimodal retrieval system designed to enhance the accessibility of complex engineering drawings and doc...

arXiv - Machine Learning · 3 min ·
[2602.13264] Directional Concentration Uncertainty: A representational approach to uncertainty quantification for generative models
Machine Learning

[2602.13264] Directional Concentration Uncertainty: A representational approach to uncertainty quantification for generative models

The paper introduces Directional Concentration Uncertainty (DCU), a flexible framework for uncertainty quantification in generative model...

arXiv - AI · 4 min ·
[2602.13530] REMem: Reasoning with Episodic Memory in Language Agent
Ai Agents

[2602.13530] REMem: Reasoning with Episodic Memory in Language Agent

The paper presents REMem, a novel framework for enhancing language agents' episodic memory, enabling better recollection and reasoning ov...

arXiv - AI · 3 min ·
[2602.13321] Detecting Jailbreak Attempts in Clinical Training LLMs Through Automated Linguistic Feature Extraction
Llms

[2602.13321] Detecting Jailbreak Attempts in Clinical Training LLMs Through Automated Linguistic Feature Extraction

This study explores automated detection of jailbreak attempts in clinical training large language models (LLMs) using linguistic feature ...

arXiv - Machine Learning · 4 min ·
Previous Page 132 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime