Natural Language Processing

Text understanding and language tasks

Top This Week

Nlp

Built a Hybrid NAS tool for RNN architectures (HyNAS-R) – Looking for feedback for my final year evaluation [R]

Hi everyone, I'm currently in the evaluation phase of my Final Year Project and am looking for feedback on the system I've built. It's ca...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] ICML 26 - What to do with the zero follow-up questions

Hello everyone. I submitted my work to ICML 26 this year, and it got somewhat above average reviews. Now, in the rebuttal acknowledgment,...

Reddit - Machine Learning · 1 min ·
Startup Battlefield 200 applications open until May 27 | TechCrunch
Nlp

Startup Battlefield 200 applications open until May 27 | TechCrunch

Nominate your startup, or one you know, and apply for a chance at VC access, TechCrunch coverage, and $100K for Startup Battlefield 200.

TechCrunch - AI · 4 min ·

All Content

[2602.13933] HyMem: Hybrid Memory Architecture with Dynamic Retrieval Scheduling
Llms

[2602.13933] HyMem: Hybrid Memory Architecture with Dynamic Retrieval Scheduling

The paper presents HyMem, a hybrid memory architecture designed to enhance the performance of large language models (LLMs) in extended di...

arXiv - AI · 4 min ·
[2602.13659] Zero-Order Optimization for LLM Fine-Tuning via Learnable Direction Sampling
Llms

[2602.13659] Zero-Order Optimization for LLM Fine-Tuning via Learnable Direction Sampling

This article presents a novel zero-order optimization framework for fine-tuning large language models (LLMs) using learnable direction sa...

arXiv - Machine Learning · 4 min ·
[2602.13880] VSAL: A Vision Solver with Adaptive Layouts for Graph Property Detection
Machine Learning

[2602.13880] VSAL: A Vision Solver with Adaptive Layouts for Graph Property Detection

The paper presents VSAL, a vision-based framework for graph property detection that utilizes adaptive layouts to enhance the detection of...

arXiv - AI · 3 min ·
[2602.13634] Optimization-Free Graph Embedding via Distributional Kernel for Community Detection
Machine Learning

[2602.13634] Optimization-Free Graph Embedding via Distributional Kernel for Community Detection

This article presents a novel method for graph embedding that addresses over-smoothing in Neighborhood Aggregation Strategy (NAS) methods...

arXiv - Machine Learning · 3 min ·
[2602.13852] Experimentation Accelerator: Interpretable Insights and Creative Recommendations for A/B Testing with Content-Aware ranking
Nlp

[2602.13852] Experimentation Accelerator: Interpretable Insights and Creative Recommendations for A/B Testing with Content-Aware ranking

The paper presents the Experimentation Accelerator, a framework that enhances A/B testing by providing interpretable insights and creativ...

arXiv - AI · 4 min ·
[2602.13524] Singular Vectors of Attention Heads Align with Features
Llms

[2602.13524] Singular Vectors of Attention Heads Align with Features

This paper explores the alignment of singular vectors of attention heads with feature representations in language models, providing theor...

arXiv - AI · 3 min ·
[2602.13498] TrasMuon: Trust-Region Adaptive Scaling for Orthogonalized Momentum Optimizers
Machine Learning

[2602.13498] TrasMuon: Trust-Region Adaptive Scaling for Orthogonalized Momentum Optimizers

TrasMuon introduces a novel optimization technique that enhances the stability and efficiency of orthogonalized momentum optimizers, outp...

arXiv - AI · 3 min ·
[2602.13483] Finding Highly Interpretable Prompt-Specific Circuits in Language Models
Llms

[2602.13483] Finding Highly Interpretable Prompt-Specific Circuits in Language Models

This article presents a novel approach to understanding prompt-specific circuits in language models, demonstrating that circuits vary by ...

arXiv - AI · 4 min ·
[2602.13418] Text Has Curvature
Machine Learning

[2602.13418] Text Has Curvature

The paper 'Text Has Curvature' explores the concept of intrinsic curvature in language, proposing a new measurement called Texture to ana...

arXiv - Machine Learning · 4 min ·
[2602.13639] Guided Collaboration in Heterogeneous LLM-Based Multi-Agent Systems via Entropy-Based Understanding Assessment and Experience Retrieval
Llms

[2602.13639] Guided Collaboration in Heterogeneous LLM-Based Multi-Agent Systems via Entropy-Based Understanding Assessment and Experience Retrieval

The paper discusses a novel Entropy-Based Adaptive Guidance Framework for enhancing collaboration in heterogeneous multi-agent systems us...

arXiv - AI · 4 min ·
[2602.13594] Hippocampus: An Efficient and Scalable Memory Module for Agentic AI
Llms

[2602.13594] Hippocampus: An Efficient and Scalable Memory Module for Agentic AI

The paper introduces Hippocampus, a scalable memory module designed for agentic AI, enhancing retrieval speed and storage efficiency comp...

arXiv - AI · 3 min ·
[2602.13583] Differentiable Rule Induction from Raw Sequence Inputs
Machine Learning

[2602.13583] Differentiable Rule Induction from Raw Sequence Inputs

This paper presents a novel approach to differentiable rule induction from raw sequence inputs, enhancing interpretability in machine lea...

arXiv - Machine Learning · 3 min ·
[2602.13345] BLUEPRINT Rebuilding a Legacy: Multimodal Retrieval for Complex Engineering Drawings and Documents
Nlp

[2602.13345] BLUEPRINT Rebuilding a Legacy: Multimodal Retrieval for Complex Engineering Drawings and Documents

The paper presents Blueprint, a multimodal retrieval system designed to enhance the accessibility of complex engineering drawings and doc...

arXiv - Machine Learning · 3 min ·
[2602.13264] Directional Concentration Uncertainty: A representational approach to uncertainty quantification for generative models
Machine Learning

[2602.13264] Directional Concentration Uncertainty: A representational approach to uncertainty quantification for generative models

The paper introduces Directional Concentration Uncertainty (DCU), a flexible framework for uncertainty quantification in generative model...

arXiv - AI · 4 min ·
[2602.13530] REMem: Reasoning with Episodic Memory in Language Agent
Ai Agents

[2602.13530] REMem: Reasoning with Episodic Memory in Language Agent

The paper presents REMem, a novel framework for enhancing language agents' episodic memory, enabling better recollection and reasoning ov...

arXiv - AI · 3 min ·
[2602.13321] Detecting Jailbreak Attempts in Clinical Training LLMs Through Automated Linguistic Feature Extraction
Llms

[2602.13321] Detecting Jailbreak Attempts in Clinical Training LLMs Through Automated Linguistic Feature Extraction

This study explores automated detection of jailbreak attempts in clinical training large language models (LLMs) using linguistic feature ...

arXiv - Machine Learning · 4 min ·
[2602.13274] ProMoral-Bench: Evaluating Prompting Strategies for Moral Reasoning and Safety in LLMs
Llms

[2602.13274] ProMoral-Bench: Evaluating Prompting Strategies for Moral Reasoning and Safety in LLMs

The paper introduces ProMoral-Bench, a benchmark for evaluating prompting strategies in large language models (LLMs) focused on moral rea...

arXiv - AI · 3 min ·
[2602.13248] X-Blocks: Linguistic Building Blocks of Natural Language Explanations for Automated Vehicles
Nlp

[2602.13248] X-Blocks: Linguistic Building Blocks of Natural Language Explanations for Automated Vehicles

The paper introduces X-Blocks, a framework for analyzing natural language explanations in automated vehicles, enhancing user trust and un...

arXiv - AI · 4 min ·
[2602.13235] Lang2Act: Fine-Grained Visual Reasoning through Self-Emergent Linguistic Toolchains
Llms

[2602.13235] Lang2Act: Fine-Grained Visual Reasoning through Self-Emergent Linguistic Toolchains

The paper introduces Lang2Act, a novel framework for enhancing visual reasoning in Vision-Language Models (VLMs) through self-emergent li...

arXiv - AI · 4 min ·
[2602.13237] NL2LOGIC: AST-Guided Translation of Natural Language into First-Order Logic with Large Language Models
Llms

[2602.13237] NL2LOGIC: AST-Guided Translation of Natural Language into First-Order Logic with Large Language Models

NL2LOGIC presents a novel framework for translating natural language into first-order logic using large language models, enhancing accura...

arXiv - AI · 4 min ·
Previous Page 126 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime