Natural Language Processing

Text understanding and language tasks

Top This Week

Nlp

[D] Is lossy compression acceptable for conversational agent memory? Every system today uses knowledge graph triples — here's why I think that's wrong.

Been thinking about this and want to know if others have hit the same issue. The dominant approach for agent memory (Mem0, Zep, most RAG ...

Reddit - Machine Learning · 1 min ·
[2601.11016] Contextual Distributionally Robust Optimization with Causal and Continuous Structure: An Interpretable and Tractable Approach
Nlp

[2601.11016] Contextual Distributionally Robust Optimization with Causal and Continuous Structure: An Interpretable and Tractable Approach

Abstract page for arXiv paper 2601.11016: Contextual Distributionally Robust Optimization with Causal and Continuous Structure: An Interp...

arXiv - Machine Learning · 4 min ·
[2511.22294] Structure is Supervision: Multiview Masked Autoencoders for Radiology
Machine Learning

[2511.22294] Structure is Supervision: Multiview Masked Autoencoders for Radiology

Abstract page for arXiv paper 2511.22294: Structure is Supervision: Multiview Masked Autoencoders for Radiology

arXiv - Machine Learning · 4 min ·

All Content

[2504.10507] PinRec: Unified Generative Retrieval for Pinterest Recommender Systems
Machine Learning

[2504.10507] PinRec: Unified Generative Retrieval for Pinterest Recommender Systems

The paper introduces PinRec, a unified generative retrieval model for Pinterest's recommendation systems, enhancing performance across va...

arXiv - Machine Learning · 4 min ·
[2411.07102] Effectively Leveraging Momentum Terms in Stochastic Line Search Frameworks for Fast Optimization of Finite-Sum Problems
Machine Learning

[2411.07102] Effectively Leveraging Momentum Terms in Stochastic Line Search Frameworks for Fast Optimization of Finite-Sum Problems

This paper presents a novel algorithmic framework that integrates momentum terms with stochastic line search methods to optimize finite-s...

arXiv - Machine Learning · 4 min ·
[2410.08958] The MAPS Algorithm: Fast model-agnostic and distribution-free prediction intervals for supervised learning
Machine Learning

[2410.08958] The MAPS Algorithm: Fast model-agnostic and distribution-free prediction intervals for supervised learning

The MAPS algorithm offers a novel approach to generating model-agnostic, distribution-free prediction intervals in supervised learning, a...

arXiv - Machine Learning · 4 min ·
[2507.23465] Role-Aware Language Models for Secure and Contextualized Access Control in Organizations
Llms

[2507.23465] Role-Aware Language Models for Secure and Contextualized Access Control in Organizations

This article explores the development of role-aware language models designed to enhance access control in organizational settings, focusi...

arXiv - AI · 3 min ·
[2506.07751] AbstRaL: Augmenting LLMs' Reasoning by Reinforcing Abstract Thinking
Llms

[2506.07751] AbstRaL: Augmenting LLMs' Reasoning by Reinforcing Abstract Thinking

The paper presents AbstRaL, a method to enhance large language models' reasoning capabilities by reinforcing abstract thinking, particula...

arXiv - AI · 4 min ·
[2506.05850] Cross-lingual Collapse: How Language-Centric Foundation Models Shape Reasoning in Large Language Models
Llms

[2506.05850] Cross-lingual Collapse: How Language-Centric Foundation Models Shape Reasoning in Large Language Models

The paper investigates 'Cross-lingual Collapse' in large language models (LLMs), revealing how reasoning capabilities can revert to a dom...

arXiv - AI · 4 min ·
[2602.03098] TextME: Bridging Unseen Modalities Through Text Descriptions
Llms

[2602.03098] TextME: Bridging Unseen Modalities Through Text Descriptions

The paper introduces TextME, a framework that enables zero-shot cross-modal transfer using only text descriptions, addressing the limitat...

arXiv - AI · 3 min ·
[2505.16670] BitHydra: Towards Bit-flip Inference Cost Attack against Large Language Models
Llms

[2505.16670] BitHydra: Towards Bit-flip Inference Cost Attack against Large Language Models

The paper presents BitHydra, a framework for executing bit-flip inference cost attacks on large language models (LLMs), demonstrating how...

arXiv - AI · 4 min ·
[2601.21315] Distributionally Robust Classification for Multi-source Unsupervised Domain Adaptation
Machine Learning

[2601.21315] Distributionally Robust Classification for Multi-source Unsupervised Domain Adaptation

This paper presents a novel distributionally robust learning framework for multi-source unsupervised domain adaptation, addressing challe...

arXiv - AI · 4 min ·
[2504.04717] Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models
Llms

[2504.04717] Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models

This article surveys advancements in multi-turn interactions with large language models (LLMs), focusing on evaluation methods, challenge...

arXiv - AI · 4 min ·
[2601.03612] Mathematical Foundations of Polyphonic Music Generation via Structural Inductive Bias
Nlp

[2601.03612] Mathematical Foundations of Polyphonic Music Generation via Structural Inductive Bias

This article presents a novel approach to polyphonic music generation using structural inductive bias, focusing on Beethoven's piano sona...

arXiv - Machine Learning · 3 min ·
[2503.04940] VQEL: Enabling Self-Play in Emergent Language Games via Agent-Internal Vector Quantization
Ai Agents

[2503.04940] VQEL: Enabling Self-Play in Emergent Language Games via Agent-Internal Vector Quantization

The paper presents VQEL, a novel architecture that enhances self-play in emergent language games through agent-internal vector quantizati...

arXiv - AI · 4 min ·
[2411.11707] Federated Co-tuning Framework for Large and Small Language Models
Llms

[2411.11707] Federated Co-tuning Framework for Large and Small Language Models

The paper presents FedCoLLM, a federated co-tuning framework that enhances the performance of both Large Language Models (LLMs) and Small...

arXiv - AI · 4 min ·
[2412.04272] PoTable: Towards Systematic Thinking via Plan-then-Execute Stage Reasoning on Tables
Llms

[2412.04272] PoTable: Towards Systematic Thinking via Plan-then-Execute Stage Reasoning on Tables

The paper presents PoTable, a novel approach to table reasoning that integrates systematic thinking through a plan-then-execute mechanism...

arXiv - AI · 4 min ·
[2511.20564] E2E-GRec: An End-to-End Joint Training Framework for Graph Neural Networks and Recommender Systems
Machine Learning

[2511.20564] E2E-GRec: An End-to-End Joint Training Framework for Graph Neural Networks and Recommender Systems

The paper presents E2E-GRec, a novel end-to-end framework that integrates Graph Neural Networks (GNNs) with recommender systems, addressi...

arXiv - Machine Learning · 4 min ·
[2511.17628] Rectifying Distribution Shift in Cascaded Precipitation Nowcasting
Machine Learning

[2511.17628] Rectifying Distribution Shift in Cascaded Precipitation Nowcasting

This article presents RectiCast, a novel framework for improving precipitation nowcasting by addressing distribution shifts in deep learn...

arXiv - Machine Learning · 4 min ·
[2410.15173] Uncovering Autoregressive LLM Knowledge of Thematic Fit in Event Representation
Llms

[2410.15173] Uncovering Autoregressive LLM Knowledge of Thematic Fit in Event Representation

This paper explores how autoregressive large language models (LLMs) assess thematic fit in event representation, achieving state-of-the-a...

arXiv - AI · 3 min ·
[2407.17412] (PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork
Machine Learning

[2407.17412] (PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork

The paper presents PASS, a novel algorithmic framework that utilizes visual prompts to enhance structural sparsity in neural networks, im...

arXiv - AI · 4 min ·
[2511.07730] Multistep Quasimetric Learning for Scalable Goal-conditioned Reinforcement Learning
Nlp

[2511.07730] Multistep Quasimetric Learning for Scalable Goal-conditioned Reinforcement Learning

This paper presents a novel approach to goal-conditioned reinforcement learning (GCRL) using multistep quasimetric learning, demonstratin...

arXiv - Machine Learning · 4 min ·
[2510.15940] Lean Finder: Semantic Search for Mathlib That Understands User Intents
Nlp

[2510.15940] Lean Finder: Semantic Search for Mathlib That Understands User Intents

Lean Finder is a semantic search engine designed for the Lean programming language and mathlib, improving theorem retrieval by understand...

arXiv - AI · 4 min ·
Previous Page 85 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime