Top Natural Language Processing This Month

1

[2508.04492] Learning Robust Intervention Representations with Delta Embeddings

Abstract page for arXiv paper 2508.04492: Learning Robust Intervention Representations with Delta Embeddings

arXiv - AI · 24 days ago

2

[2510.04676] Counterfactual Credit Guided Bayesian Optimization

Abstract page for arXiv paper 2510.04676: Counterfactual Credit Guided Bayesian Optimization

arXiv - Machine Learning · 24 days ago

3

[D] On-device Game AI: would you try AI characters, and what should we build next? Discussion

The discussion focuses on developing on-device Game AI capable of real-time conversations and context-aware interactions, exploring potential applications and user interest.

Reddit - Machine Learning · 28 days ago

4

AI music generator Suno hits 2M paid subscribers and $300M in annual recurring revenue | TechCrunch

AI music generator Suno has reached 2 million paid subscribers and $300 million in annual recurring revenue, showcasing rapid growth and innovation in the music industry.

TechCrunch - AI · 28 days ago

5

[2503.11832] Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated by Machine Unlearning

Abstract page for arXiv paper 2503.11832: Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated by Machine Unlearning

arXiv - Machine Learning · 24 days ago

6

[2602.10195] Versor: A Geometric Sequence Architecture

The paper introduces Versor, a novel geometric sequence architecture that leverages Conformal Geometric Algebra for enhanced performance and interpretability in machine learning tasks.

arXiv - Machine Learning · 28 days ago

7

[P] Micro Diffusion — Discrete text diffusion in ~150 lines of pure Python

This article presents a minimal implementation of discrete text diffusion in Python, inspired by Karpathy's MicroGPT, showcasing the core algorithm with simplicity.

Reddit - Machine Learning · 27 days ago

8

[R] Causal self-attention as a probabilistic model over embeddings

We’ve been working on a probabilistic interpretation of causal self-attention where token embeddings are treated as latent variables. In that view, the attention map induces a change-of-variables t...

Reddit - Machine Learning · 3 days ago

9

[2504.00037] ViT-Linearizer: Distilling Quadratic Knowledge into Linear-Time Vision Models

The paper introduces ViT-Linearizer, a framework that distills knowledge from Vision Transformers (ViTs) into efficient linear-time models, addressing the challenges of quadratic complexity in high...

arXiv - AI · 28 days ago

10

Societal level AI Tragedy of the Commons. Someone please prove me wrong.

The article discusses concerns about AI-induced layoffs of white-collar workers, emphasizing the potential economic impact due to reduced consumer spending.

Reddit - Artificial Intelligence · 28 days ago

11

[D] Self-Promotion Thread

Please post your personal projects, startups, product placements, collaboration needs, blogs etc. Please mention the payment and pricing requirements for products and services. Please do not post l...

Reddit - Machine Learning · 25 days ago

12

[2602.22220] What Makes an Ideal Quote? Recommending "Unexpected yet Rational" Quotations via Novelty

This article presents a novel framework for recommending quotations that are both unexpected and rational, enhancing the writing experience by focusing on deeper semantic properties rather than jus...

arXiv - AI · 28 days ago

13

[2602.22219] Comparative Analysis of Neural Retriever-Reranker Pipelines for Retrieval-Augmented Generation over Knowledge Graphs in E-commerce Applications

This article presents a comparative analysis of neural retriever-reranker pipelines for retrieval-augmented generation (RAG) in e-commerce applications, highlighting advancements in integrating kno...

arXiv - AI · 28 days ago

14

[2602.23280] Physics Informed Viscous Value Representations

This paper presents a novel approach to offline goal-conditioned reinforcement learning by introducing a physics-informed regularization derived from the viscosity solution of the Hamilton-Jacobi-B...

arXiv - Machine Learning · 28 days ago

15

[2602.22351] Decoder-based Sense Knowledge Distillation

This paper introduces Decoder-based Sense Knowledge Distillation (DSKD), a novel framework that enhances knowledge distillation in decoder-based large language models (LLMs) by integrating lexical ...

arXiv - AI · 28 days ago

16

[2602.22424] Causality $\neq$ Invariance: Function and Concept Vectors in LLMs

This paper investigates the representation of concepts in large language models (LLMs), revealing that Function Vectors (FVs) are not fully invariant across input formats, while Concept Vectors (CV...

arXiv - Machine Learning · 28 days ago

17

[2602.22456] Automating the Detection of Requirement Dependencies Using Large Language Models

This article presents LEREDD, a novel approach utilizing Large Language Models to automate the detection of requirement dependencies in software engineering, achieving high accuracy in classificati...

arXiv - AI · 28 days ago

18

[2602.22678] ViCLIP-OT: The First Foundation Vision-Language Model for Vietnamese Image-Text Retrieval with Optimal Transport

ViCLIP-OT introduces a novel vision-language model tailored for Vietnamese image-text retrieval, outperforming existing models in low-resource settings.

arXiv - AI · 28 days ago

19

[2602.22895] SPD Learn: A Geometric Deep Learning Python Library for Neural Decoding Through Trivialization

SPD Learn is a new Python library designed for geometric deep learning, specifically for neural decoding using symmetric positive definite matrices, enhancing reproducibility and integration in mac...

arXiv - Machine Learning · 28 days ago

20

[2602.22697] Reinforcing Real-world Service Agents: Balancing Utility and Cost in Task-oriented Dialogue

The paper presents InteractCS-RL, a novel framework for enhancing task-oriented dialogue systems by balancing empathetic communication and cost-effectiveness through reinforcement learning.

arXiv - AI · 28 days ago

21

[2602.22698] Tokenization, Fusion and Decoupling: Bridging the Granularity Mismatch Between Large Language Models and Knowledge Graphs

This paper presents KGT, a novel framework addressing the granularity mismatch between large language models (LLMs) and knowledge graphs (KGs) by introducing dedicated entity tokens for improved kn...

arXiv - AI · 28 days ago

22

[2602.22716] SoPE: Spherical Coordinate-Based Positional Embedding for Enhancing Spatial Perception of 3D LVLMs

The paper presents SoPE, a novel Spherical Coordinate-Based Positional Embedding method aimed at improving the spatial perception capabilities of 3D Large Vision-Language Models (3D LVLMs) by addre...

arXiv - AI · 28 days ago

23

[2602.22724] AgentSentry: Mitigating Indirect Prompt Injection in LLM Agents via Temporal Causal Diagnostics and Context Purification

AgentSentry introduces a novel framework to mitigate indirect prompt injection (IPI) in LLM agents, enhancing their security while maintaining task performance.

arXiv - AI · 28 days ago

24

[2602.22740] AMLRIS: Alignment-aware Masked Learning for Referring Image Segmentation

The paper presents AMLRIS, a novel training strategy for Referring Image Segmentation (RIS) that enhances object segmentation through alignment-aware masked learning, achieving state-of-the-art res...

arXiv - AI · 28 days ago

25

[2602.22735] Simulation-based Optimization for Augmented Reading

This article presents a novel approach to augmented reading systems, proposing a simulation-based optimization framework that enhances text presentation for better comprehension and performance.

arXiv - AI · 28 days ago

26

[2602.22775] TherapyProbe: Generating Design Knowledge for Relational Safety in Mental Health Chatbots Through Adversarial Simulation

The paper introduces TherapyProbe, a methodology for enhancing relational safety in mental health chatbots through adversarial simulations, identifying failure patterns and design recommendations.

arXiv - AI · 28 days ago

27

[2602.22787] Probing for Knowledge Attribution in Large Language Models

This article explores knowledge attribution in large language models (LLMs), focusing on how to identify the source of information that leads to model outputs, addressing issues of accuracy and rel...

arXiv - AI · 28 days ago

28

[2602.22790] Natural Language Declarative Prompting (NLD-P): A Modular Governance Method for Prompt Design Under Model Drift

The paper introduces Natural Language Declarative Prompting (NLD-P), a governance method for prompt design that addresses challenges posed by model drift in large language models (LLMs).

arXiv - AI · 28 days ago

29

[2602.23197] Fine-Tuning Without Forgetting In-Context Learning: A Theoretical Analysis of Linear Attention Models

This paper explores the impact of fine-tuning on in-context learning in linear attention models, revealing conditions that can enhance or degrade performance on downstream tasks.

arXiv - Machine Learning · 28 days ago

30

[2603.01986] Accurate, private, secure, federated U-statistics with higher degree

Abstract page for arXiv paper 2603.01986: Accurate, private, secure, federated U-statistics with higher degree

arXiv - Machine Learning · 24 days ago

Top Natural Language Processing This Month

Stay updated with AI News