Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Nlp

Has anyone here switched to TeraBox recently? Is it actually worth it?

I’ve been seeing more people talk about TeraBox lately, especially around storage for AI-related workflows. Curious if anyone here has us...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Nlp

Enabling agent-first process redesign | MIT Technology Review

Unlike static, rules-based systems, AI agents can learn, adapt, and optimize processes dynamically. As they interact with data, systems, ...

MIT Technology Review - AI · 4 min · about 7 hours ago

Llms

Stop Overcomplicating AI Workflows. This Is the Simple Framework

I’ve been working on building an agentic AI workflow system for business use cases and one thing became very clear very quickly. This is ...

Reddit - Artificial Intelligence · 1 min · about 8 hours ago

All Content

Nlp

[2602.12517] Bench-MFG: A Benchmark Suite for Learning in Stationary Mean Field Games

The paper presents Bench-MFG, a benchmark suite designed to standardize evaluations in learning for stationary Mean Field Games, addressi...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.12444] Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models

This paper presents a novel recovery-based shielding framework for safe reinforcement learning (RL) using Gaussian process dynamics model...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.12424] RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty

The paper introduces RankLLM, a framework for evaluating large language models (LLMs) by quantifying question difficulty, enhancing model...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12422] CacheMind: From Miss Rates to Why -- Natural-Language, Trace-Grounded Reasoning for Cache Replacement

CacheMind introduces a novel tool for cache replacement, leveraging natural language processing and trace-grounded reasoning to enhance C...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.12393] Reproducing DragDiffusion: Interactive Point-Based Editing with Diffusion Models

This article presents a reproducibility study of DragDiffusion, a method for interactive point-based image editing using diffusion models...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.12342] Intrinsic Credit Assignment for Long Horizon Interaction

This article presents a novel approach, {}Belief-RL, for training agents to navigate uncertainty over long horizons by utilizing intrins...

arXiv - Machine Learning · 3 min · about 2 months ago

Nlp

[2602.12315] AgenticShop: Benchmarking Agentic Product Curation for Personalized Web Shopping

The paper presents AgenticShop, a benchmark for evaluating agentic systems in personalized web shopping, addressing gaps in current evalu...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2602.12311] Perceptual Self-Reflection in Agentic Physics Simulation Code Generation

This article presents a multi-agent framework for generating physics simulation code from natural language descriptions, introducing a no...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2602.12296] Adaptive traffic signal control optimization using a novel road partition and multi-channel state representation method

This article presents a novel adaptive traffic signal control method utilizing Deep Q-Networks and Proximal Policy Optimization to enhanc...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12287] Retrieval-Augmented Self-Taught Reasoning Model with Adaptive Chain-of-Thought for ASR Named Entity Correction

This article presents a novel retrieval-augmented reasoning model designed to enhance named entity correction in automatic speech recogni...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.11247] Peak + Accumulation: A Proxy-Level Scoring Formula for Multi-Turn LLM Attack Detection

The paper presents a novel scoring formula, Peak + Accumulation, for detecting multi-turn LLM attack patterns, addressing limitations in ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12284] A Lightweight LLM Framework for Disaster Humanitarian Information Classification

This paper presents a lightweight framework for classifying humanitarian information from social media, enhancing disaster response effic...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2511.13494] Language-Guided Invariance Probing of Vision-Language Models

This article introduces Language-Guided Invariance Probing (LGIP), a benchmark for evaluating the robustness of vision-language models (V...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.12665] Evaluating Robustness of Reasoning Models on Parameterized Logical Problems

This paper introduces a diagnostic benchmark for evaluating the robustness of reasoning models on parameterized logical problems, specifi...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.12586] Can I Have Your Order? Monte-Carlo Tree Search for Slot Filling Ordering in Diffusion Language Models

This paper introduces McDiffuSE, a Monte Carlo Tree Search framework aimed at optimizing slot filling orders in Masked Diffusion Models, ...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.12544] Scaling Web Agent Training through Automatic Data Generation and Fine-grained Evaluation

This paper presents a scalable pipeline for generating high-quality training data for web agents, introducing a novel evaluation framewor...

arXiv - AI · 3 min · about 2 months ago

Nlp

[D] ACL ARR Jan 2026 Reviews

The article discusses three official reviews of ACL ARR Jan 2026, presenting average scores for Overall Assessment and Confidence, prompt...

Reddit - Machine Learning · 1 min · about 2 months ago

Llms

Leveraging Pre-trained Language Model Checkpoints for Encoder-Decoder Models

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 58 min · about 2 months ago

Open Source Ai

Retrieval Augmented Generation with Huggingface Transformers and Ray

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 6 min · about 2 months ago

Open Source Ai

Train a Sentence Embedding Model with 1B Training Pairs

We had to rate limit your IP (159.65.251.98). To continue using our service, create a HF account or login to your existing account, and m...

Hugging Face Blog · 1 min · about 2 months ago

Previous Page 138 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

Has anyone here switched to TeraBox recently? Is it actually worth it?

Enabling agent-first process redesign | MIT Technology Review

Stop Overcomplicating AI Workflows. This Is the Simple Framework

All Content

[2602.12517] Bench-MFG: A Benchmark Suite for Learning in Stationary Mean Field Games

[2602.12444] Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models

[2602.12424] RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty

[2602.12422] CacheMind: From Miss Rates to Why -- Natural-Language, Trace-Grounded Reasoning for Cache Replacement

[2602.12393] Reproducing DragDiffusion: Interactive Point-Based Editing with Diffusion Models

[2602.12342] Intrinsic Credit Assignment for Long Horizon Interaction

[2602.12315] AgenticShop: Benchmarking Agentic Product Curation for Personalized Web Shopping

[2602.12311] Perceptual Self-Reflection in Agentic Physics Simulation Code Generation

[2602.12296] Adaptive traffic signal control optimization using a novel road partition and multi-channel state representation method

[2602.12287] Retrieval-Augmented Self-Taught Reasoning Model with Adaptive Chain-of-Thought for ASR Named Entity Correction

[2602.11247] Peak + Accumulation: A Proxy-Level Scoring Formula for Multi-Turn LLM Attack Detection

[2602.12284] A Lightweight LLM Framework for Disaster Humanitarian Information Classification

[2511.13494] Language-Guided Invariance Probing of Vision-Language Models

[2602.12665] Evaluating Robustness of Reasoning Models on Parameterized Logical Problems

[2602.12586] Can I Have Your Order? Monte-Carlo Tree Search for Slot Filling Ordering in Diffusion Language Models

[2602.12544] Scaling Web Agent Training through Automatic Data Generation and Fine-grained Evaluation

[D] ACL ARR Jan 2026 Reviews

Leveraging Pre-trained Language Model Checkpoints for Encoder-Decoder Models

Retrieval Augmented Generation with Huggingface Transformers and Ray

Train a Sentence Embedding Model with 1B Training Pairs

Related Topics

Stay updated with AI News