Natural Language Processing

Text understanding and language tasks

Top This Week

Nlp

Has anyone here switched to TeraBox recently? Is it actually worth it?

I’ve been seeing more people talk about TeraBox lately, especially around storage for AI-related workflows. Curious if anyone here has us...

Reddit - Artificial Intelligence · 1 min ·
Enabling agent-first process redesign | MIT Technology Review
Nlp

Enabling agent-first process redesign | MIT Technology Review

Unlike static, rules-based systems, AI agents can learn, adapt, and optimize processes dynamically. As they interact with data, systems, ...

MIT Technology Review - AI · 4 min ·
Llms

Stop Overcomplicating AI Workflows. This Is the Simple Framework

I’ve been working on building an agentic AI workflow system for business use cases and one thing became very clear very quickly. This is ...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.12517] Bench-MFG: A Benchmark Suite for Learning in Stationary Mean Field Games
Nlp

[2602.12517] Bench-MFG: A Benchmark Suite for Learning in Stationary Mean Field Games

The paper presents Bench-MFG, a benchmark suite designed to standardize evaluations in learning for stationary Mean Field Games, addressi...

arXiv - Machine Learning · 4 min ·
[2602.12444] Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models
Machine Learning

[2602.12444] Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models

This paper presents a novel recovery-based shielding framework for safe reinforcement learning (RL) using Gaussian process dynamics model...

arXiv - AI · 3 min ·
[2602.12424] RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty
Llms

[2602.12424] RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty

The paper introduces RankLLM, a framework for evaluating large language models (LLMs) by quantifying question difficulty, enhancing model...

arXiv - AI · 4 min ·
[2602.12422] CacheMind: From Miss Rates to Why -- Natural-Language, Trace-Grounded Reasoning for Cache Replacement
Llms

[2602.12422] CacheMind: From Miss Rates to Why -- Natural-Language, Trace-Grounded Reasoning for Cache Replacement

CacheMind introduces a novel tool for cache replacement, leveraging natural language processing and trace-grounded reasoning to enhance C...

arXiv - Machine Learning · 4 min ·
[2602.12393] Reproducing DragDiffusion: Interactive Point-Based Editing with Diffusion Models
Machine Learning

[2602.12393] Reproducing DragDiffusion: Interactive Point-Based Editing with Diffusion Models

This article presents a reproducibility study of DragDiffusion, a method for interactive point-based image editing using diffusion models...

arXiv - Machine Learning · 4 min ·
[2602.12342] Intrinsic Credit Assignment for Long Horizon Interaction
Llms

[2602.12342] Intrinsic Credit Assignment for Long Horizon Interaction

This article presents a novel approach, {}Belief-RL, for training agents to navigate uncertainty over long horizons by utilizing intrins...

arXiv - Machine Learning · 3 min ·
[2602.12315] AgenticShop: Benchmarking Agentic Product Curation for Personalized Web Shopping
Nlp

[2602.12315] AgenticShop: Benchmarking Agentic Product Curation for Personalized Web Shopping

The paper presents AgenticShop, a benchmark for evaluating agentic systems in personalized web shopping, addressing gaps in current evalu...

arXiv - AI · 4 min ·
[2602.12311] Perceptual Self-Reflection in Agentic Physics Simulation Code Generation
Nlp

[2602.12311] Perceptual Self-Reflection in Agentic Physics Simulation Code Generation

This article presents a multi-agent framework for generating physics simulation code from natural language descriptions, introducing a no...

arXiv - AI · 4 min ·
[2602.12296] Adaptive traffic signal control optimization using a novel road partition and multi-channel state representation method
Nlp

[2602.12296] Adaptive traffic signal control optimization using a novel road partition and multi-channel state representation method

This article presents a novel adaptive traffic signal control method utilizing Deep Q-Networks and Proximal Policy Optimization to enhanc...

arXiv - AI · 4 min ·
[2602.12287] Retrieval-Augmented Self-Taught Reasoning Model with Adaptive Chain-of-Thought for ASR Named Entity Correction
Llms

[2602.12287] Retrieval-Augmented Self-Taught Reasoning Model with Adaptive Chain-of-Thought for ASR Named Entity Correction

This article presents a novel retrieval-augmented reasoning model designed to enhance named entity correction in automatic speech recogni...

arXiv - AI · 3 min ·
[2602.11247] Peak + Accumulation: A Proxy-Level Scoring Formula for Multi-Turn LLM Attack Detection
Llms

[2602.11247] Peak + Accumulation: A Proxy-Level Scoring Formula for Multi-Turn LLM Attack Detection

The paper presents a novel scoring formula, Peak + Accumulation, for detecting multi-turn LLM attack patterns, addressing limitations in ...

arXiv - AI · 4 min ·
[2602.12284] A Lightweight LLM Framework for Disaster Humanitarian Information Classification
Llms

[2602.12284] A Lightweight LLM Framework for Disaster Humanitarian Information Classification

This paper presents a lightweight framework for classifying humanitarian information from social media, enhancing disaster response effic...

arXiv - Machine Learning · 3 min ·
[2511.13494] Language-Guided Invariance Probing of Vision-Language Models
Llms

[2511.13494] Language-Guided Invariance Probing of Vision-Language Models

This article introduces Language-Guided Invariance Probing (LGIP), a benchmark for evaluating the robustness of vision-language models (V...

arXiv - AI · 3 min ·
[2602.12665] Evaluating Robustness of Reasoning Models on Parameterized Logical Problems
Llms

[2602.12665] Evaluating Robustness of Reasoning Models on Parameterized Logical Problems

This paper introduces a diagnostic benchmark for evaluating the robustness of reasoning models on parameterized logical problems, specifi...

arXiv - AI · 3 min ·
[2602.12586] Can I Have Your Order? Monte-Carlo Tree Search for Slot Filling Ordering in Diffusion Language Models
Llms

[2602.12586] Can I Have Your Order? Monte-Carlo Tree Search for Slot Filling Ordering in Diffusion Language Models

This paper introduces McDiffuSE, a Monte Carlo Tree Search framework aimed at optimizing slot filling orders in Masked Diffusion Models, ...

arXiv - AI · 3 min ·
[2602.12544] Scaling Web Agent Training through Automatic Data Generation and Fine-grained Evaluation
Machine Learning

[2602.12544] Scaling Web Agent Training through Automatic Data Generation and Fine-grained Evaluation

This paper presents a scalable pipeline for generating high-quality training data for web agents, introducing a novel evaluation framewor...

arXiv - AI · 3 min ·
Nlp

[D] ACL ARR Jan 2026 Reviews

The article discusses three official reviews of ACL ARR Jan 2026, presenting average scores for Overall Assessment and Confidence, prompt...

Reddit - Machine Learning · 1 min ·
Leveraging Pre-trained Language Model Checkpoints for Encoder-Decoder Models
Llms

Leveraging Pre-trained Language Model Checkpoints for Encoder-Decoder Models

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 58 min ·
Retrieval Augmented Generation with Huggingface Transformers and Ray
Open Source Ai

Retrieval Augmented Generation with Huggingface Transformers and Ray

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 6 min ·
Open Source Ai

Train a Sentence Embedding Model with 1B Training Pairs

We had to rate limit your IP (159.65.251.98). To continue using our service, create a HF account or login to your existing account, and m...

Hugging Face Blog · 1 min ·
Previous Page 138 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime