Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Nlp

Has anyone here switched to TeraBox recently? Is it actually worth it?

I’ve been seeing more people talk about TeraBox lately, especially around storage for AI-related workflows. Curious if anyone here has us...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Nlp

Enabling agent-first process redesign | MIT Technology Review

Unlike static, rules-based systems, AI agents can learn, adapt, and optimize processes dynamically. As they interact with data, systems, ...

MIT Technology Review · 4 min · about 5 hours ago

Llms

Stop Overcomplicating AI Workflows. This Is the Simple Framework

I’ve been working on building an agentic AI workflow system for business use cases and one thing became very clear very quickly. This is ...

Reddit - Artificial Intelligence · 1 min · about 6 hours ago

All Content

Llms

[2602.13165] Asynchronous Verified Semantic Caching for Tiered LLM Architectures

The paper introduces Krites, an asynchronous caching policy for large language models (LLMs) that enhances semantic caching efficiency wh...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.13047] Can we trust AI to detect healthy multilingual English speakers among the cognitively impaired cohort in the UK? An investigation using real-world conversational speech

This study investigates the reliability of AI in detecting cognitive impairment among multilingual English speakers in the UK, revealing ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12996] Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models

This article presents a novel meta-cognitive framework aimed at enhancing knowledge augmentation in Large Language Models (LLMs), address...

arXiv - AI · 3 min · about 2 months ago

Robotics

[2602.12924] Never say never: Exploring the effects of available knowledge on agent persuasiveness in controlled physiotherapy motivation dialogues

This article examines how the availability of knowledge influences the persuasiveness of generative social agents (GSAs) in physiotherapy...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12892] RADAR: Revealing Asymmetric Development of Abilities in MLLM Pre-training

The paper presents RADAR, a novel evaluation framework for Multi-modal Large Language Models (MLLMs) that addresses performance bottlenec...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12833] TRACE: Temporal Reasoning via Agentic Context Evolution for Streaming Electronic Health Records (EHRs)

TRACE introduces a novel framework for temporal reasoning in electronic health records, enhancing prediction accuracy and clinical safety...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.12828] GRAIL: Geometry-Aware Retrieval-Augmented Inference with LLMs over Hyperbolic Representations of Patient Trajectories

The GRAIL framework enhances next-visit event prediction in healthcare by utilizing geometry-aware retrieval and hyperbolic representatio...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.12811] Left-right asymmetry in predicting brain activity from LLMs' representations emerges with their formal linguistic competence

This article explores how left-right asymmetry in predicting brain activity from large language models (LLMs) correlates with their forma...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12806] RAT-Bench: A Comprehensive Benchmark for Text Anonymization

RAT-Bench introduces a comprehensive benchmark for evaluating text anonymization tools based on their effectiveness in preventing re-iden...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.12798] Can Neural Networks Provide Latent Embeddings for Telemetry-Aware Greedy Routing?

The paper explores a novel algorithm, Placer, which utilizes Message Passing Networks to create latent embeddings for telemetry-aware gre...

arXiv - Machine Learning · 3 min · about 2 months ago

Nlp

[2602.12783] SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

The paper introduces SQuTR, a new benchmark for evaluating the robustness of spoken query retrieval systems under various acoustic noise ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12705] MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs

MedXIAOHE is a medical vision-language foundation model that enhances medical understanding and reasoning in clinical applications, achie...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.12691] ALOE: Action-Level Off-Policy Evaluation for Vision-Language-Action Model Post-Training

The paper presents ALOE, an action-level off-policy evaluation framework aimed at enhancing vision-language-action models through reinfor...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.12643] Unifying Model-Free Efficiency and Model-Based Representations via Latent Dynamics

The paper presents Unified Latent Dynamics (ULD), a novel reinforcement learning algorithm that combines the efficiency of model-free met...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.12642] Beyond Normalization: Rethinking the Partition Function as a Difficulty Scheduler for RLVR

This article presents a novel approach to reinforcement learning by reinterpreting the partition function as a difficulty scheduler, enha...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12609] QuEPT: Quantized Elastic Precision Transformers with One-Shot Calibration for Multi-Bit Switching

The paper presents QuEPT, a novel quantization method for Transformers that enables efficient multi-bit switching with one-shot calibrati...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.12601] HyperMLP: An Integrated Perspective for Sequence Modeling

The paper presents HyperMLP, a novel approach to sequence modeling that reinterprets autoregressive attention as a dynamic two-layer MLP,...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.12593] RQ-GMM: Residual Quantized Gaussian Mixture Model for Multimodal Semantic Discretization in CTR Prediction

The paper introduces RQ-GMM, a novel model for improving click-through rate (CTR) prediction by effectively discretizing multimodal embed...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.12574] Monte Carlo Tree Search with Reasoning Path Refinement for Small Language Models in Conversational Text-to-NoSQL

This paper presents a novel framework, Stage-MCTS, which enhances small language models' ability to generate NoSQL queries through conver...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12546] Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR

The paper presents a decoder-only Conformer model for automatic speech recognition (ASR) that integrates speech and text processing witho...

arXiv - AI · 3 min · about 2 months ago

Previous Page 137 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

Has anyone here switched to TeraBox recently? Is it actually worth it?

Enabling agent-first process redesign | MIT Technology Review

Stop Overcomplicating AI Workflows. This Is the Simple Framework

All Content

[2602.13165] Asynchronous Verified Semantic Caching for Tiered LLM Architectures

[2602.13047] Can we trust AI to detect healthy multilingual English speakers among the cognitively impaired cohort in the UK? An investigation using real-world conversational speech

[2602.12996] Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models

[2602.12924] Never say never: Exploring the effects of available knowledge on agent persuasiveness in controlled physiotherapy motivation dialogues

[2602.12892] RADAR: Revealing Asymmetric Development of Abilities in MLLM Pre-training

[2602.12833] TRACE: Temporal Reasoning via Agentic Context Evolution for Streaming Electronic Health Records (EHRs)

[2602.12828] GRAIL: Geometry-Aware Retrieval-Augmented Inference with LLMs over Hyperbolic Representations of Patient Trajectories

[2602.12811] Left-right asymmetry in predicting brain activity from LLMs' representations emerges with their formal linguistic competence

[2602.12806] RAT-Bench: A Comprehensive Benchmark for Text Anonymization

[2602.12798] Can Neural Networks Provide Latent Embeddings for Telemetry-Aware Greedy Routing?

[2602.12783] SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

[2602.12705] MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs

[2602.12691] ALOE: Action-Level Off-Policy Evaluation for Vision-Language-Action Model Post-Training

[2602.12643] Unifying Model-Free Efficiency and Model-Based Representations via Latent Dynamics

[2602.12642] Beyond Normalization: Rethinking the Partition Function as a Difficulty Scheduler for RLVR

[2602.12609] QuEPT: Quantized Elastic Precision Transformers with One-Shot Calibration for Multi-Bit Switching

[2602.12601] HyperMLP: An Integrated Perspective for Sequence Modeling

[2602.12593] RQ-GMM: Residual Quantized Gaussian Mixture Model for Multimodal Semantic Discretization in CTR Prediction

[2602.12574] Monte Carlo Tree Search with Reasoning Path Refinement for Small Language Models in Conversational Text-to-NoSQL

[2602.12546] Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR

Related Topics

Stay updated with AI News