Natural Language Processing

Text understanding and language tasks

Top This Week

Nlp

Has anyone here switched to TeraBox recently? Is it actually worth it?

I’ve been seeing more people talk about TeraBox lately, especially around storage for AI-related workflows. Curious if anyone here has us...

Reddit - Artificial Intelligence · 1 min ·
Enabling agent-first process redesign | MIT Technology Review
Nlp

Enabling agent-first process redesign | MIT Technology Review

Unlike static, rules-based systems, AI agents can learn, adapt, and optimize processes dynamically. As they interact with data, systems, ...

MIT Technology Review · 4 min ·
Llms

Stop Overcomplicating AI Workflows. This Is the Simple Framework

I’ve been working on building an agentic AI workflow system for business use cases and one thing became very clear very quickly. This is ...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.13165] Asynchronous Verified Semantic Caching for Tiered LLM Architectures
Llms

[2602.13165] Asynchronous Verified Semantic Caching for Tiered LLM Architectures

The paper introduces Krites, an asynchronous caching policy for large language models (LLMs) that enhances semantic caching efficiency wh...

arXiv - AI · 4 min ·
[2602.13047] Can we trust AI to detect healthy multilingual English speakers among the cognitively impaired cohort in the UK? An investigation using real-world conversational speech
Machine Learning

[2602.13047] Can we trust AI to detect healthy multilingual English speakers among the cognitively impaired cohort in the UK? An investigation using real-world conversational speech

This study investigates the reliability of AI in detecting cognitive impairment among multilingual English speakers in the UK, revealing ...

arXiv - AI · 4 min ·
[2602.12996] Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models
Llms

[2602.12996] Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models

This article presents a novel meta-cognitive framework aimed at enhancing knowledge augmentation in Large Language Models (LLMs), address...

arXiv - AI · 3 min ·
[2602.12924] Never say never: Exploring the effects of available knowledge on agent persuasiveness in controlled physiotherapy motivation dialogues
Robotics

[2602.12924] Never say never: Exploring the effects of available knowledge on agent persuasiveness in controlled physiotherapy motivation dialogues

This article examines how the availability of knowledge influences the persuasiveness of generative social agents (GSAs) in physiotherapy...

arXiv - AI · 4 min ·
[2602.12892] RADAR: Revealing Asymmetric Development of Abilities in MLLM Pre-training
Llms

[2602.12892] RADAR: Revealing Asymmetric Development of Abilities in MLLM Pre-training

The paper presents RADAR, a novel evaluation framework for Multi-modal Large Language Models (MLLMs) that addresses performance bottlenec...

arXiv - AI · 4 min ·
[2602.12833] TRACE: Temporal Reasoning via Agentic Context Evolution for Streaming Electronic Health Records (EHRs)
Llms

[2602.12833] TRACE: Temporal Reasoning via Agentic Context Evolution for Streaming Electronic Health Records (EHRs)

TRACE introduces a novel framework for temporal reasoning in electronic health records, enhancing prediction accuracy and clinical safety...

arXiv - Machine Learning · 4 min ·
[2602.12828] GRAIL: Geometry-Aware Retrieval-Augmented Inference with LLMs over Hyperbolic Representations of Patient Trajectories
Llms

[2602.12828] GRAIL: Geometry-Aware Retrieval-Augmented Inference with LLMs over Hyperbolic Representations of Patient Trajectories

The GRAIL framework enhances next-visit event prediction in healthcare by utilizing geometry-aware retrieval and hyperbolic representatio...

arXiv - Machine Learning · 3 min ·
[2602.12811] Left-right asymmetry in predicting brain activity from LLMs' representations emerges with their formal linguistic competence
Llms

[2602.12811] Left-right asymmetry in predicting brain activity from LLMs' representations emerges with their formal linguistic competence

This article explores how left-right asymmetry in predicting brain activity from large language models (LLMs) correlates with their forma...

arXiv - AI · 4 min ·
[2602.12806] RAT-Bench: A Comprehensive Benchmark for Text Anonymization
Llms

[2602.12806] RAT-Bench: A Comprehensive Benchmark for Text Anonymization

RAT-Bench introduces a comprehensive benchmark for evaluating text anonymization tools based on their effectiveness in preventing re-iden...

arXiv - Machine Learning · 4 min ·
[2602.12798] Can Neural Networks Provide Latent Embeddings for Telemetry-Aware Greedy Routing?
Machine Learning

[2602.12798] Can Neural Networks Provide Latent Embeddings for Telemetry-Aware Greedy Routing?

The paper explores a novel algorithm, Placer, which utilizes Message Passing Networks to create latent embeddings for telemetry-aware gre...

arXiv - Machine Learning · 3 min ·
[2602.12783] SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise
Nlp

[2602.12783] SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

The paper introduces SQuTR, a new benchmark for evaluating the robustness of spoken query retrieval systems under various acoustic noise ...

arXiv - AI · 4 min ·
[2602.12705] MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs
Llms

[2602.12705] MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs

MedXIAOHE is a medical vision-language foundation model that enhances medical understanding and reasoning in clinical applications, achie...

arXiv - AI · 3 min ·
[2602.12691] ALOE: Action-Level Off-Policy Evaluation for Vision-Language-Action Model Post-Training
Machine Learning

[2602.12691] ALOE: Action-Level Off-Policy Evaluation for Vision-Language-Action Model Post-Training

The paper presents ALOE, an action-level off-policy evaluation framework aimed at enhancing vision-language-action models through reinfor...

arXiv - AI · 4 min ·
[2602.12643] Unifying Model-Free Efficiency and Model-Based Representations via Latent Dynamics
Machine Learning

[2602.12643] Unifying Model-Free Efficiency and Model-Based Representations via Latent Dynamics

The paper presents Unified Latent Dynamics (ULD), a novel reinforcement learning algorithm that combines the efficiency of model-free met...

arXiv - Machine Learning · 4 min ·
[2602.12642] Beyond Normalization: Rethinking the Partition Function as a Difficulty Scheduler for RLVR
Llms

[2602.12642] Beyond Normalization: Rethinking the Partition Function as a Difficulty Scheduler for RLVR

This article presents a novel approach to reinforcement learning by reinterpreting the partition function as a difficulty scheduler, enha...

arXiv - AI · 4 min ·
[2602.12609] QuEPT: Quantized Elastic Precision Transformers with One-Shot Calibration for Multi-Bit Switching
Llms

[2602.12609] QuEPT: Quantized Elastic Precision Transformers with One-Shot Calibration for Multi-Bit Switching

The paper presents QuEPT, a novel quantization method for Transformers that enables efficient multi-bit switching with one-shot calibrati...

arXiv - AI · 4 min ·
[2602.12601] HyperMLP: An Integrated Perspective for Sequence Modeling
Machine Learning

[2602.12601] HyperMLP: An Integrated Perspective for Sequence Modeling

The paper presents HyperMLP, a novel approach to sequence modeling that reinterprets autoregressive attention as a dynamic two-layer MLP,...

arXiv - Machine Learning · 3 min ·
[2602.12593] RQ-GMM: Residual Quantized Gaussian Mixture Model for Multimodal Semantic Discretization in CTR Prediction
Machine Learning

[2602.12593] RQ-GMM: Residual Quantized Gaussian Mixture Model for Multimodal Semantic Discretization in CTR Prediction

The paper introduces RQ-GMM, a novel model for improving click-through rate (CTR) prediction by effectively discretizing multimodal embed...

arXiv - AI · 3 min ·
[2602.12574] Monte Carlo Tree Search with Reasoning Path Refinement for Small Language Models in Conversational Text-to-NoSQL
Llms

[2602.12574] Monte Carlo Tree Search with Reasoning Path Refinement for Small Language Models in Conversational Text-to-NoSQL

This paper presents a novel framework, Stage-MCTS, which enhances small language models' ability to generate NoSQL queries through conver...

arXiv - AI · 4 min ·
[2602.12546] Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR
Llms

[2602.12546] Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR

The paper presents a decoder-only Conformer model for automatic speech recognition (ASR) that integrates speech and text processing witho...

arXiv - AI · 3 min ·
Previous Page 137 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime