Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Nlp

Apple’s best product in its first 50 years | The Verge

From the Macintosh to the iPhone to the iMac to the iPod, it’s hard to pick a best Apple product ever. But we tried to do so anyway.

The Verge - AI · 4 min · about 2 hours ago

Nlp

[D] Is lossy compression acceptable for conversational agent memory? Every system today uses knowledge graph triples — here's why I think that's wrong.

Been thinking about this and want to know if others have hit the same issue. The dominant approach for agent memory (Mem0, Zep, most RAG ...

Reddit - Machine Learning · 1 min · about 5 hours ago

Nlp

[2601.11016] Contextual Distributionally Robust Optimization with Causal and Continuous Structure: An Interpretable and Tractable Approach

Abstract page for arXiv paper 2601.11016: Contextual Distributionally Robust Optimization with Causal and Continuous Structure: An Interp...

arXiv - Machine Learning · 4 min · about 9 hours ago

All Content

Llms

[2602.20135] KNIGHT: Knowledge Graph-Driven Multiple-Choice Question Generation with Adaptive Hardness Calibration

The paper introduces KNIGHT, a framework for generating multiple-choice questions using knowledge graphs, enhancing efficiency and adapta...

arXiv - AI · 4 min · about 1 month ago

Llms

[2503.11842] Test-Time Training Provably Improves Transformers as In-context Learners

The paper explores how Test-Time Training (TTT) enhances transformer models as in-context learners, demonstrating significant efficiency ...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2502.04591] Are We Measuring Oversmoothing in Graph Neural Networks Correctly?

This article critiques traditional metrics for measuring oversmoothing in Graph Neural Networks (GNNs) and proposes a rank-based approach...

arXiv - AI · 4 min · about 1 month ago

Llms

[2502.03771] vCache: Verified Semantic Prompt Caching

The paper presents vCache, a verified semantic prompt caching system that enhances LLM inference efficiency by dynamically adjusting simi...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.20089] StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues

The paper presents StructXLIP, a novel approach that enhances vision-language models by integrating multimodal structural cues, improving...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.20065] Multilingual Large Language Models do not comprehend all natural languages to equal degrees

This article examines the performance of multilingual large language models (LLMs) across various languages, revealing that comprehension...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.20040] AgenticSum: An Agentic Inference-Time Framework for Faithful Clinical Text Summarization

AgenticSum presents a novel framework for improving clinical text summarization using large language models, focusing on reducing factual...

arXiv - AI · 3 min · about 1 month ago

Nlp

[2602.20151] Conformal Risk Control for Non-Monotonic Losses

This article presents a novel approach to conformal risk control for non-monotonic losses, extending traditional methods to multidimensio...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.20122] NanoKnow: How to Know What Your Language Model Knows

The article discusses NanoKnow, a benchmark dataset designed to understand how large language models (LLMs) acquire knowledge, using the ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.19969] ReAttn: Improving Attention-based Re-ranking via Attention Re-weighting

The paper presents ReAttn, a novel strategy to enhance attention-based re-ranking in large language models by reducing lexical bias and i...

arXiv - AI · 3 min · about 1 month ago

Nlp

[2602.20046] Closing the gap in multimodal medical representation alignment

This paper addresses the modality gap in multimodal medical representation alignment, proposing a framework to enhance alignment between ...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.19843] MAS-FIRE: Fault Injection and Reliability Evaluation for LLM-Based Multi-Agent Systems

The paper presents MAS-FIRE, a framework for evaluating the reliability of LLM-based Multi-Agent Systems through fault injection, address...

arXiv - AI · 4 min · about 1 month ago

Generative Ai

[2602.19718] Carbon-Aware Governance Gates: An Architecture for Sustainable GenAI Development

The paper proposes Carbon-Aware Governance Gates (CAGG) to integrate sustainability into Generative AI development, addressing the increa...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.19799] Path-conditioned training: a principled way to rescale ReLU neural networks

The paper presents a novel approach to rescale ReLU neural networks through path-conditioned training, enhancing training dynamics and ef...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.19702] DReX: An Explainable Deep Learning-based Multimodal Recommendation Framework

DReX is a novel multimodal recommendation framework that enhances user and item representation through explainable deep learning, address...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19778] Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation

The paper presents a novel two-stage training approach for Automatic Chord Recognition (ACR), utilizing pseudo-labeling and knowledge dis...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.19548] Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining

This paper explores the limitations of using a single extractor for HTML-to-text conversion in LLM pretraining, proposing a union of mult...

arXiv - Machine Learning · 3 min · about 1 month ago

Nlp

[2602.19585] Tri-Subspaces Disentanglement for Multimodal Sentiment Analysis

The paper presents a Tri-Subspace Disentanglement (TSD) framework for Multimodal Sentiment Analysis, enhancing representation by factorin...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.19461] Laplacian Multi-scale Flow Matching for Generative Modeling

The paper presents Laplacian Multi-scale Flow Matching (LapFlow), a new framework for image generative modeling that enhances flow matchi...

arXiv - Machine Learning · 3 min · about 1 month ago

Ai Safety

[2602.19569] Temporal-Aware Heterogeneous Graph Reasoning with Multi-View Fusion for Temporal Question Answering

This paper presents a novel framework for Temporal Question Answering over Temporal Knowledge Graphs, addressing limitations in temporal ...

arXiv - AI · 3 min · about 1 month ago

Previous Page 87 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

Apple’s best product in its first 50 years | The Verge

[D] Is lossy compression acceptable for conversational agent memory? Every system today uses knowledge graph triples — here's why I think that's wrong.

[2601.11016] Contextual Distributionally Robust Optimization with Causal and Continuous Structure: An Interpretable and Tractable Approach

All Content

[2602.20135] KNIGHT: Knowledge Graph-Driven Multiple-Choice Question Generation with Adaptive Hardness Calibration

[2503.11842] Test-Time Training Provably Improves Transformers as In-context Learners

[2502.04591] Are We Measuring Oversmoothing in Graph Neural Networks Correctly?

[2502.03771] vCache: Verified Semantic Prompt Caching

[2602.20089] StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues

[2602.20065] Multilingual Large Language Models do not comprehend all natural languages to equal degrees

[2602.20040] AgenticSum: An Agentic Inference-Time Framework for Faithful Clinical Text Summarization

[2602.20151] Conformal Risk Control for Non-Monotonic Losses

[2602.20122] NanoKnow: How to Know What Your Language Model Knows

[2602.19969] ReAttn: Improving Attention-based Re-ranking via Attention Re-weighting

[2602.20046] Closing the gap in multimodal medical representation alignment

[2602.19843] MAS-FIRE: Fault Injection and Reliability Evaluation for LLM-Based Multi-Agent Systems

[2602.19718] Carbon-Aware Governance Gates: An Architecture for Sustainable GenAI Development

[2602.19799] Path-conditioned training: a principled way to rescale ReLU neural networks

[2602.19702] DReX: An Explainable Deep Learning-based Multimodal Recommendation Framework

[2602.19778] Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation

[2602.19548] Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining

[2602.19585] Tri-Subspaces Disentanglement for Multimodal Sentiment Analysis

[2602.19461] Laplacian Multi-scale Flow Matching for Generative Modeling

[2602.19569] Temporal-Aware Heterogeneous Graph Reasoning with Multi-View Fusion for Temporal Question Answering

Related Topics

Stay updated with AI News