Natural Language Processing

Text understanding and language tasks

Top This Week

Apple’s best product in its first 50 years | The Verge
Nlp

Apple’s best product in its first 50 years | The Verge

From the Macintosh to the iPhone to the iMac to the iPod, it’s hard to pick a best Apple product ever. But we tried to do so anyway.

The Verge - AI · 4 min ·
Nlp

[D] Is lossy compression acceptable for conversational agent memory? Every system today uses knowledge graph triples — here's why I think that's wrong.

Been thinking about this and want to know if others have hit the same issue. The dominant approach for agent memory (Mem0, Zep, most RAG ...

Reddit - Machine Learning · 1 min ·
[2601.11016] Contextual Distributionally Robust Optimization with Causal and Continuous Structure: An Interpretable and Tractable Approach
Nlp

[2601.11016] Contextual Distributionally Robust Optimization with Causal and Continuous Structure: An Interpretable and Tractable Approach

Abstract page for arXiv paper 2601.11016: Contextual Distributionally Robust Optimization with Causal and Continuous Structure: An Interp...

arXiv - Machine Learning · 4 min ·

All Content

[2602.20135] KNIGHT: Knowledge Graph-Driven Multiple-Choice Question Generation with Adaptive Hardness Calibration
Llms

[2602.20135] KNIGHT: Knowledge Graph-Driven Multiple-Choice Question Generation with Adaptive Hardness Calibration

The paper introduces KNIGHT, a framework for generating multiple-choice questions using knowledge graphs, enhancing efficiency and adapta...

arXiv - AI · 4 min ·
[2503.11842] Test-Time Training Provably Improves Transformers as In-context Learners
Llms

[2503.11842] Test-Time Training Provably Improves Transformers as In-context Learners

The paper explores how Test-Time Training (TTT) enhances transformer models as in-context learners, demonstrating significant efficiency ...

arXiv - Machine Learning · 4 min ·
[2502.04591] Are We Measuring Oversmoothing in Graph Neural Networks Correctly?
Machine Learning

[2502.04591] Are We Measuring Oversmoothing in Graph Neural Networks Correctly?

This article critiques traditional metrics for measuring oversmoothing in Graph Neural Networks (GNNs) and proposes a rank-based approach...

arXiv - AI · 4 min ·
[2502.03771] vCache: Verified Semantic Prompt Caching
Llms

[2502.03771] vCache: Verified Semantic Prompt Caching

The paper presents vCache, a verified semantic prompt caching system that enhances LLM inference efficiency by dynamically adjusting simi...

arXiv - Machine Learning · 4 min ·
[2602.20089] StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues
Llms

[2602.20089] StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues

The paper presents StructXLIP, a novel approach that enhances vision-language models by integrating multimodal structural cues, improving...

arXiv - AI · 4 min ·
[2602.20065] Multilingual Large Language Models do not comprehend all natural languages to equal degrees
Llms

[2602.20065] Multilingual Large Language Models do not comprehend all natural languages to equal degrees

This article examines the performance of multilingual large language models (LLMs) across various languages, revealing that comprehension...

arXiv - AI · 4 min ·
[2602.20040] AgenticSum: An Agentic Inference-Time Framework for Faithful Clinical Text Summarization
Llms

[2602.20040] AgenticSum: An Agentic Inference-Time Framework for Faithful Clinical Text Summarization

AgenticSum presents a novel framework for improving clinical text summarization using large language models, focusing on reducing factual...

arXiv - AI · 3 min ·
[2602.20151] Conformal Risk Control for Non-Monotonic Losses
Nlp

[2602.20151] Conformal Risk Control for Non-Monotonic Losses

This article presents a novel approach to conformal risk control for non-monotonic losses, extending traditional methods to multidimensio...

arXiv - Machine Learning · 3 min ·
[2602.20122] NanoKnow: How to Know What Your Language Model Knows
Llms

[2602.20122] NanoKnow: How to Know What Your Language Model Knows

The article discusses NanoKnow, a benchmark dataset designed to understand how large language models (LLMs) acquire knowledge, using the ...

arXiv - Machine Learning · 4 min ·
[2602.19969] ReAttn: Improving Attention-based Re-ranking via Attention Re-weighting
Llms

[2602.19969] ReAttn: Improving Attention-based Re-ranking via Attention Re-weighting

The paper presents ReAttn, a novel strategy to enhance attention-based re-ranking in large language models by reducing lexical bias and i...

arXiv - AI · 3 min ·
[2602.20046] Closing the gap in multimodal medical representation alignment
Nlp

[2602.20046] Closing the gap in multimodal medical representation alignment

This paper addresses the modality gap in multimodal medical representation alignment, proposing a framework to enhance alignment between ...

arXiv - Machine Learning · 3 min ·
[2602.19843] MAS-FIRE: Fault Injection and Reliability Evaluation for LLM-Based Multi-Agent Systems
Llms

[2602.19843] MAS-FIRE: Fault Injection and Reliability Evaluation for LLM-Based Multi-Agent Systems

The paper presents MAS-FIRE, a framework for evaluating the reliability of LLM-based Multi-Agent Systems through fault injection, address...

arXiv - AI · 4 min ·
[2602.19718] Carbon-Aware Governance Gates: An Architecture for Sustainable GenAI Development
Generative Ai

[2602.19718] Carbon-Aware Governance Gates: An Architecture for Sustainable GenAI Development

The paper proposes Carbon-Aware Governance Gates (CAGG) to integrate sustainability into Generative AI development, addressing the increa...

arXiv - AI · 3 min ·
[2602.19799] Path-conditioned training: a principled way to rescale ReLU neural networks
Machine Learning

[2602.19799] Path-conditioned training: a principled way to rescale ReLU neural networks

The paper presents a novel approach to rescale ReLU neural networks through path-conditioned training, enhancing training dynamics and ef...

arXiv - Machine Learning · 3 min ·
[2602.19702] DReX: An Explainable Deep Learning-based Multimodal Recommendation Framework
Machine Learning

[2602.19702] DReX: An Explainable Deep Learning-based Multimodal Recommendation Framework

DReX is a novel multimodal recommendation framework that enhances user and item representation through explainable deep learning, address...

arXiv - AI · 4 min ·
[2602.19778] Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation
Machine Learning

[2602.19778] Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation

The paper presents a novel two-stage training approach for Automatic Chord Recognition (ACR), utilizing pseudo-labeling and knowledge dis...

arXiv - Machine Learning · 4 min ·
[2602.19548] Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining
Llms

[2602.19548] Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining

This paper explores the limitations of using a single extractor for HTML-to-text conversion in LLM pretraining, proposing a union of mult...

arXiv - Machine Learning · 3 min ·
[2602.19585] Tri-Subspaces Disentanglement for Multimodal Sentiment Analysis
Nlp

[2602.19585] Tri-Subspaces Disentanglement for Multimodal Sentiment Analysis

The paper presents a Tri-Subspace Disentanglement (TSD) framework for Multimodal Sentiment Analysis, enhancing representation by factorin...

arXiv - AI · 3 min ·
[2602.19461] Laplacian Multi-scale Flow Matching for Generative Modeling
Machine Learning

[2602.19461] Laplacian Multi-scale Flow Matching for Generative Modeling

The paper presents Laplacian Multi-scale Flow Matching (LapFlow), a new framework for image generative modeling that enhances flow matchi...

arXiv - Machine Learning · 3 min ·
[2602.19569] Temporal-Aware Heterogeneous Graph Reasoning with Multi-View Fusion for Temporal Question Answering
Ai Safety

[2602.19569] Temporal-Aware Heterogeneous Graph Reasoning with Multi-View Fusion for Temporal Question Answering

This paper presents a novel framework for Temporal Question Answering over Temporal Knowledge Graphs, addressing limitations in temporal ...

arXiv - AI · 3 min ·
Previous Page 87 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime