Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[2603.24326] Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

Abstract page for arXiv paper 2603.24326: Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

arXiv - AI · 4 min · about 6 hours ago

Nlp

[2601.13508] Autonomous Computational Catalysis Research via Agentic Systems

Abstract page for arXiv paper 2601.13508: Autonomous Computational Catalysis Research via Agentic Systems

arXiv - AI · 3 min · about 6 hours ago

Machine Learning

[2510.20847] Integrated representational signatures strengthen specificity in brains and models

Abstract page for arXiv paper 2510.20847: Integrated representational signatures strengthen specificity in brains and models

arXiv - AI · 4 min · about 6 hours ago

All Content

Llms

[2506.11087] Enhancing Delta Compression in LLMs via SVD-based Quantization Error Minimization

This article presents PrinMix, a new SVD-based framework for enhancing delta compression in large language models (LLMs), addressing stor...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2506.07272] A Cramér-von Mises Approach to Incentivizing Truthful Data Sharing

This paper introduces a novel approach using the Cramér-von Mises statistic to create incentive mechanisms that promote truthful data sha...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2510.10193] SAFER: Risk-Constrained Sample-then-Filter in Large Language Models

The paper presents SAFER, a two-stage risk control framework for large language models (LLMs) that enhances output trustworthiness in ris...

arXiv - AI · 4 min · about 2 months ago

Llms

[2509.25260] Internal Planning in Language Models: Characterizing Horizon and Branch Awareness

This article explores how decoder-only language models engage in internal planning, focusing on their ability to organize computations fo...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2505.11771] Residual Feature Integration is Sufficient to Prevent Negative Transfer

This paper presents a novel approach to prevent negative transfer in transfer learning by integrating residual features from pretrained m...

arXiv - AI · 4 min · about 2 months ago

Llms

[2503.08796] Robust Multi-Objective Controlled Decoding of Large Language Models

This article presents Robust Multi-Objective Decoding (RMOD), an innovative algorithm designed to enhance the performance of Large Langua...

arXiv - AI · 3 min · about 2 months ago

Llms

[2502.05376] LO-BCQ: Block Clustered Quantization for 4-bit (W4A4) LLM Inference

The paper presents LO-BCQ, a novel block clustered quantization method for 4-bit LLM inference, achieving less than 1% accuracy loss whil...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2502.02415] Fast Graph Generation via Autoregressive Noisy Filtration Modeling

This paper presents Autoregressive Noisy Filtration Modeling (ANFM), a new framework for fast graph generation that balances quality and ...

arXiv - Machine Learning · 3 min · about 2 months ago

Nlp

[2602.14788] VIPA: Visual Informative Part Attention for Referring Image Segmentation

The paper presents VIPA, a novel framework for Referring Image Segmentation that enhances attention mechanisms by leveraging informative ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.14778] A Geometric Analysis of Small-sized Language Model Hallucinations

This paper explores hallucinations in small-sized language models (LLMs) through a geometric lens, demonstrating that genuine responses c...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.14763] Unlocking Reasoning Capability on Machine Translation in Large Language Models

The paper evaluates the impact of reasoning-oriented large language models on machine translation, revealing that explicit reasoning ofte...

arXiv - AI · 3 min · about 2 months ago

Ai Startups

[2602.14710] Orcheo: A Modular Full-Stack Platform for Conversational Search

Orcheo is an open-source platform designed to streamline conversational search by offering a modular architecture, production-ready infra...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.15006] Distributed Quantum Gaussian Processes for Multi-Agent Systems

This article presents a novel Distributed Quantum Gaussian Process (DQGP) method for multi-agent systems, enhancing modeling capabilities...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.14885] Drift-Diffusion Matching: Embedding dynamics in latent manifolds of asymmetric neural networks

The paper introduces Drift-Diffusion Matching, a framework for training recurrent neural networks (RNNs) to model complex stochastic dyna...

arXiv - Machine Learning · 4 min · about 2 months ago

Nlp

[2602.14846] Multi-dimensional Persistent Sheaf Laplacians for Image Analysis

This paper introduces a multi-dimensional persistent sheaf Laplacian (MPSL) framework for image analysis, enhancing dimensionality reduct...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.14536] Explainable Token-level Noise Filtering for LLM Fine-tuning Datasets

The paper presents XTF, an explainable token-level noise filtering framework designed to enhance the fine-tuning of Large Language Models...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.14828] Exploring the limits of pre-trained embeddings in machine-guided protein design: a case study on predicting AAV vector viability

This study evaluates the effectiveness of pre-trained embeddings in machine-guided protein design, focusing on predicting AAV vector viab...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.14488] BETA-Labeling for Multilingual Dataset Construction in Low-Resource IR

This article presents the BETA-labeling framework for constructing a Bangla IR dataset, addressing challenges in low-resource languages a...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.14771] GOT-JEPA: Generic Object Tracking with Model Adaptation and Occlusion Handling using Joint-Embedding Predictive Architecture

GOT-JEPA introduces a novel framework for generic object tracking that enhances model adaptation and occlusion handling, improving robust...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.14464] CoCoDiff: Correspondence-Consistent Diffusion Model for Fine-grained Style Transfer

The paper presents CoCoDiff, a novel framework for fine-grained style transfer in images, emphasizing semantic correspondence and achievi...

arXiv - AI · 3 min · about 2 months ago

Previous Page 120 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

[2603.24326] Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

[2601.13508] Autonomous Computational Catalysis Research via Agentic Systems

[2510.20847] Integrated representational signatures strengthen specificity in brains and models

All Content

[2506.11087] Enhancing Delta Compression in LLMs via SVD-based Quantization Error Minimization

[2506.07272] A Cramér-von Mises Approach to Incentivizing Truthful Data Sharing

[2510.10193] SAFER: Risk-Constrained Sample-then-Filter in Large Language Models

[2509.25260] Internal Planning in Language Models: Characterizing Horizon and Branch Awareness

[2505.11771] Residual Feature Integration is Sufficient to Prevent Negative Transfer

[2503.08796] Robust Multi-Objective Controlled Decoding of Large Language Models

[2502.05376] LO-BCQ: Block Clustered Quantization for 4-bit (W4A4) LLM Inference

[2502.02415] Fast Graph Generation via Autoregressive Noisy Filtration Modeling

[2602.14788] VIPA: Visual Informative Part Attention for Referring Image Segmentation

[2602.14778] A Geometric Analysis of Small-sized Language Model Hallucinations

[2602.14763] Unlocking Reasoning Capability on Machine Translation in Large Language Models

[2602.14710] Orcheo: A Modular Full-Stack Platform for Conversational Search

[2602.15006] Distributed Quantum Gaussian Processes for Multi-Agent Systems

[2602.14885] Drift-Diffusion Matching: Embedding dynamics in latent manifolds of asymmetric neural networks

[2602.14846] Multi-dimensional Persistent Sheaf Laplacians for Image Analysis

[2602.14536] Explainable Token-level Noise Filtering for LLM Fine-tuning Datasets

[2602.14828] Exploring the limits of pre-trained embeddings in machine-guided protein design: a case study on predicting AAV vector viability

[2602.14488] BETA-Labeling for Multilingual Dataset Construction in Low-Resource IR

[2602.14771] GOT-JEPA: Generic Object Tracking with Model Adaptation and Occlusion Handling using Joint-Embedding Predictive Architecture

[2602.14464] CoCoDiff: Correspondence-Consistent Diffusion Model for Fine-grained Style Transfer

Related Topics

Stay updated with AI News