Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[2603.24326] Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

Abstract page for arXiv paper 2603.24326: Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

arXiv - AI · 4 min · 37 minutes ago

Nlp

[2601.13508] Autonomous Computational Catalysis Research via Agentic Systems

Abstract page for arXiv paper 2601.13508: Autonomous Computational Catalysis Research via Agentic Systems

arXiv - AI · 3 min · 37 minutes ago

Machine Learning

[2510.20847] Integrated representational signatures strengthen specificity in brains and models

Abstract page for arXiv paper 2510.20847: Integrated representational signatures strengthen specificity in brains and models

arXiv - AI · 4 min · 37 minutes ago

All Content

Llms

[2602.15183] Seeing to Generalize: How Visual Data Corrects Binding Shortcuts

This article explores how Vision Language Models (VLMs) enhance performance on text-only tasks by correcting binding shortcuts through vi...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.15155] Refine Now, Query Fast: A Decoupled Refinement Paradigm for Implicit Neural Fields

The paper presents a Decoupled Representation Refinement (DRR) paradigm for Implicit Neural Representations (INRs), enhancing speed and f...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.15089] Hybrid Feature Learning with Time Series Embeddings for Equipment Anomaly Prediction

This study presents a hybrid approach for equipment anomaly prediction by combining time series embeddings with statistical features, ach...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[D] Semantic Compression Vectors in LLMs: A Field Study on Topic Persistence in 5.1 vs 4o Models

This article explores the effectiveness of Semantic Compression Vectors (SCVs) in large language models (LLMs), comparing the 5.1 and 4o ...

Reddit - Machine Learning · 1 min · about 2 months ago

Machine Learning

[D] We tested the same INT8 model on 5 Snapdragon chipsets. Accuracy ranged from 93% to 71%. Same weights, same ONNX file.

This article presents findings from testing an INT8 model across five Snapdragon chipsets, revealing significant variations in accuracy, ...

Reddit - Machine Learning · 1 min · about 2 months ago

Llms

Live demo: This is what AI shopping actually looks like when stores serve structured data via UCP

This article discusses a live demo showcasing AI shopping experiences using structured data via the Universal Commerce Protocol (UCP), hi...

Reddit - Artificial Intelligence · 1 min · about 2 months ago

Open Source Ai

NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル

NVIDIA has launched the Nemotron-Nano-9B-v2-Japanese, a lightweight language model designed to enhance Japanese language understanding an...

Hugging Face Blog · 2 min · about 2 months ago

Machine Learning

Cohere launches a family of open multilingual models | TechCrunch

Cohere has launched Tiny Aya, a family of open multilingual models that support over 70 languages and can run on everyday devices, enhanc...

TechCrunch - AI · 5 min · about 2 months ago

Machine Learning

[2602.10058] Evaluating Disentangled Representations for Controllable Music Generation

This article evaluates disentangled representations in music generation, focusing on their effectiveness for controllable synthesis and i...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.10551] C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning

The paper presents C^2ROPE, an advanced positional encoding method for 3D Large Multimodal Models, addressing limitations of existing Rot...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.10139] Anonymization-Enhanced Privacy Protection for Mobile GUI Agents: Available but Invisible

The paper presents a novel framework for enhancing privacy protection in mobile GUI agents by anonymizing sensitive data while maintainin...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.08274] Language Modeling and Understanding Through Paraphrase Generation and Detection

The paper explores the role of paraphrase generation and detection in language modeling, emphasizing the need for fine-grained semantic u...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.07672] Debugging code world models

The paper explores Code World Models (CWMs), which simulate program execution and identify error sources, focusing on local semantic exec...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2601.20336] Do Whitepaper Claims Predict Market Behavior? Evidence from Cryptocurrency Factor Analysis

This study examines the relationship between cryptocurrency whitepaper claims and actual market behavior, revealing weak predictive power...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2601.15518] DS@GT at TREC TOT 2025: Bridging Vague Recollection with Fusion Retrieval and Learned Reranking

This paper presents a two-stage retrieval system designed for the TREC Tip-of-the-Tongue task, integrating multiple retrieval methods wit...

arXiv - Machine Learning · 3 min · about 2 months ago

Nlp

[2602.01023] Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment

This paper presents a unified framework for Query Auto-Completion (QAC) that integrates Retrieval-Augmented Generation (RAG) and multi-ob...

arXiv - AI · 4 min · about 2 months ago

Llms

[2601.23232] ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search

ShotFinder introduces a novel benchmark for open-domain video shot retrieval, utilizing LLMs to enhance video search capabilities through...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2511.20974] RosettaSpeech: Zero-Shot Speech-to-Speech Translation without Parallel Speech

RosettaSpeech introduces a zero-shot framework for speech-to-speech translation, overcoming the need for parallel speech data by using mo...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2601.14172] Human Values in a Single Sentence: Moral Presence, Hierarchies, and Transformer Ensembles on the Schwartz Continuum

This article explores the detection of 19 human values in sentences using transformer models, demonstrating the learnability of moral pre...

arXiv - AI · 4 min · about 2 months ago

Llms

[2601.12522] Improved Bug Localization with AI Agents Leveraging Hypothesis and Dynamic Cognition

The paper presents CogniGent, a novel AI technique for bug localization that enhances traditional methods by leveraging causal reasoning ...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 117 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

[2603.24326] Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

[2601.13508] Autonomous Computational Catalysis Research via Agentic Systems

[2510.20847] Integrated representational signatures strengthen specificity in brains and models

All Content

[2602.15183] Seeing to Generalize: How Visual Data Corrects Binding Shortcuts

[2602.15155] Refine Now, Query Fast: A Decoupled Refinement Paradigm for Implicit Neural Fields

[2602.15089] Hybrid Feature Learning with Time Series Embeddings for Equipment Anomaly Prediction

[D] Semantic Compression Vectors in LLMs: A Field Study on Topic Persistence in 5.1 vs 4o Models

[D] We tested the same INT8 model on 5 Snapdragon chipsets. Accuracy ranged from 93% to 71%. Same weights, same ONNX file.

Live demo: This is what AI shopping actually looks like when stores serve structured data via UCP

NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル

Cohere launches a family of open multilingual models | TechCrunch

[2602.10058] Evaluating Disentangled Representations for Controllable Music Generation

[2602.10551] C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning

[2602.10139] Anonymization-Enhanced Privacy Protection for Mobile GUI Agents: Available but Invisible

[2602.08274] Language Modeling and Understanding Through Paraphrase Generation and Detection

[2602.07672] Debugging code world models

[2601.20336] Do Whitepaper Claims Predict Market Behavior? Evidence from Cryptocurrency Factor Analysis

[2601.15518] DS@GT at TREC TOT 2025: Bridging Vague Recollection with Fusion Retrieval and Learned Reranking

[2602.01023] Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment

[2601.23232] ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search

[2511.20974] RosettaSpeech: Zero-Shot Speech-to-Speech Translation without Parallel Speech

[2601.14172] Human Values in a Single Sentence: Moral Presence, Hierarchies, and Transformer Ensembles on the Schwartz Continuum

[2601.12522] Improved Bug Localization with AI Agents Leveraging Hypothesis and Dynamic Cognition

Related Topics

Stay updated with AI News