Natural Language Processing

Text understanding and language tasks

Top This Week

[2603.24326] Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing
Llms

[2603.24326] Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

Abstract page for arXiv paper 2603.24326: Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

arXiv - AI · 4 min ·
[2601.13508] Autonomous Computational Catalysis Research via Agentic Systems
Nlp

[2601.13508] Autonomous Computational Catalysis Research via Agentic Systems

Abstract page for arXiv paper 2601.13508: Autonomous Computational Catalysis Research via Agentic Systems

arXiv - AI · 3 min ·
[2510.20847] Integrated representational signatures strengthen specificity in brains and models
Machine Learning

[2510.20847] Integrated representational signatures strengthen specificity in brains and models

Abstract page for arXiv paper 2510.20847: Integrated representational signatures strengthen specificity in brains and models

arXiv - AI · 4 min ·

All Content

[2602.15183] Seeing to Generalize: How Visual Data Corrects Binding Shortcuts
Llms

[2602.15183] Seeing to Generalize: How Visual Data Corrects Binding Shortcuts

This article explores how Vision Language Models (VLMs) enhance performance on text-only tasks by correcting binding shortcuts through vi...

arXiv - Machine Learning · 4 min ·
[2602.15155] Refine Now, Query Fast: A Decoupled Refinement Paradigm for Implicit Neural Fields
Machine Learning

[2602.15155] Refine Now, Query Fast: A Decoupled Refinement Paradigm for Implicit Neural Fields

The paper presents a Decoupled Representation Refinement (DRR) paradigm for Implicit Neural Representations (INRs), enhancing speed and f...

arXiv - Machine Learning · 4 min ·
[2602.15089] Hybrid Feature Learning with Time Series Embeddings for Equipment Anomaly Prediction
Machine Learning

[2602.15089] Hybrid Feature Learning with Time Series Embeddings for Equipment Anomaly Prediction

This study presents a hybrid approach for equipment anomaly prediction by combining time series embeddings with statistical features, ach...

arXiv - Machine Learning · 4 min ·
Llms

[D] Semantic Compression Vectors in LLMs: A Field Study on Topic Persistence in 5.1 vs 4o Models

This article explores the effectiveness of Semantic Compression Vectors (SCVs) in large language models (LLMs), comparing the 5.1 and 4o ...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] We tested the same INT8 model on 5 Snapdragon chipsets. Accuracy ranged from 93% to 71%. Same weights, same ONNX file.

This article presents findings from testing an INT8 model across five Snapdragon chipsets, revealing significant variations in accuracy, ...

Reddit - Machine Learning · 1 min ·
Llms

Live demo: This is what AI shopping actually looks like when stores serve structured data via UCP

This article discusses a live demo showcasing AI shopping experiences using structured data via the Universal Commerce Protocol (UCP), hi...

Reddit - Artificial Intelligence · 1 min ·
NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル
Open Source Ai

NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル

NVIDIA has launched the Nemotron-Nano-9B-v2-Japanese, a lightweight language model designed to enhance Japanese language understanding an...

Hugging Face Blog · 2 min ·
Cohere launches a family of open multilingual models | TechCrunch
Machine Learning

Cohere launches a family of open multilingual models | TechCrunch

Cohere has launched Tiny Aya, a family of open multilingual models that support over 70 languages and can run on everyday devices, enhanc...

TechCrunch - AI · 5 min ·
[2602.10058] Evaluating Disentangled Representations for Controllable Music Generation
Machine Learning

[2602.10058] Evaluating Disentangled Representations for Controllable Music Generation

This article evaluates disentangled representations in music generation, focusing on their effectiveness for controllable synthesis and i...

arXiv - Machine Learning · 3 min ·
[2602.10551] C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning
Llms

[2602.10551] C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning

The paper presents C^2ROPE, an advanced positional encoding method for 3D Large Multimodal Models, addressing limitations of existing Rot...

arXiv - AI · 4 min ·
[2602.10139] Anonymization-Enhanced Privacy Protection for Mobile GUI Agents: Available but Invisible
Llms

[2602.10139] Anonymization-Enhanced Privacy Protection for Mobile GUI Agents: Available but Invisible

The paper presents a novel framework for enhancing privacy protection in mobile GUI agents by anonymizing sensitive data while maintainin...

arXiv - AI · 4 min ·
[2602.08274] Language Modeling and Understanding Through Paraphrase Generation and Detection
Llms

[2602.08274] Language Modeling and Understanding Through Paraphrase Generation and Detection

The paper explores the role of paraphrase generation and detection in language modeling, emphasizing the need for fine-grained semantic u...

arXiv - AI · 4 min ·
[2602.07672] Debugging code world models
Llms

[2602.07672] Debugging code world models

The paper explores Code World Models (CWMs), which simulate program execution and identify error sources, focusing on local semantic exec...

arXiv - AI · 4 min ·
[2601.20336] Do Whitepaper Claims Predict Market Behavior? Evidence from Cryptocurrency Factor Analysis
Nlp

[2601.20336] Do Whitepaper Claims Predict Market Behavior? Evidence from Cryptocurrency Factor Analysis

This study examines the relationship between cryptocurrency whitepaper claims and actual market behavior, revealing weak predictive power...

arXiv - Machine Learning · 3 min ·
[2601.15518] DS@GT at TREC TOT 2025: Bridging Vague Recollection with Fusion Retrieval and Learned Reranking
Llms

[2601.15518] DS@GT at TREC TOT 2025: Bridging Vague Recollection with Fusion Retrieval and Learned Reranking

This paper presents a two-stage retrieval system designed for the TREC Tip-of-the-Tongue task, integrating multiple retrieval methods wit...

arXiv - Machine Learning · 3 min ·
[2602.01023] Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment
Nlp

[2602.01023] Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment

This paper presents a unified framework for Query Auto-Completion (QAC) that integrates Retrieval-Augmented Generation (RAG) and multi-ob...

arXiv - AI · 4 min ·
[2601.23232] ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search
Llms

[2601.23232] ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search

ShotFinder introduces a novel benchmark for open-domain video shot retrieval, utilizing LLMs to enhance video search capabilities through...

arXiv - AI · 4 min ·
[2511.20974] RosettaSpeech: Zero-Shot Speech-to-Speech Translation without Parallel Speech
Machine Learning

[2511.20974] RosettaSpeech: Zero-Shot Speech-to-Speech Translation without Parallel Speech

RosettaSpeech introduces a zero-shot framework for speech-to-speech translation, overcoming the need for parallel speech data by using mo...

arXiv - Machine Learning · 4 min ·
[2601.14172] Human Values in a Single Sentence: Moral Presence, Hierarchies, and Transformer Ensembles on the Schwartz Continuum
Machine Learning

[2601.14172] Human Values in a Single Sentence: Moral Presence, Hierarchies, and Transformer Ensembles on the Schwartz Continuum

This article explores the detection of 19 human values in sentences using transformer models, demonstrating the learnability of moral pre...

arXiv - AI · 4 min ·
[2601.12522] Improved Bug Localization with AI Agents Leveraging Hypothesis and Dynamic Cognition
Llms

[2601.12522] Improved Bug Localization with AI Agents Leveraging Hypothesis and Dynamic Cognition

The paper presents CogniGent, a novel AI technique for bug localization that enhances traditional methods by leveraging causal reasoning ...

arXiv - Machine Learning · 4 min ·
Previous Page 117 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime