[2603.24326] Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing
Abstract page for arXiv paper 2603.24326: Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing
Text understanding and language tasks
Abstract page for arXiv paper 2603.24326: Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing
Abstract page for arXiv paper 2601.13508: Autonomous Computational Catalysis Research via Agentic Systems
Abstract page for arXiv paper 2510.20847: Integrated representational signatures strengthen specificity in brains and models
This article presents a hybrid framework for improving neural machine translation performance in low-resource languages, specifically add...
This article explores the evolving relationship between human creativity and AI, particularly in writing, highlighting how authors adapt ...
ArtistMus introduces a benchmark for music question answering, leveraging a diverse dataset to enhance retrieval-augmented generation mod...
The paper presents Robust Reward Policy Optimization (RRPO), a novel framework designed to enhance emotional text-to-speech (TTS) systems...
This article presents a study on deep learning techniques for detecting clouds and cloud shadows in methane satellite and airborne imagin...
This paper presents FlexGT, a method for optimizing distributed stochastic problems by balancing communication and computation, achieving...
This paper presents a novel framework for ranking node importance in complex networks using influence-aware causal node embedding, enhanc...
The paper presents AthenaBench, a dynamic benchmark designed to evaluate large language models (LLMs) in the context of Cyber Threat Inte...
The paper presents a novel framework for batch speculative decoding, addressing critical failures in existing methods and achieving signi...
This paper presents a comparative study of context management strategies for end-to-end Spoken Dialogue State Tracking using Speech-LLMs,...
This article presents a novel approach to offline reinforcement learning (RL) using reward-weighted fine-tuning, enhancing conversation o...
The article introduces OmniEarth-Bench, a comprehensive benchmark for evaluating interactions across Earth's six spheres using multimodal...
This article presents EAPrivacy, a benchmark for evaluating the physical-world privacy awareness of large language models (LLMs), reveali...
The paper introduces ReliabilityRAG, a framework designed to enhance the robustness of Retrieval-Augmented Generation (RAG) systems again...
This paper presents a safety steering framework to enhance the robustness of large language models (LLMs) against multi-turn jailbreaking...
This article examines the evolution of concepts in language model pre-training, revealing how feature development influences performance ...
This article presents a diagnostic framework for evaluating synthetic dialogue generation in contact centers, highlighting the limitation...
The paper presents a novel Scaling-Theory-Informed Machine Learning (STIML) framework for predicting company growth by integrating struct...
This article presents a novel approach to simulating cyberattacks by integrating Security Chaos Engineering (SCE) into Breach Attack Simu...
The paper presents PragmaBot, a framework for robotic task planning that utilizes real-world experiences and self-reflection to enhance l...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime