Startup Battlefield 200 applications open until May 27 | TechCrunch
Nominate your startup, or one you know, and apply for a chance at VC access, TechCrunch coverage, and $100K for Startup Battlefield 200.
Text understanding and language tasks
Nominate your startup, or one you know, and apply for a chance at VC access, TechCrunch coverage, and $100K for Startup Battlefield 200.
Abstract page for arXiv paper 2603.24326: Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing
Abstract page for arXiv paper 2601.13508: Autonomous Computational Catalysis Research via Agentic Systems
The paper introduces DenseMLLM, a multimodal large language model designed to perform dense predictions without the need for complex, tas...
The paper presents voice2mode, a method for classifying four singing phonation modes using self-supervised speech models, demonstrating s...
TabTracer introduces a novel Monte Carlo Tree Search framework for enhancing table reasoning in large language models, improving accuracy...
This article presents a methodology for adapting vision-language models to the Polish language using the LLaVA framework, demonstrating s...
This article presents a novel framework for dynamic modeling and forecasting of group-level value evolution using large language models (...
This paper explores Named Entity Recognition (NER) techniques for payment data, presenting advanced models like PaymentBERT that enhance ...
This paper explores the trade-off between sufficiency and conciseness in self-explanations provided by large language models (LLMs), emph...
The paper introduces LiveNewsBench, a benchmark for evaluating the web search capabilities of Large Language Models (LLMs) using freshly ...
The paper discusses a polytopological PDL framework for expressing common knowledge and its implications in epistemic logic, highlighting...
The paper presents SpargeAttention2, a novel trainable sparse attention method that enhances the efficiency of diffusion models by combin...
This paper explores data-driven equation discovery to enhance optimization processes in engineering, introducing the Learned Gradient Flo...
The paper introduces the Generative Speech Reward Model (GSRM), a novel approach to evaluating speech naturalness in AI-generated audio, ...
This paper presents stochastic variance reduced extragradient methods for solving hierarchical variational inequalities, proving converge...
AsyncVLA introduces an asynchronous control framework for robotic navigation, enhancing real-time performance by decoupling semantic reas...
DTBench introduces a synthetic benchmark for evaluating document-to-table extraction capabilities, addressing limitations in existing ben...
The paper introduces OmniScience, a large-scale multi-modal dataset designed to enhance scientific image understanding in AI models, addr...
The paper presents Pailitao-VL, a multi-modal retrieval system designed for real-time industrial search, addressing key challenges in ret...
The paper presents MASFly, a novel framework for dynamic adaptation of LLM-based multi-agent systems at test time, enhancing task perform...
The article presents KorMedMCQA-V, a benchmark dataset for evaluating vision-language models on the Korean Medical Licensing Examination,...
PT-RAG introduces a novel framework for retrieval-augmented generation that maintains the hierarchical structure of academic papers, impr...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime