[2603.17205] OPERA: Online Data Pruning for Efficient Retrieval Model Adaptation
Abstract page for arXiv paper 2603.17205: OPERA: Online Data Pruning for Efficient Retrieval Model Adaptation
Text understanding and language tasks
Abstract page for arXiv paper 2603.17205: OPERA: Online Data Pruning for Efficient Retrieval Model Adaptation
Abstract page for arXiv paper 2512.02418: Leveraging Large Language Models to Bridge Cross-Domain Transparency in Stablecoins
Abstract page for arXiv paper 2510.14377: PluriHopRAG: Exhaustive, Recall-Sensitive QA Through Corpus-Specific Document Structure Learning
This article presents a novel approach to improving masked diffusion models (MDMs) for language modeling by introducing a learned schedul...
This article explores the statistical advantages of softmax attention mechanisms in large language models, particularly in single-locatio...
The paper presents SPARTA, a novel framework for generating scalable benchmarks for tree-structured multi-hop question answering (QA) ove...
This article presents rBridge, a small proxy model that predicts reasoning performance in large language models (LLMs), demonstrating sig...
The paper presents MovieTeller, a novel framework for generating movie synopses using tool-augmented progressive abstraction to enhance c...
This paper investigates why Diffusion Language Models (DLMs) often default to autoregressive decoding instead of utilizing their potentia...
This article explores RL-Obfuscation, a method for training language models to evade latent-space monitors that detect undesirable behavi...
This study examines the relationship between fluency and accuracy in L2 Mandarin prosody, revealing that while learners may achieve quant...
This paper presents a novel approach to long-form Bengali Automatic Speech Recognition (ASR) and speaker diarization, introducing a compr...
The paper introduces Affine-Scaled Attention, a novel approach to Transformer attention that enhances flexibility and stability by modify...
This paper presents a detailed analysis of offline policy learning in contextual bandits, focusing on $f$-divergence regularization and i...
This paper presents a novel framework that utilizes language models to guide symbolic regression in discovering interpretable physical la...
This paper presents a robust framework for Bangla Automatic Speech Recognition (ASR) and Speaker Diarization, addressing challenges in pr...
This paper introduces a framework connecting multi-chart autoencoders with vector bundles and characteristic classes, enhancing manifold ...
The paper presents a novel framework called Stitching Noisy Diffusion Thoughts, which enhances reasoning in large language models by comb...
This paper evaluates the effectiveness of small language models (SLMs) in leader-follower interactions, comparing zero-shot and one-shot ...
The paper presents ManifoldGD, a training-free framework for dataset distillation using hierarchical manifold guidance, improving efficie...
The article presents TCM-DiffRAG, a novel reasoning framework for Traditional Chinese Medicine (TCM) that enhances diagnosis through know...
This article discusses a novel approach to enhancing app store ranking by integrating LLM-generated textual relevance labels with behavio...
This paper explores the impact of fine-tuning on in-context learning in linear attention models, revealing conditions that can enhance or...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime