[2603.17677] Adaptive Guidance for Retrieval-Augmented Masked Diffusion Models
Abstract page for arXiv paper 2603.17677: Adaptive Guidance for Retrieval-Augmented Masked Diffusion Models
Image, video, audio, and text generation
Abstract page for arXiv paper 2603.17677: Adaptive Guidance for Retrieval-Augmented Masked Diffusion Models
Abstract page for arXiv paper 2601.16933: Reward-Forcing: Autoregressive Video Generation with Reward Feedback
Abstract page for arXiv paper 2505.15263: gen2seg: Generative Models Enable Generalizable Instance Segmentation
The paper introduces FeatBench, a new benchmark for evaluating feature-level code generation in Large Language Models (LLMs), addressing ...
The paper presents Quant VideoGen, a framework for autoregressive long video generation that addresses the limitations of KV cache memory...
The article presents PolicyPad, an interactive system designed for collaborative prototyping of policies governing large language models ...
The paper introduces Group-Equivariant Posterior Consistency (GEPC), a method for detecting out-of-distribution data in diffusion models ...
This paper presents a novel framework for out-of-distribution (OOD) detection in molecular complexes using diffusion models tailored for ...
The paper presents FreqPolicy, a novel flow-based visuomotor policy that enhances efficiency in robotic manipulation by imposing frequenc...
This paper presents MedVLSynther, a framework for synthesizing high-quality visual question answering (VQA) from medical documents, enhan...
This article presents a novel approach combining Chain-of-Thought (CoT) and Retrieval Augmented Generation (RAG) to improve rare disease ...
This survey reviews advancements in spatiotemporal consistency in video generation, addressing challenges and methodologies in creating c...
This paper explores the dynamics of Classifier-Free Guidance (CFG) in diffusion models, revealing its effects on sampling processes and d...
This article explores the universal properties of activation sparsity in modern large language models (LLMs), highlighting its implicatio...
FairTabGen introduces a novel framework for generating high-fidelity synthetic healthcare data from limited samples, enhancing fairness a...
This article presents a novel technique for extracting safety classifiers from aligned large language models (LLMs) to address vulnerabil...
PromptGuard introduces a novel method for moderating unsafe content in text-to-image models, enhancing safety without sacrificing image q...
The paper presents MC-LLaVA, a multi-concept personalized vision-language model that enhances user experience by integrating multiple con...
The paper presents Ctrl-GenAug, a novel framework for controllable generative augmentation in medical sequence classification, addressing...
This paper introduces a novel method for evaluating language model agency through negotiation games, addressing limitations of existing b...
The paper introduces DiffusionBlocks, a framework for block-wise training of neural networks that reduces memory bottlenecks while mainta...
The article presents VERA-MH, an open-source evaluation tool designed to assess the safety of AI in mental health contexts, focusing on s...
The paper presents SEISMO, a trajectory-aware LLM agent designed to enhance sample efficiency in molecular optimization, achieving signif...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime