[2602.08277] PISCO: Precise Video Instance Insertion with Sparse Control
Abstract page for arXiv paper 2602.08277: PISCO: Precise Video Instance Insertion with Sparse Control
Image, video, audio, and text generation
Abstract page for arXiv paper 2602.08277: PISCO: Precise Video Instance Insertion with Sparse Control
Abstract page for arXiv paper 2511.18746: Any4D: Open-Prompt 4D Generation from Natural Language and Images
Abstract page for arXiv paper 2512.14549: Dual-objective Language Models: Training Efficiency Without Overfitting
Abstract page for arXiv paper 2507.08965: Improving Classifier-Free Guidance in Masked Diffusion: Low-Dim Theoretical Insights with High-...
Abstract page for arXiv paper 2312.15490: Diffusion-EXR: Controllable Review Generation for Explainable Recommendation via Diffusion Models
Abstract page for arXiv paper 2506.05668: RNE: plug-and-play diffusion inference-time control and energy-based training
Abstract page for arXiv paper 2505.20934: NatADiff: Adversarial Boundary Guidance for Natural Adversarial Diffusion
Abstract page for arXiv paper 2603.03281: CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance
Abstract page for arXiv paper 2603.03163: Conditioned Activation Transport for T2I Safety Steering
Abstract page for arXiv paper 2603.03143: Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing
Abstract page for arXiv paper 2603.02829: Toward Early Quality Assessment of Text-to-Image Diffusion Models
Abstract page for arXiv paper 2603.03074: Design Generative AI for Practitioners: Exploring Interaction Approaches Aligned with Creative ...
Abstract page for arXiv paper 2603.02667: DREAM: Where Visual Understanding Meets Text-to-Image Generation
Abstract page for arXiv paper 2603.02919: Interpretable Motion-Attentive Maps: Spatio-Temporally Localizing Concepts in Video Diffusion T...
Abstract page for arXiv paper 2603.02816: BrandFusion: A Multi-Agent Framework for Seamless Brand Integration in Text-to-Video Generation
Abstract page for arXiv paper 2603.02760: Efficient Self-Evaluation for Diffusion Language Models via Sequence Regeneration
Abstract page for arXiv paper 2603.02417: Fisher-Geometric Diffusion in Stochastic Gradient Descent: Optimal Rates, Oracle Complexity, an...
Abstract page for arXiv paper 2603.02697: ShareVerse: Multi-Agent Consistent Video Generation for Shared World Modeling
Abstract page for arXiv paper 2603.02547: CoDAR: Continuous Diffusion Language Models are More Powerful Than You Think
Abstract page for arXiv paper 2603.03238: On Geometry Regularization in Autoencoder Reduced-Order Models with Latent Neural ODE Dynamics
Abstract page for arXiv paper 2603.02650: Improving Diffusion Planners by Self-Supervised Action Gating with Energies
Abstract page for arXiv paper 2603.02613: Real-Time Generative Policy via Langevin-Guided Flow Matching for Autonomous Driving
Abstract page for arXiv paper 2603.03147: Agentic AI-based Coverage Closure for Formal Verification
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime