Generative AI

Image, video, audio, and text generation

Top This Week

[2602.08277] PISCO: Precise Video Instance Insertion with Sparse Control
Generative Ai

[2602.08277] PISCO: Precise Video Instance Insertion with Sparse Control

Abstract page for arXiv paper 2602.08277: PISCO: Precise Video Instance Insertion with Sparse Control

arXiv - AI · 4 min ·
[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language and Images
Machine Learning

[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language and Images

Abstract page for arXiv paper 2511.18746: Any4D: Open-Prompt 4D Generation from Natural Language and Images

arXiv - AI · 4 min ·
[2512.14549] Dual-objective Language Models: Training Efficiency Without Overfitting
Llms

[2512.14549] Dual-objective Language Models: Training Efficiency Without Overfitting

Abstract page for arXiv paper 2512.14549: Dual-objective Language Models: Training Efficiency Without Overfitting

arXiv - AI · 3 min ·

All Content

[2603.03714] Order Is Not Layout: Order-to-Space Bias in Image Generation
Machine Learning

[2603.03714] Order Is Not Layout: Order-to-Space Bias in Image Generation

Abstract page for arXiv paper 2603.03714: Order Is Not Layout: Order-to-Space Bias in Image Generation

arXiv - AI · 3 min ·
[2603.03700] Generalization Properties of Score-matching Diffusion Models for Intrinsically Low-dimensional Data
Machine Learning

[2603.03700] Generalization Properties of Score-matching Diffusion Models for Intrinsically Low-dimensional Data

Abstract page for arXiv paper 2603.03700: Generalization Properties of Score-matching Diffusion Models for Intrinsically Low-dimensional ...

arXiv - Machine Learning · 4 min ·
[2603.03692] Error as Signal: Stiffness-Aware Diffusion Sampling via Embedded Runge-Kutta Guidance
Machine Learning

[2603.03692] Error as Signal: Stiffness-Aware Diffusion Sampling via Embedded Runge-Kutta Guidance

Abstract page for arXiv paper 2603.03692: Error as Signal: Stiffness-Aware Diffusion Sampling via Embedded Runge-Kutta Guidance

arXiv - AI · 3 min ·
[2603.04064] Tuning Just Enough: Lightweight Backdoor Attacks on Multi-Encoder Diffusion Models
Machine Learning

[2603.04064] Tuning Just Enough: Lightweight Backdoor Attacks on Multi-Encoder Diffusion Models

Abstract page for arXiv paper 2603.04064: Tuning Just Enough: Lightweight Backdoor Attacks on Multi-Encoder Diffusion Models

arXiv - Machine Learning · 4 min ·
[2603.03973] Dual-Solver: A Generalized ODE Solver for Diffusion Models with Dual Prediction
Machine Learning

[2603.03973] Dual-Solver: A Generalized ODE Solver for Diffusion Models with Dual Prediction

Abstract page for arXiv paper 2603.03973: Dual-Solver: A Generalized ODE Solver for Diffusion Models with Dual Prediction

arXiv - Machine Learning · 3 min ·
[2603.03505] PhyPrompt: RL-based Prompt Refinement for Physically Plausible Text-to-Video Generation
Machine Learning

[2603.03505] PhyPrompt: RL-based Prompt Refinement for Physically Plausible Text-to-Video Generation

Abstract page for arXiv paper 2603.03505: PhyPrompt: RL-based Prompt Refinement for Physically Plausible Text-to-Video Generation

arXiv - AI · 4 min ·
[2603.03485] Phys4D: Fine-Grained Physics-Consistent 4D Modeling from Video Diffusion
Machine Learning

[2603.03485] Phys4D: Fine-Grained Physics-Consistent 4D Modeling from Video Diffusion

Abstract page for arXiv paper 2603.03485: Phys4D: Fine-Grained Physics-Consistent 4D Modeling from Video Diffusion

arXiv - AI · 4 min ·
[2603.03469] Biased Generalization in Diffusion Models
Machine Learning

[2603.03469] Biased Generalization in Diffusion Models

Abstract page for arXiv paper 2603.03469: Biased Generalization in Diffusion Models

arXiv - Machine Learning · 4 min ·
[2603.03970] Generative AI in Managerial Decision-Making: Redefining Boundaries through Ambiguity Resolution and Sycophancy Analysis
Machine Learning

[2603.03970] Generative AI in Managerial Decision-Making: Redefining Boundaries through Ambiguity Resolution and Sycophancy Analysis

Abstract page for arXiv paper 2603.03970: Generative AI in Managerial Decision-Making: Redefining Boundaries through Ambiguity Resolution...

arXiv - AI · 4 min ·
[2510.08946] Physically Valid Biomolecular Interaction Modeling with Gauss-Seidel Projection
Llms

[2510.08946] Physically Valid Biomolecular Interaction Modeling with Gauss-Seidel Projection

Abstract page for arXiv paper 2510.08946: Physically Valid Biomolecular Interaction Modeling with Gauss-Seidel Projection

arXiv - Machine Learning · 4 min ·
[2602.04898] Semantic-level Backdoor Attack against Text-to-Image Diffusion Models
Machine Learning

[2602.04898] Semantic-level Backdoor Attack against Text-to-Image Diffusion Models

Abstract page for arXiv paper 2602.04898: Semantic-level Backdoor Attack against Text-to-Image Diffusion Models

arXiv - AI · 3 min ·
[2412.09646] RealOSR: Latent Guidance Boosts Diffusion-based Real-world Omnidirectional Image Super-Resolutions
Machine Learning

[2412.09646] RealOSR: Latent Guidance Boosts Diffusion-based Real-world Omnidirectional Image Super-Resolutions

Abstract page for arXiv paper 2412.09646: RealOSR: Latent Guidance Boosts Diffusion-based Real-world Omnidirectional Image Super-Resolutions

arXiv - Machine Learning · 4 min ·
[2510.14765] Inpainting the Red Planet: Diffusion Models for the Reconstruction of Martian Environments in Virtual Reality
Machine Learning

[2510.14765] Inpainting the Red Planet: Diffusion Models for the Reconstruction of Martian Environments in Virtual Reality

Abstract page for arXiv paper 2510.14765: Inpainting the Red Planet: Diffusion Models for the Reconstruction of Martian Environments in V...

arXiv - AI · 4 min ·
[2602.12274] Function-Space Decoupled Diffusion for Forward and Inverse Modeling in Carbon Capture and Storage
Machine Learning

[2602.12274] Function-Space Decoupled Diffusion for Forward and Inverse Modeling in Carbon Capture and Storage

Abstract page for arXiv paper 2602.12274: Function-Space Decoupled Diffusion for Forward and Inverse Modeling in Carbon Capture and Storage

arXiv - Machine Learning · 4 min ·
[2511.07970] Continual Unlearning for Text-to-Image Diffusion Models: A Regularization Perspective
Machine Learning

[2511.07970] Continual Unlearning for Text-to-Image Diffusion Models: A Regularization Perspective

Abstract page for arXiv paper 2511.07970: Continual Unlearning for Text-to-Image Diffusion Models: A Regularization Perspective

arXiv - Machine Learning · 4 min ·
[2510.04573] LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning
Llms

[2510.04573] LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

Abstract page for arXiv paper 2510.04573: LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

arXiv - Machine Learning · 4 min ·
[2510.02692] Fine-Tuning Diffusion Models via Intermediate Distribution Shaping
Machine Learning

[2510.02692] Fine-Tuning Diffusion Models via Intermediate Distribution Shaping

Abstract page for arXiv paper 2510.02692: Fine-Tuning Diffusion Models via Intermediate Distribution Shaping

arXiv - Machine Learning · 4 min ·
[2506.07177] Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models
Machine Learning

[2506.07177] Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models

Abstract page for arXiv paper 2506.07177: Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models

arXiv - AI · 3 min ·
[2509.23348] Entering the Era of Discrete Diffusion Models: A Benchmark for Schrödinger Bridges and Entropic Optimal Transport
Machine Learning

[2509.23348] Entering the Era of Discrete Diffusion Models: A Benchmark for Schrödinger Bridges and Entropic Optimal Transport

Abstract page for arXiv paper 2509.23348: Entering the Era of Discrete Diffusion Models: A Benchmark for Schrödinger Bridges and Entropic...

arXiv - Machine Learning · 4 min ·
[2509.23265] CREPE: Controlling Diffusion with Replica Exchange
Machine Learning

[2509.23265] CREPE: Controlling Diffusion with Replica Exchange

Abstract page for arXiv paper 2509.23265: CREPE: Controlling Diffusion with Replica Exchange

arXiv - Machine Learning · 3 min ·
Previous Page 15 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime