Generative AI

Image, video, audio, and text generation

Top This Week

[2602.08277] PISCO: Precise Video Instance Insertion with Sparse Control
Generative Ai

[2602.08277] PISCO: Precise Video Instance Insertion with Sparse Control

Abstract page for arXiv paper 2602.08277: PISCO: Precise Video Instance Insertion with Sparse Control

arXiv - AI · 4 min ·
[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language and Images
Machine Learning

[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language and Images

Abstract page for arXiv paper 2511.18746: Any4D: Open-Prompt 4D Generation from Natural Language and Images

arXiv - AI · 4 min ·
[2512.14549] Dual-objective Language Models: Training Efficiency Without Overfitting
Llms

[2512.14549] Dual-objective Language Models: Training Efficiency Without Overfitting

Abstract page for arXiv paper 2512.14549: Dual-objective Language Models: Training Efficiency Without Overfitting

arXiv - AI · 3 min ·

All Content

[2508.12811] Next Visual Granularity Generation
Generative Ai

[2508.12811] Next Visual Granularity Generation

Abstract page for arXiv paper 2508.12811: Next Visual Granularity Generation

arXiv - Machine Learning · 4 min ·
[2508.04663] HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models
Machine Learning

[2508.04663] HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models

Abstract page for arXiv paper 2508.04663: HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models

arXiv - AI · 4 min ·
[2507.02314] MAGIC: Few-Shot Mask-Guided Anomaly Inpainting with Prompt Perturbation, Spatially Adaptive Guidance, and Context Awareness
Machine Learning

[2507.02314] MAGIC: Few-Shot Mask-Guided Anomaly Inpainting with Prompt Perturbation, Spatially Adaptive Guidance, and Context Awareness

Abstract page for arXiv paper 2507.02314: MAGIC: Few-Shot Mask-Guided Anomaly Inpainting with Prompt Perturbation, Spatially Adaptive Gui...

arXiv - AI · 4 min ·
[2507.00445] Iterative Distillation for Reward-Guided Fine-Tuning of Diffusion Models in Biomolecular Design
Machine Learning

[2507.00445] Iterative Distillation for Reward-Guided Fine-Tuning of Diffusion Models in Biomolecular Design

Abstract page for arXiv paper 2507.00445: Iterative Distillation for Reward-Guided Fine-Tuning of Diffusion Models in Biomolecular Design

arXiv - Machine Learning · 4 min ·
[2506.24108] Navigating with Annealing Guidance Scale in Diffusion Space
Machine Learning

[2506.24108] Navigating with Annealing Guidance Scale in Diffusion Space

Abstract page for arXiv paper 2506.24108: Navigating with Annealing Guidance Scale in Diffusion Space

arXiv - Machine Learning · 4 min ·
[2504.14814] A Diagnostic Evaluation of Neural Networks Trained with the Error Diffusion Learning Algorithm
Machine Learning

[2504.14814] A Diagnostic Evaluation of Neural Networks Trained with the Error Diffusion Learning Algorithm

Abstract page for arXiv paper 2504.14814: A Diagnostic Evaluation of Neural Networks Trained with the Error Diffusion Learning Algorithm

arXiv - Machine Learning · 4 min ·
[2505.22973] EquiReg: Equivariance Regularized Diffusion for Inverse Problems
Machine Learning

[2505.22973] EquiReg: Equivariance Regularized Diffusion for Inverse Problems

Abstract page for arXiv paper 2505.22973: EquiReg: Equivariance Regularized Diffusion for Inverse Problems

arXiv - Machine Learning · 4 min ·
[2502.21278] Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion
Machine Learning

[2502.21278] Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion

Abstract page for arXiv paper 2502.21278: Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion

arXiv - Machine Learning · 4 min ·
[2505.17561] Model Already Knows the Best Noise: Bayesian Active Noise Selection via Attention in Video Diffusion Model
Machine Learning

[2505.17561] Model Already Knows the Best Noise: Bayesian Active Noise Selection via Attention in Video Diffusion Model

Abstract page for arXiv paper 2505.17561: Model Already Knows the Best Noise: Bayesian Active Noise Selection via Attention in Video Diff...

arXiv - AI · 4 min ·
[2503.09642] Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k
Machine Learning

[2503.09642] Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k

Abstract page for arXiv paper 2503.09642: Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k

arXiv - AI · 4 min ·
[2404.08480] Using ChatGPT for Data Science Analyses
Llms

[2404.08480] Using ChatGPT for Data Science Analyses

Abstract page for arXiv paper 2404.08480: Using ChatGPT for Data Science Analyses

arXiv - Machine Learning · 3 min ·
[2404.00962] Distributional Priors Guided Diffusion for Generating 3D Molecules in Low Data Regimes
Generative Ai

[2404.00962] Distributional Priors Guided Diffusion for Generating 3D Molecules in Low Data Regimes

Abstract page for arXiv paper 2404.00962: Distributional Priors Guided Diffusion for Generating 3D Molecules in Low Data Regimes

arXiv - Machine Learning · 4 min ·
[2603.01623] Adaptive Spectral Feature Forecasting for Diffusion Sampling Acceleration
Machine Learning

[2603.01623] Adaptive Spectral Feature Forecasting for Diffusion Sampling Acceleration

Abstract page for arXiv paper 2603.01623: Adaptive Spectral Feature Forecasting for Diffusion Sampling Acceleration

arXiv - Machine Learning · 4 min ·
[2509.23589] BridgeDrive: Diffusion Bridge Policy for Closed-Loop Trajectory Planning in Autonomous Driving
Machine Learning

[2509.23589] BridgeDrive: Diffusion Bridge Policy for Closed-Loop Trajectory Planning in Autonomous Driving

Abstract page for arXiv paper 2509.23589: BridgeDrive: Diffusion Bridge Policy for Closed-Loop Trajectory Planning in Autonomous Driving

arXiv - Machine Learning · 4 min ·
[2506.12664] Behavioral Generative Agents for Energy Operations
Machine Learning

[2506.12664] Behavioral Generative Agents for Energy Operations

Abstract page for arXiv paper 2506.12664: Behavioral Generative Agents for Energy Operations

arXiv - AI · 4 min ·
[2603.01068] LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model
Machine Learning

[2603.01068] LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model

Abstract page for arXiv paper 2603.01068: LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model

arXiv - Machine Learning · 3 min ·
[2603.01019] BadRSSD: Backdoor Attacks on Regularized Self-Supervised Diffusion Models
Machine Learning

[2603.01019] BadRSSD: Backdoor Attacks on Regularized Self-Supervised Diffusion Models

Abstract page for arXiv paper 2603.01019: BadRSSD: Backdoor Attacks on Regularized Self-Supervised Diffusion Models

arXiv - Machine Learning · 4 min ·
[2603.00772] Initialization-Aware Score-Based Diffusion Sampling
Machine Learning

[2603.00772] Initialization-Aware Score-Based Diffusion Sampling

Abstract page for arXiv paper 2603.00772: Initialization-Aware Score-Based Diffusion Sampling

arXiv - Machine Learning · 3 min ·
[2603.02190] Sketch2Colab: Sketch-Conditioned Multi-Human Animation via Controllable Flow Distillation
Machine Learning

[2603.02190] Sketch2Colab: Sketch-Conditioned Multi-Human Animation via Controllable Flow Distillation

Abstract page for arXiv paper 2603.02190: Sketch2Colab: Sketch-Conditioned Multi-Human Animation via Controllable Flow Distillation

arXiv - Machine Learning · 3 min ·
[2603.02129] LiftAvatar: Kinematic-Space Completion for Expression-Controlled 3D Gaussian Avatar Animation
Machine Learning

[2603.02129] LiftAvatar: Kinematic-Space Completion for Expression-Controlled 3D Gaussian Avatar Animation

Abstract page for arXiv paper 2603.02129: LiftAvatar: Kinematic-Space Completion for Expression-Controlled 3D Gaussian Avatar Animation

arXiv - AI · 4 min ·
Previous Page 19 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime