Generative AI

Image, video, audio, and text generation

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 3 hours ago

Machine Learning

[2603.14294] Seeking Physics in Diffusion Noise

Abstract page for arXiv paper 2603.14294: Seeking Physics in Diffusion Noise

arXiv - Machine Learning · 3 min · about 5 hours ago

Machine Learning

[2512.22854] ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum Learning

Abstract page for arXiv paper 2512.22854: ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum ...

arXiv - Machine Learning · 4 min · about 5 hours ago

All Content

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 3 hours ago

Machine Learning

[2603.14294] Seeking Physics in Diffusion Noise

Abstract page for arXiv paper 2603.14294: Seeking Physics in Diffusion Noise

arXiv - Machine Learning · 3 min · about 5 hours ago

Machine Learning

[2512.22854] ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum Learning

Abstract page for arXiv paper 2512.22854: ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum ...

arXiv - Machine Learning · 4 min · about 5 hours ago

Machine Learning

[2601.08881] TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts

Abstract page for arXiv paper 2601.08881: TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts

arXiv - AI · 4 min · about 5 hours ago

Machine Learning

[2510.14989] Constrained Diffusion for Protein Design with Hard Structural Constraints

Abstract page for arXiv paper 2510.14989: Constrained Diffusion for Protein Design with Hard Structural Constraints

arXiv - Machine Learning · 3 min · about 5 hours ago

Llms

[2509.24296] DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models

Abstract page for arXiv paper 2509.24296: DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models

arXiv - AI · 4 min · about 5 hours ago

Machine Learning

[2505.21545] Corruption-Aware Training of Latent Video Diffusion Models for Robust Text-to-Video Generation

Abstract page for arXiv paper 2505.21545: Corruption-Aware Training of Latent Video Diffusion Models for Robust Text-to-Video Generation

arXiv - Machine Learning · 4 min · about 5 hours ago

Machine Learning

[2508.19897] The Information Dynamics of Generative Diffusion

Abstract page for arXiv paper 2508.19897: The Information Dynamics of Generative Diffusion

arXiv - Machine Learning · 4 min · about 5 hours ago

Machine Learning

[2401.11605] Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers

Abstract page for arXiv paper 2401.11605: Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers

arXiv - Machine Learning · 3 min · about 5 hours ago

Machine Learning

[2402.12760] A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis

Abstract page for arXiv paper 2402.12760: A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis

arXiv - AI · 4 min · about 5 hours ago

Llms

[2510.18087] Planned Diffusion

Abstract page for arXiv paper 2510.18087: Planned Diffusion

arXiv - AI · 4 min · about 5 hours ago

Machine Learning

[2511.14961] Graph Memory: A Structured and Interpretable Framework for Modality-Agnostic Embedding-Based Inference

Abstract page for arXiv paper 2511.14961: Graph Memory: A Structured and Interpretable Framework for Modality-Agnostic Embedding-Based In...

arXiv - Machine Learning · 4 min · about 5 hours ago

Machine Learning

[2603.25730] PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Abstract page for arXiv paper 2603.25730: PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

arXiv - AI · 4 min · about 5 hours ago

Generative Ai

[2603.25728] PixelSmile: Toward Fine-Grained Facial Expression Editing

Abstract page for arXiv paper 2603.25728: PixelSmile: Toward Fine-Grained Facial Expression Editing

arXiv - AI · 3 min · about 5 hours ago

Machine Learning

[2510.12453] Time-Correlated Video Bridge Matching

Abstract page for arXiv paper 2510.12453: Time-Correlated Video Bridge Matching

arXiv - Machine Learning · 3 min · about 5 hours ago

Machine Learning

[2603.25462] Temporally Decoupled Diffusion Planning for Autonomous Driving

Abstract page for arXiv paper 2603.25462: Temporally Decoupled Diffusion Planning for Autonomous Driving

arXiv - AI · 3 min · about 5 hours ago

Machine Learning

[2603.25209] Free-Lunch Long Video Generation via Layer-Adaptive O.O.D Correction

Abstract page for arXiv paper 2603.25209: Free-Lunch Long Video Generation via Layer-Adaptive O.O.D Correction

arXiv - AI · 4 min · about 5 hours ago

Machine Learning

[2603.25109] MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness

Abstract page for arXiv paper 2603.25109: MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness

arXiv - AI · 4 min · about 5 hours ago

Machine Learning

[2603.24764] Synthetic Cardiac MRI Image Generation using Deep Generative Models

Abstract page for arXiv paper 2603.24764: Synthetic Cardiac MRI Image Generation using Deep Generative Models

arXiv - Machine Learning · 3 min · about 5 hours ago

Generative Ai

[2603.24965] Self-Corrected Image Generation with Explainable Latent Rewards

Abstract page for arXiv paper 2603.24965: Self-Corrected Image Generation with Explainable Latent Rewards

arXiv - AI · 3 min · about 5 hours ago

Page 1 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Generative AI

Top This Week

Accelerating science with AI and simulations

[2603.14294] Seeking Physics in Diffusion Noise

[2512.22854] ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum Learning

All Content

Accelerating science with AI and simulations

[2603.14294] Seeking Physics in Diffusion Noise

[2512.22854] ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum Learning

[2601.08881] TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts

[2510.14989] Constrained Diffusion for Protein Design with Hard Structural Constraints

[2509.24296] DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models

[2505.21545] Corruption-Aware Training of Latent Video Diffusion Models for Robust Text-to-Video Generation

[2508.19897] The Information Dynamics of Generative Diffusion

[2401.11605] Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers

[2402.12760] A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis

[2510.18087] Planned Diffusion

[2511.14961] Graph Memory: A Structured and Interpretable Framework for Modality-Agnostic Embedding-Based Inference

[2603.25730] PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

[2603.25728] PixelSmile: Toward Fine-Grained Facial Expression Editing

[2510.12453] Time-Correlated Video Bridge Matching

[2603.25462] Temporally Decoupled Diffusion Planning for Autonomous Driving

[2603.25209] Free-Lunch Long Video Generation via Layer-Adaptive O.O.D Correction

[2603.25109] MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness

[2603.24764] Synthetic Cardiac MRI Image Generation using Deep Generative Models

[2603.24965] Self-Corrected Image Generation with Explainable Latent Rewards

Related Topics

Stay updated with AI News