Generative AI

Image, video, audio, and text generation

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Generative Ai

[2602.08277] PISCO: Precise Video Instance Insertion with Sparse Control

Abstract page for arXiv paper 2602.08277: PISCO: Precise Video Instance Insertion with Sparse Control

arXiv - AI · 4 min · about 15 hours ago

Machine Learning

[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language and Images

Abstract page for arXiv paper 2511.18746: Any4D: Open-Prompt 4D Generation from Natural Language and Images

arXiv - AI · 4 min · about 15 hours ago

Llms

[2512.14549] Dual-objective Language Models: Training Efficiency Without Overfitting

Abstract page for arXiv paper 2512.14549: Dual-objective Language Models: Training Efficiency Without Overfitting

arXiv - AI · 3 min · about 15 hours ago

All Content

Machine Learning

[2510.25976] Brain-IT: Image Reconstruction from fMRI via Brain-Interaction Transformer

Abstract page for arXiv paper 2510.25976: Brain-IT: Image Reconstruction from fMRI via Brain-Interaction Transformer

arXiv - AI · 4 min · 27 days ago

Llms

[2511.19473] WavefrontDiffusion: Dynamic Decoding Schedule for Improved Reasoning

Abstract page for arXiv paper 2511.19473: WavefrontDiffusion: Dynamic Decoding Schedule for Improved Reasoning

arXiv - Machine Learning · 4 min · 27 days ago

Generative Ai

[2510.22835] Clustering by Denoising: Latent plug-and-play diffusion for single-cell data

Abstract page for arXiv paper 2510.22835: Clustering by Denoising: Latent plug-and-play diffusion for single-cell data

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2510.15301] Latent Diffusion Model without Variational Autoencoder

Abstract page for arXiv paper 2510.15301: Latent Diffusion Model without Variational Autoencoder

arXiv - AI · 4 min · 27 days ago

Machine Learning

[2510.19304] Loopholing Discrete Diffusion: Deterministic Bypass of the Sampling Wall

Abstract page for arXiv paper 2510.19304: Loopholing Discrete Diffusion: Deterministic Bypass of the Sampling Wall

arXiv - Machine Learning · 3 min · 27 days ago

Llms

[2510.17206] Soft-Masked Diffusion Language Models

Abstract page for arXiv paper 2510.17206: Soft-Masked Diffusion Language Models

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2510.13117] On the Reasoning Abilities of Masked Diffusion Language Models

Abstract page for arXiv paper 2510.13117: On the Reasoning Abilities of Masked Diffusion Language Models

arXiv - Machine Learning · 3 min · 27 days ago

Llms

[2510.07940] TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

Abstract page for arXiv paper 2510.07940: TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2510.02253] DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing

Abstract page for arXiv paper 2510.02253: DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing

arXiv - Machine Learning · 4 min · 27 days ago

Generative Ai

[2510.01478] Purrception: Variational Flow Matching for Vector-Quantized Image Generation

Abstract page for arXiv paper 2510.01478: Purrception: Variational Flow Matching for Vector-Quantized Image Generation

arXiv - Machine Learning · 3 min · 27 days ago

Machine Learning

[2509.26364] Data-to-Energy Stochastic Dynamics

Abstract page for arXiv paper 2509.26364: Data-to-Energy Stochastic Dynamics

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2509.26432] AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size

Abstract page for arXiv paper 2509.26432: AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size

arXiv - Machine Learning · 4 min · 27 days ago

Generative Ai

[2509.23357] Landing with the Score: Riemannian Optimization through Denoising

Abstract page for arXiv paper 2509.23357: Landing with the Score: Riemannian Optimization through Denoising

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2509.22957] Doubly-Robust LLM-as-a-Judge: Externally Valid Estimation with Imperfect Personas

Abstract page for arXiv paper 2509.22957: Doubly-Robust LLM-as-a-Judge: Externally Valid Estimation with Imperfect Personas

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2509.21835] On the $ε$-Free Inference Complexity of Absorbing Discrete Diffusion

Abstract page for arXiv paper 2509.21835: On the $ε$-Free Inference Complexity of Absorbing Discrete Diffusion

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2509.21659] RED-DiffEq: Regularization by denoising diffusion models for solving inverse PDE problems with application to full waveform inversion

Abstract page for arXiv paper 2509.21659: RED-DiffEq: Regularization by denoising diffusion models for solving inverse PDE problems with ...

arXiv - Machine Learning · 3 min · 27 days ago

Machine Learning

[2509.21513] DistillKac: Few-Step Image Generation via Damped Wave Equations

Abstract page for arXiv paper 2509.21513: DistillKac: Few-Step Image Generation via Damped Wave Equations

arXiv - Machine Learning · 3 min · 27 days ago

Machine Learning

[2509.21278] Does FLUX Already Know How to Perform Physically Plausible Image Composition?

Abstract page for arXiv paper 2509.21278: Does FLUX Already Know How to Perform Physically Plausible Image Composition?

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2509.13789] BWCache: Accelerating Video Diffusion Transformers through Block-Wise Caching

Abstract page for arXiv paper 2509.13789: BWCache: Accelerating Video Diffusion Transformers through Block-Wise Caching

arXiv - AI · 4 min · 27 days ago

Machine Learning

[2508.16557] Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution

Abstract page for arXiv paper 2508.16557: Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution

arXiv - AI · 4 min · 27 days ago

Previous Page 18 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Generative AI

Top This Week

[2602.08277] PISCO: Precise Video Instance Insertion with Sparse Control

[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language and Images

[2512.14549] Dual-objective Language Models: Training Efficiency Without Overfitting

All Content

[2510.25976] Brain-IT: Image Reconstruction from fMRI via Brain-Interaction Transformer

[2511.19473] WavefrontDiffusion: Dynamic Decoding Schedule for Improved Reasoning

[2510.22835] Clustering by Denoising: Latent plug-and-play diffusion for single-cell data

[2510.15301] Latent Diffusion Model without Variational Autoencoder

[2510.19304] Loopholing Discrete Diffusion: Deterministic Bypass of the Sampling Wall

[2510.17206] Soft-Masked Diffusion Language Models

[2510.13117] On the Reasoning Abilities of Masked Diffusion Language Models

[2510.07940] TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

[2510.02253] DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing

[2510.01478] Purrception: Variational Flow Matching for Vector-Quantized Image Generation

[2509.26364] Data-to-Energy Stochastic Dynamics

[2509.26432] AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size

[2509.23357] Landing with the Score: Riemannian Optimization through Denoising

[2509.22957] Doubly-Robust LLM-as-a-Judge: Externally Valid Estimation with Imperfect Personas

[2509.21835] On the $ε$-Free Inference Complexity of Absorbing Discrete Diffusion

[2509.21659] RED-DiffEq: Regularization by denoising diffusion models for solving inverse PDE problems with application to full waveform inversion

[2509.21513] DistillKac: Few-Step Image Generation via Damped Wave Equations

[2509.21278] Does FLUX Already Know How to Perform Physically Plausible Image Composition?

[2509.13789] BWCache: Accelerating Video Diffusion Transformers through Block-Wise Caching

[2508.16557] Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution

Related Topics

Stay updated with AI News