Generative AI

Image, video, audio, and text generation

Top This Week

[2602.08277] PISCO: Precise Video Instance Insertion with Sparse Control
Generative Ai

[2602.08277] PISCO: Precise Video Instance Insertion with Sparse Control

Abstract page for arXiv paper 2602.08277: PISCO: Precise Video Instance Insertion with Sparse Control

arXiv - AI · 4 min ·
[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language and Images
Machine Learning

[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language and Images

Abstract page for arXiv paper 2511.18746: Any4D: Open-Prompt 4D Generation from Natural Language and Images

arXiv - AI · 4 min ·
[2512.14549] Dual-objective Language Models: Training Efficiency Without Overfitting
Llms

[2512.14549] Dual-objective Language Models: Training Efficiency Without Overfitting

Abstract page for arXiv paper 2512.14549: Dual-objective Language Models: Training Efficiency Without Overfitting

arXiv - AI · 3 min ·

All Content

[2510.25976] Brain-IT: Image Reconstruction from fMRI via Brain-Interaction Transformer
Machine Learning

[2510.25976] Brain-IT: Image Reconstruction from fMRI via Brain-Interaction Transformer

Abstract page for arXiv paper 2510.25976: Brain-IT: Image Reconstruction from fMRI via Brain-Interaction Transformer

arXiv - AI · 4 min ·
[2511.19473] WavefrontDiffusion: Dynamic Decoding Schedule for Improved Reasoning
Llms

[2511.19473] WavefrontDiffusion: Dynamic Decoding Schedule for Improved Reasoning

Abstract page for arXiv paper 2511.19473: WavefrontDiffusion: Dynamic Decoding Schedule for Improved Reasoning

arXiv - Machine Learning · 4 min ·
[2510.22835] Clustering by Denoising: Latent plug-and-play diffusion for single-cell data
Generative Ai

[2510.22835] Clustering by Denoising: Latent plug-and-play diffusion for single-cell data

Abstract page for arXiv paper 2510.22835: Clustering by Denoising: Latent plug-and-play diffusion for single-cell data

arXiv - Machine Learning · 4 min ·
[2510.15301] Latent Diffusion Model without Variational Autoencoder
Machine Learning

[2510.15301] Latent Diffusion Model without Variational Autoencoder

Abstract page for arXiv paper 2510.15301: Latent Diffusion Model without Variational Autoencoder

arXiv - AI · 4 min ·
[2510.19304] Loopholing Discrete Diffusion: Deterministic Bypass of the Sampling Wall
Machine Learning

[2510.19304] Loopholing Discrete Diffusion: Deterministic Bypass of the Sampling Wall

Abstract page for arXiv paper 2510.19304: Loopholing Discrete Diffusion: Deterministic Bypass of the Sampling Wall

arXiv - Machine Learning · 3 min ·
[2510.17206] Soft-Masked Diffusion Language Models
Llms

[2510.17206] Soft-Masked Diffusion Language Models

Abstract page for arXiv paper 2510.17206: Soft-Masked Diffusion Language Models

arXiv - Machine Learning · 4 min ·
[2510.13117] On the Reasoning Abilities of Masked Diffusion Language Models
Llms

[2510.13117] On the Reasoning Abilities of Masked Diffusion Language Models

Abstract page for arXiv paper 2510.13117: On the Reasoning Abilities of Masked Diffusion Language Models

arXiv - Machine Learning · 3 min ·
[2510.07940] TTOM: Test-Time Optimization and Memorization for Compositional Video Generation
Llms

[2510.07940] TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

Abstract page for arXiv paper 2510.07940: TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

arXiv - Machine Learning · 4 min ·
[2510.02253] DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing
Machine Learning

[2510.02253] DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing

Abstract page for arXiv paper 2510.02253: DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing

arXiv - Machine Learning · 4 min ·
[2510.01478] Purrception: Variational Flow Matching for Vector-Quantized Image Generation
Generative Ai

[2510.01478] Purrception: Variational Flow Matching for Vector-Quantized Image Generation

Abstract page for arXiv paper 2510.01478: Purrception: Variational Flow Matching for Vector-Quantized Image Generation

arXiv - Machine Learning · 3 min ·
[2509.26364] Data-to-Energy Stochastic Dynamics
Machine Learning

[2509.26364] Data-to-Energy Stochastic Dynamics

Abstract page for arXiv paper 2509.26364: Data-to-Energy Stochastic Dynamics

arXiv - Machine Learning · 4 min ·
[2509.26432] AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size
Llms

[2509.26432] AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size

Abstract page for arXiv paper 2509.26432: AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size

arXiv - Machine Learning · 4 min ·
[2509.23357] Landing with the Score: Riemannian Optimization through Denoising
Generative Ai

[2509.23357] Landing with the Score: Riemannian Optimization through Denoising

Abstract page for arXiv paper 2509.23357: Landing with the Score: Riemannian Optimization through Denoising

arXiv - Machine Learning · 4 min ·
[2509.22957] Doubly-Robust LLM-as-a-Judge: Externally Valid Estimation with Imperfect Personas
Llms

[2509.22957] Doubly-Robust LLM-as-a-Judge: Externally Valid Estimation with Imperfect Personas

Abstract page for arXiv paper 2509.22957: Doubly-Robust LLM-as-a-Judge: Externally Valid Estimation with Imperfect Personas

arXiv - Machine Learning · 4 min ·
[2509.21835] On the $ε$-Free Inference Complexity of Absorbing Discrete Diffusion
Machine Learning

[2509.21835] On the $ε$-Free Inference Complexity of Absorbing Discrete Diffusion

Abstract page for arXiv paper 2509.21835: On the $ε$-Free Inference Complexity of Absorbing Discrete Diffusion

arXiv - Machine Learning · 4 min ·
[2509.21659] RED-DiffEq: Regularization by denoising diffusion models for solving inverse PDE problems with application to full waveform inversion
Machine Learning

[2509.21659] RED-DiffEq: Regularization by denoising diffusion models for solving inverse PDE problems with application to full waveform inversion

Abstract page for arXiv paper 2509.21659: RED-DiffEq: Regularization by denoising diffusion models for solving inverse PDE problems with ...

arXiv - Machine Learning · 3 min ·
[2509.21513] DistillKac: Few-Step Image Generation via Damped Wave Equations
Machine Learning

[2509.21513] DistillKac: Few-Step Image Generation via Damped Wave Equations

Abstract page for arXiv paper 2509.21513: DistillKac: Few-Step Image Generation via Damped Wave Equations

arXiv - Machine Learning · 3 min ·
[2509.21278] Does FLUX Already Know How to Perform Physically Plausible Image Composition?
Machine Learning

[2509.21278] Does FLUX Already Know How to Perform Physically Plausible Image Composition?

Abstract page for arXiv paper 2509.21278: Does FLUX Already Know How to Perform Physically Plausible Image Composition?

arXiv - Machine Learning · 4 min ·
[2509.13789] BWCache: Accelerating Video Diffusion Transformers through Block-Wise Caching
Machine Learning

[2509.13789] BWCache: Accelerating Video Diffusion Transformers through Block-Wise Caching

Abstract page for arXiv paper 2509.13789: BWCache: Accelerating Video Diffusion Transformers through Block-Wise Caching

arXiv - AI · 4 min ·
[2508.16557] Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution
Machine Learning

[2508.16557] Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution

Abstract page for arXiv paper 2508.16557: Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution

arXiv - AI · 4 min ·
Previous Page 18 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime