Generative AI

Image, video, audio, and text generation

Top This Week

[2602.08277] PISCO: Precise Video Instance Insertion with Sparse Control
Generative Ai

[2602.08277] PISCO: Precise Video Instance Insertion with Sparse Control

Abstract page for arXiv paper 2602.08277: PISCO: Precise Video Instance Insertion with Sparse Control

arXiv - AI · 4 min ·
[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language and Images
Machine Learning

[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language and Images

Abstract page for arXiv paper 2511.18746: Any4D: Open-Prompt 4D Generation from Natural Language and Images

arXiv - AI · 4 min ·
[2512.14549] Dual-objective Language Models: Training Efficiency Without Overfitting
Llms

[2512.14549] Dual-objective Language Models: Training Efficiency Without Overfitting

Abstract page for arXiv paper 2512.14549: Dual-objective Language Models: Training Efficiency Without Overfitting

arXiv - AI · 3 min ·

All Content

[2603.02531] Bridging Diffusion Guidance and Anderson Acceleration via Hopfield Dynamics
Machine Learning

[2603.02531] Bridging Diffusion Guidance and Anderson Acceleration via Hopfield Dynamics

Abstract page for arXiv paper 2603.02531: Bridging Diffusion Guidance and Anderson Acceleration via Hopfield Dynamics

arXiv - AI · 3 min ·
[2603.02447] Spectral Regularization for Diffusion Models
Machine Learning

[2603.02447] Spectral Regularization for Diffusion Models

Abstract page for arXiv paper 2603.02447: Spectral Regularization for Diffusion Models

arXiv - Machine Learning · 3 min ·
[2603.02348] Diffusion-MPC in Discrete Domains: Feasibility Constraints, Horizon Effects, and Critic Alignment: Case study with Tetris
Machine Learning

[2603.02348] Diffusion-MPC in Discrete Domains: Feasibility Constraints, Horizon Effects, and Critic Alignment: Case study with Tetris

Abstract page for arXiv paper 2603.02348: Diffusion-MPC in Discrete Domains: Feasibility Constraints, Horizon Effects, and Critic Alignme...

arXiv - AI · 4 min ·
[2603.02337] Preconditioned Score and Flow Matching
Machine Learning

[2603.02337] Preconditioned Score and Flow Matching

Abstract page for arXiv paper 2603.02337: Preconditioned Score and Flow Matching

arXiv - AI · 3 min ·
[2603.02542] AnchorDrive: LLM Scenario Rollout with Anchor-Guided Diffusion Regeneration for Safety-Critical Scenario Generation
Llms

[2603.02542] AnchorDrive: LLM Scenario Rollout with Anchor-Guided Diffusion Regeneration for Safety-Critical Scenario Generation

Abstract page for arXiv paper 2603.02542: AnchorDrive: LLM Scenario Rollout with Anchor-Guided Diffusion Regeneration for Safety-Critical...

arXiv - AI · 4 min ·
[2603.02230] Generalized Discrete Diffusion with Self-Correction
Machine Learning

[2603.02230] Generalized Discrete Diffusion with Self-Correction

Abstract page for arXiv paper 2603.02230: Generalized Discrete Diffusion with Self-Correction

arXiv - AI · 3 min ·
Llms

[D] Quantified analysis of 2,218 Gary Marcus claims - two independent LLM pipelines, scored against evidence

Built a dataset scoring every testable claim from Marcus's 474 Substack posts. Two pipelines (Claude Opus 4.6 and ChatGPT Codex) analyzed...

Reddit - Machine Learning · 1 min ·
PRX Part 3 — Training a Text-to-Image Model in 24h!
Open Source Ai

PRX Part 3 — Training a Text-to-Image Model in 24h!

A Blog post by Photoroom on Hugging Face

Hugging Face Blog · 8 min ·
[2601.18685] LLAMA LIMA: A Living Meta-Analysis on the Effects of Generative AI on Learning Mathematics
Llms

[2601.18685] LLAMA LIMA: A Living Meta-Analysis on the Effects of Generative AI on Learning Mathematics

Abstract page for arXiv paper 2601.18685: LLAMA LIMA: A Living Meta-Analysis on the Effects of Generative AI on Learning Mathematics

arXiv - Machine Learning · 3 min ·
[2511.01266] MotionStream: Real-Time Video Generation with Interactive Motion Controls
Machine Learning

[2511.01266] MotionStream: Real-Time Video Generation with Interactive Motion Controls

Abstract page for arXiv paper 2511.01266: MotionStream: Real-Time Video Generation with Interactive Motion Controls

arXiv - Machine Learning · 4 min ·
[2510.08409] Optimal Stopping in Latent Diffusion Models
Machine Learning

[2510.08409] Optimal Stopping in Latent Diffusion Models

Abstract page for arXiv paper 2510.08409: Optimal Stopping in Latent Diffusion Models

arXiv - Machine Learning · 4 min ·
[2509.22459] Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs)
Machine Learning

[2509.22459] Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs)

Abstract page for arXiv paper 2509.22459: Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs)

arXiv - Machine Learning · 4 min ·
[2507.06547] Concept-TRAK: Understanding how diffusion models learn concepts through concept-level attribution
Machine Learning

[2507.06547] Concept-TRAK: Understanding how diffusion models learn concepts through concept-level attribution

Abstract page for arXiv paper 2507.06547: Concept-TRAK: Understanding how diffusion models learn concepts through concept-level attribution

arXiv - Machine Learning · 3 min ·
[2503.07197] Effective and Efficient Masked Image Generation Models
Machine Learning

[2503.07197] Effective and Efficient Masked Image Generation Models

Abstract page for arXiv paper 2503.07197: Effective and Efficient Masked Image Generation Models

arXiv - Machine Learning · 3 min ·
[2601.08011] TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models
Machine Learning

[2601.08011] TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models

Abstract page for arXiv paper 2601.08011: TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models

arXiv - Machine Learning · 4 min ·
[2512.14341] Towards Transferable Defense Against Malicious Image Edits
Machine Learning

[2512.14341] Towards Transferable Defense Against Malicious Image Edits

Abstract page for arXiv paper 2512.14341: Towards Transferable Defense Against Malicious Image Edits

arXiv - Machine Learning · 4 min ·
[2601.23280] Decoupled Diffusion Sampling for Inverse Problems on Function Spaces
Machine Learning

[2601.23280] Decoupled Diffusion Sampling for Inverse Problems on Function Spaces

Abstract page for arXiv paper 2601.23280: Decoupled Diffusion Sampling for Inverse Problems on Function Spaces

arXiv - Machine Learning · 3 min ·
[2512.15657] SoFlow: Solution Flow Models for One-Step Generative Modeling
Machine Learning

[2512.15657] SoFlow: Solution Flow Models for One-Step Generative Modeling

Abstract page for arXiv paper 2512.15657: SoFlow: Solution Flow Models for One-Step Generative Modeling

arXiv - Machine Learning · 3 min ·
[2510.26818] GACA-DiT: Diffusion-based Dance-to-Music Generation with Genre-Adaptive Rhythm and Context-Aware Alignment
Generative Ai

[2510.26818] GACA-DiT: Diffusion-based Dance-to-Music Generation with Genre-Adaptive Rhythm and Context-Aware Alignment

Abstract page for arXiv paper 2510.26818: GACA-DiT: Diffusion-based Dance-to-Music Generation with Genre-Adaptive Rhythm and Context-Awar...

arXiv - AI · 4 min ·
[2510.26585] Stop Wasting Your Tokens: Towards Efficient Runtime Multi-Agent Systems
Generative Ai

[2510.26585] Stop Wasting Your Tokens: Towards Efficient Runtime Multi-Agent Systems

Abstract page for arXiv paper 2510.26585: Stop Wasting Your Tokens: Towards Efficient Runtime Multi-Agent Systems

arXiv - AI · 3 min ·
Previous Page 17 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime