Generative AI

Image, video, audio, and text generation

Top This Week

[2602.08277] PISCO: Precise Video Instance Insertion with Sparse Control
Generative Ai

[2602.08277] PISCO: Precise Video Instance Insertion with Sparse Control

Abstract page for arXiv paper 2602.08277: PISCO: Precise Video Instance Insertion with Sparse Control

arXiv - AI · 4 min ·
[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language and Images
Machine Learning

[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language and Images

Abstract page for arXiv paper 2511.18746: Any4D: Open-Prompt 4D Generation from Natural Language and Images

arXiv - AI · 4 min ·
[2512.14549] Dual-objective Language Models: Training Efficiency Without Overfitting
Llms

[2512.14549] Dual-objective Language Models: Training Efficiency Without Overfitting

Abstract page for arXiv paper 2512.14549: Dual-objective Language Models: Training Efficiency Without Overfitting

arXiv - AI · 3 min ·

All Content

[2504.08714] Generating Fine Details of Entity Interactions
Llms

[2504.08714] Generating Fine Details of Entity Interactions

Abstract page for arXiv paper 2504.08714: Generating Fine Details of Entity Interactions

arXiv - Machine Learning · 3 min ·
[2509.14858] MeanFlowSE: one-step generative speech enhancement via conditional mean flow
Machine Learning

[2509.14858] MeanFlowSE: one-step generative speech enhancement via conditional mean flow

Abstract page for arXiv paper 2509.14858: MeanFlowSE: one-step generative speech enhancement via conditional mean flow

arXiv - AI · 3 min ·
[2507.13231] VITA: Vision-to-Action Flow Matching Policy
Generative Ai

[2507.13231] VITA: Vision-to-Action Flow Matching Policy

Abstract page for arXiv paper 2507.13231: VITA: Vision-to-Action Flow Matching Policy

arXiv - AI · 4 min ·
[2511.01343] CNFP: Optimizing Cloud-Native Network Function Placement with Diffusion Models on the Cloud Continuum
Machine Learning

[2511.01343] CNFP: Optimizing Cloud-Native Network Function Placement with Diffusion Models on the Cloud Continuum

Abstract page for arXiv paper 2511.01343: CNFP: Optimizing Cloud-Native Network Function Placement with Diffusion Models on the Cloud Con...

arXiv - Machine Learning · 4 min ·
[2509.23405] Planner Aware Path Learning in Diffusion Language Models Training
Llms

[2509.23405] Planner Aware Path Learning in Diffusion Language Models Training

Abstract page for arXiv paper 2509.23405: Planner Aware Path Learning in Diffusion Language Models Training

arXiv - Machine Learning · 4 min ·
[2312.17505] Catch Me If You Can Describe Me: Open-Vocabulary Camouflaged Instance Segmentation with Diffusion
Machine Learning

[2312.17505] Catch Me If You Can Describe Me: Open-Vocabulary Camouflaged Instance Segmentation with Diffusion

Abstract page for arXiv paper 2312.17505: Catch Me If You Can Describe Me: Open-Vocabulary Camouflaged Instance Segmentation with Diffusion

arXiv - AI · 4 min ·
[2410.02601] Diffusion & Adversarial Schrödinger Bridges via Iterative Proportional Markovian Fitting
Machine Learning

[2410.02601] Diffusion & Adversarial Schrödinger Bridges via Iterative Proportional Markovian Fitting

Abstract page for arXiv paper 2410.02601: Diffusion & Adversarial Schrödinger Bridges via Iterative Proportional Markovian Fitting

arXiv - Machine Learning · 4 min ·
[2603.04366] Low-Resource Guidance for Controllable Latent Audio Diffusion
Machine Learning

[2603.04366] Low-Resource Guidance for Controllable Latent Audio Diffusion

Abstract page for arXiv paper 2603.04366: Low-Resource Guidance for Controllable Latent Audio Diffusion

arXiv - AI · 3 min ·
[2603.04343] Enhancing Authorship Attribution with Synthetic Paintings
Machine Learning

[2603.04343] Enhancing Authorship Attribution with Synthetic Paintings

Abstract page for arXiv paper 2603.04343: Enhancing Authorship Attribution with Synthetic Paintings

arXiv - Machine Learning · 3 min ·
[2603.04340] Balancing Fidelity, Utility, and Privacy in Synthetic Cardiac MRI Generation: A Comparative Study
Machine Learning

[2603.04340] Balancing Fidelity, Utility, and Privacy in Synthetic Cardiac MRI Generation: A Comparative Study

Abstract page for arXiv paper 2603.04340: Balancing Fidelity, Utility, and Privacy in Synthetic Cardiac MRI Generation: A Comparative Study

arXiv - Machine Learning · 3 min ·
[2603.04325] Scalable Evaluation of the Realism of Synthetic Environmental Augmentations in Images
Generative Ai

[2603.04325] Scalable Evaluation of the Realism of Synthetic Environmental Augmentations in Images

Abstract page for arXiv paper 2603.04325: Scalable Evaluation of the Realism of Synthetic Environmental Augmentations in Images

arXiv - Machine Learning · 4 min ·
[2603.04291] CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video
Machine Learning

[2603.04291] CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video

Abstract page for arXiv paper 2603.04291: CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video

arXiv - AI · 4 min ·
[2603.04122] FastWave: Optimized Diffusion Model for Audio Super-Resolution
Machine Learning

[2603.04122] FastWave: Optimized Diffusion Model for Audio Super-Resolution

Abstract page for arXiv paper 2603.04122: FastWave: Optimized Diffusion Model for Audio Super-Resolution

arXiv - Machine Learning · 3 min ·
[2603.04005] Training-Free Rate-Distortion-Perception Traversal With Diffusion
Machine Learning

[2603.04005] Training-Free Rate-Distortion-Perception Traversal With Diffusion

Abstract page for arXiv paper 2603.04005: Training-Free Rate-Distortion-Perception Traversal With Diffusion

arXiv - Machine Learning · 3 min ·
[2603.03792] TAP: A Token-Adaptive Predictor Framework for Training-Free Diffusion Acceleration
Machine Learning

[2603.03792] TAP: A Token-Adaptive Predictor Framework for Training-Free Diffusion Acceleration

Abstract page for arXiv paper 2603.03792: TAP: A Token-Adaptive Predictor Framework for Training-Free Diffusion Acceleration

arXiv - Machine Learning · 3 min ·
[2603.04024] Volumetric Directional Diffusion: Anchoring Uncertainty Quantification in Anatomical Consensus for Ambiguous Medical Image Segmentation
Machine Learning

[2603.04024] Volumetric Directional Diffusion: Anchoring Uncertainty Quantification in Anatomical Consensus for Ambiguous Medical Image Segmentation

Abstract page for arXiv paper 2603.04024: Volumetric Directional Diffusion: Anchoring Uncertainty Quantification in Anatomical Consensus ...

arXiv - AI · 4 min ·
[2603.04001] STEM Faculty Perspectives on Generative AI in Higher Education
Generative Ai

[2603.04001] STEM Faculty Perspectives on Generative AI in Higher Education

Abstract page for arXiv paper 2603.04001: STEM Faculty Perspectives on Generative AI in Higher Education

arXiv - AI · 4 min ·
[2603.03626] Riemannian Langevin Dynamics: Strong Convergence of Geometric Euler-Maruyama Scheme
Machine Learning

[2603.03626] Riemannian Langevin Dynamics: Strong Convergence of Geometric Euler-Maruyama Scheme

Abstract page for arXiv paper 2603.03626: Riemannian Langevin Dynamics: Strong Convergence of Geometric Euler-Maruyama Scheme

arXiv - Machine Learning · 3 min ·
[2603.03971] Upholding Epistemic Agency: A Brouwerian Assertibility Constraint for Responsible AI
Generative Ai

[2603.03971] Upholding Epistemic Agency: A Brouwerian Assertibility Constraint for Responsible AI

Abstract page for arXiv paper 2603.03971: Upholding Epistemic Agency: A Brouwerian Assertibility Constraint for Responsible AI

arXiv - AI · 4 min ·
[2603.03727] Understanding Parents' Desires in Moderating Children's Interactions with GenAI Chatbots through LLM-Generated Probes
Llms

[2603.03727] Understanding Parents' Desires in Moderating Children's Interactions with GenAI Chatbots through LLM-Generated Probes

Abstract page for arXiv paper 2603.03727: Understanding Parents' Desires in Moderating Children's Interactions with GenAI Chatbots throug...

arXiv - AI · 3 min ·
Previous Page 14 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime