Generative AI

Image, video, audio, and text generation

Top This Week

Accelerating science with AI and simulations
Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min ·
[2603.12057] Coarse-Guided Visual Generation via Weighted h-Transform Sampling
Machine Learning

[2603.12057] Coarse-Guided Visual Generation via Weighted h-Transform Sampling

Abstract page for arXiv paper 2603.12057: Coarse-Guided Visual Generation via Weighted h-Transform Sampling

arXiv - AI · 4 min ·
[2603.07455] Image Generation Models: A Technical History
Machine Learning

[2603.07455] Image Generation Models: A Technical History

Abstract page for arXiv paper 2603.07455: Image Generation Models: A Technical History

arXiv - AI · 3 min ·

All Content

[2411.11727] Aligning Few-Step Diffusion Models with Dense Reward Difference Learning
Machine Learning

[2411.11727] Aligning Few-Step Diffusion Models with Dense Reward Difference Learning

This paper presents Stepwise Diffusion Policy Optimization (SDPO), a novel reinforcement learning framework designed to enhance few-step ...

arXiv - Machine Learning · 4 min ·
[2602.22871] Test-Time Scaling with Diffusion Language Models via Reward-Guided Stitching
Llms

[2602.22871] Test-Time Scaling with Diffusion Language Models via Reward-Guided Stitching

The paper presents a novel framework called Stitching Noisy Diffusion Thoughts, which enhances reasoning in large language models by comb...

arXiv - AI · 4 min ·
[2602.23295] ManifoldGD: Training-Free Hierarchical Manifold Guidance for Diffusion-Based Dataset Distillation
Machine Learning

[2602.23295] ManifoldGD: Training-Free Hierarchical Manifold Guidance for Diffusion-Based Dataset Distillation

The paper presents ManifoldGD, a training-free framework for dataset distillation using hierarchical manifold guidance, improving efficie...

arXiv - Machine Learning · 4 min ·
[2602.23234] Scaling Search Relevance: Augmenting App Store Ranking with LLM-Generated Judgments
Llms

[2602.23234] Scaling Search Relevance: Augmenting App Store Ranking with LLM-Generated Judgments

This article discusses a novel approach to enhancing app store ranking by integrating LLM-generated textual relevance labels with behavio...

arXiv - Machine Learning · 4 min ·
[2602.23214] Plug-and-Play Diffusion Meets ADMM: Dual-Variable Coupling for Robust Medical Image Reconstruction
Machine Learning

[2602.23214] Plug-and-Play Diffusion Meets ADMM: Dual-Variable Coupling for Robust Medical Image Reconstruction

This paper presents a novel approach to medical image reconstruction using Dual-Coupled Plug-and-Play Diffusion, addressing limitations i...

arXiv - Machine Learning · 4 min ·
[2602.22790] Natural Language Declarative Prompting (NLD-P): A Modular Governance Method for Prompt Design Under Model Drift
Llms

[2602.22790] Natural Language Declarative Prompting (NLD-P): A Modular Governance Method for Prompt Design Under Model Drift

The paper introduces Natural Language Declarative Prompting (NLD-P), a governance method for prompt design that addresses challenges pose...

arXiv - AI · 4 min ·
[2602.23136] Modality Collapse as Mismatched Decoding: Information-Theoretic Limits of Multimodal LLMs
Llms

[2602.23136] Modality Collapse as Mismatched Decoding: Information-Theoretic Limits of Multimodal LLMs

This article explores the concept of modality collapse in multimodal large language models (LLMs), highlighting the limitations of decode...

arXiv - Machine Learning · 4 min ·
[2602.22775] TherapyProbe: Generating Design Knowledge for Relational Safety in Mental Health Chatbots Through Adversarial Simulation
Ai Startups

[2602.22775] TherapyProbe: Generating Design Knowledge for Relational Safety in Mental Health Chatbots Through Adversarial Simulation

The paper introduces TherapyProbe, a methodology for enhancing relational safety in mental health chatbots through adversarial simulation...

arXiv - AI · 3 min ·
[2602.23132] From Agnostic to Specific: Latent Preference Diffusion for Multi-Behavior Sequential Recommendation
Machine Learning

[2602.23132] From Agnostic to Specific: Latent Preference Diffusion for Multi-Behavior Sequential Recommendation

This paper presents FatsMB, a novel framework for Multi-Behavior Sequential Recommendation (MBSR) that enhances user preference modeling ...

arXiv - Machine Learning · 4 min ·
[2602.23085] Q-Tag: Watermarking Quantum Circuit Generative Models
Machine Learning

[2602.23085] Q-Tag: Watermarking Quantum Circuit Generative Models

The paper presents Q-Tag, a novel watermarking framework for quantum circuit generative models (QCGMs), addressing the need for secure co...

arXiv - Machine Learning · 4 min ·
[2602.22760] Distributed LLM Pretraining During Renewable Curtailment Windows: A Feasibility Study
Llms

[2602.22760] Distributed LLM Pretraining During Renewable Curtailment Windows: A Feasibility Study

This study explores the feasibility of pretraining large language models (LLMs) during renewable energy curtailment periods, aiming to re...

arXiv - AI · 3 min ·
[2602.22752] Towards Simulating Social Media Users with LLMs: Evaluating the Operational Validity of Conditioned Comment Prediction
Llms

[2602.22752] Towards Simulating Social Media Users with LLMs: Evaluating the Operational Validity of Conditioned Comment Prediction

This article presents a study on the operational validity of using Large Language Models (LLMs) to simulate social media user behavior th...

arXiv - AI · 4 min ·
[2602.22716] SoPE: Spherical Coordinate-Based Positional Embedding for Enhancing Spatial Perception of 3D LVLMs
Llms

[2602.22716] SoPE: Spherical Coordinate-Based Positional Embedding for Enhancing Spatial Perception of 3D LVLMs

The paper presents SoPE, a novel Spherical Coordinate-Based Positional Embedding method aimed at improving the spatial perception capabil...

arXiv - AI · 4 min ·
[2602.22700] IMMACULATE: A Practical LLM Auditing Framework via Verifiable Computation
Llms

[2602.22700] IMMACULATE: A Practical LLM Auditing Framework via Verifiable Computation

The paper presents IMMACULATE, a framework for auditing large language models (LLMs) using verifiable computation to detect economic devi...

arXiv - AI · 3 min ·
[2602.22913] SIGMA: A Semantic-Grounded Instruction-Driven Generative Multi-Task Recommender at AliExpress
Llms

[2602.22913] SIGMA: A Semantic-Grounded Instruction-Driven Generative Multi-Task Recommender at AliExpress

The paper presents SIGMA, a novel generative multi-task recommender system developed for AliExpress, utilizing semantic grounding and ins...

arXiv - Machine Learning · 3 min ·
[2602.22624] Instruction-based Image Editing with Planning, Reasoning, and Generation
Llms

[2602.22624] Instruction-based Image Editing with Planning, Reasoning, and Generation

This paper presents a novel approach to instruction-based image editing by integrating planning, reasoning, and generation through a mult...

arXiv - AI · 4 min ·
[2602.22801] Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving
Machine Learning

[2602.22801] Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving

This article explores the application of diffusion models in end-to-end autonomous driving, demonstrating their effectiveness through ext...

arXiv - Machine Learning · 4 min ·
[2602.22606] CoLyricist: Enhancing Lyric Writing with AI through Workflow-Aligned Support
Generative Ai

[2602.22606] CoLyricist: Enhancing Lyric Writing with AI through Workflow-Aligned Support

CoLyricist is an AI-assisted tool designed to enhance the lyric writing process by aligning with the common workflows of lyricists, impro...

arXiv - AI · 3 min ·
[2602.22596] BetterScene: 3D Scene Synthesis with Representation-Aligned Generative Model
Machine Learning

[2602.22596] BetterScene: 3D Scene Synthesis with Representation-Aligned Generative Model

BetterScene introduces an innovative approach to 3D scene synthesis, enhancing novel view synthesis quality using sparse photos and a rep...

arXiv - AI · 4 min ·
[2602.22732] Generative Recommendation for Large-Scale Advertising
Llms

[2602.22732] Generative Recommendation for Large-Scale Advertising

This paper introduces GR4AD, a generative recommendation system designed for large-scale advertising, enhancing ad revenue through innova...

arXiv - Machine Learning · 4 min ·
Previous Page 31 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime