Generative AI

Image, video, audio, and text generation

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 7 hours ago

Machine Learning

[2603.12057] Coarse-Guided Visual Generation via Weighted h-Transform Sampling

Abstract page for arXiv paper 2603.12057: Coarse-Guided Visual Generation via Weighted h-Transform Sampling

arXiv - AI · 4 min · about 9 hours ago

Machine Learning

[2603.07455] Image Generation Models: A Technical History

Abstract page for arXiv paper 2603.07455: Image Generation Models: A Technical History

arXiv - AI · 3 min · about 9 hours ago

All Content

Machine Learning

[2411.11727] Aligning Few-Step Diffusion Models with Dense Reward Difference Learning

This paper presents Stepwise Diffusion Policy Optimization (SDPO), a novel reinforcement learning framework designed to enhance few-step ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.22871] Test-Time Scaling with Diffusion Language Models via Reward-Guided Stitching

The paper presents a novel framework called Stitching Noisy Diffusion Thoughts, which enhances reasoning in large language models by comb...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.23295] ManifoldGD: Training-Free Hierarchical Manifold Guidance for Diffusion-Based Dataset Distillation

The paper presents ManifoldGD, a training-free framework for dataset distillation using hierarchical manifold guidance, improving efficie...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.23234] Scaling Search Relevance: Augmenting App Store Ranking with LLM-Generated Judgments

This article discusses a novel approach to enhancing app store ranking by integrating LLM-generated textual relevance labels with behavio...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.23214] Plug-and-Play Diffusion Meets ADMM: Dual-Variable Coupling for Robust Medical Image Reconstruction

This paper presents a novel approach to medical image reconstruction using Dual-Coupled Plug-and-Play Diffusion, addressing limitations i...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.22790] Natural Language Declarative Prompting (NLD-P): A Modular Governance Method for Prompt Design Under Model Drift

The paper introduces Natural Language Declarative Prompting (NLD-P), a governance method for prompt design that addresses challenges pose...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.23136] Modality Collapse as Mismatched Decoding: Information-Theoretic Limits of Multimodal LLMs

This article explores the concept of modality collapse in multimodal large language models (LLMs), highlighting the limitations of decode...

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Startups

[2602.22775] TherapyProbe: Generating Design Knowledge for Relational Safety in Mental Health Chatbots Through Adversarial Simulation

The paper introduces TherapyProbe, a methodology for enhancing relational safety in mental health chatbots through adversarial simulation...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.23132] From Agnostic to Specific: Latent Preference Diffusion for Multi-Behavior Sequential Recommendation

This paper presents FatsMB, a novel framework for Multi-Behavior Sequential Recommendation (MBSR) that enhances user preference modeling ...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.23085] Q-Tag: Watermarking Quantum Circuit Generative Models

The paper presents Q-Tag, a novel watermarking framework for quantum circuit generative models (QCGMs), addressing the need for secure co...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.22760] Distributed LLM Pretraining During Renewable Curtailment Windows: A Feasibility Study

This study explores the feasibility of pretraining large language models (LLMs) during renewable energy curtailment periods, aiming to re...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.22752] Towards Simulating Social Media Users with LLMs: Evaluating the Operational Validity of Conditioned Comment Prediction

This article presents a study on the operational validity of using Large Language Models (LLMs) to simulate social media user behavior th...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.22716] SoPE: Spherical Coordinate-Based Positional Embedding for Enhancing Spatial Perception of 3D LVLMs

The paper presents SoPE, a novel Spherical Coordinate-Based Positional Embedding method aimed at improving the spatial perception capabil...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.22700] IMMACULATE: A Practical LLM Auditing Framework via Verifiable Computation

The paper presents IMMACULATE, a framework for auditing large language models (LLMs) using verifiable computation to detect economic devi...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.22913] SIGMA: A Semantic-Grounded Instruction-Driven Generative Multi-Task Recommender at AliExpress

The paper presents SIGMA, a novel generative multi-task recommender system developed for AliExpress, utilizing semantic grounding and ins...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.22624] Instruction-based Image Editing with Planning, Reasoning, and Generation

This paper presents a novel approach to instruction-based image editing by integrating planning, reasoning, and generation through a mult...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.22801] Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving

This article explores the application of diffusion models in end-to-end autonomous driving, demonstrating their effectiveness through ext...

arXiv - Machine Learning · 4 min · about 1 month ago

Generative Ai

[2602.22606] CoLyricist: Enhancing Lyric Writing with AI through Workflow-Aligned Support

CoLyricist is an AI-assisted tool designed to enhance the lyric writing process by aligning with the common workflows of lyricists, impro...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.22596] BetterScene: 3D Scene Synthesis with Representation-Aligned Generative Model

BetterScene introduces an innovative approach to 3D scene synthesis, enhancing novel view synthesis quality using sparse photos and a rep...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.22732] Generative Recommendation for Large-Scale Advertising

This paper introduces GR4AD, a generative recommendation system designed for large-scale advertising, enhancing ad revenue through innova...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 31 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Generative AI

Top This Week

Accelerating science with AI and simulations

[2603.12057] Coarse-Guided Visual Generation via Weighted h-Transform Sampling

[2603.07455] Image Generation Models: A Technical History

All Content

[2411.11727] Aligning Few-Step Diffusion Models with Dense Reward Difference Learning

[2602.22871] Test-Time Scaling with Diffusion Language Models via Reward-Guided Stitching

[2602.23295] ManifoldGD: Training-Free Hierarchical Manifold Guidance for Diffusion-Based Dataset Distillation

[2602.23234] Scaling Search Relevance: Augmenting App Store Ranking with LLM-Generated Judgments

[2602.23214] Plug-and-Play Diffusion Meets ADMM: Dual-Variable Coupling for Robust Medical Image Reconstruction

[2602.22790] Natural Language Declarative Prompting (NLD-P): A Modular Governance Method for Prompt Design Under Model Drift

[2602.23136] Modality Collapse as Mismatched Decoding: Information-Theoretic Limits of Multimodal LLMs

[2602.22775] TherapyProbe: Generating Design Knowledge for Relational Safety in Mental Health Chatbots Through Adversarial Simulation

[2602.23132] From Agnostic to Specific: Latent Preference Diffusion for Multi-Behavior Sequential Recommendation

[2602.23085] Q-Tag: Watermarking Quantum Circuit Generative Models

[2602.22760] Distributed LLM Pretraining During Renewable Curtailment Windows: A Feasibility Study

[2602.22752] Towards Simulating Social Media Users with LLMs: Evaluating the Operational Validity of Conditioned Comment Prediction

[2602.22716] SoPE: Spherical Coordinate-Based Positional Embedding for Enhancing Spatial Perception of 3D LVLMs

[2602.22700] IMMACULATE: A Practical LLM Auditing Framework via Verifiable Computation

[2602.22913] SIGMA: A Semantic-Grounded Instruction-Driven Generative Multi-Task Recommender at AliExpress

[2602.22624] Instruction-based Image Editing with Planning, Reasoning, and Generation

[2602.22801] Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving

[2602.22606] CoLyricist: Enhancing Lyric Writing with AI through Workflow-Aligned Support

[2602.22596] BetterScene: 3D Scene Synthesis with Representation-Aligned Generative Model

[2602.22732] Generative Recommendation for Large-Scale Advertising

Related Topics

Stay updated with AI News