Generative AI

Image, video, audio, and text generation

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 10 hours ago

Machine Learning

[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion

Abstract page for arXiv paper 2603.10202: Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Ap...

arXiv - Machine Learning · 4 min · about 11 hours ago

Llms

[2602.00388] Safer by Diffusion, Broken by Context: Diffusion LLM's Safety Blessing and Its Failure Mode

Abstract page for arXiv paper 2602.00388: Safer by Diffusion, Broken by Context: Diffusion LLM's Safety Blessing and Its Failure Mode

arXiv - Machine Learning · 4 min · about 11 hours ago

All Content

Machine Learning

[2602.21133] SOM-VQ: Topology-Aware Tokenization for Interactive Generative Models

The paper presents SOM-VQ, a novel tokenization method that enhances interactive generative models by integrating vector quantization wit...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.20758] Deep unfolding of MCMC kernels: scalable, modular & explainable GANs for high-dimensional posterior sampling

This article presents a novel approach to integrating deep unfolding techniques with MCMC methods, enhancing the efficiency and interpret...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.20549] Sample-efficient evidence estimation of score based priors for model selection

The paper presents a novel estimator for model evidence in Bayesian inverse problems, particularly using diffusion models, enhancing samp...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.20360] Momentum Guidance: Plug-and-Play Guidance for Flow Models

The paper introduces Momentum Guidance (MG), a novel technique for enhancing flow-based generative models, achieving significant improvem...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.20338] Emergent Manifold Separability during Reasoning in Large Language Models

This paper explores the dynamics of reasoning in Large Language Models (LLMs) through Manifold Capacity Theory, revealing how latent repr...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.20293] Discrete Diffusion with Sample-Efficient Estimators for Conditionals

This paper presents a novel discrete denoising diffusion framework that utilizes a sample-efficient estimator for single-site conditional...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.11184] KBVQ-MoE: KLT-guided SVD with Bias-Corrected Vector Quantization for MoE Large Language Models

The paper presents KBVQ-MoE, a novel framework for improving vector quantization in Mixture of Experts (MoE) large language models, addre...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.00044] When LLMs Imagine People: A Human-Centered Persona Brainstorm Audit for Bias and Fairness in Creative Applications

This paper introduces the Persona Brainstorm Audit (PBA), a method for assessing bias in Large Language Models (LLMs) used in creative ap...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2601.11675] Generating metamers of human scene understanding

This article presents MetamerGen, a novel tool that generates metamers of human scene understanding by combining low-resolution gist info...

arXiv - AI · 4 min · about 1 month ago

Llms

[2601.03868] What Matters For Safety Alignment?

This paper investigates safety alignment in large language models (LLMs) and large reasoning models (LRMs), identifying key factors that ...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2512.24787] HiGR: Efficient Generative Slate Recommendation via Hierarchical Planning and Multi-Objective Preference Alignment

The paper presents HiGR, a novel framework for generative slate recommendation that enhances efficiency and user preference alignment thr...

arXiv - AI · 4 min · about 1 month ago

Llms

[2512.16602] Refusal Steering: Fine-grained Control over LLM Refusal Behaviour for Sensitive Topics

The paper introduces Refusal Steering, a method for controlling Large Language Models' refusal behavior on sensitive topics without retra...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2511.17844] Less is More: Data-Efficient Adaptation for Controllable Text-to-Video Generation

This article presents a novel data-efficient approach for fine-tuning text-to-video generation models, demonstrating that low-quality syn...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2510.18114] Latent-Augmented Discrete Diffusion Models

The paper presents Latent-Augmented Discrete Diffusion Models (LADD), which enhance discrete diffusion models for improved language gener...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2510.08091] Everything is Plausible: Investigating the Impact of LLM Rationales on Human Notions of Plausibility

This article explores how rationales generated by large language models (LLMs) influence human judgments of plausibility in commonsense r...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2509.25774] PCPO: Proportionate Credit Policy Optimization for Aligning Image Generation Models

The paper introduces Proportionate Credit Policy Optimization (PCPO), a novel framework aimed at improving the stability and quality of t...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2509.15796] Monte Carlo Tree Diffusion with Multiple Experts for Protein Design

The paper presents MCTD-ME, a novel approach combining Monte Carlo Tree Search and masked diffusion models for efficient protein design, ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2508.03250] RooseBERT: A New Deal For Political Language Modelling

RooseBERT introduces a specialized language model for political discourse, enhancing the analysis of political debates through improved s...

arXiv - AI · 4 min · about 1 month ago

Llms

[2506.06251] DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation

DesignBench introduces a comprehensive benchmark for evaluating MLLM-based front-end code generation, addressing limitations in existing ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2506.03922] HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models

HSSBench introduces a benchmark for evaluating Multimodal Large Language Models (MLLMs) in Humanities and Social Sciences, addressing gap...

arXiv - AI · 4 min · about 1 month ago

Previous Page 47 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Generative AI

Top This Week

Accelerating science with AI and simulations

[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion

[2602.00388] Safer by Diffusion, Broken by Context: Diffusion LLM's Safety Blessing and Its Failure Mode

All Content

[2602.21133] SOM-VQ: Topology-Aware Tokenization for Interactive Generative Models

[2602.20758] Deep unfolding of MCMC kernels: scalable, modular & explainable GANs for high-dimensional posterior sampling

[2602.20549] Sample-efficient evidence estimation of score based priors for model selection

[2602.20360] Momentum Guidance: Plug-and-Play Guidance for Flow Models

[2602.20338] Emergent Manifold Separability during Reasoning in Large Language Models

[2602.20293] Discrete Diffusion with Sample-Efficient Estimators for Conditionals

[2602.11184] KBVQ-MoE: KLT-guided SVD with Bias-Corrected Vector Quantization for MoE Large Language Models

[2602.00044] When LLMs Imagine People: A Human-Centered Persona Brainstorm Audit for Bias and Fairness in Creative Applications

[2601.11675] Generating metamers of human scene understanding

[2601.03868] What Matters For Safety Alignment?

[2512.24787] HiGR: Efficient Generative Slate Recommendation via Hierarchical Planning and Multi-Objective Preference Alignment

[2512.16602] Refusal Steering: Fine-grained Control over LLM Refusal Behaviour for Sensitive Topics

[2511.17844] Less is More: Data-Efficient Adaptation for Controllable Text-to-Video Generation

[2510.18114] Latent-Augmented Discrete Diffusion Models

[2510.08091] Everything is Plausible: Investigating the Impact of LLM Rationales on Human Notions of Plausibility

[2509.25774] PCPO: Proportionate Credit Policy Optimization for Aligning Image Generation Models

[2509.15796] Monte Carlo Tree Diffusion with Multiple Experts for Protein Design

[2508.03250] RooseBERT: A New Deal For Political Language Modelling

[2506.06251] DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation

[2506.03922] HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models

Related Topics

Stay updated with AI News