Generative AI

Image, video, audio, and text generation

Top This Week

Accelerating science with AI and simulations
Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min ·
[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion
Machine Learning

[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion

Abstract page for arXiv paper 2603.10202: Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Ap...

arXiv - Machine Learning · 4 min ·
[2602.00388] Safer by Diffusion, Broken by Context: Diffusion LLM's Safety Blessing and Its Failure Mode
Llms

[2602.00388] Safer by Diffusion, Broken by Context: Diffusion LLM's Safety Blessing and Its Failure Mode

Abstract page for arXiv paper 2602.00388: Safer by Diffusion, Broken by Context: Diffusion LLM's Safety Blessing and Its Failure Mode

arXiv - Machine Learning · 4 min ·

All Content

[2602.21133] SOM-VQ: Topology-Aware Tokenization for Interactive Generative Models
Machine Learning

[2602.21133] SOM-VQ: Topology-Aware Tokenization for Interactive Generative Models

The paper presents SOM-VQ, a novel tokenization method that enhances interactive generative models by integrating vector quantization wit...

arXiv - Machine Learning · 3 min ·
[2602.20758] Deep unfolding of MCMC kernels: scalable, modular & explainable GANs for high-dimensional posterior sampling
Machine Learning

[2602.20758] Deep unfolding of MCMC kernels: scalable, modular & explainable GANs for high-dimensional posterior sampling

This article presents a novel approach to integrating deep unfolding techniques with MCMC methods, enhancing the efficiency and interpret...

arXiv - Machine Learning · 4 min ·
[2602.20549] Sample-efficient evidence estimation of score based priors for model selection
Machine Learning

[2602.20549] Sample-efficient evidence estimation of score based priors for model selection

The paper presents a novel estimator for model evidence in Bayesian inverse problems, particularly using diffusion models, enhancing samp...

arXiv - Machine Learning · 4 min ·
[2602.20360] Momentum Guidance: Plug-and-Play Guidance for Flow Models
Machine Learning

[2602.20360] Momentum Guidance: Plug-and-Play Guidance for Flow Models

The paper introduces Momentum Guidance (MG), a novel technique for enhancing flow-based generative models, achieving significant improvem...

arXiv - Machine Learning · 3 min ·
[2602.20338] Emergent Manifold Separability during Reasoning in Large Language Models
Llms

[2602.20338] Emergent Manifold Separability during Reasoning in Large Language Models

This paper explores the dynamics of reasoning in Large Language Models (LLMs) through Manifold Capacity Theory, revealing how latent repr...

arXiv - Machine Learning · 3 min ·
[2602.20293] Discrete Diffusion with Sample-Efficient Estimators for Conditionals
Machine Learning

[2602.20293] Discrete Diffusion with Sample-Efficient Estimators for Conditionals

This paper presents a novel discrete denoising diffusion framework that utilizes a sample-efficient estimator for single-site conditional...

arXiv - Machine Learning · 3 min ·
[2602.11184] KBVQ-MoE: KLT-guided SVD with Bias-Corrected Vector Quantization for MoE Large Language Models
Llms

[2602.11184] KBVQ-MoE: KLT-guided SVD with Bias-Corrected Vector Quantization for MoE Large Language Models

The paper presents KBVQ-MoE, a novel framework for improving vector quantization in Mixture of Experts (MoE) large language models, addre...

arXiv - Machine Learning · 4 min ·
[2602.00044] When LLMs Imagine People: A Human-Centered Persona Brainstorm Audit for Bias and Fairness in Creative Applications
Llms

[2602.00044] When LLMs Imagine People: A Human-Centered Persona Brainstorm Audit for Bias and Fairness in Creative Applications

This paper introduces the Persona Brainstorm Audit (PBA), a method for assessing bias in Large Language Models (LLMs) used in creative ap...

arXiv - AI · 4 min ·
[2601.11675] Generating metamers of human scene understanding
Machine Learning

[2601.11675] Generating metamers of human scene understanding

This article presents MetamerGen, a novel tool that generates metamers of human scene understanding by combining low-resolution gist info...

arXiv - AI · 4 min ·
[2601.03868] What Matters For Safety Alignment?
Llms

[2601.03868] What Matters For Safety Alignment?

This paper investigates safety alignment in large language models (LLMs) and large reasoning models (LRMs), identifying key factors that ...

arXiv - AI · 4 min ·
[2512.24787] HiGR: Efficient Generative Slate Recommendation via Hierarchical Planning and Multi-Objective Preference Alignment
Machine Learning

[2512.24787] HiGR: Efficient Generative Slate Recommendation via Hierarchical Planning and Multi-Objective Preference Alignment

The paper presents HiGR, a novel framework for generative slate recommendation that enhances efficiency and user preference alignment thr...

arXiv - AI · 4 min ·
[2512.16602] Refusal Steering: Fine-grained Control over LLM Refusal Behaviour for Sensitive Topics
Llms

[2512.16602] Refusal Steering: Fine-grained Control over LLM Refusal Behaviour for Sensitive Topics

The paper introduces Refusal Steering, a method for controlling Large Language Models' refusal behavior on sensitive topics without retra...

arXiv - AI · 4 min ·
[2511.17844] Less is More: Data-Efficient Adaptation for Controllable Text-to-Video Generation
Machine Learning

[2511.17844] Less is More: Data-Efficient Adaptation for Controllable Text-to-Video Generation

This article presents a novel data-efficient approach for fine-tuning text-to-video generation models, demonstrating that low-quality syn...

arXiv - AI · 3 min ·
[2510.18114] Latent-Augmented Discrete Diffusion Models
Machine Learning

[2510.18114] Latent-Augmented Discrete Diffusion Models

The paper presents Latent-Augmented Discrete Diffusion Models (LADD), which enhance discrete diffusion models for improved language gener...

arXiv - Machine Learning · 3 min ·
[2510.08091] Everything is Plausible: Investigating the Impact of LLM Rationales on Human Notions of Plausibility
Llms

[2510.08091] Everything is Plausible: Investigating the Impact of LLM Rationales on Human Notions of Plausibility

This article explores how rationales generated by large language models (LLMs) influence human judgments of plausibility in commonsense r...

arXiv - AI · 3 min ·
[2509.25774] PCPO: Proportionate Credit Policy Optimization for Aligning Image Generation Models
Machine Learning

[2509.25774] PCPO: Proportionate Credit Policy Optimization for Aligning Image Generation Models

The paper introduces Proportionate Credit Policy Optimization (PCPO), a novel framework aimed at improving the stability and quality of t...

arXiv - Machine Learning · 3 min ·
[2509.15796] Monte Carlo Tree Diffusion with Multiple Experts for Protein Design
Llms

[2509.15796] Monte Carlo Tree Diffusion with Multiple Experts for Protein Design

The paper presents MCTD-ME, a novel approach combining Monte Carlo Tree Search and masked diffusion models for efficient protein design, ...

arXiv - Machine Learning · 4 min ·
[2508.03250] RooseBERT: A New Deal For Political Language Modelling
Llms

[2508.03250] RooseBERT: A New Deal For Political Language Modelling

RooseBERT introduces a specialized language model for political discourse, enhancing the analysis of political debates through improved s...

arXiv - AI · 4 min ·
[2506.06251] DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation
Llms

[2506.06251] DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation

DesignBench introduces a comprehensive benchmark for evaluating MLLM-based front-end code generation, addressing limitations in existing ...

arXiv - AI · 4 min ·
[2506.03922] HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models
Llms

[2506.03922] HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models

HSSBench introduces a benchmark for evaluating Multimodal Large Language Models (MLLMs) in Humanities and Social Sciences, addressing gap...

arXiv - AI · 4 min ·
Previous Page 47 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime