Generative AI

Image, video, audio, and text generation

Top This Week

Generative Ai

Will Generative AI apps remain a revenue powerhouse in 2026?

AI Tools & Products · 1 min ·
[2601.08565] Rewriting Video: Text-Driven Reauthoring of Video Footage
Machine Learning

[2601.08565] Rewriting Video: Text-Driven Reauthoring of Video Footage

Abstract page for arXiv paper 2601.08565: Rewriting Video: Text-Driven Reauthoring of Video Footage

arXiv - AI · 3 min ·
[2512.18388] Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models
Machine Learning

[2512.18388] Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models

Abstract page for arXiv paper 2512.18388: Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creatio...

arXiv - AI · 4 min ·

All Content

[2508.21285] A Financial Brain Scan of the LLM
Llms

[2508.21285] A Financial Brain Scan of the LLM

This article presents a novel approach to analyzing large language models (LLMs) in finance, enabling researchers to identify and manipul...

arXiv - AI · 3 min ·
[2508.18210] Why Synthetic Isn't Real Yet: A Diagnostic Framework for Contact Center Dialogue Generation
Generative Ai

[2508.18210] Why Synthetic Isn't Real Yet: A Diagnostic Framework for Contact Center Dialogue Generation

This article presents a diagnostic framework for evaluating synthetic dialogue generation in contact centers, highlighting the limitation...

arXiv - AI · 4 min ·
[2307.14397] A Survey on Generative Modeling with Limited Data, Few Shots, and Zero Shot
Machine Learning

[2307.14397] A Survey on Generative Modeling with Limited Data, Few Shots, and Zero Shot

This survey explores generative modeling under constraints of limited data, few shots, and zero shots, presenting challenges and methodol...

arXiv - Machine Learning · 4 min ·
[2507.04704] SPATIA: Multimodal Generation and Prediction of Spatial Cell Phenotypes
Machine Learning

[2507.04704] SPATIA: Multimodal Generation and Prediction of Spatial Cell Phenotypes

The paper introduces SPATIA, a novel multimodal model for predicting spatial cell phenotypes by integrating cellular morphology, gene exp...

arXiv - AI · 4 min ·
[2602.06801] On the Non-Identifiability of Steering Vectors in Large Language Models
Llms

[2602.06801] On the Non-Identifiability of Steering Vectors in Large Language Models

This paper explores the non-identifiability of steering vectors in large language models (LLMs), revealing that these vectors cannot be u...

arXiv - AI · 3 min ·
[2506.04051] High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning
Llms

[2506.04051] High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning

The paper presents HALT, a method for finetuning large language models (LLMs) to enhance reliability by generating responses only when co...

arXiv - AI · 4 min ·
[2602.05319] Accelerated Sequential Flow Matching: A Bayesian Filtering Perspective
Machine Learning

[2602.05319] Accelerated Sequential Flow Matching: A Bayesian Filtering Perspective

This paper introduces Accelerated Sequential Flow Matching, a Bayesian filtering framework that enhances real-time inference in stochasti...

arXiv - Machine Learning · 4 min ·
[2506.03407] Multi-Spectral Gaussian Splatting with Neural Color Representation
Machine Learning

[2506.03407] Multi-Spectral Gaussian Splatting with Neural Color Representation

The paper presents MS-Splatting, a novel multi-spectral 3D Gaussian Splatting framework that generates consistent views from images captu...

arXiv - Machine Learning · 4 min ·
[2602.00628] From Associations to Activations: Comparing Behavioral and Hidden-State Semantic Geometry in LLMs
Llms

[2602.00628] From Associations to Activations: Comparing Behavioral and Hidden-State Semantic Geometry in LLMs

This paper examines the relationship between behavioral and hidden-state semantic geometry in large language models (LLMs) through psycho...

arXiv - AI · 3 min ·
[2505.07861] Scalable LLM Reasoning Acceleration with Low-rank Distillation
Llms

[2505.07861] Scalable LLM Reasoning Acceleration with Low-rank Distillation

The paper presents Caprese, a low-rank distillation method designed to enhance reasoning capabilities in large language models (LLMs) whi...

arXiv - Machine Learning · 3 min ·
[2505.07671] Benchmarking Retrieval-Augmented Generation for Chemistry
Llms

[2505.07671] Benchmarking Retrieval-Augmented Generation for Chemistry

This article presents ChemRAG-Bench, a benchmark for evaluating retrieval-augmented generation (RAG) in chemistry, demonstrating signific...

arXiv - AI · 4 min ·
[2601.13190] LAViG-FLOW: Latent Autoregressive Video Generation for Fluid Flow Simulations
Machine Learning

[2601.13190] LAViG-FLOW: Latent Autoregressive Video Generation for Fluid Flow Simulations

LAViG-FLOW introduces a novel framework for generating fluid flow simulations, significantly improving efficiency and consistency in mode...

arXiv - Machine Learning · 4 min ·
[2601.12415] Orthogonalized Policy Optimization:Decoupling Sampling Geometry from Optimization Geometry in RLHF
Llms

[2601.12415] Orthogonalized Policy Optimization:Decoupling Sampling Geometry from Optimization Geometry in RLHF

This paper introduces Orthogonalized Policy Optimization (OPO), a new approach in reinforcement learning that separates sampling and opti...

arXiv - Machine Learning · 4 min ·
[2503.04641] Simulating the Real World: A Unified Survey of Multimodal Generative Models
Machine Learning

[2503.04641] Simulating the Real World: A Unified Survey of Multimodal Generative Models

This article presents a comprehensive survey of multimodal generative models, focusing on their integration from 2D to 4D representations...

arXiv - Machine Learning · 4 min ·
[2601.03213] Critic-Guided Reinforcement Unlearning in Text-to-Image Diffusion
Machine Learning

[2601.03213] Critic-Guided Reinforcement Unlearning in Text-to-Image Diffusion

The paper presents a novel reinforcement learning framework for unlearning targeted concepts in text-to-image diffusion models, enhancing...

arXiv - Machine Learning · 4 min ·
[2502.16730] RapidPen: Fully Automated IP-to-Shell Penetration Testing with LLM-based Agents
Llms

[2502.16730] RapidPen: Fully Automated IP-to-Shell Penetration Testing with LLM-based Agents

RapidPen is a novel automated penetration testing framework that utilizes large language models to autonomously exploit vulnerabilities, ...

arXiv - AI · 4 min ·
[2512.10858] Scaling Behavior of Discrete Diffusion Language Models
Llms

[2512.10858] Scaling Behavior of Discrete Diffusion Language Models

This article explores the scaling behavior of discrete diffusion language models (DLMs) compared to autoregressive language models (ALMs)...

arXiv - Machine Learning · 4 min ·
[2511.22693] Generative Anchored Fields: Controlled Data Generation via Emergent Velocity Fields and Transport Algebra
Machine Learning

[2511.22693] Generative Anchored Fields: Controlled Data Generation via Emergent Velocity Fields and Transport Algebra

The paper introduces Generative Anchored Fields (GAF), a novel generative model that enhances data generation through controlled interpol...

arXiv - Machine Learning · 4 min ·
[2511.17879] Generative Adversarial Post-Training Mitigates Reward Hacking in Live Human-AI Music Interaction
Machine Learning

[2511.17879] Generative Adversarial Post-Training Mitigates Reward Hacking in Live Human-AI Music Interaction

This paper presents a novel method using generative adversarial training to address reward hacking in real-time human-AI music interactio...

arXiv - Machine Learning · 4 min ·
[2511.07833] MURPHY: Multi-Turn GRPO for Self Correcting Code Generation
Llms

[2511.07833] MURPHY: Multi-Turn GRPO for Self Correcting Code Generation

The paper presents MURPHY, a multi-turn reinforcement learning framework that enhances code generation by incorporating execution feedbac...

arXiv - AI · 3 min ·
Previous Page 93 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime