Generative AI

Image, video, audio, and text generation

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Generative Ai

Will Generative AI apps remain a revenue powerhouse in 2026?

AI Tools & Products · 1 min · 41 minutes ago

Machine Learning

[2601.08565] Rewriting Video: Text-Driven Reauthoring of Video Footage

Abstract page for arXiv paper 2601.08565: Rewriting Video: Text-Driven Reauthoring of Video Footage

arXiv - AI · 3 min · about 2 hours ago

Machine Learning

[2512.18388] Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models

Abstract page for arXiv paper 2512.18388: Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creatio...

arXiv - AI · 4 min · about 2 hours ago

All Content

Llms

[2602.03837] Accelerating Scientific Research with Gemini: Case Studies and Common Techniques

This article explores how Google's Gemini models enhance scientific research through case studies, showcasing effective human-AI collabor...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2602.01023] Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment

This paper presents a unified framework for Query Auto-Completion (QAC) that integrates Retrieval-Augmented Generation (RAG) and multi-ob...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2601.21812] A Decomposable Forward Process in Diffusion Models for Time-Series Forecasting

This paper presents a novel forward diffusion process for time-series forecasting that effectively decomposes signals into spectral compo...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2511.20974] RosettaSpeech: Zero-Shot Speech-to-Speech Translation without Parallel Speech

RosettaSpeech introduces a zero-shot framework for speech-to-speech translation, overcoming the need for parallel speech data by using mo...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2601.09982] Context Volume Drives Performance: Tackling Domain Shift in Extremely Low-Resource Translation via RAG

This article presents a hybrid framework for improving neural machine translation performance in low-resource languages, specifically add...

arXiv - AI · 3 min · about 2 months ago

Llms

[2512.22420] Nightjar: Dynamic Adaptive Speculative Decoding for Large Language Models Serving

The paper presents Nightjar, a novel algorithm for dynamic adaptive speculative decoding in large language models, enhancing throughput a...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2510.08431] Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency

This paper presents a novel approach to large-scale diffusion distillation using a score-regularized continuous-time consistency model, a...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2512.14166] IntentMiner: Intent Inversion Attack via Tool Call Analysis in the Model Context Protocol

The paper introduces IntentMiner, a novel approach to detect Intent Inversion Attacks in Large Language Models (LLMs) by analyzing tool c...

arXiv - AI · 4 min · about 2 months ago

Llms

[2512.13697] Writing in Symbiosis: Mapping Human Creative Agency in the AI Era

This article explores the evolving relationship between human creativity and AI, particularly in writing, highlighting how authors adapt ...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2512.09185] Learning Patient-Specific Disease Dynamics with Latent Flow Matching for Longitudinal Imaging Generation

The paper presents a novel framework, $ ext{Δ}$-LFM, for modeling patient-specific disease dynamics using latent flow matching, enhancing...

arXiv - AI · 4 min · about 2 months ago

Llms

[2512.04552] RRPO: Robust Reward Policy Optimization for LLM-based Emotional TTS

The paper presents Robust Reward Policy Optimization (RRPO), a novel framework designed to enhance emotional text-to-speech (TTS) systems...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2509.20928] Conditionally Whitened Generative Models for Probabilistic Time Series Forecasting

The paper introduces Conditionally Whitened Generative Models (CW-Gen) for probabilistic time series forecasting, addressing challenges l...

arXiv - Machine Learning · 4 min · about 2 months ago

Nlp

[2510.22876] Batch Speculative Decoding Done Right

The paper presents a novel framework for batch speculative decoding, addressing critical failures in existing methods and achieving signi...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2507.07139] Image Can Bring Your Memory Back: A Novel Multi-Modal Guided Attack against Image Generation Model Unlearning

The paper presents Recall, a novel adversarial framework that targets the robustness of image generation model unlearning, revealing vuln...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2510.04398] SECA: Semantically Equivalent and Coherent Attacks for Eliciting LLM Hallucinations

The paper presents SECA, a method for eliciting hallucinations in large language models (LLMs) through semantically equivalent and cohere...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2510.02356] Measuring Physical-World Privacy Awareness of Large Language Models: An Evaluation Benchmark

This article presents EAPrivacy, a benchmark for evaluating the physical-world privacy awareness of large language models (LLMs), reveali...

arXiv - AI · 4 min · about 2 months ago

Llms

[2510.00232] BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses

The paper introduces BiasFreeBench, a benchmark designed to evaluate bias mitigation techniques in large language models (LLMs) by provid...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2509.18776] AECBench: A Hierarchical Benchmark for Knowledge Evaluation of Large Language Models in the AEC Field

The paper introduces AECBench, a benchmark for evaluating large language models (LLMs) in the Architecture, Engineering, and Construction...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2503.10522] AudioX: A Unified Framework for Anything-to-Audio Generation

AudioX presents a unified framework for generating audio from various multimodal inputs, enhancing the quality and flexibility of audio g...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2411.01629] Denoising Diffusions with Optimal Transport: Localization, Curvature, and Multi-Scale Complexity

This paper explores denoising diffusions using optimal transport, focusing on localization, curvature, and multi-scale complexity in gene...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 92 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Generative AI

Top This Week

Will Generative AI apps remain a revenue powerhouse in 2026?

[2601.08565] Rewriting Video: Text-Driven Reauthoring of Video Footage

[2512.18388] Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models

All Content

[2602.03837] Accelerating Scientific Research with Gemini: Case Studies and Common Techniques

[2602.01023] Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment

[2601.21812] A Decomposable Forward Process in Diffusion Models for Time-Series Forecasting

[2511.20974] RosettaSpeech: Zero-Shot Speech-to-Speech Translation without Parallel Speech

[2601.09982] Context Volume Drives Performance: Tackling Domain Shift in Extremely Low-Resource Translation via RAG

[2512.22420] Nightjar: Dynamic Adaptive Speculative Decoding for Large Language Models Serving

[2510.08431] Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency

[2512.14166] IntentMiner: Intent Inversion Attack via Tool Call Analysis in the Model Context Protocol

[2512.13697] Writing in Symbiosis: Mapping Human Creative Agency in the AI Era

[2512.09185] Learning Patient-Specific Disease Dynamics with Latent Flow Matching for Longitudinal Imaging Generation

[2512.04552] RRPO: Robust Reward Policy Optimization for LLM-based Emotional TTS

[2509.20928] Conditionally Whitened Generative Models for Probabilistic Time Series Forecasting

[2510.22876] Batch Speculative Decoding Done Right

[2507.07139] Image Can Bring Your Memory Back: A Novel Multi-Modal Guided Attack against Image Generation Model Unlearning

[2510.04398] SECA: Semantically Equivalent and Coherent Attacks for Eliciting LLM Hallucinations

[2510.02356] Measuring Physical-World Privacy Awareness of Large Language Models: An Evaluation Benchmark

[2510.00232] BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses

[2509.18776] AECBench: A Hierarchical Benchmark for Knowledge Evaluation of Large Language Models in the AEC Field

[2503.10522] AudioX: A Unified Framework for Anything-to-Audio Generation

[2411.01629] Denoising Diffusions with Optimal Transport: Localization, Curvature, and Multi-Scale Complexity

Related Topics

Stay updated with AI News