Generative AI

Image, video, audio, and text generation

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Generative Ai

Navigating Recent Developments in Generative AI and Trade Secret Protection

Two recent federal district court decisions highlight the significant risks of sharing confidential information with a generative AI plat...

AI Tools & Products · 13 min · 39 minutes ago

Generative Ai

Midjourney has a new offer on the cancel page there is 20 off for 2 months

submitted by /u/RainDragonfly826 [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 19 hours ago

Generative Ai

The Real Reason OpenAI Shut Sora Down Is a Warning to Every AI Startup

AI Tools & Products · 3 min · 1 day ago

All Content

Llms

[2508.02515] PoeTone: A Framework for Constrained Generation of Structured Chinese Songci with LLMs

The paper presents PoeTone, a framework for generating structured Chinese Songci poetry using large language models (LLMs), evaluating th...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2506.05688] Voice Impression Control in Zero-Shot TTS

This paper presents a novel method for controlling voice impressions in zero-shot text-to-speech (TTS) systems, utilizing a low-dimension...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2511.04694] Reasoning Up the Instruction Ladder for Controllable Language Models

This paper explores the importance of instruction hierarchy in large language models (LLMs) for enhancing their controllability and relia...

arXiv - AI · 4 min · about 2 months ago

Llms

[2510.20091] CreativityPrism: A Holistic Evaluation Framework for Large Language Model Creativity

The paper presents CreativityPrism, a comprehensive framework for evaluating the creativity of large language models (LLMs) across variou...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2510.15828] GENESIS: A Generative Model of Episodic-Semantic Interaction

The paper introduces GENESIS, a generative model that integrates episodic and semantic memory, addressing a key challenge in cognitive ne...

arXiv - AI · 4 min · about 2 months ago

Llms

[2510.08102] Lossless Vocabulary Reduction for Auto-Regressive Language Models

This paper introduces a theoretical framework for lossless vocabulary reduction in auto-regressive language models, enabling efficient co...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2509.22237] FeatBench: Towards More Realistic Evaluation of Feature-level Code Generation

The paper introduces FeatBench, a new benchmark for evaluating feature-level code generation in Large Language Models (LLMs), addressing ...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.02958] Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

The paper presents Quant VideoGen, a framework for autoregressive long video generation that addresses the limitations of KV cache memory...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2509.19680] PolicyPad: Collaborative Prototyping of LLM Policies

The article presents PolicyPad, an interactive system designed for collaborative prototyping of policies governing large language models ...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.00191] GEPC: Group-Equivariant Posterior Consistency for Out-of-Distribution Detection in Diffusion Models

The paper introduces Group-Equivariant Posterior Consistency (GEPC), a method for detecting out-of-distribution data in diffusion models ...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2512.18454] Out-of-Distribution Detection in Molecular Complexes via Diffusion Models for Irregular Graphs

This paper presents a novel framework for out-of-distribution (OOD) detection in molecular complexes using diffusion models tailored for ...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2506.08822] FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency

The paper presents FreqPolicy, a novel flow-based visuomotor policy that enhances efficiency in robotic manipulation by imposing frequenc...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2510.25867] Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs

This paper presents MedVLSynther, a framework for synthesizing high-quality visual question answering (VQA) from medical documents, enhan...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2503.12286] Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes

This article presents a novel approach combining Chain-of-Thought (CoT) and Retrieval Augmented Generation (RAG) to improve rare disease ...

arXiv - AI · 4 min · about 2 months ago

Generative Ai

[2502.17863] A Survey: Spatiotemporal Consistency in Video Generation

This survey reviews advancements in spatiotemporal consistency in video generation, addressing challenges and methodologies in creating c...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2509.22007] Stage-wise Dynamics of Classifier-Free Guidance in Diffusion Models

This paper explores the dynamics of Classifier-Free Guidance (CFG) in diffusion models, revealing its effects on sampling processes and d...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2509.00454] Universal Properties of Activation Sparsity in Modern Large Language Models

This article explores the universal properties of activation sparsity in modern large language models (LLMs), highlighting its implicatio...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2508.11810] FairTabGen: High-Fidelity and Fair Synthetic Health Data Generation from Limited Samples

FairTabGen introduces a novel framework for generating high-fidelity synthetic healthcare data from limited samples, enhancing fairness a...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2501.16534] Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs

This article presents a novel technique for extracting safety classifiers from aligned large language models (LLMs) to address vulnerabil...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2501.03544] PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models

PromptGuard introduces a novel method for moderating unsafe content in text-to-image models, enhancing safety without sacrificing image q...

arXiv - AI · 4 min · about 2 months ago

Previous Page 75 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Generative AI

Top This Week

Navigating Recent Developments in Generative AI and Trade Secret Protection

Midjourney has a new offer on the cancel page there is 20 off for 2 months

The Real Reason OpenAI Shut Sora Down Is a Warning to Every AI Startup

All Content

[2508.02515] PoeTone: A Framework for Constrained Generation of Structured Chinese Songci with LLMs

[2506.05688] Voice Impression Control in Zero-Shot TTS

[2511.04694] Reasoning Up the Instruction Ladder for Controllable Language Models

[2510.20091] CreativityPrism: A Holistic Evaluation Framework for Large Language Model Creativity

[2510.15828] GENESIS: A Generative Model of Episodic-Semantic Interaction

[2510.08102] Lossless Vocabulary Reduction for Auto-Regressive Language Models

[2509.22237] FeatBench: Towards More Realistic Evaluation of Feature-level Code Generation

[2602.02958] Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

[2509.19680] PolicyPad: Collaborative Prototyping of LLM Policies

[2602.00191] GEPC: Group-Equivariant Posterior Consistency for Out-of-Distribution Detection in Diffusion Models

[2512.18454] Out-of-Distribution Detection in Molecular Complexes via Diffusion Models for Irregular Graphs

[2506.08822] FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency

[2510.25867] Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs

[2503.12286] Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes

[2502.17863] A Survey: Spatiotemporal Consistency in Video Generation

[2509.22007] Stage-wise Dynamics of Classifier-Free Guidance in Diffusion Models

[2509.00454] Universal Properties of Activation Sparsity in Modern Large Language Models

[2508.11810] FairTabGen: High-Fidelity and Fair Synthetic Health Data Generation from Limited Samples

[2501.16534] Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs

[2501.03544] PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models

Related Topics

Stay updated with AI News