Generative AI

Image, video, audio, and text generation

Top This Week

Navigating Recent Developments in Generative AI and Trade Secret Protection
Generative Ai

Navigating Recent Developments in Generative AI and Trade Secret Protection

Two recent federal district court decisions highlight the significant risks of sharing confidential information with a generative AI plat...

AI Tools & Products · 13 min ·
Generative Ai

Midjourney has a new offer on the cancel page there is 20 off for 2 months

submitted by /u/RainDragonfly826 [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
The Real Reason OpenAI Shut Sora Down Is a Warning to Every AI Startup
Generative Ai

The Real Reason OpenAI Shut Sora Down Is a Warning to Every AI Startup

AI Tools & Products · 3 min ·

All Content

[2508.02515] PoeTone: A Framework for Constrained Generation of Structured Chinese Songci with LLMs
Llms

[2508.02515] PoeTone: A Framework for Constrained Generation of Structured Chinese Songci with LLMs

The paper presents PoeTone, a framework for generating structured Chinese Songci poetry using large language models (LLMs), evaluating th...

arXiv - Machine Learning · 4 min ·
[2506.05688] Voice Impression Control in Zero-Shot TTS
Machine Learning

[2506.05688] Voice Impression Control in Zero-Shot TTS

This paper presents a novel method for controlling voice impressions in zero-shot text-to-speech (TTS) systems, utilizing a low-dimension...

arXiv - Machine Learning · 3 min ·
[2511.04694] Reasoning Up the Instruction Ladder for Controllable Language Models
Llms

[2511.04694] Reasoning Up the Instruction Ladder for Controllable Language Models

This paper explores the importance of instruction hierarchy in large language models (LLMs) for enhancing their controllability and relia...

arXiv - AI · 4 min ·
[2510.20091] CreativityPrism: A Holistic Evaluation Framework for Large Language Model Creativity
Llms

[2510.20091] CreativityPrism: A Holistic Evaluation Framework for Large Language Model Creativity

The paper presents CreativityPrism, a comprehensive framework for evaluating the creativity of large language models (LLMs) across variou...

arXiv - AI · 4 min ·
[2510.15828] GENESIS: A Generative Model of Episodic-Semantic Interaction
Machine Learning

[2510.15828] GENESIS: A Generative Model of Episodic-Semantic Interaction

The paper introduces GENESIS, a generative model that integrates episodic and semantic memory, addressing a key challenge in cognitive ne...

arXiv - AI · 4 min ·
[2510.08102] Lossless Vocabulary Reduction for Auto-Regressive Language Models
Llms

[2510.08102] Lossless Vocabulary Reduction for Auto-Regressive Language Models

This paper introduces a theoretical framework for lossless vocabulary reduction in auto-regressive language models, enabling efficient co...

arXiv - Machine Learning · 3 min ·
[2509.22237] FeatBench: Towards More Realistic Evaluation of Feature-level Code Generation
Llms

[2509.22237] FeatBench: Towards More Realistic Evaluation of Feature-level Code Generation

The paper introduces FeatBench, a new benchmark for evaluating feature-level code generation in Large Language Models (LLMs), addressing ...

arXiv - AI · 4 min ·
[2602.02958] Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization
Machine Learning

[2602.02958] Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

The paper presents Quant VideoGen, a framework for autoregressive long video generation that addresses the limitations of KV cache memory...

arXiv - Machine Learning · 4 min ·
[2509.19680] PolicyPad: Collaborative Prototyping of LLM Policies
Llms

[2509.19680] PolicyPad: Collaborative Prototyping of LLM Policies

The article presents PolicyPad, an interactive system designed for collaborative prototyping of policies governing large language models ...

arXiv - AI · 3 min ·
[2602.00191] GEPC: Group-Equivariant Posterior Consistency for Out-of-Distribution Detection in Diffusion Models
Machine Learning

[2602.00191] GEPC: Group-Equivariant Posterior Consistency for Out-of-Distribution Detection in Diffusion Models

The paper introduces Group-Equivariant Posterior Consistency (GEPC), a method for detecting out-of-distribution data in diffusion models ...

arXiv - Machine Learning · 4 min ·
[2512.18454] Out-of-Distribution Detection in Molecular Complexes via Diffusion Models for Irregular Graphs
Machine Learning

[2512.18454] Out-of-Distribution Detection in Molecular Complexes via Diffusion Models for Irregular Graphs

This paper presents a novel framework for out-of-distribution (OOD) detection in molecular complexes using diffusion models tailored for ...

arXiv - Machine Learning · 4 min ·
[2506.08822] FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency
Machine Learning

[2506.08822] FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency

The paper presents FreqPolicy, a novel flow-based visuomotor policy that enhances efficiency in robotic manipulation by imposing frequenc...

arXiv - AI · 4 min ·
[2510.25867] Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs
Machine Learning

[2510.25867] Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs

This paper presents MedVLSynther, a framework for synthesizing high-quality visual question answering (VQA) from medical documents, enhan...

arXiv - Machine Learning · 4 min ·
[2503.12286] Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes
Llms

[2503.12286] Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes

This article presents a novel approach combining Chain-of-Thought (CoT) and Retrieval Augmented Generation (RAG) to improve rare disease ...

arXiv - AI · 4 min ·
[2502.17863] A Survey: Spatiotemporal Consistency in Video Generation
Generative Ai

[2502.17863] A Survey: Spatiotemporal Consistency in Video Generation

This survey reviews advancements in spatiotemporal consistency in video generation, addressing challenges and methodologies in creating c...

arXiv - AI · 4 min ·
[2509.22007] Stage-wise Dynamics of Classifier-Free Guidance in Diffusion Models
Machine Learning

[2509.22007] Stage-wise Dynamics of Classifier-Free Guidance in Diffusion Models

This paper explores the dynamics of Classifier-Free Guidance (CFG) in diffusion models, revealing its effects on sampling processes and d...

arXiv - Machine Learning · 4 min ·
[2509.00454] Universal Properties of Activation Sparsity in Modern Large Language Models
Llms

[2509.00454] Universal Properties of Activation Sparsity in Modern Large Language Models

This article explores the universal properties of activation sparsity in modern large language models (LLMs), highlighting its implicatio...

arXiv - Machine Learning · 4 min ·
[2508.11810] FairTabGen: High-Fidelity and Fair Synthetic Health Data Generation from Limited Samples
Llms

[2508.11810] FairTabGen: High-Fidelity and Fair Synthetic Health Data Generation from Limited Samples

FairTabGen introduces a novel framework for generating high-fidelity synthetic healthcare data from limited samples, enhancing fairness a...

arXiv - Machine Learning · 3 min ·
[2501.16534] Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs
Llms

[2501.16534] Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs

This article presents a novel technique for extracting safety classifiers from aligned large language models (LLMs) to address vulnerabil...

arXiv - AI · 4 min ·
[2501.03544] PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models
Machine Learning

[2501.03544] PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models

PromptGuard introduces a novel method for moderating unsafe content in text-to-image models, enhancing safety without sacrificing image q...

arXiv - AI · 4 min ·
Previous Page 75 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime