Generative AI

Image, video, audio, and text generation

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 4 hours ago

Machine Learning

[2603.12057] Coarse-Guided Visual Generation via Weighted h-Transform Sampling

Abstract page for arXiv paper 2603.12057: Coarse-Guided Visual Generation via Weighted h-Transform Sampling

arXiv - AI · 4 min · about 6 hours ago

Machine Learning

[2603.07455] Image Generation Models: A Technical History

Abstract page for arXiv paper 2603.07455: Image Generation Models: A Technical History

arXiv - AI · 3 min · about 6 hours ago

All Content

Robotics

Employees at Google and OpenAI support Anthropic's Pentagon stand in open letter | TechCrunch

Over 360 employees from Google and OpenAI have signed an open letter supporting Anthropic's stance against the Pentagon's demands for AI ...

TechCrunch - AI · 5 min · about 1 month ago

Ai Agents

Samsung’s Galaxy S26 AI camera features are a photography nightmare | The Verge

The Vergecast discusses Samsung's Galaxy S26 AI camera features, arguing they redefine photography and raise concerns about the essence o...

The Verge - AI · 5 min · about 1 month ago

Ai Infrastructure

OpenAI raises $110B in one of the largest private funding rounds in history | TechCrunch

OpenAI secures $110 billion in private funding, led by Amazon, Nvidia, and SoftBank, marking a significant milestone in AI infrastructure...

TechCrunch - AI · 5 min · about 1 month ago

Llms

OpenAI snags $110 billion in investments from Amazon, Nvidia, and Softbank | The Verge

OpenAI secures $110 billion in new investments from Amazon, Nvidia, and Softbank, enhancing its market position and partnerships while pr...

The Verge - AI · 5 min · about 1 month ago

Llms

Dr Seuss vs Hemingway in LLMs

The article discusses an experiment comparing the writing styles of Dr. Seuss and Hemingway using language models, highlighting the poten...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Machine Learning

Why your AI sounds the same across every platform

The article discusses the uniformity in AI-generated marketing copy across different platforms, highlighting the challenges of creating d...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Ai Startups

Huxe Will Give You a Personalized, Daily Audio Summary Powered by AI | WIRED

Huxe is an AI-powered app that provides personalized daily audio summaries from your email and calendar, helping users save time and stay...

Wired - AI · 8 min · about 1 month ago

Generative Ai

[R] Community Members, Kindly share your opinion on my article. Am I clear in my thoughts? Anything I miss here?

The article seeks feedback on a research piece about AI's role in creative writing, inviting community insights to enhance future work.

Reddit - Machine Learning · 1 min · about 1 month ago

Ai Safety

Anthropic Rejects the Pentagon’s Demand That It Remove AI Safeguards

Anthropic has rejected the Pentagon's demand to remove AI safeguards for its model Claude, aiming to prevent its use in mass surveillance...

AI Tools & Products · 5 min · about 1 month ago

Machine Learning

Mixing generative AI with physics to create personal items that work in the real world

The article discusses the challenges of using generative AI in creating functional designs, emphasizing the need for integrating physics ...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Machine Learning

[2512.01292] Diffusion Model in Latent Space for Medical Image Segmentation Task

This article presents MedSegLatDiff, a novel diffusion model for efficient medical image segmentation that enhances interpretability by g...

arXiv - AI · 4 min · about 1 month ago

Ai Agents

[2510.25726] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

The Tool Decathlon introduces a benchmark for evaluating language agents on diverse, realistic, and complex tasks, highlighting significa...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.19060] PoSh: Using Scene Graphs To Guide LLMs-as-a-Judge For Detailed Image Descriptions

The paper introduces PoSh, a new metric using scene graphs to enhance the evaluation of detailed image descriptions by LLMs, outperformin...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.02334] VQ-Style: Disentangling Style and Content in Motion with Residual Quantized Representations

The paper presents VQ-Style, a method for disentangling style and content in human motion data using Residual Vector Quantized Variationa...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.03495] AgentHub: A Registry for Discoverable, Verifiable, and Reproducible AI Agents

AgentHub proposes a registry for AI agents that enhances discoverability, verifiability, and reproducibility, addressing gaps in current ...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2512.05251] One-Step Diffusion Samplers via Self-Distillation and Deterministic Flow

The paper presents a novel one-step diffusion sampler that utilizes self-distillation and deterministic flow to enhance sampling efficien...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2507.17937] Bob's Confetti: Phonetic Memorization Attacks in Music and Video Generation

The paper presents a novel attack method, Adversarial PhoneTic Prompting (APT), that exploits phonetic memorization in generative AI syst...

arXiv - AI · 4 min · about 1 month ago

Llms

[2507.12553] Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility

This paper explores how language models (LMs) categorize event plausibility, revealing that LMs can reliably discern modal categories, wh...

arXiv - AI · 4 min · about 1 month ago

Ai Agents

[2507.00788] Echoes of AI: Investigating the Downstream Effects of AI Assistants on Software Maintainability

This study investigates the impact of AI assistants on software maintainability, revealing no significant differences in code evolution d...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2510.01031] Secure and reversible face anonymization with diffusion models

This paper presents a novel framework for secure and reversible face anonymization using diffusion models, addressing challenges in image...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 29 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Generative AI

Top This Week

Accelerating science with AI and simulations

[2603.12057] Coarse-Guided Visual Generation via Weighted h-Transform Sampling

[2603.07455] Image Generation Models: A Technical History

All Content

Employees at Google and OpenAI support Anthropic's Pentagon stand in open letter | TechCrunch

Samsung’s Galaxy S26 AI camera features are a photography nightmare | The Verge

OpenAI raises $110B in one of the largest private funding rounds in history | TechCrunch

OpenAI snags $110 billion in investments from Amazon, Nvidia, and Softbank | The Verge

Dr Seuss vs Hemingway in LLMs

Why your AI sounds the same across every platform

Huxe Will Give You a Personalized, Daily Audio Summary Powered by AI | WIRED

[R] Community Members, Kindly share your opinion on my article. Am I clear in my thoughts? Anything I miss here?

Anthropic Rejects the Pentagon’s Demand That It Remove AI Safeguards

Mixing generative AI with physics to create personal items that work in the real world

[2512.01292] Diffusion Model in Latent Space for Medical Image Segmentation Task

[2510.25726] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

[2510.19060] PoSh: Using Scene Graphs To Guide LLMs-as-a-Judge For Detailed Image Descriptions

[2602.02334] VQ-Style: Disentangling Style and Content in Motion with Residual Quantized Representations

[2510.03495] AgentHub: A Registry for Discoverable, Verifiable, and Reproducible AI Agents

[2512.05251] One-Step Diffusion Samplers via Self-Distillation and Deterministic Flow

[2507.17937] Bob's Confetti: Phonetic Memorization Attacks in Music and Video Generation

[2507.12553] Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility

[2507.00788] Echoes of AI: Investigating the Downstream Effects of AI Assistants on Software Maintainability

[2510.01031] Secure and reversible face anonymization with diffusion models

Related Topics

Stay updated with AI News