Generative AI

Image, video, audio, and text generation

Top This Week

Accelerating science with AI and simulations
Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min ·
[2603.12057] Coarse-Guided Visual Generation via Weighted h-Transform Sampling
Machine Learning

[2603.12057] Coarse-Guided Visual Generation via Weighted h-Transform Sampling

Abstract page for arXiv paper 2603.12057: Coarse-Guided Visual Generation via Weighted h-Transform Sampling

arXiv - AI · 4 min ·
[2603.07455] Image Generation Models: A Technical History
Machine Learning

[2603.07455] Image Generation Models: A Technical History

Abstract page for arXiv paper 2603.07455: Image Generation Models: A Technical History

arXiv - AI · 3 min ·

All Content

Employees at Google and OpenAI support Anthropic's Pentagon stand in open letter | TechCrunch
Robotics

Employees at Google and OpenAI support Anthropic's Pentagon stand in open letter | TechCrunch

Over 360 employees from Google and OpenAI have signed an open letter supporting Anthropic's stance against the Pentagon's demands for AI ...

TechCrunch - AI · 5 min ·
Samsung’s Galaxy S26 AI camera features are a photography nightmare | The Verge
Ai Agents

Samsung’s Galaxy S26 AI camera features are a photography nightmare | The Verge

The Vergecast discusses Samsung's Galaxy S26 AI camera features, arguing they redefine photography and raise concerns about the essence o...

The Verge - AI · 5 min ·
OpenAI raises $110B in one of the largest private funding rounds in history | TechCrunch
Ai Infrastructure

OpenAI raises $110B in one of the largest private funding rounds in history | TechCrunch

OpenAI secures $110 billion in private funding, led by Amazon, Nvidia, and SoftBank, marking a significant milestone in AI infrastructure...

TechCrunch - AI · 5 min ·
OpenAI snags $110 billion in investments from Amazon, Nvidia, and Softbank | The Verge
Llms

OpenAI snags $110 billion in investments from Amazon, Nvidia, and Softbank | The Verge

OpenAI secures $110 billion in new investments from Amazon, Nvidia, and Softbank, enhancing its market position and partnerships while pr...

The Verge - AI · 5 min ·
Llms

Dr Seuss vs Hemingway in LLMs

The article discusses an experiment comparing the writing styles of Dr. Seuss and Hemingway using language models, highlighting the poten...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

Why your AI sounds the same across every platform

The article discusses the uniformity in AI-generated marketing copy across different platforms, highlighting the challenges of creating d...

Reddit - Artificial Intelligence · 1 min ·
Huxe Will Give You a Personalized, Daily Audio Summary Powered by AI | WIRED
Ai Startups

Huxe Will Give You a Personalized, Daily Audio Summary Powered by AI | WIRED

Huxe is an AI-powered app that provides personalized daily audio summaries from your email and calendar, helping users save time and stay...

Wired - AI · 8 min ·
Generative Ai

[R] Community Members, Kindly share your opinion on my article. Am I clear in my thoughts? Anything I miss here?

The article seeks feedback on a research piece about AI's role in creative writing, inviting community insights to enhance future work.

Reddit - Machine Learning · 1 min ·
Anthropic Rejects the Pentagon’s Demand That It Remove AI Safeguards
Ai Safety

Anthropic Rejects the Pentagon’s Demand That It Remove AI Safeguards

Anthropic has rejected the Pentagon's demand to remove AI safeguards for its model Claude, aiming to prevent its use in mass surveillance...

AI Tools & Products · 5 min ·
Machine Learning

Mixing generative AI with physics to create personal items that work in the real world

The article discusses the challenges of using generative AI in creating functional designs, emphasizing the need for integrating physics ...

Reddit - Artificial Intelligence · 1 min ·
[2512.01292] Diffusion Model in Latent Space for Medical Image Segmentation Task
Machine Learning

[2512.01292] Diffusion Model in Latent Space for Medical Image Segmentation Task

This article presents MedSegLatDiff, a novel diffusion model for efficient medical image segmentation that enhances interpretability by g...

arXiv - AI · 4 min ·
[2510.25726] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
Ai Agents

[2510.25726] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

The Tool Decathlon introduces a benchmark for evaluating language agents on diverse, realistic, and complex tasks, highlighting significa...

arXiv - AI · 4 min ·
[2510.19060] PoSh: Using Scene Graphs To Guide LLMs-as-a-Judge For Detailed Image Descriptions
Llms

[2510.19060] PoSh: Using Scene Graphs To Guide LLMs-as-a-Judge For Detailed Image Descriptions

The paper introduces PoSh, a new metric using scene graphs to enhance the evaluation of detailed image descriptions by LLMs, outperformin...

arXiv - AI · 4 min ·
[2602.02334] VQ-Style: Disentangling Style and Content in Motion with Residual Quantized Representations
Machine Learning

[2602.02334] VQ-Style: Disentangling Style and Content in Motion with Residual Quantized Representations

The paper presents VQ-Style, a method for disentangling style and content in human motion data using Residual Vector Quantized Variationa...

arXiv - Machine Learning · 4 min ·
[2510.03495] AgentHub: A Registry for Discoverable, Verifiable, and Reproducible AI Agents
Llms

[2510.03495] AgentHub: A Registry for Discoverable, Verifiable, and Reproducible AI Agents

AgentHub proposes a registry for AI agents that enhances discoverability, verifiability, and reproducibility, addressing gaps in current ...

arXiv - AI · 4 min ·
[2512.05251] One-Step Diffusion Samplers via Self-Distillation and Deterministic Flow
Machine Learning

[2512.05251] One-Step Diffusion Samplers via Self-Distillation and Deterministic Flow

The paper presents a novel one-step diffusion sampler that utilizes self-distillation and deterministic flow to enhance sampling efficien...

arXiv - Machine Learning · 3 min ·
[2507.17937] Bob's Confetti: Phonetic Memorization Attacks in Music and Video Generation
Machine Learning

[2507.17937] Bob's Confetti: Phonetic Memorization Attacks in Music and Video Generation

The paper presents a novel attack method, Adversarial PhoneTic Prompting (APT), that exploits phonetic memorization in generative AI syst...

arXiv - AI · 4 min ·
[2507.12553] Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility
Llms

[2507.12553] Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility

This paper explores how language models (LMs) categorize event plausibility, revealing that LMs can reliably discern modal categories, wh...

arXiv - AI · 4 min ·
[2507.00788] Echoes of AI: Investigating the Downstream Effects of AI Assistants on Software Maintainability
Ai Agents

[2507.00788] Echoes of AI: Investigating the Downstream Effects of AI Assistants on Software Maintainability

This study investigates the impact of AI assistants on software maintainability, revealing no significant differences in code evolution d...

arXiv - AI · 4 min ·
[2510.01031] Secure and reversible face anonymization with diffusion models
Machine Learning

[2510.01031] Secure and reversible face anonymization with diffusion models

This paper presents a novel framework for secure and reversible face anonymization using diffusion models, addressing challenges in image...

arXiv - Machine Learning · 4 min ·
Previous Page 29 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime