Generative AI

Image, video, audio, and text generation

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

AI video generation seems fundamentally more expensive than text, not just less optimized

There’s been a lot of discussion recently about how expensive AI video generation is compared to text, and it feels like this is more tha...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 17 hours ago

Machine Learning

[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion

Abstract page for arXiv paper 2603.10202: Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Ap...

arXiv - Machine Learning · 4 min · about 18 hours ago

All Content

Llms

[D] How much are you using LLMs to summarize/read papers now?

The article discusses the evolving effectiveness of LLMs in summarizing research papers, highlighting improvements in quality and usabili...

Reddit - Machine Learning · 1 min · about 1 month ago

Machine Learning

Remote ML Intern – NLP, Diffusion Models, End-to-End Deployment (2nd Year AI/ML Student)

A second-year AI/ML student seeks a remote internship in machine learning, showcasing skills in NLP, diffusion models, and end-to-end pro...

Reddit - ML Jobs · 1 min · about 1 month ago

Llms

Anthropic accuses three Chinese AI labs of abusing Claude to improve their own models

Anthropic accuses three Chinese AI labs of conducting distillation attacks on its Claude chatbot, claiming they illicitly extracted capab...

AI Tools & Products · 2 min · about 1 month ago

Nlp

Anthropic Slams China for AI Theft, But Critics Say the Outrage Is Hypocritical

Anthropic accuses Chinese developers of stealing AI secrets from its Claude chatbot, sparking criticism over its own data scraping practi...

AI Tools & Products · 7 min · about 1 month ago

Llms

CrowdStrike Reassesses Role As Claude Code Security Shifts AI Risk

CrowdStrike reassesses its position in the cybersecurity landscape following the launch of Anthropic's Claude Code Security, an AI tool t...

AI Tools & Products · 6 min · about 1 month ago

Ai Agents

An AI Thought Experiment on Substack Is Sending The Stock Market Spiraling

A thought experiment on Substack predicts a future where AI leads to significant unemployment and economic disruption, causing turmoil in...

AI Tools & Products · 6 min · about 1 month ago

Machine Learning

[2602.10604] Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

The paper introduces Step 3.5 Flash, a sparse Mixture-of-Experts model designed for efficient agentic intelligence with 11B active parame...

arXiv - AI · 6 min · about 1 month ago

Llms

[2602.07774] Generative Reasoning Re-ranker

The paper presents the Generative Reasoning Re-ranker (GR2), an innovative framework for enhancing recommendation systems using Large Lan...

arXiv - AI · 4 min · about 1 month ago

Generative Ai

[2601.16210] PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation

The paper introduces PyraTok, a language-aligned pyramidal tokenizer designed to enhance video understanding and generation by improving ...

arXiv - AI · 3 min · about 1 month ago

Generative Ai

[2601.15500] Low-Dimensional Adaptation of Rectified Flow: A Diffusion and Stochastic Localization Perspective

This paper explores the adaptation of Rectified Flow (RF) to low-dimensional target distributions, demonstrating improved sampling effici...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2601.05280] On the Limits of Self-Improving in Large Language Models: The Singularity Is Not Near Without Symbolic Model Synthesis

This paper explores the limitations of self-improvement in large language models (LLMs), arguing that without symbolic model synthesis, t...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2601.04205] STaRR: Spatial-Temporal Token-Dynamics-Aware Responsive Remasking for Diffusion Language Models

The paper presents STaRR, a novel framework for responsive remasking in diffusion language models that adapts remasking decisions based o...

arXiv - AI · 3 min · about 1 month ago

Llms

[2512.13742] DL$^3$M: A Vision-to-Language Framework for Expert-Level Medical Reasoning through Deep Learning and Large Language Models

The DL$^3$M framework integrates deep learning and large language models to enhance medical reasoning from images, addressing limitations...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2512.01809] Much Ado About Noising: Dispelling the Myths of Generative Robotic Control

This paper evaluates generative control policies in robotics, revealing that their success is due to iterative computation rather than mu...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2511.20629] MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models

The paper presents MapReduce LoRA, a novel framework for optimizing generative models by addressing multi-preference alignment issues. It...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2511.07399] StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation

StreamDiffusionV2 presents a novel system for dynamic and interactive video generation, enhancing live streaming capabilities through opt...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.25850] Debate2Create: Robot Co-design via Multi-Agent LLM Debate

The paper introduces Debate2Create, a framework for robot co-design that utilizes multi-agent LLM debate to optimize robot morphology and...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2511.14478] Agentic AI Systems in Electrical Power Systems Engineering: Current State-of-the-Art and Challenges

This article reviews the state-of-the-art in agentic AI systems within electrical power engineering, providing a taxonomy and practical a...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.27246] Beyond a Million Tokens: Benchmarking and Enhancing Long-Term Memory in LLMs

This paper presents a new framework for evaluating and enhancing long-term memory in large language models (LLMs), introducing the BEAM b...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.14979] From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

The paper discusses the development of native Vision-Language Models (VLMs) that integrate vision and language capabilities more effectiv...

arXiv - AI · 4 min · about 1 month ago

Previous Page 52 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Generative AI

Top This Week

AI video generation seems fundamentally more expensive than text, not just less optimized

Accelerating science with AI and simulations

[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion

All Content

[D] How much are you using LLMs to summarize/read papers now?

Remote ML Intern – NLP, Diffusion Models, End-to-End Deployment (2nd Year AI/ML Student)

Anthropic accuses three Chinese AI labs of abusing Claude to improve their own models

Anthropic Slams China for AI Theft, But Critics Say the Outrage Is Hypocritical

CrowdStrike Reassesses Role As Claude Code Security Shifts AI Risk

An AI Thought Experiment on Substack Is Sending The Stock Market Spiraling

[2602.10604] Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

[2602.07774] Generative Reasoning Re-ranker

[2601.16210] PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation

[2601.15500] Low-Dimensional Adaptation of Rectified Flow: A Diffusion and Stochastic Localization Perspective

[2601.05280] On the Limits of Self-Improving in Large Language Models: The Singularity Is Not Near Without Symbolic Model Synthesis

[2601.04205] STaRR: Spatial-Temporal Token-Dynamics-Aware Responsive Remasking for Diffusion Language Models

[2512.13742] DL$^3$M: A Vision-to-Language Framework for Expert-Level Medical Reasoning through Deep Learning and Large Language Models

[2512.01809] Much Ado About Noising: Dispelling the Myths of Generative Robotic Control

[2511.20629] MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models

[2511.07399] StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation

[2510.25850] Debate2Create: Robot Co-design via Multi-Agent LLM Debate

[2511.14478] Agentic AI Systems in Electrical Power Systems Engineering: Current State-of-the-Art and Challenges

[2510.27246] Beyond a Million Tokens: Benchmarking and Enhancing Long-Term Memory in LLMs

[2510.14979] From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Related Topics

Stay updated with AI News