Generative AI

Image, video, audio, and text generation

Top This Week

Machine Learning

AI video generation seems fundamentally more expensive than text, not just less optimized

There’s been a lot of discussion recently about how expensive AI video generation is compared to text, and it feels like this is more tha...

Reddit - Artificial Intelligence · 1 min ·
Accelerating science with AI and simulations
Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min ·
[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion
Machine Learning

[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion

Abstract page for arXiv paper 2603.10202: Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Ap...

arXiv - Machine Learning · 4 min ·

All Content

Llms

[D] How much are you using LLMs to summarize/read papers now?

The article discusses the evolving effectiveness of LLMs in summarizing research papers, highlighting improvements in quality and usabili...

Reddit - Machine Learning · 1 min ·
Machine Learning

Remote ML Intern – NLP, Diffusion Models, End-to-End Deployment (2nd Year AI/ML Student)

A second-year AI/ML student seeks a remote internship in machine learning, showcasing skills in NLP, diffusion models, and end-to-end pro...

Reddit - ML Jobs · 1 min ·
Anthropic accuses three Chinese AI labs of abusing Claude to improve their own models
Llms

Anthropic accuses three Chinese AI labs of abusing Claude to improve their own models

Anthropic accuses three Chinese AI labs of conducting distillation attacks on its Claude chatbot, claiming they illicitly extracted capab...

AI Tools & Products · 2 min ·
Anthropic Slams China for AI Theft, But Critics Say the Outrage Is Hypocritical
Nlp

Anthropic Slams China for AI Theft, But Critics Say the Outrage Is Hypocritical

Anthropic accuses Chinese developers of stealing AI secrets from its Claude chatbot, sparking criticism over its own data scraping practi...

AI Tools & Products · 7 min ·
CrowdStrike Reassesses Role As Claude Code Security Shifts AI Risk
Llms

CrowdStrike Reassesses Role As Claude Code Security Shifts AI Risk

CrowdStrike reassesses its position in the cybersecurity landscape following the launch of Anthropic's Claude Code Security, an AI tool t...

AI Tools & Products · 6 min ·
An AI Thought Experiment on Substack Is Sending The Stock Market Spiraling
Ai Agents

An AI Thought Experiment on Substack Is Sending The Stock Market Spiraling

A thought experiment on Substack predicts a future where AI leads to significant unemployment and economic disruption, causing turmoil in...

AI Tools & Products · 6 min ·
[2602.10604] Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters
Machine Learning

[2602.10604] Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

The paper introduces Step 3.5 Flash, a sparse Mixture-of-Experts model designed for efficient agentic intelligence with 11B active parame...

arXiv - AI · 6 min ·
[2602.07774] Generative Reasoning Re-ranker
Llms

[2602.07774] Generative Reasoning Re-ranker

The paper presents the Generative Reasoning Re-ranker (GR2), an innovative framework for enhancing recommendation systems using Large Lan...

arXiv - AI · 4 min ·
[2601.16210] PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation
Generative Ai

[2601.16210] PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation

The paper introduces PyraTok, a language-aligned pyramidal tokenizer designed to enhance video understanding and generation by improving ...

arXiv - AI · 3 min ·
[2601.15500] Low-Dimensional Adaptation of Rectified Flow: A Diffusion and Stochastic Localization Perspective
Generative Ai

[2601.15500] Low-Dimensional Adaptation of Rectified Flow: A Diffusion and Stochastic Localization Perspective

This paper explores the adaptation of Rectified Flow (RF) to low-dimensional target distributions, demonstrating improved sampling effici...

arXiv - Machine Learning · 4 min ·
[2601.05280] On the Limits of Self-Improving in Large Language Models: The Singularity Is Not Near Without Symbolic Model Synthesis
Llms

[2601.05280] On the Limits of Self-Improving in Large Language Models: The Singularity Is Not Near Without Symbolic Model Synthesis

This paper explores the limitations of self-improvement in large language models (LLMs), arguing that without symbolic model synthesis, t...

arXiv - Machine Learning · 4 min ·
[2601.04205] STaRR: Spatial-Temporal Token-Dynamics-Aware Responsive Remasking for Diffusion Language Models
Llms

[2601.04205] STaRR: Spatial-Temporal Token-Dynamics-Aware Responsive Remasking for Diffusion Language Models

The paper presents STaRR, a novel framework for responsive remasking in diffusion language models that adapts remasking decisions based o...

arXiv - AI · 3 min ·
[2512.13742] DL$^3$M: A Vision-to-Language Framework for Expert-Level Medical Reasoning through Deep Learning and Large Language Models
Llms

[2512.13742] DL$^3$M: A Vision-to-Language Framework for Expert-Level Medical Reasoning through Deep Learning and Large Language Models

The DL$^3$M framework integrates deep learning and large language models to enhance medical reasoning from images, addressing limitations...

arXiv - AI · 4 min ·
[2512.01809] Much Ado About Noising: Dispelling the Myths of Generative Robotic Control
Machine Learning

[2512.01809] Much Ado About Noising: Dispelling the Myths of Generative Robotic Control

This paper evaluates generative control policies in robotics, revealing that their success is due to iterative computation rather than mu...

arXiv - Machine Learning · 4 min ·
[2511.20629] MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models
Machine Learning

[2511.20629] MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models

The paper presents MapReduce LoRA, a novel framework for optimizing generative models by addressing multi-preference alignment issues. It...

arXiv - Machine Learning · 4 min ·
[2511.07399] StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
Machine Learning

[2511.07399] StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation

StreamDiffusionV2 presents a novel system for dynamic and interactive video generation, enhancing live streaming capabilities through opt...

arXiv - Machine Learning · 4 min ·
[2510.25850] Debate2Create: Robot Co-design via Multi-Agent LLM Debate
Llms

[2510.25850] Debate2Create: Robot Co-design via Multi-Agent LLM Debate

The paper introduces Debate2Create, a framework for robot co-design that utilizes multi-agent LLM debate to optimize robot morphology and...

arXiv - Machine Learning · 3 min ·
[2511.14478] Agentic AI Systems in Electrical Power Systems Engineering: Current State-of-the-Art and Challenges
Machine Learning

[2511.14478] Agentic AI Systems in Electrical Power Systems Engineering: Current State-of-the-Art and Challenges

This article reviews the state-of-the-art in agentic AI systems within electrical power engineering, providing a taxonomy and practical a...

arXiv - AI · 4 min ·
[2510.27246] Beyond a Million Tokens: Benchmarking and Enhancing Long-Term Memory in LLMs
Llms

[2510.27246] Beyond a Million Tokens: Benchmarking and Enhancing Long-Term Memory in LLMs

This paper presents a new framework for evaluating and enhancing long-term memory in large language models (LLMs), introducing the BEAM b...

arXiv - AI · 4 min ·
[2510.14979] From Pixels to Words -- Towards Native Vision-Language Primitives at Scale
Llms

[2510.14979] From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

The paper discusses the development of native Vision-Language Models (VLMs) that integrate vision and language capabilities more effectiv...

arXiv - AI · 4 min ·
Previous Page 52 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime