Generative AI

Image, video, audio, and text generation

Top This Week

Generative Ai

Will Generative AI apps remain a revenue powerhouse in 2026?

AI Tools & Products · 1 min ·
[2601.08565] Rewriting Video: Text-Driven Reauthoring of Video Footage
Machine Learning

[2601.08565] Rewriting Video: Text-Driven Reauthoring of Video Footage

Abstract page for arXiv paper 2601.08565: Rewriting Video: Text-Driven Reauthoring of Video Footage

arXiv - AI · 3 min ·
[2512.18388] Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models
Machine Learning

[2512.18388] Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models

Abstract page for arXiv paper 2512.18388: Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creatio...

arXiv - AI · 4 min ·

All Content

[2602.14910] Position: Introspective Experience from Conversational Environments as a Path to Better Learning
Machine Learning

[2602.14910] Position: Introspective Experience from Conversational Environments as a Path to Better Learning

The paper discusses how introspective experiences from conversational environments can enhance learning in AI systems, arguing for the im...

arXiv - AI · 4 min ·
[2602.14740] AI Arms and Influence: Frontier Models Exhibit Sophisticated Reasoning in Simulated Nuclear Crises
Machine Learning

[2602.14740] AI Arms and Influence: Frontier Models Exhibit Sophisticated Reasoning in Simulated Nuclear Crises

The paper explores how advanced AI models exhibit complex reasoning in simulated nuclear crises, revealing insights into strategic decisi...

arXiv - AI · 4 min ·
[2602.14721] WebWorld: A Large-Scale World Model for Web Agent Training
Machine Learning

[2602.14721] WebWorld: A Large-Scale World Model for Web Agent Training

WebWorld introduces a large-scale simulator for training web agents, utilizing over 1 million open-web interactions to enhance generaliza...

arXiv - AI · 3 min ·
[2602.14589] MATEO: A Multimodal Benchmark for Temporal Reasoning and Planning in LVLMs
Machine Learning

[2602.14589] MATEO: A Multimodal Benchmark for Temporal Reasoning and Planning in LVLMs

MATEO introduces a benchmark for assessing temporal reasoning in Large Vision Language Models (LVLMs), focusing on multimodal inputs and ...

arXiv - Machine Learning · 3 min ·
[2602.14529] Disentangling Deception and Hallucination Failures in LLMs
Llms

[2602.14529] Disentangling Deception and Hallucination Failures in LLMs

This paper explores the distinction between deception and hallucination failures in large language models (LLMs), proposing a mechanism-o...

arXiv - AI · 3 min ·
[2602.14518] Diagnosing Knowledge Conflict in Multimodal Long-Chain Reasoning
Llms

[2602.14518] Diagnosing Knowledge Conflict in Multimodal Long-Chain Reasoning

This paper explores knowledge conflicts in multimodal large language models (MLLMs) during long chain-of-thought reasoning, proposing a f...

arXiv - AI · 3 min ·
[2602.14457] Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5
Llms

[2602.14457] Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5

This technical report presents a comprehensive risk analysis framework for frontier AI, focusing on emerging threats and mitigation strat...

arXiv - Machine Learning · 4 min ·
[2602.14451] Precedent-Informed Reasoning: Mitigating Overthinking in Large Reasoning Models via Test-Time Precedent Learning
Llms

[2602.14451] Precedent-Informed Reasoning: Mitigating Overthinking in Large Reasoning Models via Test-Time Precedent Learning

The paper introduces Precedent-Informed Reasoning (PIR) to enhance reasoning in Large Language Models (LLMs) by leveraging past cases, im...

arXiv - AI · 4 min ·
[2602.14370] Competition for attention predicts good-to-bad tipping in AI
Llms

[2602.14370] Competition for attention predicts good-to-bad tipping in AI

This paper explores how competition for attention in AI systems can lead to tipping points from beneficial to harmful outcomes, providing...

arXiv - AI · 3 min ·
[2602.14296] AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines
Machine Learning

[2602.14296] AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines

The paper presents AutoWebWorld, a framework that synthesizes verifiable web environments using Finite State Machines, enhancing the trai...

arXiv - AI · 4 min ·
[2602.14229] CORPGEN: Simulating Corporate Environments with Autonomous Digital Employees in Multi-Horizon Task Environments
Robotics

[2602.14229] CORPGEN: Simulating Corporate Environments with Autonomous Digital Employees in Multi-Horizon Task Environments

The paper introduces CORPGEN, a framework for simulating corporate environments using autonomous digital employees, addressing long-horiz...

arXiv - Machine Learning · 4 min ·
[2602.14135] ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI
Ai Safety

[2602.14135] ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI

The paper presents the ForesightSafety Bench, a comprehensive framework for evaluating AI safety risks, addressing limitations in current...

arXiv - AI · 4 min ·
[2602.14130] Algebraic Quantum Intelligence: A New Framework for Reproducible Machine Creativity
Llms

[2602.14130] Algebraic Quantum Intelligence: A New Framework for Reproducible Machine Creativity

The paper introduces Algebraic Quantum Intelligence (AQI), a framework designed to enhance the creative capabilities of large language mo...

arXiv - Machine Learning · 4 min ·
[2602.14095] NEST: Nascent Encoded Steganographic Thoughts
Llms

[2602.14095] NEST: Nascent Encoded Steganographic Thoughts

The paper 'NEST: Nascent Encoded Steganographic Thoughts' explores the potential for large language models (LLMs) to conceal reasoning wi...

arXiv - AI · 3 min ·
[2602.14003] Prompt-Driven Low-Altitude Edge Intelligence: Modular Agents and Generative Reasoning
Machine Learning

[2602.14003] Prompt-Driven Low-Altitude Edge Intelligence: Modular Agents and Generative Reasoning

The paper presents a novel framework for low-altitude edge intelligence, addressing limitations of large AI models through a prompt-to-ag...

arXiv - AI · 4 min ·
[2602.13980] Cognitive Chunking for Soft Prompts: Accelerating Compressor Learning via Block-wise Causal Masking
Llms

[2602.13980] Cognitive Chunking for Soft Prompts: Accelerating Compressor Learning via Block-wise Causal Masking

This article presents a novel method called Parallelized Iterative Compression (PIC) for enhancing soft prompt compression in Large Langu...

arXiv - Machine Learning · 4 min ·
[2602.13912] From Pixels to Policies: Reinforcing Spatial Reasoning in Language Models for Content-Aware Layout Design
Llms

[2602.13912] From Pixels to Policies: Reinforcing Spatial Reasoning in Language Models for Content-Aware Layout Design

The paper presents LaySPA, a reinforcement learning framework designed to enhance spatial reasoning in large language models for effectiv...

arXiv - AI · 3 min ·
[2602.13904] Diagnosing Pathological Chain-of-Thought in Reasoning Models
Llms

[2602.13904] Diagnosing Pathological Chain-of-Thought in Reasoning Models

This paper discusses the identification and diagnosis of pathological chain-of-thought reasoning in AI models, highlighting three specifi...

arXiv - AI · 3 min ·
[2602.13873] Ambient Physics: Training Neural PDE Solvers with Partial Observations
Machine Learning

[2602.13873] Ambient Physics: Training Neural PDE Solvers with Partial Observations

The paper introduces 'Ambient Physics', a novel framework for training neural PDE solvers using partial observations, achieving significa...

arXiv - Machine Learning · 3 min ·
[2602.13855] From Fluent to Verifiable: Claim-Level Auditability for Deep Research Agents
Ai Agents

[2602.13855] From Fluent to Verifiable: Claim-Level Auditability for Deep Research Agents

The paper discusses the need for claim-level auditability in deep research agents, highlighting the shift from factual errors to weak cla...

arXiv - AI · 3 min ·
Previous Page 99 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime