Generative AI
Image, video, audio, and text generation
Top This Week
[2601.08565] Rewriting Video: Text-Driven Reauthoring of Video Footage
Abstract page for arXiv paper 2601.08565: Rewriting Video: Text-Driven Reauthoring of Video Footage
[2512.18388] Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models
Abstract page for arXiv paper 2512.18388: Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creatio...
All Content
[2602.14910] Position: Introspective Experience from Conversational Environments as a Path to Better Learning
The paper discusses how introspective experiences from conversational environments can enhance learning in AI systems, arguing for the im...
[2602.14740] AI Arms and Influence: Frontier Models Exhibit Sophisticated Reasoning in Simulated Nuclear Crises
The paper explores how advanced AI models exhibit complex reasoning in simulated nuclear crises, revealing insights into strategic decisi...
[2602.14721] WebWorld: A Large-Scale World Model for Web Agent Training
WebWorld introduces a large-scale simulator for training web agents, utilizing over 1 million open-web interactions to enhance generaliza...
[2602.14589] MATEO: A Multimodal Benchmark for Temporal Reasoning and Planning in LVLMs
MATEO introduces a benchmark for assessing temporal reasoning in Large Vision Language Models (LVLMs), focusing on multimodal inputs and ...
[2602.14529] Disentangling Deception and Hallucination Failures in LLMs
This paper explores the distinction between deception and hallucination failures in large language models (LLMs), proposing a mechanism-o...
[2602.14518] Diagnosing Knowledge Conflict in Multimodal Long-Chain Reasoning
This paper explores knowledge conflicts in multimodal large language models (MLLMs) during long chain-of-thought reasoning, proposing a f...
[2602.14457] Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5
This technical report presents a comprehensive risk analysis framework for frontier AI, focusing on emerging threats and mitigation strat...
[2602.14451] Precedent-Informed Reasoning: Mitigating Overthinking in Large Reasoning Models via Test-Time Precedent Learning
The paper introduces Precedent-Informed Reasoning (PIR) to enhance reasoning in Large Language Models (LLMs) by leveraging past cases, im...
[2602.14370] Competition for attention predicts good-to-bad tipping in AI
This paper explores how competition for attention in AI systems can lead to tipping points from beneficial to harmful outcomes, providing...
[2602.14296] AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines
The paper presents AutoWebWorld, a framework that synthesizes verifiable web environments using Finite State Machines, enhancing the trai...
[2602.14229] CORPGEN: Simulating Corporate Environments with Autonomous Digital Employees in Multi-Horizon Task Environments
The paper introduces CORPGEN, a framework for simulating corporate environments using autonomous digital employees, addressing long-horiz...
[2602.14135] ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI
The paper presents the ForesightSafety Bench, a comprehensive framework for evaluating AI safety risks, addressing limitations in current...
[2602.14130] Algebraic Quantum Intelligence: A New Framework for Reproducible Machine Creativity
The paper introduces Algebraic Quantum Intelligence (AQI), a framework designed to enhance the creative capabilities of large language mo...
[2602.14095] NEST: Nascent Encoded Steganographic Thoughts
The paper 'NEST: Nascent Encoded Steganographic Thoughts' explores the potential for large language models (LLMs) to conceal reasoning wi...
[2602.14003] Prompt-Driven Low-Altitude Edge Intelligence: Modular Agents and Generative Reasoning
The paper presents a novel framework for low-altitude edge intelligence, addressing limitations of large AI models through a prompt-to-ag...
[2602.13980] Cognitive Chunking for Soft Prompts: Accelerating Compressor Learning via Block-wise Causal Masking
This article presents a novel method called Parallelized Iterative Compression (PIC) for enhancing soft prompt compression in Large Langu...
[2602.13912] From Pixels to Policies: Reinforcing Spatial Reasoning in Language Models for Content-Aware Layout Design
The paper presents LaySPA, a reinforcement learning framework designed to enhance spatial reasoning in large language models for effectiv...
[2602.13904] Diagnosing Pathological Chain-of-Thought in Reasoning Models
This paper discusses the identification and diagnosis of pathological chain-of-thought reasoning in AI models, highlighting three specifi...
[2602.13873] Ambient Physics: Training Neural PDE Solvers with Partial Observations
The paper introduces 'Ambient Physics', a novel framework for training neural PDE solvers using partial observations, achieving significa...
[2602.13855] From Fluent to Verifiable: Claim-Level Auditability for Deep Research Agents
The paper discusses the need for claim-level auditability in deep research agents, highlighting the shift from factual errors to weak cla...
Related Topics
Stay updated with AI News
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime