Generative AI

Image, video, audio, and text generation

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Generative Ai

Google's Veo 3.1 Lite Cuts API Costs in Half as OpenAI's Sora Exits the Market

Google just cut Veo 3.1 API prices across the board today (April 7). Lite tier is now $0.05/sec — less than half the cost of Fast. Timing...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Generative Ai

Will Generative AI apps remain a revenue powerhouse in 2026?

AI Tools & Products · 1 min · about 15 hours ago

Machine Learning

[2601.08565] Rewriting Video: Text-Driven Reauthoring of Video Footage

Abstract page for arXiv paper 2601.08565: Rewriting Video: Text-Driven Reauthoring of Video Footage

arXiv - AI · 3 min · about 16 hours ago

All Content

Ai Agents

[2602.13855] From Fluent to Verifiable: Claim-Level Auditability for Deep Research Agents

The paper discusses the need for claim-level auditability in deep research agents, highlighting the shift from factual errors to weak cla...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.13808] An end-to-end agentic pipeline for smart contract translation and quality evaluation

This article presents a comprehensive framework for evaluating smart contracts generated from natural language specifications, focusing o...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.13792] StackingNet: Collective Inference Across Independent AI Foundation Models

StackingNet introduces a meta-ensemble framework that enhances the coordination of independent AI foundation models, improving accuracy, ...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.13695] Can a Lightweight Automated AI Pipeline Solve Research-Level Mathematical Problems?

This article explores the potential of a lightweight AI pipeline to solve complex mathematical problems, demonstrating its effectiveness ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13665] HyFunc: Accelerating LLM-based Function Calls for Agentic AI through Hybrid-Model Cascade and Dynamic Templating

The paper presents HyFunc, a framework designed to enhance the efficiency of LLM-based function calls in agentic AI by reducing computati...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.13418] Text Has Curvature

The paper 'Text Has Curvature' explores the concept of intrinsic curvature in language, proposing a new measurement called Texture to ana...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.13616] DiffusionRollout: Uncertainty-Aware Rollout Planning in Long-Horizon PDE Solving

The paper introduces DiffusionRollout, a strategy for improving long-horizon predictions in physical systems governed by PDEs by addressi...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.13416] High-Resolution Climate Projections Using Diffusion-Based Downscaling of a Lightweight Climate Emulator

This article presents a novel approach to high-resolution climate projections using a diffusion-based downscaling framework applied to a ...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.13264] Directional Concentration Uncertainty: A representational approach to uncertainty quantification for generative models

The paper introduces Directional Concentration Uncertainty (DCU), a flexible framework for uncertainty quantification in generative model...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13568] Who Do LLMs Trust? Human Experts Matter More Than Other LLMs

This paper explores how large language models (LLMs) prioritize feedback from human experts over other LLMs in decision-making tasks, rev...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.13407] On-Policy Supervised Fine-Tuning for Efficient Reasoning

The paper presents a novel training strategy called on-policy supervised fine-tuning (SFT) for large reasoning models, simplifying the op...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13367] Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

Nanbeige4.1-3B is a novel small generalist language model that excels in reasoning, alignment, and code generation, demonstrating signifi...

arXiv - AI · 4 min · about 2 months ago

Ai Agents

[2602.13318] DECKBench: Benchmarking Multi-Agent Frameworks for Academic Slide Generation and Editing

DECKBench introduces a new evaluation framework for multi-agent systems focused on generating and editing academic slide decks, addressin...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.13274] ProMoral-Bench: Evaluating Prompting Strategies for Moral Reasoning and Safety in LLMs

The paper introduces ProMoral-Bench, a benchmark for evaluating prompting strategies in large language models (LLMs) focused on moral rea...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.13258] MAPLE: A Sub-Agent Architecture for Memory, Learning, and Personalization in Agentic AI Systems

The paper presents MAPLE, a novel sub-agent architecture designed to enhance memory, learning, and personalization in AI systems, address...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.13234] Stay in Character, Stay Safe: Dual-Cycle Adversarial Self-Evolution for Safety Role-Playing Agents

The paper presents a novel framework, Dual-Cycle Adversarial Self-Evolution, aimed at enhancing the safety and fidelity of role-playing a...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13232] PlotChain: Deterministic Checkpointed Evaluation of Multimodal LLMs on Engineering Plot Reading

PlotChain introduces a deterministic benchmark for evaluating multimodal large language models (MLLMs) on engineering plot reading, focus...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13226] Variation is the Key: A Variation-Based Framework for LLM-Generated Text Detection

This paper presents VaryBalance, a novel framework for detecting text generated by large language models (LLMs), outperforming existing m...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.13224] A Geometric Taxonomy of Hallucinations in LLMs

This article presents a geometric taxonomy of hallucinations in large language models (LLMs), categorizing them into three types: unfaith...

arXiv - AI · 3 min · about 2 months ago

Ai Startups

[2602.13217] VeRA: Verified Reasoning Data Augmentation at Scale

VeRA introduces a framework for generating verified reasoning data at scale, enhancing AI evaluation by creating dynamic, executable benc...

arXiv - AI · 4 min · about 2 months ago

Previous Page 100 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Generative AI

Top This Week

Google's Veo 3.1 Lite Cuts API Costs in Half as OpenAI's Sora Exits the Market

Will Generative AI apps remain a revenue powerhouse in 2026?

[2601.08565] Rewriting Video: Text-Driven Reauthoring of Video Footage

All Content

[2602.13855] From Fluent to Verifiable: Claim-Level Auditability for Deep Research Agents

[2602.13808] An end-to-end agentic pipeline for smart contract translation and quality evaluation

[2602.13792] StackingNet: Collective Inference Across Independent AI Foundation Models

[2602.13695] Can a Lightweight Automated AI Pipeline Solve Research-Level Mathematical Problems?

[2602.13665] HyFunc: Accelerating LLM-based Function Calls for Agentic AI through Hybrid-Model Cascade and Dynamic Templating

[2602.13418] Text Has Curvature

[2602.13616] DiffusionRollout: Uncertainty-Aware Rollout Planning in Long-Horizon PDE Solving

[2602.13416] High-Resolution Climate Projections Using Diffusion-Based Downscaling of a Lightweight Climate Emulator

[2602.13264] Directional Concentration Uncertainty: A representational approach to uncertainty quantification for generative models

[2602.13568] Who Do LLMs Trust? Human Experts Matter More Than Other LLMs

[2602.13407] On-Policy Supervised Fine-Tuning for Efficient Reasoning

[2602.13367] Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

[2602.13318] DECKBench: Benchmarking Multi-Agent Frameworks for Academic Slide Generation and Editing

[2602.13274] ProMoral-Bench: Evaluating Prompting Strategies for Moral Reasoning and Safety in LLMs

[2602.13258] MAPLE: A Sub-Agent Architecture for Memory, Learning, and Personalization in Agentic AI Systems

[2602.13234] Stay in Character, Stay Safe: Dual-Cycle Adversarial Self-Evolution for Safety Role-Playing Agents

[2602.13232] PlotChain: Deterministic Checkpointed Evaluation of Multimodal LLMs on Engineering Plot Reading

[2602.13226] Variation is the Key: A Variation-Based Framework for LLM-Generated Text Detection

[2602.13224] A Geometric Taxonomy of Hallucinations in LLMs

[2602.13217] VeRA: Verified Reasoning Data Augmentation at Scale

Related Topics

Stay updated with AI News