Generative AI

Image, video, audio, and text generation

Top This Week

Generative Ai

Google's Veo 3.1 Lite Cuts API Costs in Half as OpenAI's Sora Exits the Market

Google just cut Veo 3.1 API prices across the board today (April 7). Lite tier is now $0.05/sec — less than half the cost of Fast. Timing...

Reddit - Artificial Intelligence · 1 min ·
Generative Ai

Will Generative AI apps remain a revenue powerhouse in 2026?

AI Tools & Products · 1 min ·
[2601.08565] Rewriting Video: Text-Driven Reauthoring of Video Footage
Machine Learning

[2601.08565] Rewriting Video: Text-Driven Reauthoring of Video Footage

Abstract page for arXiv paper 2601.08565: Rewriting Video: Text-Driven Reauthoring of Video Footage

arXiv - AI · 3 min ·

All Content

[2602.13855] From Fluent to Verifiable: Claim-Level Auditability for Deep Research Agents
Ai Agents

[2602.13855] From Fluent to Verifiable: Claim-Level Auditability for Deep Research Agents

The paper discusses the need for claim-level auditability in deep research agents, highlighting the shift from factual errors to weak cla...

arXiv - AI · 3 min ·
[2602.13808] An end-to-end agentic pipeline for smart contract translation and quality evaluation
Llms

[2602.13808] An end-to-end agentic pipeline for smart contract translation and quality evaluation

This article presents a comprehensive framework for evaluating smart contracts generated from natural language specifications, focusing o...

arXiv - AI · 3 min ·
[2602.13792] StackingNet: Collective Inference Across Independent AI Foundation Models
Llms

[2602.13792] StackingNet: Collective Inference Across Independent AI Foundation Models

StackingNet introduces a meta-ensemble framework that enhances the coordination of independent AI foundation models, improving accuracy, ...

arXiv - AI · 3 min ·
[2602.13695] Can a Lightweight Automated AI Pipeline Solve Research-Level Mathematical Problems?
Llms

[2602.13695] Can a Lightweight Automated AI Pipeline Solve Research-Level Mathematical Problems?

This article explores the potential of a lightweight AI pipeline to solve complex mathematical problems, demonstrating its effectiveness ...

arXiv - AI · 4 min ·
[2602.13665] HyFunc: Accelerating LLM-based Function Calls for Agentic AI through Hybrid-Model Cascade and Dynamic Templating
Llms

[2602.13665] HyFunc: Accelerating LLM-based Function Calls for Agentic AI through Hybrid-Model Cascade and Dynamic Templating

The paper presents HyFunc, a framework designed to enhance the efficiency of LLM-based function calls in agentic AI by reducing computati...

arXiv - AI · 4 min ·
[2602.13418] Text Has Curvature
Machine Learning

[2602.13418] Text Has Curvature

The paper 'Text Has Curvature' explores the concept of intrinsic curvature in language, proposing a new measurement called Texture to ana...

arXiv - Machine Learning · 4 min ·
[2602.13616] DiffusionRollout: Uncertainty-Aware Rollout Planning in Long-Horizon PDE Solving
Machine Learning

[2602.13616] DiffusionRollout: Uncertainty-Aware Rollout Planning in Long-Horizon PDE Solving

The paper introduces DiffusionRollout, a strategy for improving long-horizon predictions in physical systems governed by PDEs by addressi...

arXiv - Machine Learning · 3 min ·
[2602.13416] High-Resolution Climate Projections Using Diffusion-Based Downscaling of a Lightweight Climate Emulator
Machine Learning

[2602.13416] High-Resolution Climate Projections Using Diffusion-Based Downscaling of a Lightweight Climate Emulator

This article presents a novel approach to high-resolution climate projections using a diffusion-based downscaling framework applied to a ...

arXiv - Machine Learning · 4 min ·
[2602.13264] Directional Concentration Uncertainty: A representational approach to uncertainty quantification for generative models
Machine Learning

[2602.13264] Directional Concentration Uncertainty: A representational approach to uncertainty quantification for generative models

The paper introduces Directional Concentration Uncertainty (DCU), a flexible framework for uncertainty quantification in generative model...

arXiv - AI · 4 min ·
[2602.13568] Who Do LLMs Trust? Human Experts Matter More Than Other LLMs
Llms

[2602.13568] Who Do LLMs Trust? Human Experts Matter More Than Other LLMs

This paper explores how large language models (LLMs) prioritize feedback from human experts over other LLMs in decision-making tasks, rev...

arXiv - AI · 3 min ·
[2602.13407] On-Policy Supervised Fine-Tuning for Efficient Reasoning
Machine Learning

[2602.13407] On-Policy Supervised Fine-Tuning for Efficient Reasoning

The paper presents a novel training strategy called on-policy supervised fine-tuning (SFT) for large reasoning models, simplifying the op...

arXiv - AI · 4 min ·
[2602.13367] Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts
Llms

[2602.13367] Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

Nanbeige4.1-3B is a novel small generalist language model that excels in reasoning, alignment, and code generation, demonstrating signifi...

arXiv - AI · 4 min ·
[2602.13318] DECKBench: Benchmarking Multi-Agent Frameworks for Academic Slide Generation and Editing
Ai Agents

[2602.13318] DECKBench: Benchmarking Multi-Agent Frameworks for Academic Slide Generation and Editing

DECKBench introduces a new evaluation framework for multi-agent systems focused on generating and editing academic slide decks, addressin...

arXiv - Machine Learning · 4 min ·
[2602.13274] ProMoral-Bench: Evaluating Prompting Strategies for Moral Reasoning and Safety in LLMs
Llms

[2602.13274] ProMoral-Bench: Evaluating Prompting Strategies for Moral Reasoning and Safety in LLMs

The paper introduces ProMoral-Bench, a benchmark for evaluating prompting strategies in large language models (LLMs) focused on moral rea...

arXiv - AI · 3 min ·
[2602.13258] MAPLE: A Sub-Agent Architecture for Memory, Learning, and Personalization in Agentic AI Systems
Llms

[2602.13258] MAPLE: A Sub-Agent Architecture for Memory, Learning, and Personalization in Agentic AI Systems

The paper presents MAPLE, a novel sub-agent architecture designed to enhance memory, learning, and personalization in AI systems, address...

arXiv - AI · 3 min ·
[2602.13234] Stay in Character, Stay Safe: Dual-Cycle Adversarial Self-Evolution for Safety Role-Playing Agents
Llms

[2602.13234] Stay in Character, Stay Safe: Dual-Cycle Adversarial Self-Evolution for Safety Role-Playing Agents

The paper presents a novel framework, Dual-Cycle Adversarial Self-Evolution, aimed at enhancing the safety and fidelity of role-playing a...

arXiv - AI · 4 min ·
[2602.13232] PlotChain: Deterministic Checkpointed Evaluation of Multimodal LLMs on Engineering Plot Reading
Llms

[2602.13232] PlotChain: Deterministic Checkpointed Evaluation of Multimodal LLMs on Engineering Plot Reading

PlotChain introduces a deterministic benchmark for evaluating multimodal large language models (MLLMs) on engineering plot reading, focus...

arXiv - AI · 4 min ·
[2602.13226] Variation is the Key: A Variation-Based Framework for LLM-Generated Text Detection
Llms

[2602.13226] Variation is the Key: A Variation-Based Framework for LLM-Generated Text Detection

This paper presents VaryBalance, a novel framework for detecting text generated by large language models (LLMs), outperforming existing m...

arXiv - AI · 3 min ·
[2602.13224] A Geometric Taxonomy of Hallucinations in LLMs
Llms

[2602.13224] A Geometric Taxonomy of Hallucinations in LLMs

This article presents a geometric taxonomy of hallucinations in large language models (LLMs), categorizing them into three types: unfaith...

arXiv - AI · 3 min ·
[2602.13217] VeRA: Verified Reasoning Data Augmentation at Scale
Ai Startups

[2602.13217] VeRA: Verified Reasoning Data Augmentation at Scale

VeRA introduces a framework for generating verified reasoning data at scale, enhancing AI evaluation by creating dynamic, executable benc...

arXiv - AI · 4 min ·
Previous Page 100 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime