AI Startups

AI startup funding, launches, and acquisitions

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

What to expect from AlphaZero's value predictions [D]

An AlphaZero agent has learnt to predict the value of a game state by training on data generated by self-play by the model and a series o...

Reddit - Machine Learning · 1 min · about 1 hour ago

Ai Startups

There aren't enough rockets for space data centers. Cowboy Space raised $275 million to build them. | TechCrunch

Cowboy Space Corporation wants to put data centers in orbit. First, it has to build the rockets to get them there.

TechCrunch - AI · about 1 hour ago

Ai Agents

AWS just gave AI agents their own wallets. Your agent can now pay for itself.

This dropped 4 days ago and I haven't seen enough people talking about it. AWS launched Amazon Bedrock AgentCore Payments in partnership ...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

All Content

Machine Learning

What to expect from AlphaZero's value predictions [D]

An AlphaZero agent has learnt to predict the value of a game state by training on data generated by self-play by the model and a series o...

Reddit - Machine Learning · 1 min · about 1 hour ago

Ai Startups

There aren't enough rockets for space data centers. Cowboy Space raised $275 million to build them. | TechCrunch

Cowboy Space Corporation wants to put data centers in orbit. First, it has to build the rockets to get them there.

TechCrunch - AI · about 1 hour ago

Ai Agents

AWS just gave AI agents their own wallets. Your agent can now pay for itself.

This dropped 4 days ago and I haven't seen enough people talking about it. AWS launched Amazon Bedrock AgentCore Payments in partnership ...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Ai Startups

Seo District in Gwangju Launches Customized 'AI Digital Learning Center' for Residents

AI News - General · 4 min · about 4 hours ago

Llms

[2511.15204] Physics-Based Benchmarking Metrics for Multimodal Synthetic Images

Abstract page for arXiv paper 2511.15204: Physics-Based Benchmarking Metrics for Multimodal Synthetic Images

arXiv - AI · 3 min · about 8 hours ago

Llms

[2506.21582] VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents

Abstract page for arXiv paper 2506.21582: VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with I...

arXiv - AI · 4 min · about 8 hours ago

Llms

[2502.01941] Semantic Integrity Matters: Benchmarking and Preserving High-Density Reasoning in KV Cache Compression

Abstract page for arXiv paper 2502.01941: Semantic Integrity Matters: Benchmarking and Preserving High-Density Reasoning in KV Cache Comp...

arXiv - AI · 4 min · about 8 hours ago

Ai Startups

[2510.00436] Automated Evaluation can Distinguish the Good and Bad AI Responses to Patient Questions about Hospitalization

Abstract page for arXiv paper 2510.00436: Automated Evaluation can Distinguish the Good and Bad AI Responses to Patient Questions about H...

arXiv - AI · 3 min · about 8 hours ago

Ai Startups

[2605.07986] Towards Apples to Apples for AI Evaluations: From Real-World Use Cases to Evaluation Scenarios

Abstract page for arXiv paper 2605.07986: Towards Apples to Apples for AI Evaluations: From Real-World Use Cases to Evaluation Scenarios

arXiv - AI · 4 min · about 8 hours ago

Llms

[2605.07985] Dooly: Configuration-Agnostic, Redundancy-Aware Profiling for LLM Inference Simulation

Abstract page for arXiv paper 2605.07985: Dooly: Configuration-Agnostic, Redundancy-Aware Profiling for LLM Inference Simulation

arXiv - AI · 4 min · about 8 hours ago

Ai Startups

[2605.07905] CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers

Abstract page for arXiv paper 2605.07905: CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers

arXiv - AI · 3 min · about 8 hours ago

Machine Learning

[2605.07872] Video Understanding Reward Modeling: A Robust Benchmark and Performant Reward Models

Abstract page for arXiv paper 2605.07872: Video Understanding Reward Modeling: A Robust Benchmark and Performant Reward Models

arXiv - AI · 3 min · about 8 hours ago

Machine Learning

[2605.07786] APEX: Assumption-free Projection-based Embedding eXamination Metric for Image Quality Assessment

Abstract page for arXiv paper 2605.07786: APEX: Assumption-free Projection-based Embedding eXamination Metric for Image Quality Assessment

arXiv - AI · 3 min · about 8 hours ago

Ai Startups

[2605.07751] Vibe coding before the trend

Abstract page for arXiv paper 2605.07751: Vibe coding before the trend

arXiv - AI · 3 min · about 8 hours ago

Llms

[2605.07699] DRIP-R: A Benchmark for Decision-Making and Reasoning Under Real-World Policy Ambiguity in the Retail Domain

Abstract page for arXiv paper 2605.07699: DRIP-R: A Benchmark for Decision-Making and Reasoning Under Real-World Policy Ambiguity in the ...

arXiv - AI · 3 min · about 8 hours ago

Llms

[2605.07394] BalCapRL: A Balanced Framework for RL-Based MLLM Image Captioning

Abstract page for arXiv paper 2605.07394: BalCapRL: A Balanced Framework for RL-Based MLLM Image Captioning

arXiv - AI · 4 min · about 8 hours ago

Ai Startups

[2605.07379] RELO: Reinforcement Learning to Localize for Visual Object Tracking

Abstract page for arXiv paper 2605.07379: RELO: Reinforcement Learning to Localize for Visual Object Tracking

arXiv - AI · 3 min · about 8 hours ago

Llms

[2605.07186] The Text Uncanny Valley: Non-Monotonic Performance Degradation in LLM Information Retrieval

Abstract page for arXiv paper 2605.07186: The Text Uncanny Valley: Non-Monotonic Performance Degradation in LLM Information Retrieval

arXiv - AI · 4 min · about 8 hours ago

Llms

[2605.07111] Beyond LoRA vs. Full Fine-Tuning: Gradient-Guided Optimizer Routing for LLM Adaptation

Abstract page for arXiv paper 2605.07111: Beyond LoRA vs. Full Fine-Tuning: Gradient-Guided Optimizer Routing for LLM Adaptation

arXiv - AI · 4 min · about 8 hours ago

Llms

[2605.06707] The Single-File Test: A Longitudinal Public-Interface Evaluation of First-Output LLM Web Generation with Social Reach Tracking

Abstract page for arXiv paper 2605.06707: The Single-File Test: A Longitudinal Public-Interface Evaluation of First-Output LLM Web Genera...

arXiv - AI · 4 min · about 8 hours ago

Page 1 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Startups

Top This Week

What to expect from AlphaZero's value predictions [D]

There aren't enough rockets for space data centers. Cowboy Space raised $275 million to build them. | TechCrunch

AWS just gave AI agents their own wallets. Your agent can now pay for itself.

All Content

What to expect from AlphaZero's value predictions [D]

There aren't enough rockets for space data centers. Cowboy Space raised $275 million to build them. | TechCrunch

AWS just gave AI agents their own wallets. Your agent can now pay for itself.

Seo District in Gwangju Launches Customized 'AI Digital Learning Center' for Residents

[2511.15204] Physics-Based Benchmarking Metrics for Multimodal Synthetic Images

[2506.21582] VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents

[2502.01941] Semantic Integrity Matters: Benchmarking and Preserving High-Density Reasoning in KV Cache Compression

[2510.00436] Automated Evaluation can Distinguish the Good and Bad AI Responses to Patient Questions about Hospitalization

[2605.07986] Towards Apples to Apples for AI Evaluations: From Real-World Use Cases to Evaluation Scenarios

[2605.07985] Dooly: Configuration-Agnostic, Redundancy-Aware Profiling for LLM Inference Simulation

[2605.07905] CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers

[2605.07872] Video Understanding Reward Modeling: A Robust Benchmark and Performant Reward Models

[2605.07786] APEX: Assumption-free Projection-based Embedding eXamination Metric for Image Quality Assessment

[2605.07751] Vibe coding before the trend

[2605.07699] DRIP-R: A Benchmark for Decision-Making and Reasoning Under Real-World Policy Ambiguity in the Retail Domain

[2605.07394] BalCapRL: A Balanced Framework for RL-Based MLLM Image Captioning

[2605.07379] RELO: Reinforcement Learning to Localize for Visual Object Tracking

[2605.07186] The Text Uncanny Valley: Non-Monotonic Performance Degradation in LLM Information Retrieval

[2605.07111] Beyond LoRA vs. Full Fine-Tuning: Gradient-Guided Optimizer Routing for LLM Adaptation

[2605.06707] The Single-File Test: A Longitudinal Public-Interface Evaluation of First-Output LLM Web Generation with Social Reach Tracking

Related Topics

Stay updated with AI News