AI Startups

AI startup funding, launches, and acquisitions

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Llms

[P] LLM with a 9-line seed + 5 rounds of contrastive feedback outperforms Optuna on 96% of benchmarks

submitted by /u/se4u [link] [comments]

Reddit - Machine Learning · 1 min · about 1 hour ago

Generative Ai

Why OpenAI really shut down Sora | TechCrunch

OpenAI's decision last week to shut down Sora, its AI video-generation tool, just six months after releasing it to the public raised imme...

TechCrunch - AI · 3 min · about 1 hour ago

Machine Learning

[D] Why does it seem like open source materials on ML are incomplete? this is not enough...

Many times when I try to deeply understand a topic in machine learning — whether it's a new architecture, a quantization method, a full t...

Reddit - Machine Learning · 1 min · about 13 hours ago

All Content

Ai Startups

Netflix is buying Ben Affleck’s AI startup | The Verge

Netflix will use AI models developed by Affleck’s company InterPositive to change the way filmmakers produce projects.

The Verge - AI · 5 min · 24 days ago

Llms

OpenAI launches GPT-5.4 with Pro and Thinking versions | TechCrunch

GPT-5.4 is billed as "our most capable and efficient frontier model for professional work."

TechCrunch - AI · 4 min · 24 days ago

Machine Learning

EXCLUSIVE: Luma launches creative AI agents powered by its new ‘Unified Intelligence’ models | TechCrunch

Luma introduced Luma Agents, powered by its new “Unified Intelligence” models, designed to coordinate multiple AI systems and generate en...

TechCrunch - AI · 5 min · 24 days ago

Ai Agents

Cursor is rolling out a new kind of agentic coding tool | TechCrunch

Called Automations, the new system gives users a way to automatically launch agents within their coding environment, triggered by a new a...

TechCrunch - AI · 5 min · 24 days ago

Ai Startups

Lio raises $30M from Andreessen Horowitz and others to automate enterprise procurement | TechCrunch

AI procurement startup Lio announced a $30 million Series A in a round led by Andreessen Horowitz.

TechCrunch - AI · 5 min · 25 days ago

Ai Startups

How 1,000+ customer calls shaped a breakout enterprise AI startup | TechCrunch

On this episode of Build Mode, David Park joins Isabelle Johannessen to discuss how he and his team are intentionally iterating, fundrais...

TechCrunch - AI · 6 min · 25 days ago

Ai Startups

[2602.10541] FastLSQ: A Framework for One-Shot PDE Solving

Abstract page for arXiv paper 2602.10541: FastLSQ: A Framework for One-Shot PDE Solving

arXiv - Machine Learning · 3 min · 25 days ago

Llms

[2511.09396] Multimodal Large Language Models for Low-Resource Languages: A Case Study for Basque

Abstract page for arXiv paper 2511.09396: Multimodal Large Language Models for Low-Resource Languages: A Case Study for Basque

arXiv - AI · 3 min · 25 days ago

Ai Startups

[2510.26840] SpotIt: Evaluating Text-to-SQL Evaluation with Formal Verification

Abstract page for arXiv paper 2510.26840: SpotIt: Evaluating Text-to-SQL Evaluation with Formal Verification

arXiv - AI · 4 min · 25 days ago

Robotics

[2509.25106] Towards Personalized Deep Research: Benchmarks and Evaluations

Abstract page for arXiv paper 2509.25106: Towards Personalized Deep Research: Benchmarks and Evaluations

arXiv - AI · 4 min · 25 days ago

Machine Learning

[2602.05286] HealthMamba: An Uncertainty-aware Spatiotemporal Graph State Space Model for Effective and Reliable Healthcare Facility Visit Prediction

Abstract page for arXiv paper 2602.05286: HealthMamba: An Uncertainty-aware Spatiotemporal Graph State Space Model for Effective and Reli...

arXiv - AI · 4 min · 25 days ago

Llms

[2412.13091] LMUnit: Fine-grained Evaluation with Natural Language Unit Tests

Abstract page for arXiv paper 2412.13091: LMUnit: Fine-grained Evaluation with Natural Language Unit Tests

arXiv - AI · 3 min · 25 days ago

Machine Learning

[2509.22580] The Lie of the Average: How Class Incremental Learning Evaluation Deceives You?

Abstract page for arXiv paper 2509.22580: The Lie of the Average: How Class Incremental Learning Evaluation Deceives You?

arXiv - Machine Learning · 4 min · 25 days ago

Ai Startups

[2508.06066] Effective Sample Size and Generalization Bounds for Temporal Networks

Abstract page for arXiv paper 2508.06066: Effective Sample Size and Generalization Bounds for Temporal Networks

arXiv - AI · 4 min · 25 days ago

Llms

[2602.09937] Why Do AI Agents Systematically Fail at Cloud Root Cause Analysis?

Abstract page for arXiv paper 2602.09937: Why Do AI Agents Systematically Fail at Cloud Root Cause Analysis?

arXiv - AI · 4 min · 25 days ago

Llms

[2601.16529] SycoEval-EM: Sycophancy Evaluation of Large Language Models in Simulated Clinical Encounters for Emergency Care

Abstract page for arXiv paper 2601.16529: SycoEval-EM: Sycophancy Evaluation of Large Language Models in Simulated Clinical Encounters fo...

arXiv - AI · 3 min · 25 days ago

Llms

[2509.21782] Benchmarking MLLM-based Web Understanding: Reasoning, Robustness and Safety

Abstract page for arXiv paper 2509.21782: Benchmarking MLLM-based Web Understanding: Reasoning, Robustness and Safety

arXiv - AI · 4 min · 25 days ago

Machine Learning

[2505.13033] TSPulse: Tiny Pre-Trained Models with Disentangled Representations for Rapid Time-Series Analysis

Abstract page for arXiv paper 2505.13033: TSPulse: Tiny Pre-Trained Models with Disentangled Representations for Rapid Time-Series Analysis

arXiv - AI · 4 min · 25 days ago

Llms

[2502.01534] Preference Leakage: A Contamination Problem in LLM-as-a-judge

Abstract page for arXiv paper 2502.01534: Preference Leakage: A Contamination Problem in LLM-as-a-judge

arXiv - AI · 4 min · 25 days ago

Ai Startups

[2412.06531] Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Abstract page for arXiv paper 2412.06531: Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

arXiv - AI · 4 min · 25 days ago

Previous Page 17 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Startups

Top This Week

[P] LLM with a 9-line seed + 5 rounds of contrastive feedback outperforms Optuna on 96% of benchmarks

Why OpenAI really shut down Sora | TechCrunch

[D] Why does it seem like open source materials on ML are incomplete? this is not enough...

All Content

Netflix is buying Ben Affleck’s AI startup | The Verge

OpenAI launches GPT-5.4 with Pro and Thinking versions | TechCrunch

EXCLUSIVE: Luma launches creative AI agents powered by its new ‘Unified Intelligence’ models | TechCrunch

Cursor is rolling out a new kind of agentic coding tool | TechCrunch

Lio raises $30M from Andreessen Horowitz and others to automate enterprise procurement | TechCrunch

How 1,000+ customer calls shaped a breakout enterprise AI startup | TechCrunch

[2602.10541] FastLSQ: A Framework for One-Shot PDE Solving

[2511.09396] Multimodal Large Language Models for Low-Resource Languages: A Case Study for Basque

[2510.26840] SpotIt: Evaluating Text-to-SQL Evaluation with Formal Verification

[2509.25106] Towards Personalized Deep Research: Benchmarks and Evaluations

[2602.05286] HealthMamba: An Uncertainty-aware Spatiotemporal Graph State Space Model for Effective and Reliable Healthcare Facility Visit Prediction

[2412.13091] LMUnit: Fine-grained Evaluation with Natural Language Unit Tests

[2509.22580] The Lie of the Average: How Class Incremental Learning Evaluation Deceives You?

[2508.06066] Effective Sample Size and Generalization Bounds for Temporal Networks

[2602.09937] Why Do AI Agents Systematically Fail at Cloud Root Cause Analysis?

[2601.16529] SycoEval-EM: Sycophancy Evaluation of Large Language Models in Simulated Clinical Encounters for Emergency Care

[2509.21782] Benchmarking MLLM-based Web Understanding: Reasoning, Robustness and Safety

[2505.13033] TSPulse: Tiny Pre-Trained Models with Disentangled Representations for Rapid Time-Series Analysis

[2502.01534] Preference Leakage: A Contamination Problem in LLM-as-a-judge

[2412.06531] Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Related Topics

Stay updated with AI News