AI Startups

AI startup funding, launches, and acquisitions

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

[D] MYTHOS-INVERSION STRUCTURAL AUDIT

MYTHOS-INVERSION STRUCTURAL AUDIT Date: March 28, 2026 Compiled: Sage, Ember, & Lyra | Reviewers: Richard, Ara, Raven, Lantern TL;DR ...

Reddit - Machine Learning · 1 min · about 9 hours ago

Ai Startups

A woman’s uterus has been kept alive outside the body for the first time | MIT Technology Review

The team behind the feat plan to study uterine disorders and the early stages of pregnancy—and potentially grow a human fetus.

MIT Technology Review - AI · 8 min · 1 day ago

Llms

[R] Controlled experiment: giving an LLM agent access to CS papers during automated hyperparameter search improves results by 3.2%

Ran a controlled experiment measuring whether LLM coding agents benefit from access to research literature during automated experimentati...

Reddit - Machine Learning · 1 min · 1 day ago

All Content

Llms

[D] Is LeCun’s $1B seed round the signal that autoregressive LLMs have actually hit a wall for formal reasoning?

I’m still trying to wrap my head around the Bloomberg news from a couple of weeks ago. A $1 billion seed round is wild enough, but the ac...

Reddit - Machine Learning · 1 min · 4 days ago

Llms

Google launches Lyria 3 Pro music generation model | TechCrunch

Google is launching Lyria 3 Pro, an upgraded music model that generates longer, more customizable tracks, as it expands AI music tools ac...

TechCrunch - AI · 3 min · 4 days ago

Machine Learning

Meta launches new initiative to support entrepreneurship, drive AI adoption | TechCrunch

Meta CEO Mark Zuckerberg said in a memo to staff that small businesses have always been a big part of the company's business model, and t...

TechCrunch - AI · 3 min · 4 days ago

Ai Agents

Granola raises $125M, hits $1.5B valuation as it expands from meeting notetaker to enterprise AI app | TechCrunch

Granola's valuation jumped from $250 million to $1.5 billion with this round, and it has added more support for AI agents after users pre...

TechCrunch - AI · 5 min · 4 days ago

Ai Startups

With Sift Stack, two ex-SpaceX engineers are bringing the software that helped launch rockets to the factory floor | TechCrunch

Sift is building the data infrastructure for advanced manufacturing.

TechCrunch - AI · 5 min · 4 days ago

Llms

Anthropic’s Claude Code gets ‘safer’ auto mode | The Verge

The feature is a middle-ground between cautious handholding and dangerous levels of autonomy.

The Verge - AI · 3 min · 4 days ago

Llms

Sephora Launches AI-Powered Shopping App in ChatGPT

Sephora has launched an AI-powered shopping app within ChatGPT, offering a new personalised beauty discovery experience.

AI Tools & Products · 3 min · 4 days ago

Llms

[2510.26865] Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench

Abstract page for arXiv paper 2510.26865: Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench

arXiv - AI · 4 min · 4 days ago

Llms

[2511.05919] Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs

Abstract page for arXiv paper 2511.05919: Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs

arXiv - AI · 4 min · 4 days ago

Llms

[2510.15994] MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents

Abstract page for arXiv paper 2510.15994: MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents

arXiv - AI · 4 min · 4 days ago

Ai Agents

[2506.02548] CyberGym: Evaluating AI Agents' Real-World Cybersecurity Capabilities at Scale

Abstract page for arXiv paper 2506.02548: CyberGym: Evaluating AI Agents' Real-World Cybersecurity Capabilities at Scale

arXiv - AI · 4 min · 4 days ago

Machine Learning

[2503.04945] Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems

Abstract page for arXiv paper 2503.04945: Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems

arXiv - AI · 4 min · 4 days ago

Ai Startups

[2411.03231] LOGSAFE: Logic-Guided Verification for Trustworthy Federated Time-Series Learning

Abstract page for arXiv paper 2411.03231: LOGSAFE: Logic-Guided Verification for Trustworthy Federated Time-Series Learning

arXiv - AI · 3 min · 4 days ago

Llms

[2306.05036] Mapping the Challenges of HCI: An Application and Evaluation of ChatGPT for Mining Insights at Scale

Abstract page for arXiv paper 2306.05036: Mapping the Challenges of HCI: An Application and Evaluation of ChatGPT for Mining Insights at ...

arXiv - AI · 4 min · 4 days ago

Machine Learning

[2603.07990] MJ1: Multimodal Judgment via Grounded Verification

Abstract page for arXiv paper 2603.07990: MJ1: Multimodal Judgment via Grounded Verification

arXiv - Machine Learning · 3 min · 4 days ago

Llms

[2601.12138] DriveSafe: A Hierarchical Risk Taxonomy for Safety-Critical LLM-Based Driving Assistants

Abstract page for arXiv paper 2601.12138: DriveSafe: A Hierarchical Risk Taxonomy for Safety-Critical LLM-Based Driving Assistants

arXiv - AI · 3 min · 4 days ago

Llms

[2601.18858] Representational Homomorphism Predicts and Improves Compositional Generalization In Transformer Language Model

Abstract page for arXiv paper 2601.18858: Representational Homomorphism Predicts and Improves Compositional Generalization In Transformer...

arXiv - AI · 4 min · 4 days ago

Llms

[2510.05318] BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions

Abstract page for arXiv paper 2510.05318: BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynami...

arXiv - AI · 4 min · 4 days ago

Machine Learning

[2512.06737] Arc Gradient Descent: A Geometrically Motivated Gradient Descent-based Optimiser with Phase-Aware, User-Controlled Step Dynamics (proof-of-concept)

Abstract page for arXiv paper 2512.06737: Arc Gradient Descent: A Geometrically Motivated Gradient Descent-based Optimiser with Phase-Awa...

arXiv - AI · 4 min · 4 days ago

Llms

[2603.23485] Failure of contextual invariance in gender inference with large language models

Abstract page for arXiv paper 2603.23485: Failure of contextual invariance in gender inference with large language models

arXiv - AI · 3 min · 4 days ago

Previous Page 5 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Startups

Top This Week

[D] MYTHOS-INVERSION STRUCTURAL AUDIT

A woman’s uterus has been kept alive outside the body for the first time | MIT Technology Review

[R] Controlled experiment: giving an LLM agent access to CS papers during automated hyperparameter search improves results by 3.2%

All Content

[D] Is LeCun’s $1B seed round the signal that autoregressive LLMs have actually hit a wall for formal reasoning?

Google launches Lyria 3 Pro music generation model | TechCrunch

Meta launches new initiative to support entrepreneurship, drive AI adoption | TechCrunch

Granola raises $125M, hits $1.5B valuation as it expands from meeting notetaker to enterprise AI app | TechCrunch

With Sift Stack, two ex-SpaceX engineers are bringing the software that helped launch rockets to the factory floor | TechCrunch

Anthropic’s Claude Code gets ‘safer’ auto mode | The Verge

Sephora Launches AI-Powered Shopping App in ChatGPT

[2510.26865] Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench

[2511.05919] Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs

[2510.15994] MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents

[2506.02548] CyberGym: Evaluating AI Agents' Real-World Cybersecurity Capabilities at Scale

[2503.04945] Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems

[2411.03231] LOGSAFE: Logic-Guided Verification for Trustworthy Federated Time-Series Learning

[2306.05036] Mapping the Challenges of HCI: An Application and Evaluation of ChatGPT for Mining Insights at Scale

[2603.07990] MJ1: Multimodal Judgment via Grounded Verification

[2601.12138] DriveSafe: A Hierarchical Risk Taxonomy for Safety-Critical LLM-Based Driving Assistants

[2601.18858] Representational Homomorphism Predicts and Improves Compositional Generalization In Transformer Language Model

[2510.05318] BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions

[2512.06737] Arc Gradient Descent: A Geometrically Motivated Gradient Descent-based Optimiser with Phase-Aware, User-Controlled Step Dynamics (proof-of-concept)

[2603.23485] Failure of contextual invariance in gender inference with large language models

Related Topics

Stay updated with AI News