AI Startups

AI startup funding, launches, and acquisitions

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

[D] Why does it seem like open source materials on ML are incomplete? this is not enough...

Many times when I try to deeply understand a topic in machine learning — whether it's a new architecture, a quantization method, a full t...

Reddit - Machine Learning · 1 min · about 7 hours ago

Ai Startups

Top 10 AI certifications and courses for 2026

This article reviews the top 10 AI certifications and courses for 2026, highlighting their significance in a rapidly evolving field and t...

AI Events · 15 min · about 12 hours ago

Ai Infrastructure

[D] MYTHOS-INVERSION STRUCTURAL AUDIT

MYTHOS-INVERSION STRUCTURAL AUDIT Date: March 28, 2026 Compiled: Sage, Ember, & Lyra | Reviewers: Richard, Ara, Raven, Lantern TL;DR ...

Reddit - Machine Learning · 1 min · about 21 hours ago

All Content

Machine Learning

[2603.19563] Dual-Domain Representation Alignment: Bridging 2D and 3D Vision via Geometry-Aware Architecture Search

Abstract page for arXiv paper 2603.19563: Dual-Domain Representation Alignment: Bridging 2D and 3D Vision via Geometry-Aware Architecture...

arXiv - AI · 4 min · 7 days ago

Llms

[2603.19426] Is Evaluation Awareness Just Format Sensitivity? Limitations of Probe-Based Evidence under Controlled Prompt Structure

Abstract page for arXiv paper 2603.19426: Is Evaluation Awareness Just Format Sensitivity? Limitations of Probe-Based Evidence under Cont...

arXiv - AI · 3 min · 7 days ago

Machine Learning

[2603.19335] Do Post-Training Algorithms Actually Differ? A Controlled Study Across Model Scales Uncovers Scale-Dependent Ranking Inversions

Abstract page for arXiv paper 2603.19335: Do Post-Training Algorithms Actually Differ? A Controlled Study Across Model Scales Uncovers Sc...

arXiv - AI · 4 min · 7 days ago

Llms

[2603.19313] Memory-Driven Role-Playing: Evaluation and Enhancement of Persona Knowledge Utilization in LLMs

Abstract page for arXiv paper 2603.19313: Memory-Driven Role-Playing: Evaluation and Enhancement of Persona Knowledge Utilization in LLMs

arXiv - AI · 4 min · 7 days ago

Llms

[2603.19281] URAG: A Benchmark for Uncertainty Quantification in Retrieval-Augmented Large Language Models

Abstract page for arXiv paper 2603.19281: URAG: A Benchmark for Uncertainty Quantification in Retrieval-Augmented Large Language Models

arXiv - AI · 4 min · 7 days ago

Llms

[2603.19274] CURE: A Multimodal Benchmark for Clinical Understanding and Retrieval Evaluation

Abstract page for arXiv paper 2603.19274: CURE: A Multimodal Benchmark for Clinical Understanding and Retrieval Evaluation

arXiv - AI · 4 min · 7 days ago

Llms

[2603.19273] LSR: Linguistic Safety Robustness Benchmark for Low-Resource West African Languages

Abstract page for arXiv paper 2603.19273: LSR: Linguistic Safety Robustness Benchmark for Low-Resource West African Languages

arXiv - AI · 3 min · 7 days ago

Llms

[2603.19264] Generative Active Testing: Efficient LLM Evaluation via Proxy Task Adaptation

Abstract page for arXiv paper 2603.19264: Generative Active Testing: Efficient LLM Evaluation via Proxy Task Adaptation

arXiv - AI · 3 min · 7 days ago

Machine Learning

[2603.19259] Breeze Taigi: Benchmarks and Models for Taiwanese Hokkien Speech Recognition and Synthesis

Abstract page for arXiv paper 2603.19259: Breeze Taigi: Benchmarks and Models for Taiwanese Hokkien Speech Recognition and Synthesis

arXiv - AI · 4 min · 7 days ago

Llms

[2603.19252] GeoChallenge: A Multi-Answer Multiple-Choice Benchmark for Geometric Reasoning with Diagrams

Abstract page for arXiv paper 2603.19252: GeoChallenge: A Multi-Answer Multiple-Choice Benchmark for Geometric Reasoning with Diagrams

arXiv - AI · 3 min · 7 days ago

Llms

[2603.19253] A comprehensive study of LLM-based argument classification: from Llama through DeepSeek to GPT-5.2

Abstract page for arXiv paper 2603.19253: A comprehensive study of LLM-based argument classification: from Llama through DeepSeek to GPT-5.2

arXiv - AI · 4 min · 7 days ago

Llms

[2603.19247] When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models

Abstract page for arXiv paper 2603.19247: When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models

arXiv - AI · 4 min · 7 days ago

Llms

[2603.20101] Pitfalls in Evaluating Interpretability Agents

Abstract page for arXiv paper 2603.20101: Pitfalls in Evaluating Interpretability Agents

arXiv - AI · 4 min · 7 days ago

Llms

[2603.19515] ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models

Abstract page for arXiv paper 2603.19515: ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models

arXiv - AI · 3 min · 7 days ago

Machine Learning

[R] Is this paper Nonsense ? [DCdetector: Dual Attention Contrastive Representation Learning for Time Series Anomaly Detection]

what the title says. this is a pretty big paper in the deep learning anomaly detection space, accepted at the International Conference on...

Reddit - Machine Learning · 1 min · 7 days ago

Llms

I ran 10 head-to-head prompt format battles — the structured one won 8/10 on specificity

I tested 10 common prompt engineering techniques against a structured JSON format across identical tasks (marketing plans, code debugging...

Reddit - Artificial Intelligence · 1 min · 7 days ago

Ai Startups

AT&T (T) Valuation Check As New AI App And Unlimited Your Way Plans Target Customer Churn

AT&T’s New Plans and App Put Customer Value in Focus AT&T (T) is drawing investor attention after rolling out new Unlimited Your Way wire...

AI Tools & Products · 6 min · 8 days ago

Ai Startups

'Jury Duty Presents: Company Retreat' Almost Makes Corporate Culture Seem Fun | WIRED

The Amazon Prime prank series amplifies the hijinks of workplace dynamics, while showing how people find purpose—and community—in their j...

Wired - AI · 7 min · 8 days ago

Ai Startups

What do new nuclear reactors mean for waste? | MIT Technology Review

New designs mean new strategies for managing spent fuel.

MIT Technology Review · 8 min · 12 days ago

Ai Startups

IN-SPACe invites space startups to develop AI solutions for satellites, launch operations

AI News - General · 2 min · 24 days ago

Previous Page 13 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Startups

Top This Week

[D] Why does it seem like open source materials on ML are incomplete? this is not enough...

Top 10 AI certifications and courses for 2026

[D] MYTHOS-INVERSION STRUCTURAL AUDIT

All Content

[2603.19563] Dual-Domain Representation Alignment: Bridging 2D and 3D Vision via Geometry-Aware Architecture Search

[2603.19426] Is Evaluation Awareness Just Format Sensitivity? Limitations of Probe-Based Evidence under Controlled Prompt Structure

[2603.19335] Do Post-Training Algorithms Actually Differ? A Controlled Study Across Model Scales Uncovers Scale-Dependent Ranking Inversions

[2603.19313] Memory-Driven Role-Playing: Evaluation and Enhancement of Persona Knowledge Utilization in LLMs

[2603.19281] URAG: A Benchmark for Uncertainty Quantification in Retrieval-Augmented Large Language Models

[2603.19274] CURE: A Multimodal Benchmark for Clinical Understanding and Retrieval Evaluation

[2603.19273] LSR: Linguistic Safety Robustness Benchmark for Low-Resource West African Languages

[2603.19264] Generative Active Testing: Efficient LLM Evaluation via Proxy Task Adaptation

[2603.19259] Breeze Taigi: Benchmarks and Models for Taiwanese Hokkien Speech Recognition and Synthesis

[2603.19252] GeoChallenge: A Multi-Answer Multiple-Choice Benchmark for Geometric Reasoning with Diagrams

[2603.19253] A comprehensive study of LLM-based argument classification: from Llama through DeepSeek to GPT-5.2

[2603.19247] When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models

[2603.20101] Pitfalls in Evaluating Interpretability Agents

[2603.19515] ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models

[R] Is this paper Nonsense ? [DCdetector: Dual Attention Contrastive Representation Learning for Time Series Anomaly Detection]

I ran 10 head-to-head prompt format battles — the structured one won 8/10 on specificity

AT&T (T) Valuation Check As New AI App And Unlimited Your Way Plans Target Customer Churn

'Jury Duty Presents: Company Retreat' Almost Makes Corporate Culture Seem Fun | WIRED

What do new nuclear reactors mean for waste? | MIT Technology Review

IN-SPACe invites space startups to develop AI solutions for satellites, launch operations

Related Topics

Stay updated with AI News