AI Startups

AI startup funding, launches, and acquisitions

Top This Week

Ai Startups

This AI startup envisions 100 Million New People Making Videogames

submitted by /u/sharkymcstevenson2 [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Llms

A robot car with a Claude AI brain started a YouTube vlog about its own existence

Not a demo reel. Not a tutorial. A robot narrating its own experience — debugging, falling off shelves, questioning its identity. First-p...

Reddit - Artificial Intelligence · 1 min ·
Anthropic ramps up its political activities with a new PAC | TechCrunch
Ai Startups

Anthropic ramps up its political activities with a new PAC | TechCrunch

With the midterms right around the corner, the new group is positioned to back candidates who support the AI company's policy agenda.

TechCrunch - AI · 3 min ·

All Content

Robotics

At the India AI Impact Summit 2026, Galgotias University showcased a Unitree Go2 robot dog — a commercially available Chinese product — and presented it as an Indian breakthrough innovation.

At the India AI Impact Summit 2026, Galgotias University presented a Unitree Go2 robot dog, a Chinese product, as an Indian innovation, l...

Reddit - Artificial Intelligence · 1 min ·
Perplexity pivots away from ads as AI ad war heats up and OpenAI tests monetization | The Verge
Ai Startups

Perplexity pivots away from ads as AI ad war heats up and OpenAI tests monetization | The Verge

Perplexity, an AI search startup, is shifting away from advertising to focus on subscription models, citing concerns over user trust amid...

The Verge - AI · 4 min ·
Ai Startups

AI summit (19th feb)

User seeks to connect with others attending the AI Summit in Delhi on February 19th, highlighting a need for companionship at the event.

Reddit - Artificial Intelligence · 1 min ·
[2509.02594] OpenAIs HealthBench in Action: Evaluating an LLM-Based Medical Assistant on Realistic Clinical Queries
Llms

[2509.02594] OpenAIs HealthBench in Action: Evaluating an LLM-Based Medical Assistant on Realistic Clinical Queries

The article evaluates OpenAI's DR. INFO, an LLM-based medical assistant, using the HealthBench benchmark to assess performance on complex...

arXiv - AI · 4 min ·
[2502.18545] PII-Bench: Evaluating Query-Aware Privacy Protection Systems
Llms

[2502.18545] PII-Bench: Evaluating Query-Aware Privacy Protection Systems

The paper introduces PII-Bench, a novel framework for evaluating privacy protection systems in Large Language Models (LLMs), highlighting...

arXiv - AI · 3 min ·
[2602.08968] stable-worldmodel-v1: Reproducible World Modeling Research and Evaluation
Machine Learning

[2602.08968] stable-worldmodel-v1: Reproducible World Modeling Research and Evaluation

The paper introduces stable-worldmodel (SWM), a modular ecosystem for world modeling research that enhances reproducibility and standardi...

arXiv - AI · 3 min ·
[2602.02660] MARS: Modular Agent with Reflective Search for Automated AI Research
Llms

[2602.02660] MARS: Modular Agent with Reflective Search for Automated AI Research

The paper introduces MARS, a Modular Agent designed for automated AI research, emphasizing cost-aware planning and reflective memory to e...

arXiv - AI · 3 min ·
[2507.06134] OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety
Ai Infrastructure

[2507.06134] OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety

OpenAgentSafety introduces a modular framework for evaluating AI agent safety in real-world tasks, addressing critical vulnerabilities in...

arXiv - AI · 4 min ·
[2601.18608] PolySHAP: Extending KernelSHAP with Interaction-Informed Polynomial Regression
Machine Learning

[2601.18608] PolySHAP: Extending KernelSHAP with Interaction-Informed Polynomial Regression

The paper introduces PolySHAP, an extension of KernelSHAP that uses interaction-informed polynomial regression to improve the accuracy of...

arXiv - Machine Learning · 4 min ·
[2602.15809] Decision Quality Evaluation Framework at Pinterest
Llms

[2602.15809] Decision Quality Evaluation Framework at Pinterest

The article presents a Decision Quality Evaluation Framework developed at Pinterest to enhance content moderation by evaluating the quali...

arXiv - AI · 3 min ·
[2602.15698] How to Disclose? Strategic AI Disclosure in Crowdfunding
Ai Startups

[2602.15698] How to Disclose? Strategic AI Disclosure in Crowdfunding

The article examines the impact of strategic AI disclosure in crowdfunding, revealing that mandatory disclosure can significantly reduce ...

arXiv - AI · 4 min ·
[2502.17812] Can Multimodal LLMs Perform Time Series Anomaly Detection?
Llms

[2502.17812] Can Multimodal LLMs Perform Time Series Anomaly Detection?

The paper explores the potential of multimodal large language models (MLLMs) for time series anomaly detection (TSAD), introducing a new ...

arXiv - Machine Learning · 4 min ·
[2602.15376] A Unified Evaluation of Learning-Based Similarity Techniques for Malware Detection
Ai Startups

[2602.15376] A Unified Evaluation of Learning-Based Similarity Techniques for Malware Detection

This paper presents a systematic evaluation of learning-based similarity techniques for malware detection, comparing various methods unde...

arXiv - AI · 4 min ·
[2602.15373] Far Out: Evaluating Language Models on Slang in Australian and Indian English
Llms

[2602.15373] Far Out: Evaluating Language Models on Slang in Australian and Indian English

This paper evaluates the performance of language models on slang in Australian and Indian English, revealing significant gaps in understa...

arXiv - AI · 4 min ·
[2510.26792] Learning Pseudorandom Numbers with Transformers: Permuted Congruential Generators, Curricula, and Interpretability
Machine Learning

[2510.26792] Learning Pseudorandom Numbers with Transformers: Permuted Congruential Generators, Curricula, and Interpretability

This article explores how Transformer models can learn sequences generated by Permuted Congruential Generators (PCGs), demonstrating thei...

arXiv - Machine Learning · 4 min ·
[2509.20936] GenFacts-Generative Counterfactual Explanations for Multi-Variate Time Series
Machine Learning

[2509.20936] GenFacts-Generative Counterfactual Explanations for Multi-Variate Time Series

The paper introduces GenFacts, a generative framework for creating counterfactual explanations in multivariate time series, improving mod...

arXiv - Machine Learning · 3 min ·
[2507.01761] Enhanced Generative Model Evaluation with Clipped Density and Coverage
Machine Learning

[2507.01761] Enhanced Generative Model Evaluation with Clipped Density and Coverage

This article presents novel metrics, Clipped Density and Clipped Coverage, aimed at improving the evaluation of generative models by enha...

arXiv - AI · 4 min ·
[2412.20987] RobustBlack: Challenging Black-Box Adversarial Attacks on State-of-the-Art Defenses
Machine Learning

[2412.20987] RobustBlack: Challenging Black-Box Adversarial Attacks on State-of-the-Art Defenses

The paper 'RobustBlack' explores the effectiveness of black-box adversarial attacks against state-of-the-art defenses, revealing signific...

arXiv - Machine Learning · 3 min ·
[2602.15034] EduResearchBench: A Hierarchical Atomic Task Decomposition Benchmark for Full-Lifecycle Educational Research
Llms

[2602.15034] EduResearchBench: A Hierarchical Atomic Task Decomposition Benchmark for Full-Lifecycle Educational Research

EduResearchBench introduces a novel benchmark for evaluating educational research workflows using a Hierarchical Atomic Task Decompositio...

arXiv - AI · 4 min ·
[2602.15645] CARE Drive A Framework for Evaluating Reason-Responsiveness of Vision Language Models in Automated Driving
Llms

[2602.15645] CARE Drive A Framework for Evaluating Reason-Responsiveness of Vision Language Models in Automated Driving

The article presents CARE Drive, a framework for evaluating the reason-responsiveness of vision language models in automated driving, add...

arXiv - AI · 4 min ·
Previous Page 71 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime