AI Startups

AI startup funding, launches, and acquisitions

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Ai Startups

This AI startup envisions 100 Million New People Making Videogames

submitted by /u/sharkymcstevenson2 [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 8 hours ago

Llms

A robot car with a Claude AI brain started a YouTube vlog about its own existence

Not a demo reel. Not a tutorial. A robot narrating its own experience — debugging, falling off shelves, questioning its identity. First-p...

Reddit - Artificial Intelligence · 1 min · about 11 hours ago

Ai Startups

Anthropic ramps up its political activities with a new PAC | TechCrunch

With the midterms right around the corner, the new group is positioned to back candidates who support the AI company's policy agenda.

TechCrunch - AI · 3 min · about 11 hours ago

All Content

Robotics

At the India AI Impact Summit 2026, Galgotias University showcased a Unitree Go2 robot dog — a commercially available Chinese product — and presented it as an Indian breakthrough innovation.

At the India AI Impact Summit 2026, Galgotias University presented a Unitree Go2 robot dog, a Chinese product, as an Indian innovation, l...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Ai Startups

Perplexity pivots away from ads as AI ad war heats up and OpenAI tests monetization | The Verge

Perplexity, an AI search startup, is shifting away from advertising to focus on subscription models, citing concerns over user trust amid...

The Verge - AI · 4 min · about 1 month ago

Ai Startups

AI summit (19th feb)

User seeks to connect with others attending the AI Summit in Delhi on February 19th, highlighting a need for companionship at the event.

Reddit - Artificial Intelligence · 1 min · about 2 months ago

Llms

[2509.02594] OpenAIs HealthBench in Action: Evaluating an LLM-Based Medical Assistant on Realistic Clinical Queries

The article evaluates OpenAI's DR. INFO, an LLM-based medical assistant, using the HealthBench benchmark to assess performance on complex...

arXiv - AI · 4 min · about 2 months ago

Llms

[2502.18545] PII-Bench: Evaluating Query-Aware Privacy Protection Systems

The paper introduces PII-Bench, a novel framework for evaluating privacy protection systems in Large Language Models (LLMs), highlighting...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.08968] stable-worldmodel-v1: Reproducible World Modeling Research and Evaluation

The paper introduces stable-worldmodel (SWM), a modular ecosystem for world modeling research that enhances reproducibility and standardi...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.02660] MARS: Modular Agent with Reflective Search for Automated AI Research

The paper introduces MARS, a Modular Agent designed for automated AI research, emphasizing cost-aware planning and reflective memory to e...

arXiv - AI · 3 min · about 2 months ago

Ai Infrastructure

[2507.06134] OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety

OpenAgentSafety introduces a modular framework for evaluating AI agent safety in real-world tasks, addressing critical vulnerabilities in...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2601.18608] PolySHAP: Extending KernelSHAP with Interaction-Informed Polynomial Regression

The paper introduces PolySHAP, an extension of KernelSHAP that uses interaction-informed polynomial regression to improve the accuracy of...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.15809] Decision Quality Evaluation Framework at Pinterest

The article presents a Decision Quality Evaluation Framework developed at Pinterest to enhance content moderation by evaluating the quali...

arXiv - AI · 3 min · about 2 months ago

Ai Startups

[2602.15698] How to Disclose? Strategic AI Disclosure in Crowdfunding

The article examines the impact of strategic AI disclosure in crowdfunding, revealing that mandatory disclosure can significantly reduce ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2502.17812] Can Multimodal LLMs Perform Time Series Anomaly Detection?

The paper explores the potential of multimodal large language models (MLLMs) for time series anomaly detection (TSAD), introducing a new ...

arXiv - Machine Learning · 4 min · about 2 months ago

Ai Startups

[2602.15376] A Unified Evaluation of Learning-Based Similarity Techniques for Malware Detection

This paper presents a systematic evaluation of learning-based similarity techniques for malware detection, comparing various methods unde...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.15373] Far Out: Evaluating Language Models on Slang in Australian and Indian English

This paper evaluates the performance of language models on slang in Australian and Indian English, revealing significant gaps in understa...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2510.26792] Learning Pseudorandom Numbers with Transformers: Permuted Congruential Generators, Curricula, and Interpretability

This article explores how Transformer models can learn sequences generated by Permuted Congruential Generators (PCGs), demonstrating thei...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2509.20936] GenFacts-Generative Counterfactual Explanations for Multi-Variate Time Series

The paper introduces GenFacts, a generative framework for creating counterfactual explanations in multivariate time series, improving mod...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2507.01761] Enhanced Generative Model Evaluation with Clipped Density and Coverage

This article presents novel metrics, Clipped Density and Clipped Coverage, aimed at improving the evaluation of generative models by enha...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2412.20987] RobustBlack: Challenging Black-Box Adversarial Attacks on State-of-the-Art Defenses

The paper 'RobustBlack' explores the effectiveness of black-box adversarial attacks against state-of-the-art defenses, revealing signific...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.15034] EduResearchBench: A Hierarchical Atomic Task Decomposition Benchmark for Full-Lifecycle Educational Research

EduResearchBench introduces a novel benchmark for evaluating educational research workflows using a Hierarchical Atomic Task Decompositio...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.15645] CARE Drive A Framework for Evaluating Reason-Responsiveness of Vision Language Models in Automated Driving

The article presents CARE Drive, a framework for evaluating the reason-responsiveness of vision language models in automated driving, add...

arXiv - AI · 4 min · about 2 months ago

Previous Page 71 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Startups

Top This Week

This AI startup envisions 100 Million New People Making Videogames

A robot car with a Claude AI brain started a YouTube vlog about its own existence

Anthropic ramps up its political activities with a new PAC | TechCrunch

All Content

At the India AI Impact Summit 2026, Galgotias University showcased a Unitree Go2 robot dog — a commercially available Chinese product — and presented it as an Indian breakthrough innovation.

Perplexity pivots away from ads as AI ad war heats up and OpenAI tests monetization | The Verge

AI summit (19th feb)

[2509.02594] OpenAIs HealthBench in Action: Evaluating an LLM-Based Medical Assistant on Realistic Clinical Queries

[2502.18545] PII-Bench: Evaluating Query-Aware Privacy Protection Systems

[2602.08968] stable-worldmodel-v1: Reproducible World Modeling Research and Evaluation

[2602.02660] MARS: Modular Agent with Reflective Search for Automated AI Research

[2507.06134] OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety

[2601.18608] PolySHAP: Extending KernelSHAP with Interaction-Informed Polynomial Regression

[2602.15809] Decision Quality Evaluation Framework at Pinterest

[2602.15698] How to Disclose? Strategic AI Disclosure in Crowdfunding

[2502.17812] Can Multimodal LLMs Perform Time Series Anomaly Detection?

[2602.15376] A Unified Evaluation of Learning-Based Similarity Techniques for Malware Detection

[2602.15373] Far Out: Evaluating Language Models on Slang in Australian and Indian English

[2510.26792] Learning Pseudorandom Numbers with Transformers: Permuted Congruential Generators, Curricula, and Interpretability

[2509.20936] GenFacts-Generative Counterfactual Explanations for Multi-Variate Time Series

[2507.01761] Enhanced Generative Model Evaluation with Clipped Density and Coverage

[2412.20987] RobustBlack: Challenging Black-Box Adversarial Attacks on State-of-the-Art Defenses

[2602.15034] EduResearchBench: A Hierarchical Atomic Task Decomposition Benchmark for Full-Lifecycle Educational Research

[2602.15645] CARE Drive A Framework for Evaluating Reason-Responsiveness of Vision Language Models in Automated Driving

Related Topics

Stay updated with AI News