AI Startups

AI startup funding, launches, and acquisitions

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Ai Startups

This AI startup envisions 100 Million New People Making Videogames

submitted by /u/sharkymcstevenson2 [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 19 hours ago

Llms

A robot car with a Claude AI brain started a YouTube vlog about its own existence

Not a demo reel. Not a tutorial. A robot narrating its own experience — debugging, falling off shelves, questioning its identity. First-p...

Reddit - Artificial Intelligence · 1 min · about 21 hours ago

Ai Startups

Anthropic ramps up its political activities with a new PAC | TechCrunch

With the midterms right around the corner, the new group is positioned to back candidates who support the AI company's policy agenda.

TechCrunch - AI · 3 min · about 21 hours ago

All Content

Llms

[2602.13576] Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges

The paper identifies a vulnerability in large language model (LLM) evaluation processes, termed Rubric-Induced Preference Drift (RIPD), w...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13504] From Perceptions To Evidence: Detecting AI-Generated Content In Turkish News Media With A Fine-Tuned Bert Classifier

This study presents a fine-tuned BERT classifier for detecting AI-generated content in Turkish news media, achieving a high F1 score and ...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2602.14914] Additive Control Variates Dominate Self-Normalisation in Off-Policy Evaluation

This paper presents a theoretical analysis demonstrating that additive control variates outperform self-normalisation techniques in off-p...

arXiv - Machine Learning · 3 min · about 2 months ago

Nlp

[2602.14855] A Pragmatic Method for Comparing Clusterings with Overlaps and Outliers

This paper presents a new method for comparing clustering results that accommodates overlaps and outliers, addressing a gap in existing e...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.13376] An Online Reference-Free Evaluation Framework for Flowchart Image-to-Code Generation

This article presents a novel reference-free evaluation framework for assessing the quality of flowchart image-to-code generation, utiliz...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.13349] From Prompt to Production:Automating Brand-Safe Marketing Imagery with Text-to-Image Models

This paper discusses a new automated pipeline for generating brand-safe marketing imagery using text-to-image models, balancing automatio...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.14571] DCTracks: An Open Dataset for Machine Learning-Based Drift Chamber Track Reconstruction

The article presents DCTracks, a new open dataset designed for machine learning-based track reconstruction in drift chambers, featuring s...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.13312] PeroMAS: A Multi-agent System of Perovskite Material Discovery

PeroMAS introduces a multi-agent system for discovering perovskite materials, enhancing efficiency in photovoltaic research through a com...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.13244] Responsible AI in Business

The paper discusses the concept of Responsible AI in business, focusing on its implementation in small and medium-sized enterprises. It c...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13243] Judging the Judges: Human Validation of Multi-LLM Evaluation for High-Quality K--12 Science Instructional Materials

This study evaluates AI-generated assessments of K-12 science instructional materials, comparing them with expert reviews to enhance futu...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.14233] Evaluating LLMs in Finance Requires Explicit Bias Consideration

This paper discusses the need for explicit bias consideration in evaluating Large Language Models (LLMs) used in finance, identifying fiv...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.14200] TS-Haystack: A Multi-Scale Retrieval Benchmark for Time Series Language Models

The paper introduces TS-Haystack, a benchmark for evaluating Time Series Language Models (TSLMs) on long-context retrieval tasks, address...

arXiv - Machine Learning · 4 min · about 2 months ago

Ai Infrastructure

[2602.15019] Hunt Globally: Deep Research AI Agents for Drug Asset Scouting in Investing, Business Development, and Search & Evaluation

The paper discusses the development of a Deep Research AI agent, Bioptic Agent, designed for drug asset scouting, particularly in non-U.S...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.14161] When Benchmarks Lie: Evaluating Malicious Prompt Classifiers Under True Distribution Shift

This paper evaluates the effectiveness of malicious prompt classifiers under true distribution shifts, revealing significant performance ...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.14795] Return of the Schema: Building Complete Datasets for Machine Learning and Reasoning on Knowledge Graphs

This paper presents a novel resource for building complete datasets that integrate schema and ground facts for machine learning and reaso...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.14721] WebWorld: A Large-Scale World Model for Web Agent Training

WebWorld introduces a large-scale simulator for training web agents, utilizing over 1 million open-web interactions to enhance generaliza...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.14024] EIDOS: Latent-Space Predictive Learning for Time Series Foundation Models

The paper introduces EIDOS, a novel approach to time series modeling that focuses on latent-space predictive learning, enhancing the stru...

arXiv - AI · 3 min · about 2 months ago

Robotics

[2602.14691] Removing Planner Bias in Goal Recognition Through Multi-Plan Dataset Generation

This paper presents a method to eliminate planner bias in goal recognition using multi-plan dataset generation, enhancing the evaluation ...

arXiv - AI · 3 min · about 2 months ago

Ai Agents

[2602.13807] AnomaMind: Agentic Time Series Anomaly Detection with Tool-Augmented Reasoning

AnomaMind presents a novel framework for time series anomaly detection, enhancing traditional methods by incorporating tool-augmented rea...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.13802] Cast-R1: Learning Tool-Augmented Sequential Decision Policies for Time Series Forecasting

The paper presents Cast-R1, a novel framework for time series forecasting that reformulates the problem as a sequential decision-making t...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 77 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Startups

Top This Week

This AI startup envisions 100 Million New People Making Videogames

A robot car with a Claude AI brain started a YouTube vlog about its own existence

Anthropic ramps up its political activities with a new PAC | TechCrunch

All Content

[2602.13576] Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges

[2602.13504] From Perceptions To Evidence: Detecting AI-Generated Content In Turkish News Media With A Fine-Tuned Bert Classifier

[2602.14914] Additive Control Variates Dominate Self-Normalisation in Off-Policy Evaluation

[2602.14855] A Pragmatic Method for Comparing Clusterings with Overlaps and Outliers

[2602.13376] An Online Reference-Free Evaluation Framework for Flowchart Image-to-Code Generation

[2602.13349] From Prompt to Production:Automating Brand-Safe Marketing Imagery with Text-to-Image Models

[2602.14571] DCTracks: An Open Dataset for Machine Learning-Based Drift Chamber Track Reconstruction

[2602.13312] PeroMAS: A Multi-agent System of Perovskite Material Discovery

[2602.13244] Responsible AI in Business

[2602.13243] Judging the Judges: Human Validation of Multi-LLM Evaluation for High-Quality K--12 Science Instructional Materials

[2602.14233] Evaluating LLMs in Finance Requires Explicit Bias Consideration

[2602.14200] TS-Haystack: A Multi-Scale Retrieval Benchmark for Time Series Language Models

[2602.15019] Hunt Globally: Deep Research AI Agents for Drug Asset Scouting in Investing, Business Development, and Search & Evaluation

[2602.14161] When Benchmarks Lie: Evaluating Malicious Prompt Classifiers Under True Distribution Shift

[2602.14795] Return of the Schema: Building Complete Datasets for Machine Learning and Reasoning on Knowledge Graphs

[2602.14721] WebWorld: A Large-Scale World Model for Web Agent Training

[2602.14024] EIDOS: Latent-Space Predictive Learning for Time Series Foundation Models

[2602.14691] Removing Planner Bias in Goal Recognition Through Multi-Plan Dataset Generation

[2602.13807] AnomaMind: Agentic Time Series Anomaly Detection with Tool-Augmented Reasoning

[2602.13802] Cast-R1: Learning Tool-Augmented Sequential Decision Policies for Time Series Forecasting

Related Topics

Stay updated with AI News