AI Startups

AI startup funding, launches, and acquisitions

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Ai Startups

This AI startup envisions 100 Million New People Making Videogames

submitted by /u/sharkymcstevenson2 [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 23 hours ago

Llms

A robot car with a Claude AI brain started a YouTube vlog about its own existence

Not a demo reel. Not a tutorial. A robot narrating its own experience — debugging, falling off shelves, questioning its identity. First-p...

Reddit - Artificial Intelligence · 1 min · 1 day ago

Ai Startups

Anthropic ramps up its political activities with a new PAC | TechCrunch

With the midterms right around the corner, the new group is positioned to back candidates who support the AI company's policy agenda.

TechCrunch - AI · 3 min · 1 day ago

All Content

Llms

[2602.13783] MEMTS: Internalizing Domain Knowledge via Parameterized Memory for Retrieval-Free Domain Adaptation of Time Series Foundation Models

The paper presents MEMTS, a novel method for domain adaptation in time series forecasting that internalizes domain knowledge through a Kn...

arXiv - Machine Learning · 4 min · about 2 months ago

Ai Safety

[2602.14135] ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI

The paper presents the ForesightSafety Bench, a comprehensive framework for evaluating AI safety risks, addressing limitations in current...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13699] Attention Head Entropy of LLMs Predicts Answer Correctness

This paper introduces Head Entropy, a method for predicting answer correctness in large language models (LLMs) by analyzing attention ent...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.13967] Neuromem: A Granular Decomposition of the Streaming Lifecycle in External Memory for LLMs

The paper presents Neuromem, a framework for evaluating external memory modules in large language models (LLMs) under a dynamic streaming...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13659] Zero-Order Optimization for LLM Fine-Tuning via Learnable Direction Sampling

This article presents a novel zero-order optimization framework for fine-tuning large language models (LLMs) using learnable direction sa...

arXiv - Machine Learning · 4 min · about 2 months ago

Ai Startups

[2602.13649] Joint Time Series Chain: Detecting Unusual Evolving Trend across Time Series

The paper introduces the Joint Time Series Chain (JTSC) concept, enhancing the detection of unusual evolving trends across interrupted or...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.13626] Benchmark Leakage Trap: Can We Trust LLM-based Recommendation?

This paper examines benchmark data leakage in LLM-based recommendation systems, revealing how it can distort performance metrics and misl...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.13808] An end-to-end agentic pipeline for smart contract translation and quality evaluation

This article presents a comprehensive framework for evaluating smart contracts generated from natural language specifications, focusing o...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.13531] QuaRK: A Quantum Reservoir Kernel for Time Series Learning

The paper introduces QuaRK, a novel quantum reservoir computing framework designed for efficient time series learning, emphasizing its em...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.13498] TrasMuon: Trust-Region Adaptive Scaling for Orthogonalized Momentum Optimizers

TrasMuon introduces a novel optimization technique that enhances the stability and efficiency of orthogonalized momentum optimizers, outp...

arXiv - AI · 3 min · about 2 months ago

Ai Startups

[2602.13485] Federated Learning of Nonlinear Temporal Dynamics with Graph Attention-based Cross-Client Interpretability

This paper presents a federated learning framework for understanding nonlinear temporal dynamics across decentralized systems, enhancing ...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.13482] Comparing Classifiers: A Case Study Using PyCM

This paper explores the PyCM library for evaluating multi-class classifiers, emphasizing the importance of diverse evaluation metrics in ...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.13359] The Speed-up Factor: A Quantitative Multi-Iteration Active Learning Performance Metric

This article introduces the Speed-up Factor, a new performance metric for evaluating multi-iteration active learning methods, demonstrati...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.13348] Exploring the Performance of ML/DL Architectures on the MNIST-1D Dataset

This article evaluates the performance of advanced machine learning architectures on the MNIST-1D dataset, demonstrating their effectiven...

arXiv - AI · 4 min · about 2 months ago

Ai Agents

[2602.13318] DECKBench: Benchmarking Multi-Agent Frameworks for Academic Slide Generation and Editing

DECKBench introduces a new evaluation framework for multi-agent systems focused on generating and editing academic slide decks, addressin...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.13272] TemporalBench: A Benchmark for Evaluating LLM-Based Agents on Contextual and Event-Informed Time Series Tasks

TemporalBench introduces a benchmark for evaluating LLM-based agents on time series tasks, focusing on contextual and event-informed reas...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.13232] PlotChain: Deterministic Checkpointed Evaluation of Multimodal LLMs on Engineering Plot Reading

PlotChain introduces a deterministic benchmark for evaluating multimodal large language models (MLLMs) on engineering plot reading, focus...

arXiv - AI · 4 min · about 2 months ago

Ai Startups

[2602.13217] VeRA: Verified Reasoning Data Augmentation at Scale

VeRA introduces a framework for generating verified reasoning data at scale, enhancing AI evaluation by creating dynamic, executable benc...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13214] BotzoneBench: Scalable LLM Evaluation via Graded AI Anchors

The paper presents BotzoneBench, a scalable framework for evaluating Large Language Models (LLMs) using graded AI anchors, addressing the...

arXiv - AI · 4 min · about 2 months ago

Llms

OpenAI just hired the OpenClaw creator

OpenAI has hired the creator of OpenClaw, an innovative open-source AI assistant that performs various tasks, marking a significant devel...

Reddit - Artificial Intelligence · 1 min · about 2 months ago

Previous Page 78 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Startups

Top This Week

This AI startup envisions 100 Million New People Making Videogames

A robot car with a Claude AI brain started a YouTube vlog about its own existence

Anthropic ramps up its political activities with a new PAC | TechCrunch

All Content

[2602.13783] MEMTS: Internalizing Domain Knowledge via Parameterized Memory for Retrieval-Free Domain Adaptation of Time Series Foundation Models

[2602.14135] ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI

[2602.13699] Attention Head Entropy of LLMs Predicts Answer Correctness

[2602.13967] Neuromem: A Granular Decomposition of the Streaming Lifecycle in External Memory for LLMs

[2602.13659] Zero-Order Optimization for LLM Fine-Tuning via Learnable Direction Sampling

[2602.13649] Joint Time Series Chain: Detecting Unusual Evolving Trend across Time Series

[2602.13626] Benchmark Leakage Trap: Can We Trust LLM-based Recommendation?

[2602.13808] An end-to-end agentic pipeline for smart contract translation and quality evaluation

[2602.13531] QuaRK: A Quantum Reservoir Kernel for Time Series Learning

[2602.13498] TrasMuon: Trust-Region Adaptive Scaling for Orthogonalized Momentum Optimizers

[2602.13485] Federated Learning of Nonlinear Temporal Dynamics with Graph Attention-based Cross-Client Interpretability

[2602.13482] Comparing Classifiers: A Case Study Using PyCM

[2602.13359] The Speed-up Factor: A Quantitative Multi-Iteration Active Learning Performance Metric

[2602.13348] Exploring the Performance of ML/DL Architectures on the MNIST-1D Dataset

[2602.13318] DECKBench: Benchmarking Multi-Agent Frameworks for Academic Slide Generation and Editing

[2602.13272] TemporalBench: A Benchmark for Evaluating LLM-Based Agents on Contextual and Event-Informed Time Series Tasks

[2602.13232] PlotChain: Deterministic Checkpointed Evaluation of Multimodal LLMs on Engineering Plot Reading

[2602.13217] VeRA: Verified Reasoning Data Augmentation at Scale

[2602.13214] BotzoneBench: Scalable LLM Evaluation via Graded AI Anchors

OpenAI just hired the OpenClaw creator

Related Topics

Stay updated with AI News