AI Startups

AI startup funding, launches, and acquisitions

Top This Week

Google Launches Gemini Import Tools to Poach Users From Rival AI Apps
Llms

Google Launches Gemini Import Tools to Poach Users From Rival AI Apps

Anyone looking to switch their AI assistant will find it surprisingly easy, as it only takes a few steps to move from A to B. This is not...

AI Tools & Products · 4 min ·
Ai Startups

Could factories run faster and greener? How AI 'digital twins' reshape production

Researchers at Örebro University have developed a new production system that uses artificial intelligence (AI) to improve efficiency and ...

Reddit - Artificial Intelligence · 1 min ·
[2603.11687] SemBench: A Universal Semantic Framework for LLM Evaluation
Llms

[2603.11687] SemBench: A Universal Semantic Framework for LLM Evaluation

Abstract page for arXiv paper 2603.11687: SemBench: A Universal Semantic Framework for LLM Evaluation

arXiv - AI · 4 min ·

All Content

Google Launches Gemini Import Tools to Poach Users From Rival AI Apps
Llms

Google Launches Gemini Import Tools to Poach Users From Rival AI Apps

Anyone looking to switch their AI assistant will find it surprisingly easy, as it only takes a few steps to move from A to B. This is not...

AI Tools & Products · 4 min ·
Ai Startups

Could factories run faster and greener? How AI 'digital twins' reshape production

Researchers at Örebro University have developed a new production system that uses artificial intelligence (AI) to improve efficiency and ...

Reddit - Artificial Intelligence · 1 min ·
[2603.11687] SemBench: A Universal Semantic Framework for LLM Evaluation
Llms

[2603.11687] SemBench: A Universal Semantic Framework for LLM Evaluation

Abstract page for arXiv paper 2603.11687: SemBench: A Universal Semantic Framework for LLM Evaluation

arXiv - AI · 4 min ·
[2603.11413] Evaluation format, not model capability, drives triage failure in the assessment of consumer health AI
Llms

[2603.11413] Evaluation format, not model capability, drives triage failure in the assessment of consumer health AI

Abstract page for arXiv paper 2603.11413: Evaluation format, not model capability, drives triage failure in the assessment of consumer he...

arXiv - AI · 4 min ·
[2510.10415] CQA-Eval: Designing Reliable Evaluations of Multi-paragraph Clinical QA under Resource Constraints
Nlp

[2510.10415] CQA-Eval: Designing Reliable Evaluations of Multi-paragraph Clinical QA under Resource Constraints

Abstract page for arXiv paper 2510.10415: CQA-Eval: Designing Reliable Evaluations of Multi-paragraph Clinical QA under Resource Constraints

arXiv - AI · 3 min ·
[2505.19046] When Models Don't Collapse: On the Consistency of Iterative MLE
Machine Learning

[2505.19046] When Models Don't Collapse: On the Consistency of Iterative MLE

Abstract page for arXiv paper 2505.19046: When Models Don't Collapse: On the Consistency of Iterative MLE

arXiv - Machine Learning · 4 min ·
[2601.00428] Interpretable ML Under the Microscope: Performance, Meta-Features, and the Regression-Classification Predictability Gap
Machine Learning

[2601.00428] Interpretable ML Under the Microscope: Performance, Meta-Features, and the Regression-Classification Predictability Gap

Abstract page for arXiv paper 2601.00428: Interpretable ML Under the Microscope: Performance, Meta-Features, and the Regression-Classific...

arXiv - Machine Learning · 4 min ·
[2509.03345] Do Language Models Follow Occam's Razor? An Evaluation of Parsimony in Inductive and Abductive Reasoning
Llms

[2509.03345] Do Language Models Follow Occam's Razor? An Evaluation of Parsimony in Inductive and Abductive Reasoning

Abstract page for arXiv paper 2509.03345: Do Language Models Follow Occam's Razor? An Evaluation of Parsimony in Inductive and Abductive ...

arXiv - AI · 4 min ·
[2512.10152] Rethinking Bivariate Causal Discovery Through the Lens of Exchangeability
Machine Learning

[2512.10152] Rethinking Bivariate Causal Discovery Through the Lens of Exchangeability

Abstract page for arXiv paper 2512.10152: Rethinking Bivariate Causal Discovery Through the Lens of Exchangeability

arXiv - Machine Learning · 4 min ·
[2510.06790] Get RICH or Die Scaling: Profitably Trading Inference Compute for Robustness
Llms

[2510.06790] Get RICH or Die Scaling: Profitably Trading Inference Compute for Robustness

Abstract page for arXiv paper 2510.06790: Get RICH or Die Scaling: Profitably Trading Inference Compute for Robustness

arXiv - Machine Learning · 4 min ·
[2510.04900] Benchmarking M-LTSF: Frequency and Noise-Based Evaluation of Multivariate Long Time Series Forecasting Models
Machine Learning

[2510.04900] Benchmarking M-LTSF: Frequency and Noise-Based Evaluation of Multivariate Long Time Series Forecasting Models

Abstract page for arXiv paper 2510.04900: Benchmarking M-LTSF: Frequency and Noise-Based Evaluation of Multivariate Long Time Series Fore...

arXiv - Machine Learning · 4 min ·
[2603.25333] Adaptive Chunking: Optimizing Chunking-Method Selection for RAG
Nlp

[2603.25333] Adaptive Chunking: Optimizing Chunking-Method Selection for RAG

Abstract page for arXiv paper 2603.25333: Adaptive Chunking: Optimizing Chunking-Method Selection for RAG

arXiv - AI · 4 min ·
[2603.25253] MolQuest: A Benchmark for Agentic Evaluation of Abductive Reasoning in Chemical Structure Elucidation
Llms

[2603.25253] MolQuest: A Benchmark for Agentic Evaluation of Abductive Reasoning in Chemical Structure Elucidation

Abstract page for arXiv paper 2603.25253: MolQuest: A Benchmark for Agentic Evaluation of Abductive Reasoning in Chemical Structure Eluci...

arXiv - AI · 4 min ·
[2603.25397] A Causal Framework for Evaluating ICU Discharge Strategies
Machine Learning

[2603.25397] A Causal Framework for Evaluating ICU Discharge Strategies

Abstract page for arXiv paper 2603.25397: A Causal Framework for Evaluating ICU Discharge Strategies

arXiv - Machine Learning · 3 min ·
[2603.25251] Does Explanation Correctness Matter? Linking Computational XAI Evaluation to Human Understanding
Machine Learning

[2603.25251] Does Explanation Correctness Matter? Linking Computational XAI Evaluation to Human Understanding

Abstract page for arXiv paper 2603.25251: Does Explanation Correctness Matter? Linking Computational XAI Evaluation to Human Understanding

arXiv - Machine Learning · 4 min ·
[2603.25222] Translation or Recitation? Calibrating Evaluation Scores for Machine Translation of Extremely Low-Resource Languages
Ai Startups

[2603.25222] Translation or Recitation? Calibrating Evaluation Scores for Machine Translation of Extremely Low-Resource Languages

Abstract page for arXiv paper 2603.25222: Translation or Recitation? Calibrating Evaluation Scores for Machine Translation of Extremely L...

arXiv - Machine Learning · 4 min ·
[2603.25150] Goodness-of-pronunciation without phoneme time alignment
Machine Learning

[2603.25150] Goodness-of-pronunciation without phoneme time alignment

Abstract page for arXiv paper 2603.25150: Goodness-of-pronunciation without phoneme time alignment

arXiv - Machine Learning · 3 min ·
[2603.25024] Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method
Machine Learning

[2603.25024] Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method

Abstract page for arXiv paper 2603.25024: Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method

arXiv - Machine Learning · 3 min ·
[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory
Llms

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

Abstract page for arXiv paper 2603.25112: Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

arXiv - AI · 4 min ·
[2603.24999] Efficient Detection of Bad Benchmark Items with Novel Scalability Coefficients
Ai Startups

[2603.24999] Efficient Detection of Bad Benchmark Items with Novel Scalability Coefficients

Abstract page for arXiv paper 2603.24999: Efficient Detection of Bad Benchmark Items with Novel Scalability Coefficients

arXiv - AI · 4 min ·
Page 1 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime