AI Startups

AI startup funding, launches, and acquisitions

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Are LLMs a Dead End? (Investors Just Bet $1 Billion on “Yes”)

| AI Reality Check | Cal Newport Chapters 0:00 What is Yan LeCun Up To? 14:55 How is it possible that LeCun could be right about LLM’s be...

Reddit - Artificial Intelligence · 1 min · 37 minutes ago

Ai Startups

20+ Best AI Project Ideas for 2026: Trending AI Projects

This article presents over 20 AI project ideas tailored for various skill levels, providing a roadmap for building portfolio-ready projec...

AI Events · about 1 hour ago

Ai Startups

Top 10 AI certifications and courses for 2026

This article reviews the top 10 AI certifications and courses for 2026, highlighting their significance in a rapidly evolving field and t...

AI Events · 15 min · about 1 hour ago

All Content

Llms

[2602.20528] Stop-Think-AutoRegress: Language Modeling with Latent Diffusion Planning

The paper presents STAR-LDM, a novel language model that integrates latent diffusion planning with autoregressive generation, enhancing n...

arXiv - Machine Learning · 3 min · about 1 month ago

Ai Startups

[2602.21043] T1: One-to-One Channel-Head Binding for Multivariate Time-Series Imputation

The paper presents T1, a CNN-Transformer hybrid model for robust multivariate time-series imputation, achieving state-of-the-art performa...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.20671] Bikelution: Federated Gradient-Boosting for Scalable Shared Micro-Mobility Demand Forecasting

The paper presents Bikelution, a federated learning approach for predicting demand in shared micro-mobility systems, addressing privacy c...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.20629] QEDBENCH: Quantifying the Alignment Gap in Automated Evaluation of University-Level Mathematical Proofs

The paper presents QEDBench, a benchmark for evaluating the alignment of automated systems in assessing university-level mathematical pro...

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Safety

[2602.20468] CGSTA: Cross-Scale Graph Contrast with Stability-Aware Alignment for Multivariate Time-Series Anomaly Detection

The CGSTA framework enhances multivariate time-series anomaly detection by utilizing dynamic layered graphs and stability-aware alignment...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.20307] In-context Pre-trained Time-Series Foundation Models adapt to Unseen Tasks

This paper presents In-Context Time-series Pre-training (ICTP), a framework that enhances time-series foundation models (TSFMs) with in-c...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2601.03868] What Matters For Safety Alignment?

This paper investigates safety alignment in large language models (LLMs) and large reasoning models (LRMs), identifying key factors that ...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2510.22500] Towards Scalable Oversight via Partitioned Human Supervision

The paper proposes a scalable oversight framework for AI systems using partitioned human supervision, addressing challenges in obtaining ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2508.03250] RooseBERT: A New Deal For Political Language Modelling

RooseBERT introduces a specialized language model for political discourse, enhancing the analysis of political debates through improved s...

arXiv - AI · 4 min · about 1 month ago

Ai Startups

[2506.08660] Towards Robust Real-World Multivariate Time Series Forecasting: A Unified Framework for Dependency, Asynchrony, and Missingness

This article presents a novel framework, ChannelTokenFormer, for robust multivariate time series forecasting, addressing challenges of de...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.12462] Evaluating and Mitigating LLM-as-a-judge Bias in Communication Systems

This article evaluates biases in Large Language Models (LLMs) used as judges in communication systems, assessing their reliability and pr...

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.25609] A Framework for Studying AI Agent Behavior: Evidence from Consumer Choice Experiments

This article presents a framework for evaluating AI agent behavior through consumer choice experiments, highlighting biases in decision-m...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.21054] VAUQ: Vision-Aware Uncertainty Quantification for LVLM Self-Evaluation

The paper introduces VAUQ, a framework for vision-aware uncertainty quantification in large vision-language models (LVLMs), enhancing sel...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20945] The Art of Efficient Reasoning: Data, Reward, and Optimization

This article explores efficient reasoning in Large Language Models (LLMs), focusing on optimizing computational resources through reward ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.20720] AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs

The paper presents AdapTools, a novel framework for adaptive indirect prompt injection attacks on agentic large language models (LLMs), h...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.20379] Case-Aware LLM-as-a-Judge Evaluation for Enterprise-Scale RAG Systems

The paper presents a case-aware evaluation framework for enterprise-scale Retrieval-Augmented Generation (RAG) systems, addressing the li...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20294] InterviewSim: A Scalable Framework for Interview-Grounded Personality Simulation

The paper presents InterviewSim, a framework for simulating personalities using large language models grounded in real interview data, en...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.20224] Exploring Anti-Aging Literature via ConvexTopics and Large Language Models

This article presents a novel clustering algorithm for analyzing anti-aging literature, improving topic modeling through convex optimizat...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.20213] CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions

CodeHacker is an automated framework designed to generate test cases that identify vulnerabilities in competitive programming solutions, ...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.20193] When Backdoors Go Beyond Triggers: Semantic Drift in Diffusion Models Under Encoder Attacks

This paper investigates the impact of encoder-side poisoning on text-to-image models, revealing that traditional evaluations of backdoor ...

arXiv - AI · 3 min · about 1 month ago

Previous Page 47 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Startups

Top This Week

Are LLMs a Dead End? (Investors Just Bet $1 Billion on “Yes”)

20+ Best AI Project Ideas for 2026: Trending AI Projects

Top 10 AI certifications and courses for 2026

All Content

[2602.20528] Stop-Think-AutoRegress: Language Modeling with Latent Diffusion Planning

[2602.21043] T1: One-to-One Channel-Head Binding for Multivariate Time-Series Imputation

[2602.20671] Bikelution: Federated Gradient-Boosting for Scalable Shared Micro-Mobility Demand Forecasting

[2602.20629] QEDBENCH: Quantifying the Alignment Gap in Automated Evaluation of University-Level Mathematical Proofs

[2602.20468] CGSTA: Cross-Scale Graph Contrast with Stability-Aware Alignment for Multivariate Time-Series Anomaly Detection

[2602.20307] In-context Pre-trained Time-Series Foundation Models adapt to Unseen Tasks

[2601.03868] What Matters For Safety Alignment?

[2510.22500] Towards Scalable Oversight via Partitioned Human Supervision

[2508.03250] RooseBERT: A New Deal For Political Language Modelling

[2506.08660] Towards Robust Real-World Multivariate Time Series Forecasting: A Unified Framework for Dependency, Asynchrony, and Missingness

[2510.12462] Evaluating and Mitigating LLM-as-a-judge Bias in Communication Systems

[2509.25609] A Framework for Studying AI Agent Behavior: Evidence from Consumer Choice Experiments

[2602.21054] VAUQ: Vision-Aware Uncertainty Quantification for LVLM Self-Evaluation

[2602.20945] The Art of Efficient Reasoning: Data, Reward, and Optimization

[2602.20720] AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs

[2602.20379] Case-Aware LLM-as-a-Judge Evaluation for Enterprise-Scale RAG Systems

[2602.20294] InterviewSim: A Scalable Framework for Interview-Grounded Personality Simulation

[2602.20224] Exploring Anti-Aging Literature via ConvexTopics and Large Language Models

[2602.20213] CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions

[2602.20193] When Backdoors Go Beyond Triggers: Semantic Drift in Diffusion Models Under Encoder Attacks

Related Topics

Stay updated with AI News