WHO/Europe launches Technical Advisory Group on Artificial Intelligence for Health
WHO/Europe has established the Technical Advisory Group on Artificial Intelligence for Health to ensure the ethical use of AI in health a...
AI startup funding, launches, and acquisitions
WHO/Europe has established the Technical Advisory Group on Artificial Intelligence for Health to ensure the ethical use of AI in health a...
Abstract page for arXiv paper 2603.10202: Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Ap...
Abstract page for arXiv paper 2511.09204: Resource-Efficient Variational Quantum Classifier
This paper presents a benchmark for evaluating positive-unlabeled (PU) learning algorithms, addressing inconsistencies in experimental se...
Aurora introduces a Multimodal Time Series Foundation Model that enhances cross-domain generalization in time series forecasting by integ...
This article introduces Conflict-Aware Fusion, a framework designed to address Logic Inertia in large language models (LLMs) by integrati...
This article presents a novel approach to predicting the virality of memes on Reddit using a multimodal dataset and advanced machine lear...
The paper presents a decision-theoretic framework for evaluating explanations in AI, emphasizing their role as information signals that i...
GraphOmni introduces a benchmark framework for evaluating large language models on graph-theoretic tasks, highlighting performance variab...
This article presents a comprehensive study on Graph Neural Networks (GNNs) for graph-level tasks, categorizing them into five types and ...
This paper explores methods to reduce biases in record matching through score calibration, proposing two model-agnostic post-processing t...
This article presents a framework for assessing the risks associated with using large language models (LLMs) in mental health support, hi...
This article presents a Normal Behavior Model (NBM) for forecasting monitoring data from the ASTRI-Horn telescope, demonstrating effectiv...
The paper presents MAS-FIRE, a framework for evaluating the reliability of LLM-based Multi-Agent Systems through fault injection, address...
SplitLight is an open-source toolkit designed to enhance the evaluation of recommender systems by providing measurable and comparable dat...
This article analyzes the dynamic relationship between forest loss and carbon emissions in the U.S. using a comprehensive dataset from 20...
This article presents a comprehensive analysis of agentic memory systems in large language models, highlighting their architectural frame...
Habilis-$β$ is a new on-device vision-language-action model that excels in fast-motion tasks, demonstrating superior performance in real-...
This paper evaluates the effectiveness of generative metrics in predicting the performance of YOLO object detection models across various...
The paper discusses the limitations of current agent caching methods in AI, proposing a new framework, W5H2, that improves efficiency and...
This pilot study explores the orchestration of LLM agents in scientific research, focusing on the generation and evaluation of multiple-c...
The paper presents a metacognitive framework for Large Language Models (LLMs) that enhances their reasoning capabilities by integrating p...
The paper presents MiSCHiEF, a benchmark for evaluating fine-grained image-caption alignment, focusing on safety and cultural contexts, h...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime