[D] Tested model routing on financial AI datasets — good savings and curious what benchmarks others use.
Ran a benchmark evaluating whether prompt complexity-based routing delivers meaningful savings. Used public HuggingFace datasets. Here's ...
Data analysis, statistics, and data engineering
Ran a benchmark evaluating whether prompt complexity-based routing delivers meaningful savings. Used public HuggingFace datasets. Here's ...
Zero Shot, a new venture capital fund with deep ties to OpenAI, is aiming to raise $100 million for its first fund. It has already writte...
Google's new offline-first dictation app uses Gemma AI models to take on the apps like Wispr Flow.
The paper presents CCAGNN, a novel Confounder-Aware causal GNN framework designed to improve predictions in graph causal classification b...
This paper presents a tighter lower bound on cumulative regret for Gaussian process bandits using a squared exponential kernel in a hyper...
The paper presents Causal Neighbourhood Learning (CNL-GNN), a novel framework for improving Graph Neural Networks (GNNs) by addressing sp...
This paper explores the limitations of attention-based regression models, particularly the phenomenon of the Pearson correlation coeffici...
The paper presents COMBA, a novel approach for learning large graphs using state space models, emphasizing cross batch aggregation and gr...
MantisV2 introduces advanced techniques for zero-shot time series classification, utilizing synthetic data and refined test-time strategi...
This article explores the application of machine learning to predict surgical outcomes in patients with chronic rhinosinusitis, demonstra...
The paper presents ADAPT, a hybrid method for optimizing prompts in LLM feature visualization, addressing challenges in local minima and ...
This article explores the use of transformer-based GANs for augmenting financial time series data, enhancing predictive accuracy in forec...
The paper introduces Neural Prior Estimator (NPE), a framework for learning class priors from latent representations, addressing class im...
The paper presents Iprox, a two-stage framework for gradient-based data selection in LLM fine-tuning, which constructs influence-preservi...
This paper introduces ruleXplain, a framework utilizing Large Language Models to extract causal rules from multivariate timeseries data, ...
The paper presents Grassmannian Mixture-of-Experts (GrMoE), a novel routing framework that enhances expert assignment in machine learning...
This paper introduces Stiefel-Bayes Adapters (SBA), a Bayesian framework for parameter-efficient fine-tuning of large language models, en...
This paper presents a novel framework for multi-material, multi-physics topology optimization using physics-informed Gaussian processes, ...
This paper explores the impact of target class selection on the compressibility of neural networks for avian monitoring using energy-auto...
The paper presents PaCoDi, a novel approach to time series generation using parallel complex diffusion, enhancing efficiency and quality ...
This article explores the effectiveness of identifying 'safety regions' in large language models (LLMs) by evaluating various methods acr...
The paper presents Duality Models (DuMo), a novel approach in generative modeling that enhances stability and efficiency by using a share...
This paper presents a probabilistic framework for forecasting NDVI from sparse satellite data and weather covariates, enhancing precision...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime