Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

[P] Looking for people who have had training runs fail unexpectedly to beta test a stability monitor. Free, takes 5 minutes to add to your existing loop. DM me.

Anyone actively training models want to try a stability monitor on a real run? Trying to get real world validation outside my own benchma...

Reddit - Machine Learning · 1 min · 30 minutes ago

Llms

Is the Mirage Effect a bug, or is it Geometric Reconstruction in action? A framework for why VLMs perform better "hallucinating" than guessing, and what that may tell us about what's really inside these models

Last week, a team from Stanford and UCSF (Asadi, O'Sullivan, Fei-Fei Li, Euan Ashley et al.) dropped two companion papers. The first, MAR...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

Yupp shuts down after raising $33M from a16z crypto's Chris Dixon | TechCrunch

Less than a year after launching, with checks from some of the biggest names in Silicon Valley, crowdsourced AI model feedback startup Yu...

TechCrunch - AI · 4 min · about 4 hours ago

All Content

Machine Learning

[2510.13772] Tensor Gaussian Processes: Efficient Solvers for Nonlinear PDEs

Abstract page for arXiv paper 2510.13772: Tensor Gaussian Processes: Efficient Solvers for Nonlinear PDEs

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2603.25730] PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Abstract page for arXiv paper 2603.25730: PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

arXiv - AI · 4 min · 5 days ago

Machine Learning

[2510.12453] Time-Correlated Video Bridge Matching

Abstract page for arXiv paper 2510.12453: Time-Correlated Video Bridge Matching

arXiv - Machine Learning · 3 min · 5 days ago

Llms

[2510.06790] Get RICH or Die Scaling: Profitably Trading Inference Compute for Robustness

Abstract page for arXiv paper 2510.06790: Get RICH or Die Scaling: Profitably Trading Inference Compute for Robustness

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2510.04900] Benchmarking M-LTSF: Frequency and Noise-Based Evaluation of Multivariate Long Time Series Forecasting Models

Abstract page for arXiv paper 2510.04900: Benchmarking M-LTSF: Frequency and Noise-Based Evaluation of Multivariate Long Time Series Fore...

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2603.25716] Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Abstract page for arXiv paper 2603.25716: Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

arXiv - AI · 4 min · 5 days ago

Machine Learning

[2509.15199] CausalPre: Scalable and Effective Data Pre-Processing for Causal Fairness

Abstract page for arXiv paper 2509.15199: CausalPre: Scalable and Effective Data Pre-Processing for Causal Fairness

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2508.09223] Hierarchical Adaptive networks with Task vectors for Test-Time Adaptation

Abstract page for arXiv paper 2508.09223: Hierarchical Adaptive networks with Task vectors for Test-Time Adaptation

arXiv - AI · 4 min · 5 days ago

Machine Learning

[2509.08617] Towards Interpretable Deep Neural Networks for Tabular Data

Abstract page for arXiv paper 2509.08617: Towards Interpretable Deep Neural Networks for Tabular Data

arXiv - Machine Learning · 3 min · 5 days ago

Llms

[2603.25697] The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase

Abstract page for arXiv paper 2603.25697: The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase

arXiv - AI · 3 min · 5 days ago

Llms

[2507.19737] Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning

Abstract page for arXiv paper 2507.19737: Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning

arXiv - AI · 4 min · 5 days ago

Llms

[2505.23004] QLIP: A Dynamic Quadtree Vision Prior Enhances MLLM Performance Without Retraining

Abstract page for arXiv paper 2505.23004: QLIP: A Dynamic Quadtree Vision Prior Enhances MLLM Performance Without Retraining

arXiv - Machine Learning · 4 min · 5 days ago

Llms

[2603.25646] A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots

Abstract page for arXiv paper 2603.25646: A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots

arXiv - AI · 3 min · 5 days ago

Machine Learning

[2411.17501] The Limits of Inference Scaling Through Resampling

Abstract page for arXiv paper 2411.17501: The Limits of Inference Scaling Through Resampling

arXiv - AI · 4 min · 5 days ago

Llms

[2603.25613] Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verification

Abstract page for arXiv paper 2603.25613: Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verif...

arXiv - AI · 4 min · 5 days ago

Machine Learning

[2410.21764] Adaptive Online Mirror Descent for Tchebycheff Scalarization in Multi-Objective Learning

Abstract page for arXiv paper 2410.21764: Adaptive Online Mirror Descent for Tchebycheff Scalarization in Multi-Objective Learning

arXiv - AI · 4 min · 5 days ago

Machine Learning

[2603.25607] DeepFAN, a transformer-based deep learning model for human-artificial intelligence collaborative assessment of incidental pulmonary nodules in CT scans: a multi-reader, multi-case trial

Abstract page for arXiv paper 2603.25607: DeepFAN, a transformer-based deep learning model for human-artificial intelligence collaborativ...

arXiv - AI · 4 min · 5 days ago

Llms

[2408.05696] SMILES-Mamba: Chemical Mamba Foundation Models for Drug ADMET Prediction

Abstract page for arXiv paper 2408.05696: SMILES-Mamba: Chemical Mamba Foundation Models for Drug ADMET Prediction

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2403.04545] Branch Scaling Manifests as Implicit Architectural Regularization for Improving Generalization in Overparameterized ResNets

Abstract page for arXiv paper 2403.04545: Branch Scaling Manifests as Implicit Architectural Regularization for Improving Generalization ...

arXiv - Machine Learning · 3 min · 5 days ago

Machine Learning

[2401.12546] On Building Myopic MPC Policies using Supervised Learning

Abstract page for arXiv paper 2401.12546: On Building Myopic MPC Policies using Supervised Learning

arXiv - Machine Learning · 4 min · 5 days ago

Previous Page 48 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

[P] Looking for people who have had training runs fail unexpectedly to beta test a stability monitor. Free, takes 5 minutes to add to your existing loop. DM me.

Is the Mirage Effect a bug, or is it Geometric Reconstruction in action? A framework for why VLMs perform better "hallucinating" than guessing, and what that may tell us about what's really inside these models

Yupp shuts down after raising $33M from a16z crypto's Chris Dixon | TechCrunch

All Content

[2510.13772] Tensor Gaussian Processes: Efficient Solvers for Nonlinear PDEs

[2603.25730] PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

[2510.12453] Time-Correlated Video Bridge Matching

[2510.06790] Get RICH or Die Scaling: Profitably Trading Inference Compute for Robustness

[2510.04900] Benchmarking M-LTSF: Frequency and Noise-Based Evaluation of Multivariate Long Time Series Forecasting Models

[2603.25716] Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

[2509.15199] CausalPre: Scalable and Effective Data Pre-Processing for Causal Fairness

[2508.09223] Hierarchical Adaptive networks with Task vectors for Test-Time Adaptation

[2509.08617] Towards Interpretable Deep Neural Networks for Tabular Data

[2603.25697] The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase

[2507.19737] Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning

[2505.23004] QLIP: A Dynamic Quadtree Vision Prior Enhances MLLM Performance Without Retraining

[2603.25646] A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots

[2411.17501] The Limits of Inference Scaling Through Resampling

[2603.25613] Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verification

[2410.21764] Adaptive Online Mirror Descent for Tchebycheff Scalarization in Multi-Objective Learning

[2603.25607] DeepFAN, a transformer-based deep learning model for human-artificial intelligence collaborative assessment of incidental pulmonary nodules in CT scans: a multi-reader, multi-case trial

[2408.05696] SMILES-Mamba: Chemical Mamba Foundation Models for Drug ADMET Prediction

[2403.04545] Branch Scaling Manifests as Implicit Architectural Regularization for Improving Generalization in Overparameterized ResNets

[2401.12546] On Building Myopic MPC Policies using Supervised Learning

Related Topics

Stay updated with AI News