Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Microsoft takes on AI rivals with three new foundational models | TechCrunch

MAI released models that can transcribe voice into text as well as generate audio and images after the group's formation six months ago.

TechCrunch - AI · 4 min · 37 minutes ago

Machine Learning

[D] Make. Big. Batch. Size.

It's something between vent and learning. I tried training RWKV v6 model by my own code on my RTX 4050. I trained over 50k steps on batch...

Reddit - Machine Learning · 1 min · about 2 hours ago

Machine Learning

AI Tools That Can’t Prove What They Did Will Hit a Wall

Most AI products are still judged like answer machines. People ask whether the model is smart, fast, creative, cheap, or good at sounding...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

All Content

Machine Learning

[2510.12453] Time-Correlated Video Bridge Matching

Abstract page for arXiv paper 2510.12453: Time-Correlated Video Bridge Matching

arXiv - Machine Learning · 3 min · 7 days ago

Llms

[2510.06790] Get RICH or Die Scaling: Profitably Trading Inference Compute for Robustness

Abstract page for arXiv paper 2510.06790: Get RICH or Die Scaling: Profitably Trading Inference Compute for Robustness

arXiv - Machine Learning · 4 min · 7 days ago

Machine Learning

[2510.04900] Benchmarking M-LTSF: Frequency and Noise-Based Evaluation of Multivariate Long Time Series Forecasting Models

Abstract page for arXiv paper 2510.04900: Benchmarking M-LTSF: Frequency and Noise-Based Evaluation of Multivariate Long Time Series Fore...

arXiv - Machine Learning · 4 min · 7 days ago

Machine Learning

[2603.25716] Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Abstract page for arXiv paper 2603.25716: Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

arXiv - AI · 4 min · 7 days ago

Machine Learning

[2509.15199] CausalPre: Scalable and Effective Data Pre-Processing for Causal Fairness

Abstract page for arXiv paper 2509.15199: CausalPre: Scalable and Effective Data Pre-Processing for Causal Fairness

arXiv - Machine Learning · 4 min · 7 days ago

Machine Learning

[2508.09223] Hierarchical Adaptive networks with Task vectors for Test-Time Adaptation

Abstract page for arXiv paper 2508.09223: Hierarchical Adaptive networks with Task vectors for Test-Time Adaptation

arXiv - AI · 4 min · 7 days ago

Machine Learning

[2509.08617] Towards Interpretable Deep Neural Networks for Tabular Data

Abstract page for arXiv paper 2509.08617: Towards Interpretable Deep Neural Networks for Tabular Data

arXiv - Machine Learning · 3 min · 7 days ago

Llms

[2603.25697] The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase

Abstract page for arXiv paper 2603.25697: The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase

arXiv - AI · 3 min · 7 days ago

Llms

[2507.19737] Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning

Abstract page for arXiv paper 2507.19737: Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning

arXiv - AI · 4 min · 7 days ago

Llms

[2505.23004] QLIP: A Dynamic Quadtree Vision Prior Enhances MLLM Performance Without Retraining

Abstract page for arXiv paper 2505.23004: QLIP: A Dynamic Quadtree Vision Prior Enhances MLLM Performance Without Retraining

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2603.25646] A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots

Abstract page for arXiv paper 2603.25646: A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots

arXiv - AI · 3 min · 7 days ago

Machine Learning

[2411.17501] The Limits of Inference Scaling Through Resampling

Abstract page for arXiv paper 2411.17501: The Limits of Inference Scaling Through Resampling

arXiv - AI · 4 min · 7 days ago

Llms

[2603.25613] Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verification

Abstract page for arXiv paper 2603.25613: Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verif...

arXiv - AI · 4 min · 7 days ago

Machine Learning

[2410.21764] Adaptive Online Mirror Descent for Tchebycheff Scalarization in Multi-Objective Learning

Abstract page for arXiv paper 2410.21764: Adaptive Online Mirror Descent for Tchebycheff Scalarization in Multi-Objective Learning

arXiv - AI · 4 min · 7 days ago

Machine Learning

[2603.25607] DeepFAN, a transformer-based deep learning model for human-artificial intelligence collaborative assessment of incidental pulmonary nodules in CT scans: a multi-reader, multi-case trial

Abstract page for arXiv paper 2603.25607: DeepFAN, a transformer-based deep learning model for human-artificial intelligence collaborativ...

arXiv - AI · 4 min · 7 days ago

Llms

[2408.05696] SMILES-Mamba: Chemical Mamba Foundation Models for Drug ADMET Prediction

Abstract page for arXiv paper 2408.05696: SMILES-Mamba: Chemical Mamba Foundation Models for Drug ADMET Prediction

arXiv - Machine Learning · 4 min · 7 days ago

Machine Learning

[2403.04545] Branch Scaling Manifests as Implicit Architectural Regularization for Improving Generalization in Overparameterized ResNets

Abstract page for arXiv paper 2403.04545: Branch Scaling Manifests as Implicit Architectural Regularization for Improving Generalization ...

arXiv - Machine Learning · 3 min · 7 days ago

Machine Learning

[2401.12546] On Building Myopic MPC Policies using Supervised Learning

Abstract page for arXiv paper 2401.12546: On Building Myopic MPC Policies using Supervised Learning

arXiv - Machine Learning · 4 min · 7 days ago

Machine Learning

[2603.25740] Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving

Abstract page for arXiv paper 2603.25740: Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving

arXiv - AI · 4 min · 7 days ago

Machine Learning

[2603.25722] No Hard Negatives Required: Concept Centric Learning Leads to Compositionality without Degrading Zero-shot Capabilities of Contrastive Models

Abstract page for arXiv paper 2603.25722: No Hard Negatives Required: Concept Centric Learning Leads to Compositionality without Degradin...

arXiv - Machine Learning · 4 min · 7 days ago

Previous Page 74 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

Microsoft takes on AI rivals with three new foundational models | TechCrunch

[D] Make. Big. Batch. Size.

AI Tools That Can’t Prove What They Did Will Hit a Wall

All Content

[2510.12453] Time-Correlated Video Bridge Matching

[2510.06790] Get RICH or Die Scaling: Profitably Trading Inference Compute for Robustness

[2510.04900] Benchmarking M-LTSF: Frequency and Noise-Based Evaluation of Multivariate Long Time Series Forecasting Models

[2603.25716] Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

[2509.15199] CausalPre: Scalable and Effective Data Pre-Processing for Causal Fairness

[2508.09223] Hierarchical Adaptive networks with Task vectors for Test-Time Adaptation

[2509.08617] Towards Interpretable Deep Neural Networks for Tabular Data

[2603.25697] The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase

[2507.19737] Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning

[2505.23004] QLIP: A Dynamic Quadtree Vision Prior Enhances MLLM Performance Without Retraining

[2603.25646] A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots

[2411.17501] The Limits of Inference Scaling Through Resampling

[2603.25613] Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verification

[2410.21764] Adaptive Online Mirror Descent for Tchebycheff Scalarization in Multi-Objective Learning

[2603.25607] DeepFAN, a transformer-based deep learning model for human-artificial intelligence collaborative assessment of incidental pulmonary nodules in CT scans: a multi-reader, multi-case trial

[2408.05696] SMILES-Mamba: Chemical Mamba Foundation Models for Drug ADMET Prediction

[2403.04545] Branch Scaling Manifests as Implicit Architectural Regularization for Improving Generalization in Overparameterized ResNets

[2401.12546] On Building Myopic MPC Policies using Supervised Learning

[2603.25740] Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving

[2603.25722] No Hard Negatives Required: Concept Centric Learning Leads to Compositionality without Degrading Zero-shot Capabilities of Contrastive Models

Related Topics

Stay updated with AI News