Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[P] Gemma 4 running on NVIDIA B200 and AMD MI355X from the same inference stack, 15% throughput gain over vLLM on Blackwell

Google DeepMind dropped Gemma 4 today: Gemma 4 31B: dense, 256K context, redesigned architecture targeting efficiency and long-context qu...

Reddit - Machine Learning · 1 min · 41 minutes ago

Machine Learning

Google releases Gemma 4 models.

submitted by /u/jferments [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

Google has published its new open-weight model Gemma 4. And made it commercially available under Apache 2.0 License

The model is also available here: 🤗 HuggingFace: https://huggingface.co/collections/google/gemma-4 🦙 Ollama: https://ollama.com/library/g...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

All Content

Machine Learning

[2401.12546] On Building Myopic MPC Policies using Supervised Learning

Abstract page for arXiv paper 2401.12546: On Building Myopic MPC Policies using Supervised Learning

arXiv - Machine Learning · 4 min · 7 days ago

Machine Learning

[2603.25740] Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving

Abstract page for arXiv paper 2603.25740: Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving

arXiv - AI · 4 min · 7 days ago

Machine Learning

[2603.25722] No Hard Negatives Required: Concept Centric Learning Leads to Compositionality without Degrading Zero-shot Capabilities of Contrastive Models

Abstract page for arXiv paper 2603.25722: No Hard Negatives Required: Concept Centric Learning Leads to Compositionality without Degradin...

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2603.25638] Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers

Abstract page for arXiv paper 2603.25638: Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers

arXiv - AI · 3 min · 7 days ago

Machine Learning

[2603.25462] Temporally Decoupled Diffusion Planning for Autonomous Driving

Abstract page for arXiv paper 2603.25462: Temporally Decoupled Diffusion Planning for Autonomous Driving

arXiv - AI · 3 min · 7 days ago

Machine Learning

[2603.25629] LanteRn: Latent Visual Structured Reasoning

Abstract page for arXiv paper 2603.25629: LanteRn: Latent Visual Structured Reasoning

arXiv - Machine Learning · 3 min · 7 days ago

Machine Learning

[2603.25423] From Manipulation to Mistrust: Explaining Diverse Micro-Video Misinformation for Robust Debunking in the Wild

Abstract page for arXiv paper 2603.25423: From Manipulation to Mistrust: Explaining Diverse Micro-Video Misinformation for Robust Debunki...

arXiv - AI · 4 min · 7 days ago

Machine Learning

[2603.25622] The Geometry of Efficient Nonconvex Sampling

Abstract page for arXiv paper 2603.25622: The Geometry of Efficient Nonconvex Sampling

arXiv - Machine Learning · 3 min · 7 days ago

Machine Learning

[2603.25579] The Rules-and-Facts Model for Simultaneous Generalization and Memorization in Neural Networks

Abstract page for arXiv paper 2603.25579: The Rules-and-Facts Model for Simultaneous Generalization and Memorization in Neural Networks

arXiv - Machine Learning · 4 min · 7 days ago

Machine Learning

[2603.25573] Hierarchy-Guided Multimodal Representation Learning for Taxonomic Inference

Abstract page for arXiv paper 2603.25573: Hierarchy-Guided Multimodal Representation Learning for Taxonomic Inference

arXiv - Machine Learning · 4 min · 7 days ago

Machine Learning

[2603.25535] Insights on back marking for the automated identification of animals

Abstract page for arXiv paper 2603.25535: Insights on back marking for the automated identification of animals

arXiv - Machine Learning · 4 min · 7 days ago

Machine Learning

[2603.25366] Integrating Deep RL and Bayesian Inference for ObjectNav in Mobile Robotics

Abstract page for arXiv paper 2603.25366: Integrating Deep RL and Bayesian Inference for ObjectNav in Mobile Robotics

arXiv - AI · 3 min · 7 days ago

Machine Learning

[2603.25517] NERO-Net: A Neuroevolutionary Approach for the Design of Adversarially Robust CNNs

Abstract page for arXiv paper 2603.25517: NERO-Net: A Neuroevolutionary Approach for the Design of Adversarially Robust CNNs

arXiv - AI · 4 min · 7 days ago

Machine Learning

[2603.25509] Conformal Prediction for Nonparametric Instrumental Regression

Abstract page for arXiv paper 2603.25509: Conformal Prediction for Nonparametric Instrumental Regression

arXiv - Machine Learning · 3 min · 7 days ago

Machine Learning

[2603.25507] Lightweight GenAI for Network Traffic Synthesis: Fidelity, Augmentation, and Classification

Abstract page for arXiv paper 2603.25507: Lightweight GenAI for Network Traffic Synthesis: Fidelity, Augmentation, and Classification

arXiv - AI · 4 min · 7 days ago

Llms

[2603.25322] AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer's Disease Diagnosis with Multi-cohort Assessment, Fairness Analysis, and Reader Study

Abstract page for arXiv paper 2603.25322: AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer's Disease D...

arXiv - AI · 4 min · 7 days ago

Machine Learning

[2603.25466] Residual-as-Teacher: Mitigating Bias Propagation in Student--Teacher Estimation

Abstract page for arXiv paper 2603.25466: Residual-as-Teacher: Mitigating Bias Propagation in Student--Teacher Estimation

arXiv - Machine Learning · 3 min · 7 days ago

Machine Learning

[2603.25289] Revealing the influence of participant failures on model quality in cross-silo Federated Learning

Abstract page for arXiv paper 2603.25289: Revealing the influence of participant failures on model quality in cross-silo Federated Learning

arXiv - AI · 4 min · 7 days ago

Machine Learning

[2603.25440] The Symmetric Perceptron: a Teacher-Student Scenario

Abstract page for arXiv paper 2603.25440: The Symmetric Perceptron: a Teacher-Student Scenario

arXiv - Machine Learning · 4 min · 7 days ago

Machine Learning

[2603.25414] Decidable By Construction: Design-Time Verification for Trustworthy AI

Abstract page for arXiv paper 2603.25414: Decidable By Construction: Design-Time Verification for Trustworthy AI

arXiv - AI · 4 min · 7 days ago

Previous Page 75 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

[P] Gemma 4 running on NVIDIA B200 and AMD MI355X from the same inference stack, 15% throughput gain over vLLM on Blackwell

Google releases Gemma 4 models.

Google has published its new open-weight model Gemma 4. And made it commercially available under Apache 2.0 License

All Content

[2401.12546] On Building Myopic MPC Policies using Supervised Learning

[2603.25740] Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving

[2603.25722] No Hard Negatives Required: Concept Centric Learning Leads to Compositionality without Degrading Zero-shot Capabilities of Contrastive Models

[2603.25638] Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers

[2603.25462] Temporally Decoupled Diffusion Planning for Autonomous Driving

[2603.25629] LanteRn: Latent Visual Structured Reasoning

[2603.25423] From Manipulation to Mistrust: Explaining Diverse Micro-Video Misinformation for Robust Debunking in the Wild

[2603.25622] The Geometry of Efficient Nonconvex Sampling

[2603.25579] The Rules-and-Facts Model for Simultaneous Generalization and Memorization in Neural Networks

[2603.25573] Hierarchy-Guided Multimodal Representation Learning for Taxonomic Inference

[2603.25535] Insights on back marking for the automated identification of animals

[2603.25366] Integrating Deep RL and Bayesian Inference for ObjectNav in Mobile Robotics

[2603.25517] NERO-Net: A Neuroevolutionary Approach for the Design of Adversarially Robust CNNs

[2603.25509] Conformal Prediction for Nonparametric Instrumental Regression

[2603.25507] Lightweight GenAI for Network Traffic Synthesis: Fidelity, Augmentation, and Classification

[2603.25322] AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer's Disease Diagnosis with Multi-cohort Assessment, Fairness Analysis, and Reader Study

[2603.25466] Residual-as-Teacher: Mitigating Bias Propagation in Student--Teacher Estimation

[2603.25289] Revealing the influence of participant failures on model quality in cross-silo Federated Learning

[2603.25440] The Symmetric Perceptron: a Teacher-Student Scenario

[2603.25414] Decidable By Construction: Design-Time Verification for Trustworthy AI

Related Topics

Stay updated with AI News