Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Qwen3 4B outperforms cloud agents on code tasks—with Mahoraga research [R]

Hey everyone in ML. I've been working on Mahoraga, an open-source orchestrator that routes tasks across local and cloud AI agents using a...

Reddit - Machine Learning · 1 min · 31 minutes ago

Machine Learning

Auroch - The Future of AI Memory

Auroch Engine is an external memory layer for AI assistants — designed to give models better long-term recall, personalization, and conte...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Machine Learning

Project Aurelia — A 3-model architecture (80B + 13B + 9B) that physically reacts to my real-time heart rate via mmWave radar, spatial awareness via Lidar, and Vibration via Accelerometer. All on a Framework Desktop + eGPU

Hey everyone, I’ve been building a multi-agent system in my spare time, and I just open-sourced the repository. I was getting tired of th...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

All Content

Machine Learning

[2307.09366] Sparse Gaussian Graphical Models with Discrete Optimization: Computational and Statistical Perspectives

Abstract page for arXiv paper 2307.09366: Sparse Gaussian Graphical Models with Discrete Optimization: Computational and Statistical Pers...

arXiv - Machine Learning · 4 min · 21 days ago

Machine Learning

[2305.03784] Neural Exploitation and Exploration of Contextual Bandits

Abstract page for arXiv paper 2305.03784: Neural Exploitation and Exploration of Contextual Bandits

arXiv - Machine Learning · 3 min · 21 days ago

Machine Learning

[2301.01741] Graph State-Space Models and Latent Relational Inference

Abstract page for arXiv paper 2301.01741: Graph State-Space Models and Latent Relational Inference

arXiv - Machine Learning · 3 min · 21 days ago

Machine Learning

[2006.04363] Mitigating Value Hallucination in Dyna Planning via Multistep Predecessor Models

Abstract page for arXiv paper 2006.04363: Mitigating Value Hallucination in Dyna Planning via Multistep Predecessor Models

arXiv - AI · 4 min · 21 days ago

Machine Learning

[2604.04930] Early Stopping for Large Reasoning Models via Confidence Dynamics

Abstract page for arXiv paper 2604.04930: Early Stopping for Large Reasoning Models via Confidence Dynamics

arXiv - AI · 3 min · 21 days ago

Machine Learning

[2604.04920] PINNs in PDE Constrained Optimal Control Problems: Direct vs Indirect Methods

Abstract page for arXiv paper 2604.04920: PINNs in PDE Constrained Optimal Control Problems: Direct vs Indirect Methods

arXiv - Machine Learning · 4 min · 21 days ago

Machine Learning

[2604.04898] QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

Abstract page for arXiv paper 2604.04898: QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

arXiv - AI · 4 min · 21 days ago

Llms

[2604.04894] Rethinking Exploration in RLVR: From Entropy Regularization to Refinement via Bidirectional Entropy Modulation

Abstract page for arXiv paper 2604.04894: Rethinking Exploration in RLVR: From Entropy Regularization to Refinement via Bidirectional Ent...

arXiv - AI · 4 min · 21 days ago

Llms

[2604.04872] Synthetic Sandbox for Training Machine Learning Engineering Agents

Abstract page for arXiv paper 2604.04872: Synthetic Sandbox for Training Machine Learning Engineering Agents

arXiv - Machine Learning · 4 min · 21 days ago

Machine Learning

[2604.04829] A Robust SINDy Autoencoder for Noisy Dynamical System Identification

Abstract page for arXiv paper 2604.04829: A Robust SINDy Autoencoder for Noisy Dynamical System Identification

arXiv - Machine Learning · 3 min · 21 days ago

Llms

[2604.04804] SkillX: Automatically Constructing Skill Knowledge Bases for Agents

Abstract page for arXiv paper 2604.04804: SkillX: Automatically Constructing Skill Knowledge Bases for Agents

arXiv - Machine Learning · 4 min · 21 days ago

Machine Learning

[2604.04828] Hybrid Fourier Neural Operator for Surrogate Modeling of Laser Processing with a Quantum-Circuit Mixer

Abstract page for arXiv paper 2604.04828: Hybrid Fourier Neural Operator for Surrogate Modeling of Laser Processing with a Quantum-Circui...

arXiv - Machine Learning · 4 min · 21 days ago

Machine Learning

[2604.04802] Partially deterministic sampling for compressed sensing with denoising guarantees

Abstract page for arXiv paper 2604.04802: Partially deterministic sampling for compressed sensing with denoising guarantees

arXiv - Machine Learning · 3 min · 21 days ago

Llms

[2604.04790] HUKUKBERT: Domain-Specific Language Model for Turkish Law

Abstract page for arXiv paper 2604.04790: HUKUKBERT: Domain-Specific Language Model for Turkish Law

arXiv - Machine Learning · 4 min · 21 days ago

Machine Learning

[2604.04757] Undetectable Conversations Between AI Agents via Pseudorandom Noise-Resilient Key Exchange

Abstract page for arXiv paper 2604.04757: Undetectable Conversations Between AI Agents via Pseudorandom Noise-Resilient Key Exchange

arXiv - AI · 4 min · 21 days ago

Machine Learning

[2604.04738] Fine-Tuning Integrity for Modern Neural Networks: Structured Drift Proofs via Norm, Rank, and Sparsity Certificates

Abstract page for arXiv paper 2604.04738: Fine-Tuning Integrity for Modern Neural Networks: Structured Drift Proofs via Norm, Rank, and S...

arXiv - Machine Learning · 4 min · 21 days ago

Machine Learning

[2604.04726] A Muon-Accelerated Algorithm for Low Separation Rank Tensor Generalized Linear Models

Abstract page for arXiv paper 2604.04726: A Muon-Accelerated Algorithm for Low Separation Rank Tensor Generalized Linear Models

arXiv - Machine Learning · 3 min · 21 days ago

Machine Learning

[2604.04677] Towards protein folding pathways by reconstructing protein residue networks with a policy-driven model

Abstract page for arXiv paper 2604.04677: Towards protein folding pathways by reconstructing protein residue networks with a policy-drive...

arXiv - Machine Learning · 3 min · 21 days ago

Machine Learning

[2604.04673] Minimaxity and Admissibility of Bayesian Neural Networks

Abstract page for arXiv paper 2604.04673: Minimaxity and Admissibility of Bayesian Neural Networks

arXiv - Machine Learning · 3 min · 21 days ago

Machine Learning

[2604.04667] ZeD-MAP: Bundle Adjustment Guided Zero-Shot Depth Maps for Real-Time Aerial Imaging

Abstract page for arXiv paper 2604.04667: ZeD-MAP: Bundle Adjustment Guided Zero-Shot Depth Maps for Real-Time Aerial Imaging

arXiv - Machine Learning · 4 min · 21 days ago

Previous Page 252 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

Qwen3 4B outperforms cloud agents on code tasks—with Mahoraga research [R]

Auroch - The Future of AI Memory

Project Aurelia — A 3-model architecture (80B + 13B + 9B) that physically reacts to my real-time heart rate via mmWave radar, spatial awareness via Lidar, and Vibration via Accelerometer. All on a Framework Desktop + eGPU

All Content

[2307.09366] Sparse Gaussian Graphical Models with Discrete Optimization: Computational and Statistical Perspectives

[2305.03784] Neural Exploitation and Exploration of Contextual Bandits

[2301.01741] Graph State-Space Models and Latent Relational Inference

[2006.04363] Mitigating Value Hallucination in Dyna Planning via Multistep Predecessor Models

[2604.04930] Early Stopping for Large Reasoning Models via Confidence Dynamics

[2604.04920] PINNs in PDE Constrained Optimal Control Problems: Direct vs Indirect Methods

[2604.04898] QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

[2604.04894] Rethinking Exploration in RLVR: From Entropy Regularization to Refinement via Bidirectional Entropy Modulation

[2604.04872] Synthetic Sandbox for Training Machine Learning Engineering Agents

[2604.04829] A Robust SINDy Autoencoder for Noisy Dynamical System Identification

[2604.04804] SkillX: Automatically Constructing Skill Knowledge Bases for Agents

[2604.04828] Hybrid Fourier Neural Operator for Surrogate Modeling of Laser Processing with a Quantum-Circuit Mixer

[2604.04802] Partially deterministic sampling for compressed sensing with denoising guarantees

[2604.04790] HUKUKBERT: Domain-Specific Language Model for Turkish Law

[2604.04757] Undetectable Conversations Between AI Agents via Pseudorandom Noise-Resilient Key Exchange

[2604.04738] Fine-Tuning Integrity for Modern Neural Networks: Structured Drift Proofs via Norm, Rank, and Sparsity Certificates

[2604.04726] A Muon-Accelerated Algorithm for Low Separation Rank Tensor Generalized Linear Models

[2604.04677] Towards protein folding pathways by reconstructing protein residue networks with a policy-driven model

[2604.04673] Minimaxity and Admissibility of Bayesian Neural Networks

[2604.04667] ZeD-MAP: Bundle Adjustment Guided Zero-Shot Depth Maps for Real-Time Aerial Imaging

Related Topics

Stay updated with AI News