Machine Learning

ML algorithms, training, and inference

Top This Week

Llms

Qwen3 4B outperforms cloud agents on code tasks—with Mahoraga research [R]

Hey everyone in ML. I've been working on Mahoraga, an open-source orchestrator that routes tasks across local and cloud AI agents using a...

Reddit - Machine Learning · 1 min ·
Machine Learning

Auroch - The Future of AI Memory

Auroch Engine is an external memory layer for AI assistants — designed to give models better long-term recall, personalization, and conte...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

Project Aurelia — A 3-model architecture (80B + 13B + 9B) that physically reacts to my real-time heart rate via mmWave radar, spatial awareness via Lidar, and Vibration via Accelerometer. All on a Framework Desktop + eGPU

Hey everyone, I’ve been building a multi-agent system in my spare time, and I just open-sourced the repository. I was getting tired of th...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2307.09366] Sparse Gaussian Graphical Models with Discrete Optimization: Computational and Statistical Perspectives
Machine Learning

[2307.09366] Sparse Gaussian Graphical Models with Discrete Optimization: Computational and Statistical Perspectives

Abstract page for arXiv paper 2307.09366: Sparse Gaussian Graphical Models with Discrete Optimization: Computational and Statistical Pers...

arXiv - Machine Learning · 4 min ·
[2305.03784] Neural Exploitation and Exploration of Contextual Bandits
Machine Learning

[2305.03784] Neural Exploitation and Exploration of Contextual Bandits

Abstract page for arXiv paper 2305.03784: Neural Exploitation and Exploration of Contextual Bandits

arXiv - Machine Learning · 3 min ·
[2301.01741] Graph State-Space Models and Latent Relational Inference
Machine Learning

[2301.01741] Graph State-Space Models and Latent Relational Inference

Abstract page for arXiv paper 2301.01741: Graph State-Space Models and Latent Relational Inference

arXiv - Machine Learning · 3 min ·
[2006.04363] Mitigating Value Hallucination in Dyna Planning via Multistep Predecessor Models
Machine Learning

[2006.04363] Mitigating Value Hallucination in Dyna Planning via Multistep Predecessor Models

Abstract page for arXiv paper 2006.04363: Mitigating Value Hallucination in Dyna Planning via Multistep Predecessor Models

arXiv - AI · 4 min ·
[2604.04930] Early Stopping for Large Reasoning Models via Confidence Dynamics
Machine Learning

[2604.04930] Early Stopping for Large Reasoning Models via Confidence Dynamics

Abstract page for arXiv paper 2604.04930: Early Stopping for Large Reasoning Models via Confidence Dynamics

arXiv - AI · 3 min ·
[2604.04920] PINNs in PDE Constrained Optimal Control Problems: Direct vs Indirect Methods
Machine Learning

[2604.04920] PINNs in PDE Constrained Optimal Control Problems: Direct vs Indirect Methods

Abstract page for arXiv paper 2604.04920: PINNs in PDE Constrained Optimal Control Problems: Direct vs Indirect Methods

arXiv - Machine Learning · 4 min ·
[2604.04898] QED-Nano: Teaching a Tiny Model to Prove Hard Theorems
Machine Learning

[2604.04898] QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

Abstract page for arXiv paper 2604.04898: QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

arXiv - AI · 4 min ·
[2604.04894] Rethinking Exploration in RLVR: From Entropy Regularization to Refinement via Bidirectional Entropy Modulation
Llms

[2604.04894] Rethinking Exploration in RLVR: From Entropy Regularization to Refinement via Bidirectional Entropy Modulation

Abstract page for arXiv paper 2604.04894: Rethinking Exploration in RLVR: From Entropy Regularization to Refinement via Bidirectional Ent...

arXiv - AI · 4 min ·
[2604.04872] Synthetic Sandbox for Training Machine Learning Engineering Agents
Llms

[2604.04872] Synthetic Sandbox for Training Machine Learning Engineering Agents

Abstract page for arXiv paper 2604.04872: Synthetic Sandbox for Training Machine Learning Engineering Agents

arXiv - Machine Learning · 4 min ·
[2604.04829] A Robust SINDy Autoencoder for Noisy Dynamical System Identification
Machine Learning

[2604.04829] A Robust SINDy Autoencoder for Noisy Dynamical System Identification

Abstract page for arXiv paper 2604.04829: A Robust SINDy Autoencoder for Noisy Dynamical System Identification

arXiv - Machine Learning · 3 min ·
[2604.04804] SkillX: Automatically Constructing Skill Knowledge Bases for Agents
Llms

[2604.04804] SkillX: Automatically Constructing Skill Knowledge Bases for Agents

Abstract page for arXiv paper 2604.04804: SkillX: Automatically Constructing Skill Knowledge Bases for Agents

arXiv - Machine Learning · 4 min ·
[2604.04828] Hybrid Fourier Neural Operator for Surrogate Modeling of Laser Processing with a Quantum-Circuit Mixer
Machine Learning

[2604.04828] Hybrid Fourier Neural Operator for Surrogate Modeling of Laser Processing with a Quantum-Circuit Mixer

Abstract page for arXiv paper 2604.04828: Hybrid Fourier Neural Operator for Surrogate Modeling of Laser Processing with a Quantum-Circui...

arXiv - Machine Learning · 4 min ·
[2604.04802] Partially deterministic sampling for compressed sensing with denoising guarantees
Machine Learning

[2604.04802] Partially deterministic sampling for compressed sensing with denoising guarantees

Abstract page for arXiv paper 2604.04802: Partially deterministic sampling for compressed sensing with denoising guarantees

arXiv - Machine Learning · 3 min ·
[2604.04790] HUKUKBERT: Domain-Specific Language Model for Turkish Law
Llms

[2604.04790] HUKUKBERT: Domain-Specific Language Model for Turkish Law

Abstract page for arXiv paper 2604.04790: HUKUKBERT: Domain-Specific Language Model for Turkish Law

arXiv - Machine Learning · 4 min ·
[2604.04757] Undetectable Conversations Between AI Agents via Pseudorandom Noise-Resilient Key Exchange
Machine Learning

[2604.04757] Undetectable Conversations Between AI Agents via Pseudorandom Noise-Resilient Key Exchange

Abstract page for arXiv paper 2604.04757: Undetectable Conversations Between AI Agents via Pseudorandom Noise-Resilient Key Exchange

arXiv - AI · 4 min ·
[2604.04738] Fine-Tuning Integrity for Modern Neural Networks: Structured Drift Proofs via Norm, Rank, and Sparsity Certificates
Machine Learning

[2604.04738] Fine-Tuning Integrity for Modern Neural Networks: Structured Drift Proofs via Norm, Rank, and Sparsity Certificates

Abstract page for arXiv paper 2604.04738: Fine-Tuning Integrity for Modern Neural Networks: Structured Drift Proofs via Norm, Rank, and S...

arXiv - Machine Learning · 4 min ·
[2604.04726] A Muon-Accelerated Algorithm for Low Separation Rank Tensor Generalized Linear Models
Machine Learning

[2604.04726] A Muon-Accelerated Algorithm for Low Separation Rank Tensor Generalized Linear Models

Abstract page for arXiv paper 2604.04726: A Muon-Accelerated Algorithm for Low Separation Rank Tensor Generalized Linear Models

arXiv - Machine Learning · 3 min ·
[2604.04677] Towards protein folding pathways by reconstructing protein residue networks with a policy-driven model
Machine Learning

[2604.04677] Towards protein folding pathways by reconstructing protein residue networks with a policy-driven model

Abstract page for arXiv paper 2604.04677: Towards protein folding pathways by reconstructing protein residue networks with a policy-drive...

arXiv - Machine Learning · 3 min ·
[2604.04673] Minimaxity and Admissibility of Bayesian Neural Networks
Machine Learning

[2604.04673] Minimaxity and Admissibility of Bayesian Neural Networks

Abstract page for arXiv paper 2604.04673: Minimaxity and Admissibility of Bayesian Neural Networks

arXiv - Machine Learning · 3 min ·
[2604.04667] ZeD-MAP: Bundle Adjustment Guided Zero-Shot Depth Maps for Real-Time Aerial Imaging
Machine Learning

[2604.04667] ZeD-MAP: Bundle Adjustment Guided Zero-Shot Depth Maps for Real-Time Aerial Imaging

Abstract page for arXiv paper 2604.04667: ZeD-MAP: Bundle Adjustment Guided Zero-Shot Depth Maps for Real-Time Aerial Imaging

arXiv - Machine Learning · 4 min ·
Previous Page 252 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime