Qwen3 4B outperforms cloud agents on code tasks—with Mahoraga research [R]
Hey everyone in ML. I've been working on Mahoraga, an open-source orchestrator that routes tasks across local and cloud AI agents using a...
ML algorithms, training, and inference
Hey everyone in ML. I've been working on Mahoraga, an open-source orchestrator that routes tasks across local and cloud AI agents using a...
Auroch Engine is an external memory layer for AI assistants — designed to give models better long-term recall, personalization, and conte...
Hey everyone, I’ve been building a multi-agent system in my spare time, and I just open-sourced the repository. I was getting tired of th...
Abstract page for arXiv paper 2307.09366: Sparse Gaussian Graphical Models with Discrete Optimization: Computational and Statistical Pers...
Abstract page for arXiv paper 2305.03784: Neural Exploitation and Exploration of Contextual Bandits
Abstract page for arXiv paper 2301.01741: Graph State-Space Models and Latent Relational Inference
Abstract page for arXiv paper 2006.04363: Mitigating Value Hallucination in Dyna Planning via Multistep Predecessor Models
Abstract page for arXiv paper 2604.04930: Early Stopping for Large Reasoning Models via Confidence Dynamics
Abstract page for arXiv paper 2604.04920: PINNs in PDE Constrained Optimal Control Problems: Direct vs Indirect Methods
Abstract page for arXiv paper 2604.04898: QED-Nano: Teaching a Tiny Model to Prove Hard Theorems
Abstract page for arXiv paper 2604.04894: Rethinking Exploration in RLVR: From Entropy Regularization to Refinement via Bidirectional Ent...
Abstract page for arXiv paper 2604.04872: Synthetic Sandbox for Training Machine Learning Engineering Agents
Abstract page for arXiv paper 2604.04829: A Robust SINDy Autoencoder for Noisy Dynamical System Identification
Abstract page for arXiv paper 2604.04804: SkillX: Automatically Constructing Skill Knowledge Bases for Agents
Abstract page for arXiv paper 2604.04828: Hybrid Fourier Neural Operator for Surrogate Modeling of Laser Processing with a Quantum-Circui...
Abstract page for arXiv paper 2604.04802: Partially deterministic sampling for compressed sensing with denoising guarantees
Abstract page for arXiv paper 2604.04790: HUKUKBERT: Domain-Specific Language Model for Turkish Law
Abstract page for arXiv paper 2604.04757: Undetectable Conversations Between AI Agents via Pseudorandom Noise-Resilient Key Exchange
Abstract page for arXiv paper 2604.04738: Fine-Tuning Integrity for Modern Neural Networks: Structured Drift Proofs via Norm, Rank, and S...
Abstract page for arXiv paper 2604.04726: A Muon-Accelerated Algorithm for Low Separation Rank Tensor Generalized Linear Models
Abstract page for arXiv paper 2604.04677: Towards protein folding pathways by reconstructing protein residue networks with a policy-drive...
Abstract page for arXiv paper 2604.04673: Minimaxity and Admissibility of Bayesian Neural Networks
Abstract page for arXiv paper 2604.04667: ZeD-MAP: Bundle Adjustment Guided Zero-Shot Depth Maps for Real-Time Aerial Imaging
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime