Auroch - The Future of AI Memory
Auroch Engine is an external memory layer for AI assistants — designed to give models better long-term recall, personalization, and conte...
ML algorithms, training, and inference
Auroch Engine is an external memory layer for AI assistants — designed to give models better long-term recall, personalization, and conte...
Hey everyone, I’ve been building a multi-agent system in my spare time, and I just open-sourced the repository. I was getting tired of th...
Heyy guyss... I had made the image dataset and was currently working on its training using the srnet model... I made it train on batches ...
Abstract page for arXiv paper 2506.08125: Not All Tokens Matter: Towards Efficient LLM Reasoning via Token Significance in Reinforcement ...
Abstract page for arXiv paper 2506.02371: SFBD Flow: A Continuous-Optimization Framework for Training Diffusion Models with Noisy Samples
Abstract page for arXiv paper 2506.01897: MLorc: Momentum Low-rank Compression for Memory Efficient Large Language Model Adaptation
Abstract page for arXiv paper 2505.24535: Beyond Linear Steering: Unified Multi-Attribute Control for Language Models
Abstract page for arXiv paper 2505.21972: LLMs Judging LLMs: A Simplex Perspective
Abstract page for arXiv paper 2505.21605: SoSBench: Benchmarking Safety Alignment on Six Scientific Domains
Abstract page for arXiv paper 2505.14202: MSDformer: Multi-scale Discrete Transformer For Time Series Generation
Abstract page for arXiv paper 2505.13742: Understanding Task Representations in Neural Networks via Bayesian Ablation
Abstract page for arXiv paper 2505.12530: Enforcing Fair Predicted Scores on Intervals of Percentiles by Difference-of-Convex Constraints
Abstract page for arXiv paper 2505.12167: FABLE: A Localized, Targeted Adversarial Attack on Weather Forecasting Models
Abstract page for arXiv paper 2505.03530: A Multi-Level Causal Intervention Framework for Mechanistic Interpretability in Variational Aut...
Abstract page for arXiv paper 2503.03206: An Analytical Theory of Spectral Bias in the Learning Dynamics of Diffusion Models
Abstract page for arXiv paper 2502.15567: Model Privacy: A Unified Framework for Understanding Model Stealing Attacks and Defenses
Abstract page for arXiv paper 2502.07977: RESIST: Resilient Decentralized Learning Using Consensus Gradient Descent
Abstract page for arXiv paper 2502.02020: Causal Bandit Over Unknown Graphs: Upper Confidence Bounds With Backdoor Adjustment
Abstract page for arXiv paper 2501.15458: Amortized Safe Active Learning for Real-Time Data Acquisition: Pretrained Neural Policies From ...
Abstract page for arXiv paper 2411.18235: Certified Training with Branch-and-Bound for Lyapunov-stable Neural Control
Abstract page for arXiv paper 2410.07430: EventFlow: Forecasting Temporal Point Processes with Flow Matching
Abstract page for arXiv paper 2410.02260: FedScalar: Federated Learning with Scalar Communication for Bandwidth-Constrained Networks
Abstract page for arXiv paper 2307.09366: Sparse Gaussian Graphical Models with Discrete Optimization: Computational and Statistical Pers...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime