Associative memory system for LLMs that learns during inference [P]
I've been working on MDA (Modular Dynamic Architecture), an online associative memory system for LLMs. Here's what I learned building it....
GPT, Claude, Gemini, and other LLMs
I've been working on MDA (Modular Dynamic Architecture), an online associative memory system for LLMs. Here's what I learned building it....
I've been building **Autodidact**, a local-first AI agent framework. The central piece is a **confidence evaluator** - something that dec...
Seriously, I just audited my stack and realized I’m spending more on rotating residential proxies than I am on the actual Claude and Open...
Abstract page for arXiv paper 2603.05121: Measuring the Redundancy of Decoder Layers in SpeechLLMs
Abstract page for arXiv paper 2603.04982: Training for Technology: Adoption and Productive Use of Generative AI in Legal Analysis
Abstract page for arXiv paper 2603.04976: 3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding
Abstract page for arXiv paper 2603.04968: When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger
Abstract page for arXiv paper 2603.04918: BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforc...
Abstract page for arXiv paper 2603.04893: Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models
Abstract page for arXiv paper 2603.04819: On the Strengths and Weaknesses of Data for Open-set Embodied Assistance
Abstract page for arXiv paper 2603.04805: Attention's Gravitational Field:A Power-Law Interpretation of Positional Correlation
Abstract page for arXiv paper 2603.04799: Beyond Linear LLM Invocation: An Efficient and Effective Semantic Filter Paradigm
Abstract page for arXiv paper 2603.04772: TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings
Abstract page for arXiv paper 2603.04763: Evaluating GPT-5 as a Multimodal Clinical Reasoner: A Landscape Commentary
Abstract page for arXiv paper 2603.04743: DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval
Abstract page for arXiv paper 2603.04759: Stacked from One: Multi-Scale Self-Injection for Context Window Extension
Abstract page for arXiv paper 2603.04727: Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in t...
Abstract page for arXiv paper 2603.04707: Detection of Illicit Content on Online Marketplaces using Large Language Models
Abstract page for arXiv paper 2603.04698: Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement
Abstract page for arXiv paper 2603.04678: Optimizing Language Models for Crosslingual Knowledge Consistency
Abstract page for arXiv paper 2603.04676: Decoding the Pulse of Reasoning VLMs in Multi-Image Understanding Tasks
Abstract page for arXiv paper 2603.04663: Neuro-Symbolic Financial Reasoning via Deterministic Fact Ledgers and Adversarial Low-Latency H...
Abstract page for arXiv paper 2603.04597: Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime