[P] Implemented ACT-R cognitive decay and hyperdimensional computing for AI agent memory (open source)
Built a memory server for AI agents (MCP protocol) and implemented two cognitive science techniques in v7.5 I wanted to share. ACT-R Cogn...
Text understanding and language tasks
Built a memory server for AI agents (MCP protocol) and implemented two cognitive science techniques in v7.5 I wanted to share. ACT-R Cogn...
🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses These hypotheses are not meant to be believed. They are meant to be **held lig...
This project enables the idea of tasking remote sensing models to acquire embeddings like we task satellites to acquire data! https://git...
A recruiter reveals that 'Data Scientist' is the lowest-paying title in machine learning across Europe, based on an analysis of over 350,...
The article discusses how two structural properties of virtual weight matrices can predict edge importance in GPT-2's induction circuit, ...
The article discusses the challenges and methods of fine-tuning sentence embeddings from transformer models, particularly focusing on agg...
A Reddit user shares their first implementation of a Transformer architecture using PyTorch, detailing the structure and parameters used,...
The paper introduces Randomized Masked Finetuning (RMFT), a technique designed to reduce the memorization of personally identifiable info...
The paper discusses Weighted Birkhoff Averages, a method that accelerates convergence in data-driven algorithms for dynamical systems, de...
This article presents the GEneral Synthetic-Powered Inference (GESPI) framework, which enhances statistical inference by integrating synt...
The paper presents PoeTone, a framework for generating structured Chinese Songci poetry using large language models (LLMs), evaluating th...
This paper presents a novel method for controlling voice impressions in zero-shot text-to-speech (TTS) systems, utilizing a low-dimension...
This paper explores the effectiveness of test-time verification over policy learning in enhancing Vision-Language-Action (VLA) alignment,...
This article presents a novel demand estimation method that utilizes unstructured data from text and images to enhance substitution patte...
This article explores the relationship between vocabulary activation and self-referential processing in large language models, introducin...
This paper explores the integration of vision-language models in autonomous driving, focusing on safety assessment and decision-making th...
The paper presents LMSeg, a novel approach for open-vocabulary semantic segmentation that enhances visual and linguistic feature alignmen...
The paper presents CreativityPrism, a comprehensive framework for evaluating the creativity of large language models (LLMs) across variou...
The paper introduces GENESIS, a generative model that integrates episodic and semantic memory, addressing a key challenge in cognitive ne...
This paper introduces a theoretical framework for lossless vocabulary reduction in auto-regressive language models, enabling efficient co...
The paper explores the challenges of spatio-temporal models in machine learning, focusing on biases in temporal attention mechanisms and ...
The paper introduces a novel approach to using features as rewards in reinforcement learning for open-ended tasks, focusing on reducing h...
This paper explores multilingual routing in Mixture-of-Experts (MoE) architectures, revealing how these models handle multilingual data a...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime