Natural Language Processing

Text understanding and language tasks

Top This Week

Nlp

[P] Implemented ACT-R cognitive decay and hyperdimensional computing for AI agent memory (open source)

Built a memory server for AI agents (MCP protocol) and implemented two cognitive science techniques in v7.5 I wanted to share. ACT-R Cogn...

Reddit - Machine Learning · 1 min ·
Nlp

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses These hypotheses are not meant to be believed. They are meant to be **held lig...

Reddit - Artificial Intelligence · 1 min ·
Llms

[P] Remote sensing foundation models made easy to use.

This project enables the idea of tasking remote sensing models to acquire embeddings like we task satellites to acquire data! https://git...

Reddit - Machine Learning · 1 min ·

All Content

Machine Learning

[R] The "Data Scientist" title is the worst paying title in ML (EMEA).

A recruiter reveals that 'Data Scientist' is the lowest-paying title in machine learning across Europe, based on an analysis of over 350,...

Reddit - Machine Learning · 1 min ·
Llms

[R] Predicting Edge Importance in GPT-2's Induction Circuit from Weights Alone (ρ=0.623, 125x speedup)

The article discusses how two structural properties of virtual weight matrices can predict edge importance in GPT-2's induction circuit, ...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] Research on Self-supervised fine tunning of "sentence" embeddings?

The article discusses the challenges and methods of fine-tuning sentence embeddings from transformer models, particularly focusing on agg...

Reddit - Machine Learning · 1 min ·
Machine Learning

[p] I Made my first Transformer architecture code

A Reddit user shares their first implementation of a Transformer architecture using PyTorch, detailing the structure and parameters used,...

Reddit - Machine Learning · 1 min ·
[2512.03310] Randomized Masked Finetuning: An Efficient Way to Mitigate Memorization of PIIs in LLMs
Llms

[2512.03310] Randomized Masked Finetuning: An Efficient Way to Mitigate Memorization of PIIs in LLMs

The paper introduces Randomized Masked Finetuning (RMFT), a technique designed to reduce the memorization of personally identifiable info...

arXiv - Machine Learning · 3 min ·
[2511.17772] Weighted Birkhoff Averages Accelerate Data-Driven Methods
Machine Learning

[2511.17772] Weighted Birkhoff Averages Accelerate Data-Driven Methods

The paper discusses Weighted Birkhoff Averages, a method that accelerates convergence in data-driven algorithms for dynamical systems, de...

arXiv - Machine Learning · 3 min ·
[2509.20345] Statistical Inference Leveraging Synthetic Data with Distribution-Free Guarantees
Machine Learning

[2509.20345] Statistical Inference Leveraging Synthetic Data with Distribution-Free Guarantees

This article presents the GEneral Synthetic-Powered Inference (GESPI) framework, which enhances statistical inference by integrating synt...

arXiv - Machine Learning · 4 min ·
[2508.02515] PoeTone: A Framework for Constrained Generation of Structured Chinese Songci with LLMs
Llms

[2508.02515] PoeTone: A Framework for Constrained Generation of Structured Chinese Songci with LLMs

The paper presents PoeTone, a framework for generating structured Chinese Songci poetry using large language models (LLMs), evaluating th...

arXiv - Machine Learning · 4 min ·
[2506.05688] Voice Impression Control in Zero-Shot TTS
Machine Learning

[2506.05688] Voice Impression Control in Zero-Shot TTS

This paper presents a novel method for controlling voice impressions in zero-shot text-to-speech (TTS) systems, utilizing a low-dimension...

arXiv - Machine Learning · 3 min ·
[2602.12281] Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Action Alignment
Machine Learning

[2602.12281] Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Action Alignment

This paper explores the effectiveness of test-time verification over policy learning in enhancing Vision-Language-Action (VLA) alignment,...

arXiv - AI · 4 min ·
[2503.20711] Demand Estimation with Text and Image Data
Machine Learning

[2503.20711] Demand Estimation with Text and Image Data

This article presents a novel demand estimation method that utilizes unstructured data from text and images to enhance substitution patte...

arXiv - Machine Learning · 3 min ·
[2602.11358] When Models Examine Themselves: Vocabulary-Activation Correspondence in Self-Referential Processing
Llms

[2602.11358] When Models Examine Themselves: Vocabulary-Activation Correspondence in Self-Referential Processing

This article explores the relationship between vocabulary activation and self-referential processing in large language models, introducin...

arXiv - Machine Learning · 4 min ·
[2602.07680] Vision and Language: Novel Representations and Artificial intelligence for Driving Scene Safety Assessment and Autonomous Vehicle Planning
Llms

[2602.07680] Vision and Language: Novel Representations and Artificial intelligence for Driving Scene Safety Assessment and Autonomous Vehicle Planning

This paper explores the integration of vision-language models in autonomous driving, focusing on safety assessment and decision-making th...

arXiv - Machine Learning · 4 min ·
[2412.00364] LMSeg: Unleashing the Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation
Llms

[2412.00364] LMSeg: Unleashing the Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation

The paper presents LMSeg, a novel approach for open-vocabulary semantic segmentation that enhances visual and linguistic feature alignmen...

arXiv - Machine Learning · 4 min ·
[2510.20091] CreativityPrism: A Holistic Evaluation Framework for Large Language Model Creativity
Llms

[2510.20091] CreativityPrism: A Holistic Evaluation Framework for Large Language Model Creativity

The paper presents CreativityPrism, a comprehensive framework for evaluating the creativity of large language models (LLMs) across variou...

arXiv - AI · 4 min ·
[2510.15828] GENESIS: A Generative Model of Episodic-Semantic Interaction
Machine Learning

[2510.15828] GENESIS: A Generative Model of Episodic-Semantic Interaction

The paper introduces GENESIS, a generative model that integrates episodic and semantic memory, addressing a key challenge in cognitive ne...

arXiv - AI · 4 min ·
[2510.08102] Lossless Vocabulary Reduction for Auto-Regressive Language Models
Llms

[2510.08102] Lossless Vocabulary Reduction for Auto-Regressive Language Models

This paper introduces a theoretical framework for lossless vocabulary reduction in auto-regressive language models, enabling efficient co...

arXiv - Machine Learning · 3 min ·
[2602.10956] Stochastic Parroting in Temporal Attention -- Regulating the Diagonal Sink
Machine Learning

[2602.10956] Stochastic Parroting in Temporal Attention -- Regulating the Diagonal Sink

The paper explores the challenges of spatio-temporal models in machine learning, focusing on biases in temporal attention mechanisms and ...

arXiv - Machine Learning · 3 min ·
[2602.10067] Features as Rewards: Scalable Supervision for Open-Ended Tasks via Interpretability
Llms

[2602.10067] Features as Rewards: Scalable Supervision for Open-Ended Tasks via Interpretability

The paper introduces a novel approach to using features as rewards in reinforcement learning for open-ended tasks, focusing on reducing h...

arXiv - Machine Learning · 4 min ·
[2510.04694] Multilingual Routing in Mixture-of-Experts
Llms

[2510.04694] Multilingual Routing in Mixture-of-Experts

This paper explores multilingual routing in Mixture-of-Experts (MoE) architectures, revealing how these models handle multilingual data a...

arXiv - Machine Learning · 4 min ·
Previous Page 104 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime