[2603.26671] Mitigating Forgetting in Continual Learning with

[2603.26671] Mitigating Forgetting in Continual Learning with Selective Gradient Projection

arXiv - Machine Learning March 31, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.26671: Mitigating Forgetting in Continual Learning with Selective Gradient Projection

Computer Science > Machine Learning arXiv:2603.26671 (cs) [Submitted on 8 Feb 2026] Title:Mitigating Forgetting in Continual Learning with Selective Gradient Projection Authors:Anika Singh, Aayush Dhaulakhandi, Varun Chopade, Likhith Malipati, David Martinez, Kevin Zhu View a PDF of the paper titled Mitigating Forgetting in Continual Learning with Selective Gradient Projection, by Anika Singh and 5 other authors View PDF HTML (experimental) Abstract:As neural networks are increasingly deployed in dynamic environments, they face the challenge of catastrophic forgetting, the tendency to overwrite previously learned knowledge when adapting to new tasks, resulting in severe performance degradation on earlier tasks. We propose Selective Forgetting-Aware Optimization (SFAO), a dynamic method that regulates gradient directions via cosine similarity and per-layer gating, enabling controlled forgetting while balancing plasticity and stability. SFAO selectively projects, accepts, or discards updates using a tunable mechanism with efficient Monte Carlo approximation. Experiments on standard continual learning benchmarks show that SFAO achieves competitive accuracy with markedly lower memory cost, a 90$\%$ reduction, and improved forgetting on MNIST datasets, making it suitable for resource-constrained scenarios. Comments: Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC) Cite as: arXiv:2603.26671 [cs.LG] (or arXiv:2603.26671v1 [cs.LG] for this version) https:...

Originally published on March 31, 2026. Curated by AI News.

Machine Learning

[D] How does distributed proof of work computing handle the coordination needs of neural network training?

[D] Ive been trying to understand the technical setup of a project called Qubic. It claims to use distributed proof of work computing for...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

[R] VLMs Behavior for Long Video Understanding

I have extensively searched on long video understanding datasets such as Video-MME, MLVU, VideoBench, LongVideoBench and etc. What I have...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

My AI spent last night modifying its own codebase

I've been working on a local AI system called Apis that runs completely offline through Ollama. During a background run, Apis identified ...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

Fake users generated by AI can't simulate humans — review of 182 research papers. Your thoughts?

https://www.researchsquare.com/article/rs-9057643/v1 There’s a massive trend right now where tech companies, businesses, even researchers...