[D] KDD Review Discussion
KDD 2026 (Feb Cycle) reviews will release today (4-April AoE), This thread is open to discuss about reviews and importantly celebrate suc...
Text understanding and language tasks
KDD 2026 (Feb Cycle) reviews will release today (4-April AoE), This thread is open to discuss about reviews and importantly celebrate suc...
Built a memory server for AI agents (MCP protocol) and implemented two cognitive science techniques in v7.5 I wanted to share. ACT-R Cogn...
🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses These hypotheses are not meant to be believed. They are meant to be **held lig...
This paper explores multilingual routing in Mixture-of-Experts (MoE) architectures, revealing how these models handle multilingual data a...
The paper introduces MedReasoner, a framework that utilizes reinforcement learning for precise medical reasoning and pixel-level groundin...
This paper explores the expressive power of graph transformers, comparing their capabilities under different logical frameworks, particul...
This paper presents MedVLSynther, a framework for synthesizing high-quality visual question answering (VQA) from medical documents, enhan...
This article presents a novel approach combining Chain-of-Thought (CoT) and Retrieval Augmented Generation (RAG) to improve rare disease ...
The paper presents EVOL-RL, a novel framework for evolving language models without labels, balancing majority-driven stability with novel...
This article explores the universal properties of activation sparsity in modern large language models (LLMs), highlighting its implicatio...
The paper presents FedARA, an innovative framework for federated parameter-efficient fine-tuning of language models, addressing data hete...
The paper introduces $ ext{Pinet}$, a novel output layer for neural networks that optimizes hard constraints using orthogonal projection ...
The paper presents Cocoa, a system designed to enhance human-agent collaboration in AI tasks by allowing flexible co-planning and co-exec...
The paper presents MC-LLaVA, a multi-concept personalized vision-language model that enhances user experience by integrating multiple con...
The paper introduces WINA, a novel framework for efficient inference in large language models (LLMs) that optimally combines hidden state...
DIAGPaper introduces a multi-agent framework for identifying and prioritizing weaknesses in scientific papers, addressing limitations of ...
CaveAgent introduces a novel framework that transforms LLMs into stateful runtime operators, enhancing their ability to manage complex ta...
The paper explores continual learning (CL) in AI, proposing a shift from minimizing memory usage to leveraging abundant memory while addr...
This paper explores the optimization challenges of Transformer models, focusing on gradient heterogeneity and its impact on convergence w...
This article presents a computational model that explores how humans and AI can integrate linguistic guidance and direct experience for e...
The paper presents GDGB, a benchmark for Generative Dynamic Text-Attributed Graph Learning, addressing the limitations of existing datase...
This paper demonstrates that parameter-free representations can outperform single-cell foundation models in various benchmarks, suggestin...
This article discusses the integration of Large Language Models (LLMs) into water distribution system management, introducing LLM-EPANET,...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime