Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Nlp

[D] KDD Review Discussion

KDD 2026 (Feb Cycle) reviews will release today (4-April AoE), This thread is open to discuss about reviews and importantly celebrate suc...

Reddit - Machine Learning · 1 min · about 3 hours ago

Nlp

[P] Implemented ACT-R cognitive decay and hyperdimensional computing for AI agent memory (open source)

Built a memory server for AI agents (MCP protocol) and implemented two cognitive science techniques in v7.5 I wanted to share. ACT-R Cogn...

Reddit - Machine Learning · 1 min · about 9 hours ago

Nlp

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses These hypotheses are not meant to be believed. They are meant to be **held lig...

Reddit - Artificial Intelligence · 1 min · about 18 hours ago

All Content

Llms

[2510.04694] Multilingual Routing in Mixture-of-Experts

This paper explores multilingual routing in Mixture-of-Experts (MoE) architectures, revealing how these models handle multilingual data a...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2508.08177] MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision

The paper introduces MedReasoner, a framework that utilizes reinforcement learning for precise medical reasoning and pixel-level groundin...

arXiv - AI · 4 min · about 1 month ago

Llms

[2508.01067] Expressive Power of Graph Transformers via Logic

This paper explores the expressive power of graph transformers, comparing their capabilities under different logical frameworks, particul...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2510.25867] Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs

This paper presents MedVLSynther, a framework for synthesizing high-quality visual question answering (VQA) from medical documents, enhan...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2503.12286] Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes

This article presents a novel approach combining Chain-of-Thought (CoT) and Retrieval Augmented Generation (RAG) to improve rare disease ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.15194] Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

The paper presents EVOL-RL, a novel framework for evolving language models without labels, balancing majority-driven stability with novel...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.00454] Universal Properties of Activation Sparsity in Modern Large Language Models

This article explores the universal properties of activation sparsity in modern large language models (LLMs), highlighting its implicatio...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2501.14406] Adaptive Rank Allocation for Federated Parameter-Efficient Fine-Tuning of Language Models

The paper presents FedARA, an innovative framework for federated parameter-efficient fine-tuning of language models, addressing data hete...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2508.10480] Pinet: Optimizing hard-constrained neural networks with orthogonal projection layers

The paper introduces $ ext{Pinet}$, a novel output layer for neural networks that optimizes hard constraints using orthogonal projection ...

arXiv - Machine Learning · 3 min · about 1 month ago

Nlp

[2412.10999] Cocoa: Co-Planning and Co-Execution with AI Agents

The paper presents Cocoa, a system designed to enhance human-agent collaboration in AI tasks by allowing flexible co-planning and co-exec...

arXiv - AI · 4 min · about 1 month ago

Llms

[2411.11706] MC-LLaVA: Multi-Concept Personalized Vision-Language Model

The paper presents MC-LLaVA, a multi-concept personalized vision-language model that enhances user experience by integrating multiple con...

arXiv - AI · 4 min · about 1 month ago

Llms

[2505.19427] WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference

The paper introduces WINA, a novel framework for efficient inference in large language models (LLMs) that optimally combines hidden state...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2601.07611] DIAGPaper: Diagnosing Valid and Specific Weaknesses in Scientific Papers via Multi-Agent Reasoning

DIAGPaper introduces a multi-agent framework for identifying and prioritizing weaknesses in scientific papers, addressing limitations of ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2601.01569] CaveAgent: Transforming LLMs into Stateful Runtime Operators

CaveAgent introduces a novel framework that transforms LLMs into stateful runtime operators, enhancing their ability to manage complex ta...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2502.07274] Forget Forgetting: Continual Learning in a World of Abundant Memory

The paper explores continual learning (CL) in AI, proposing a shift from minimizing memory usage to leveraging abundant memory while addr...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2502.00213] Understanding Transformer Optimization via Gradient Heterogeneity

This paper explores the optimization challenges of Transformer models, focusing on gradient heterogeneity and its impact on convergence w...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2509.00074] Language and Experience: A Computational Model of Social Learning in Complex Tasks

This article presents a computational model that explores how humans and AI can integrate linguistic guidance and direct experience for e...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2507.03267] GDGB: A Benchmark for Generative Dynamic Text-Attributed Graph Learning

The paper presents GDGB, a benchmark for Generative Dynamic Text-Attributed Graph Learning, addressing the limitations of existing datase...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.16696] Parameter-free representations outperform single-cell foundation models on downstream benchmarks

This paper demonstrates that parameter-free representations can outperform single-cell foundation models in various benchmarks, suggestin...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2503.16191] Large Language Models for Water Distribution Systems Modeling and Decision-Making

This article discusses the integration of Large Language Models (LLMs) into water distribution system management, introducing LLM-EPANET,...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 105 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

[D] KDD Review Discussion

[P] Implemented ACT-R cognitive decay and hyperdimensional computing for AI agent memory (open source)

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses

All Content

[2510.04694] Multilingual Routing in Mixture-of-Experts

[2508.08177] MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision

[2508.01067] Expressive Power of Graph Transformers via Logic

[2510.25867] Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs

[2503.12286] Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes

[2509.15194] Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

[2509.00454] Universal Properties of Activation Sparsity in Modern Large Language Models

[2501.14406] Adaptive Rank Allocation for Federated Parameter-Efficient Fine-Tuning of Language Models

[2508.10480] Pinet: Optimizing hard-constrained neural networks with orthogonal projection layers

[2412.10999] Cocoa: Co-Planning and Co-Execution with AI Agents

[2411.11706] MC-LLaVA: Multi-Concept Personalized Vision-Language Model

[2505.19427] WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference

[2601.07611] DIAGPaper: Diagnosing Valid and Specific Weaknesses in Scientific Papers via Multi-Agent Reasoning

[2601.01569] CaveAgent: Transforming LLMs into Stateful Runtime Operators

[2502.07274] Forget Forgetting: Continual Learning in a World of Abundant Memory

[2502.00213] Understanding Transformer Optimization via Gradient Heterogeneity

[2509.00074] Language and Experience: A Computational Model of Social Learning in Complex Tasks

[2507.03267] GDGB: A Benchmark for Generative Dynamic Text-Attributed Graph Learning

[2602.16696] Parameter-free representations outperform single-cell foundation models on downstream benchmarks

[2503.16191] Large Language Models for Water Distribution Systems Modeling and Decision-Making

Related Topics

Stay updated with AI News