Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

If AI is about to get 10x smarter, how do we prevent the internet from collapsing under synthetic noise?

Im all for acceleration. I think the faster we hit AGI the better. but theres a bottleneck nobody here talks about enough-training data. ...

Reddit - Artificial Intelligence · 1 min · 12 minutes ago

Llms

Qwen3 4B outperforms cloud agents on code tasks—with Mahoraga research [R]

Hey everyone in ML. I've been working on Mahoraga, an open-source orchestrator that routes tasks across local and cloud AI agents using a...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

Associative memory system for LLMs that learns during inference [P]

I've been working on MDA (Modular Dynamic Architecture), an online associative memory system for LLMs. Here's what I learned building it....

Reddit - Machine Learning · 1 min · about 6 hours ago

All Content

Llms

[2603.04663] Neuro-Symbolic Financial Reasoning via Deterministic Fact Ledgers and Adversarial Low-Latency Hallucination Detector

Abstract page for arXiv paper 2603.04663: Neuro-Symbolic Financial Reasoning via Deterministic Fact Ledgers and Adversarial Low-Latency H...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.04597] Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Abstract page for arXiv paper 2603.04597: Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.04474] From Spark to Fire: Modeling and Mitigating Error Cascades in LLM-Based Multi-Agent Collaboration

Abstract page for arXiv paper 2603.04474: From Spark to Fire: Modeling and Mitigating Error Cascades in LLM-Based Multi-Agent Collaboration

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.04464] Understanding the Dynamics of Demonstration Conflict in In-Context Learning

Abstract page for arXiv paper 2603.04464: Understanding the Dynamics of Demonstration Conflict in In-Context Learning

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.04459] Benchmark of Benchmarks: Unpacking Influence and Code Repository Quality in LLM Safety Benchmarks

Abstract page for arXiv paper 2603.04459: Benchmark of Benchmarks: Unpacking Influence and Code Repository Quality in LLM Safety Benchmarks

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.04460] VSPrefill: Vertical-Slash Sparse Attention with Lightweight Indexing for Long-Context Prefilling

Abstract page for arXiv paper 2603.04460: VSPrefill: Vertical-Slash Sparse Attention with Lightweight Indexing for Long-Context Prefilling

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.04455] Large Language Models as Bidding Agents in Repeated HetNet Auction

Abstract page for arXiv paper 2603.04455: Large Language Models as Bidding Agents in Repeated HetNet Auction

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.04454] Query Disambiguation via Answer-Free Context: Doubling Performance on Humanity's Last Exam

Abstract page for arXiv paper 2603.04454: Query Disambiguation via Answer-Free Context: Doubling Performance on Humanity's Last Exam

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.04453] Induced Numerical Instability: Hidden Costs in Multimodal Large Language Models

Abstract page for arXiv paper 2603.04453: Induced Numerical Instability: Hidden Costs in Multimodal Large Language Models

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.04452] A unified foundational framework for knowledge injection and evaluation of Large Language Models in Combustion Science

Abstract page for arXiv paper 2603.04452: A unified foundational framework for knowledge injection and evaluation of Large Language Model...

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.04444] vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality Models

Abstract page for arXiv paper 2603.04444: vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality Models

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.04436] ZorBA: Zeroth-order Federated Fine-tuning of LLMs with Heterogeneous Block Activation

Abstract page for arXiv paper 2603.04436: ZorBA: Zeroth-order Federated Fine-tuning of LLMs with Heterogeneous Block Activation

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.04443] AMV-L: Lifecycle-Managed Agent Memory for Tail-Latency Control in Long-Running LLM Systems

Abstract page for arXiv paper 2603.04443: AMV-L: Lifecycle-Managed Agent Memory for Tail-Latency Control in Long-Running LLM Systems

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.04429] What Is Missing: Interpretable Ratings for Large Language Model Outputs

Abstract page for arXiv paper 2603.04429: What Is Missing: Interpretable Ratings for Large Language Model Outputs

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.04428] Agent Memory Below the Prompt: Persistent Q4 KV Cache for Multi-Agent LLM Inference on Edge Devices

Abstract page for arXiv paper 2603.04428: Agent Memory Below the Prompt: Persistent Q4 KV Cache for Multi-Agent LLM Inference on Edge Dev...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.04421] Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?

Abstract page for arXiv paper 2603.04421: Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.04419] Context-Dependent Affordance Computation in Vision-Language Models

Abstract page for arXiv paper 2603.04419: Context-Dependent Affordance Computation in Vision-Language Models

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.04413] Simulating Meaning, Nevermore! Introducing ICR: A Semiotic-Hermeneutic Metric for Evaluating Meaning in LLM Text Summaries

Abstract page for arXiv paper 2603.04413: Simulating Meaning, Nevermore! Introducing ICR: A Semiotic-Hermeneutic Metric for Evaluating Me...

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.04411] One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache

Abstract page for arXiv paper 2603.04411: One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.04410] SalamahBench: Toward Standardized Safety Evaluation for Arabic Language Models

Abstract page for arXiv paper 2603.04410: SalamahBench: Toward Standardized Safety Evaluation for Arabic Language Models

arXiv - AI · 4 min · about 2 months ago

Previous Page 241 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

If AI is about to get 10x smarter, how do we prevent the internet from collapsing under synthetic noise?

Qwen3 4B outperforms cloud agents on code tasks—with Mahoraga research [R]

Associative memory system for LLMs that learns during inference [P]

All Content

[2603.04663] Neuro-Symbolic Financial Reasoning via Deterministic Fact Ledgers and Adversarial Low-Latency Hallucination Detector

[2603.04597] Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

[2603.04474] From Spark to Fire: Modeling and Mitigating Error Cascades in LLM-Based Multi-Agent Collaboration

[2603.04464] Understanding the Dynamics of Demonstration Conflict in In-Context Learning

[2603.04459] Benchmark of Benchmarks: Unpacking Influence and Code Repository Quality in LLM Safety Benchmarks

[2603.04460] VSPrefill: Vertical-Slash Sparse Attention with Lightweight Indexing for Long-Context Prefilling

[2603.04455] Large Language Models as Bidding Agents in Repeated HetNet Auction

[2603.04454] Query Disambiguation via Answer-Free Context: Doubling Performance on Humanity's Last Exam

[2603.04453] Induced Numerical Instability: Hidden Costs in Multimodal Large Language Models

[2603.04452] A unified foundational framework for knowledge injection and evaluation of Large Language Models in Combustion Science

[2603.04444] vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality Models

[2603.04436] ZorBA: Zeroth-order Federated Fine-tuning of LLMs with Heterogeneous Block Activation

[2603.04443] AMV-L: Lifecycle-Managed Agent Memory for Tail-Latency Control in Long-Running LLM Systems

[2603.04429] What Is Missing: Interpretable Ratings for Large Language Model Outputs

[2603.04428] Agent Memory Below the Prompt: Persistent Q4 KV Cache for Multi-Agent LLM Inference on Edge Devices

[2603.04421] Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?

[2603.04419] Context-Dependent Affordance Computation in Vision-Language Models

[2603.04413] Simulating Meaning, Nevermore! Introducing ICR: A Semiotic-Hermeneutic Metric for Evaluating Meaning in LLM Text Summaries

[2603.04411] One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache

[2603.04410] SalamahBench: Toward Standardized Safety Evaluation for Arabic Language Models

Related Topics

Stay updated with AI News