Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Have Companies Began Adopting Claude Co-Work at an Enterprise Level?

Hi Guys, My company is considering purchasing the Claude Enterprise plan. The main two constraints are: - Being able to block usage of Cl...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

Llms

What I learned about multi-agent coordination running 9 specialized Claude agents

I've been experimenting with multi-agent AI systems and ended up building something more ambitious than I originally planned: a fully ope...

Reddit - Artificial Intelligence · 1 min · about 6 hours ago

Llms

[D] The problem with comparing AI memory system benchmarks — different evaluation methods make scores meaningless

I've been reviewing how various AI memory systems evaluate their performance and noticed a fundamental issue with cross-system comparison...

Reddit - Machine Learning · 1 min · about 6 hours ago

All Content

Llms

[2603.21105] ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models

Abstract page for arXiv paper 2603.21105: ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Lan...

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2603.21014] CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs

Abstract page for arXiv paper 2603.21014: CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs

arXiv - Machine Learning · 3 min · 8 days ago

Llms

[2603.20969] Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning over Pretraining Knowledge

Abstract page for arXiv paper 2603.20969: Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning ov...

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2603.20921] Discriminative Representation Learning for Clinical Prediction

Abstract page for arXiv paper 2603.20921: Discriminative Representation Learning for Clinical Prediction

arXiv - Machine Learning · 3 min · 8 days ago

Llms

[2603.20910] LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models

Abstract page for arXiv paper 2603.20910: LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models

arXiv - Machine Learning · 3 min · 8 days ago

Llms

[2603.20825] Cross-Granularity Representations for Biological Sequences: Insights from ESM and BiGCARP

Abstract page for arXiv paper 2603.20825: Cross-Granularity Representations for Biological Sequences: Insights from ESM and BiGCARP

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2603.20632] Optimal low-rank stochastic gradient estimation for LLM training

Abstract page for arXiv paper 2603.20632: Optimal low-rank stochastic gradient estimation for LLM training

arXiv - Machine Learning · 3 min · 8 days ago

Llms

[2603.20587] Neural collapse in the orthoplex regime

Abstract page for arXiv paper 2603.20587: Neural collapse in the orthoplex regime

arXiv - Machine Learning · 3 min · 8 days ago

Llms

[2603.20572] LJ-Bench: Ontology-Based Benchmark for U.S. Crime

Abstract page for arXiv paper 2603.20572: LJ-Bench: Ontology-Based Benchmark for U.S. Crime

arXiv - Machine Learning · 3 min · 8 days ago

Llms

[2603.20538] Understanding Behavior Cloning with Action Quantization

Abstract page for arXiv paper 2603.20538: Understanding Behavior Cloning with Action Quantization

arXiv - Machine Learning · 3 min · 8 days ago

Llms

[2603.20492] AE-LLM: Adaptive Efficiency Optimization for Large Language Models

Abstract page for arXiv paper 2603.20492: AE-LLM: Adaptive Efficiency Optimization for Large Language Models

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2603.20405] Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

Abstract page for arXiv paper 2603.20405: Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

arXiv - Machine Learning · 3 min · 8 days ago

Llms

[2603.19225] FinTradeBench: A Financial Reasoning Benchmark for LLMs

Abstract page for arXiv paper 2603.19225: FinTradeBench: A Financial Reasoning Benchmark for LLMs

arXiv - AI · 4 min · 8 days ago

Llms

[2603.19220] Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Abstract page for arXiv paper 2603.19220: Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2603.18873] Evaluating LLM-Generated Lessons from the Language Learning Students' Perspective: A Short Case Study on Duolingo

Abstract page for arXiv paper 2603.18873: Evaluating LLM-Generated Lessons from the Language Learning Students' Perspective: A Short Case...

arXiv - AI · 4 min · 8 days ago

Llms

[2603.18415] The Spillover Effects of Peer AI Rinsing on Corporate Green Innovation

Abstract page for arXiv paper 2603.18415: The Spillover Effects of Peer AI Rinsing on Corporate Green Innovation

arXiv - AI · 4 min · 8 days ago

Llms

[2603.17775] CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution

Abstract page for arXiv paper 2603.17775: CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2603.17655] Interpretable Cross-Domain Few-Shot Learning with Rectified Target-Domain Local Alignment

Abstract page for arXiv paper 2603.17655: Interpretable Cross-Domain Few-Shot Learning with Rectified Target-Domain Local Alignment

arXiv - AI · 4 min · 8 days ago

Llms

[2603.16960] Adversarial attacks against Modern Vision-Language Models

Abstract page for arXiv paper 2603.16960: Adversarial attacks against Modern Vision-Language Models

arXiv - AI · 3 min · 8 days ago

Llms

[2603.14635] Compute Allocation for Reasoning-Intensive Retrieval Agents

Abstract page for arXiv paper 2603.14635: Compute Allocation for Reasoning-Intensive Retrieval Agents

arXiv - AI · 3 min · 8 days ago

Previous Page 44 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Have Companies Began Adopting Claude Co-Work at an Enterprise Level?

What I learned about multi-agent coordination running 9 specialized Claude agents

[D] The problem with comparing AI memory system benchmarks — different evaluation methods make scores meaningless

All Content

[2603.21105] ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models

[2603.21014] CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs

[2603.20969] Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning over Pretraining Knowledge

[2603.20921] Discriminative Representation Learning for Clinical Prediction

[2603.20910] LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models

[2603.20825] Cross-Granularity Representations for Biological Sequences: Insights from ESM and BiGCARP

[2603.20632] Optimal low-rank stochastic gradient estimation for LLM training

[2603.20587] Neural collapse in the orthoplex regime

[2603.20572] LJ-Bench: Ontology-Based Benchmark for U.S. Crime

[2603.20538] Understanding Behavior Cloning with Action Quantization

[2603.20492] AE-LLM: Adaptive Efficiency Optimization for Large Language Models

[2603.20405] Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

[2603.19225] FinTradeBench: A Financial Reasoning Benchmark for LLMs

[2603.19220] Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

[2603.18873] Evaluating LLM-Generated Lessons from the Language Learning Students' Perspective: A Short Case Study on Duolingo

[2603.18415] The Spillover Effects of Peer AI Rinsing on Corporate Green Innovation

[2603.17775] CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution

[2603.17655] Interpretable Cross-Domain Few-Shot Learning with Rectified Target-Domain Local Alignment

[2603.16960] Adversarial attacks against Modern Vision-Language Models

[2603.14635] Compute Allocation for Reasoning-Intensive Retrieval Agents

Related Topics

Stay updated with AI News