Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

[2603.17839] How do LLMs Compute Verbal Confidence
Llms

[2603.17839] How do LLMs Compute Verbal Confidence

Abstract page for arXiv paper 2603.17839: How do LLMs Compute Verbal Confidence

arXiv - AI · 4 min ·
[2603.15970] 100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models
Llms

[2603.15970] 100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models

Abstract page for arXiv paper 2603.15970: 100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight...

arXiv - AI · 4 min ·
[2603.10062] Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead
Llms

[2603.10062] Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead

Abstract page for arXiv paper 2603.10062: Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead

arXiv - AI · 3 min ·

All Content

[2603.20969] Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning over Pretraining Knowledge
Llms

[2603.20969] Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning over Pretraining Knowledge

Abstract page for arXiv paper 2603.20969: Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning ov...

arXiv - Machine Learning · 4 min ·
[2603.20921] Discriminative Representation Learning for Clinical Prediction
Llms

[2603.20921] Discriminative Representation Learning for Clinical Prediction

Abstract page for arXiv paper 2603.20921: Discriminative Representation Learning for Clinical Prediction

arXiv - Machine Learning · 3 min ·
[2603.20910] LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models
Llms

[2603.20910] LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models

Abstract page for arXiv paper 2603.20910: LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models

arXiv - Machine Learning · 3 min ·
[2603.20825] Cross-Granularity Representations for Biological Sequences: Insights from ESM and BiGCARP
Llms

[2603.20825] Cross-Granularity Representations for Biological Sequences: Insights from ESM and BiGCARP

Abstract page for arXiv paper 2603.20825: Cross-Granularity Representations for Biological Sequences: Insights from ESM and BiGCARP

arXiv - Machine Learning · 4 min ·
[2603.20632] Optimal low-rank stochastic gradient estimation for LLM training
Llms

[2603.20632] Optimal low-rank stochastic gradient estimation for LLM training

Abstract page for arXiv paper 2603.20632: Optimal low-rank stochastic gradient estimation for LLM training

arXiv - Machine Learning · 3 min ·
[2603.20587] Neural collapse in the orthoplex regime
Llms

[2603.20587] Neural collapse in the orthoplex regime

Abstract page for arXiv paper 2603.20587: Neural collapse in the orthoplex regime

arXiv - Machine Learning · 3 min ·
[2603.20572] LJ-Bench: Ontology-Based Benchmark for U.S. Crime
Llms

[2603.20572] LJ-Bench: Ontology-Based Benchmark for U.S. Crime

Abstract page for arXiv paper 2603.20572: LJ-Bench: Ontology-Based Benchmark for U.S. Crime

arXiv - Machine Learning · 3 min ·
[2603.20538] Understanding Behavior Cloning with Action Quantization
Llms

[2603.20538] Understanding Behavior Cloning with Action Quantization

Abstract page for arXiv paper 2603.20538: Understanding Behavior Cloning with Action Quantization

arXiv - Machine Learning · 3 min ·
[2603.20492] AE-LLM: Adaptive Efficiency Optimization for Large Language Models
Llms

[2603.20492] AE-LLM: Adaptive Efficiency Optimization for Large Language Models

Abstract page for arXiv paper 2603.20492: AE-LLM: Adaptive Efficiency Optimization for Large Language Models

arXiv - Machine Learning · 4 min ·
[2603.20405] Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP
Llms

[2603.20405] Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

Abstract page for arXiv paper 2603.20405: Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

arXiv - Machine Learning · 3 min ·
[2603.19225] FinTradeBench: A Financial Reasoning Benchmark for LLMs
Llms

[2603.19225] FinTradeBench: A Financial Reasoning Benchmark for LLMs

Abstract page for arXiv paper 2603.19225: FinTradeBench: A Financial Reasoning Benchmark for LLMs

arXiv - AI · 4 min ·
[2603.19220] Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Llms

[2603.19220] Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Abstract page for arXiv paper 2603.19220: Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

arXiv - Machine Learning · 4 min ·
[2603.18873] Evaluating LLM-Generated Lessons from the Language Learning Students' Perspective: A Short Case Study on Duolingo
Llms

[2603.18873] Evaluating LLM-Generated Lessons from the Language Learning Students' Perspective: A Short Case Study on Duolingo

Abstract page for arXiv paper 2603.18873: Evaluating LLM-Generated Lessons from the Language Learning Students' Perspective: A Short Case...

arXiv - AI · 4 min ·
[2603.18415] The Spillover Effects of Peer AI Rinsing on Corporate Green Innovation
Llms

[2603.18415] The Spillover Effects of Peer AI Rinsing on Corporate Green Innovation

Abstract page for arXiv paper 2603.18415: The Spillover Effects of Peer AI Rinsing on Corporate Green Innovation

arXiv - AI · 4 min ·
[2603.17775] CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution
Llms

[2603.17775] CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution

Abstract page for arXiv paper 2603.17775: CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution

arXiv - Machine Learning · 4 min ·
[2603.17655] Interpretable Cross-Domain Few-Shot Learning with Rectified Target-Domain Local Alignment
Llms

[2603.17655] Interpretable Cross-Domain Few-Shot Learning with Rectified Target-Domain Local Alignment

Abstract page for arXiv paper 2603.17655: Interpretable Cross-Domain Few-Shot Learning with Rectified Target-Domain Local Alignment

arXiv - AI · 4 min ·
[2603.16960] Adversarial attacks against Modern Vision-Language Models
Llms

[2603.16960] Adversarial attacks against Modern Vision-Language Models

Abstract page for arXiv paper 2603.16960: Adversarial attacks against Modern Vision-Language Models

arXiv - AI · 3 min ·
[2603.14635] Compute Allocation for Reasoning-Intensive Retrieval Agents
Llms

[2603.14635] Compute Allocation for Reasoning-Intensive Retrieval Agents

Abstract page for arXiv paper 2603.14635: Compute Allocation for Reasoning-Intensive Retrieval Agents

arXiv - AI · 3 min ·
[2603.16065] Large Reward Models: Generalizable Online Robot Reward Generation with Vision-Language Models
Llms

[2603.16065] Large Reward Models: Generalizable Online Robot Reward Generation with Vision-Language Models

Abstract page for arXiv paper 2603.16065: Large Reward Models: Generalizable Online Robot Reward Generation with Vision-Language Models

arXiv - AI · 4 min ·
[2603.14672] Seamless Deception: Larger Language Models Are Better Knowledge Concealers
Llms

[2603.14672] Seamless Deception: Larger Language Models Are Better Knowledge Concealers

Abstract page for arXiv paper 2603.14672: Seamless Deception: Larger Language Models Are Better Knowledge Concealers

arXiv - AI · 3 min ·
Previous Page 50 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime