Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Anthropic’s new cybersecurity model could get it back in the government’s good graces | The Verge
Llms

Anthropic’s new cybersecurity model could get it back in the government’s good graces | The Verge

After Anthropic announced Claude Mythos Preview, the Trump administration reportedly took notice. It may inspire change in the Anthropic-...

The Verge - AI · 6 min ·
Llms

What is the current landscape on AI agents knowledge

Recently used "free" rates codex to give me a quick fastapi project sample. It gave me deprecated (a)app.on_event("startup). What are you...

Reddit - Artificial Intelligence · 1 min ·
OpenAI Executive Kevin Weil Is Leaving the Company | WIRED
Llms

OpenAI Executive Kevin Weil Is Leaving the Company | WIRED

The former Instagram VP is departing the ChatGPT-maker, which is folding the AI science application he led into Codex.

Wired - AI · 5 min ·

All Content

[2603.04459] Benchmark of Benchmarks: Unpacking Influence and Code Repository Quality in LLM Safety Benchmarks
Llms

[2603.04459] Benchmark of Benchmarks: Unpacking Influence and Code Repository Quality in LLM Safety Benchmarks

Abstract page for arXiv paper 2603.04459: Benchmark of Benchmarks: Unpacking Influence and Code Repository Quality in LLM Safety Benchmarks

arXiv - AI · 4 min ·
[2603.04460] VSPrefill: Vertical-Slash Sparse Attention with Lightweight Indexing for Long-Context Prefilling
Llms

[2603.04460] VSPrefill: Vertical-Slash Sparse Attention with Lightweight Indexing for Long-Context Prefilling

Abstract page for arXiv paper 2603.04460: VSPrefill: Vertical-Slash Sparse Attention with Lightweight Indexing for Long-Context Prefilling

arXiv - Machine Learning · 3 min ·
[2603.04455] Large Language Models as Bidding Agents in Repeated HetNet Auction
Llms

[2603.04455] Large Language Models as Bidding Agents in Repeated HetNet Auction

Abstract page for arXiv paper 2603.04455: Large Language Models as Bidding Agents in Repeated HetNet Auction

arXiv - AI · 4 min ·
[2603.04454] Query Disambiguation via Answer-Free Context: Doubling Performance on Humanity's Last Exam
Llms

[2603.04454] Query Disambiguation via Answer-Free Context: Doubling Performance on Humanity's Last Exam

Abstract page for arXiv paper 2603.04454: Query Disambiguation via Answer-Free Context: Doubling Performance on Humanity's Last Exam

arXiv - AI · 3 min ·
[2603.04453] Induced Numerical Instability: Hidden Costs in Multimodal Large Language Models
Llms

[2603.04453] Induced Numerical Instability: Hidden Costs in Multimodal Large Language Models

Abstract page for arXiv paper 2603.04453: Induced Numerical Instability: Hidden Costs in Multimodal Large Language Models

arXiv - Machine Learning · 3 min ·
[2603.04452] A unified foundational framework for knowledge injection and evaluation of Large Language Models in Combustion Science
Llms

[2603.04452] A unified foundational framework for knowledge injection and evaluation of Large Language Models in Combustion Science

Abstract page for arXiv paper 2603.04452: A unified foundational framework for knowledge injection and evaluation of Large Language Model...

arXiv - AI · 3 min ·
[2603.04444] vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality Models
Llms

[2603.04444] vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality Models

Abstract page for arXiv paper 2603.04444: vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality Models

arXiv - AI · 4 min ·
[2603.04436] ZorBA: Zeroth-order Federated Fine-tuning of LLMs with Heterogeneous Block Activation
Llms

[2603.04436] ZorBA: Zeroth-order Federated Fine-tuning of LLMs with Heterogeneous Block Activation

Abstract page for arXiv paper 2603.04436: ZorBA: Zeroth-order Federated Fine-tuning of LLMs with Heterogeneous Block Activation

arXiv - Machine Learning · 4 min ·
[2603.04443] AMV-L: Lifecycle-Managed Agent Memory for Tail-Latency Control in Long-Running LLM Systems
Llms

[2603.04443] AMV-L: Lifecycle-Managed Agent Memory for Tail-Latency Control in Long-Running LLM Systems

Abstract page for arXiv paper 2603.04443: AMV-L: Lifecycle-Managed Agent Memory for Tail-Latency Control in Long-Running LLM Systems

arXiv - Machine Learning · 4 min ·
[2603.04429] What Is Missing: Interpretable Ratings for Large Language Model Outputs
Llms

[2603.04429] What Is Missing: Interpretable Ratings for Large Language Model Outputs

Abstract page for arXiv paper 2603.04429: What Is Missing: Interpretable Ratings for Large Language Model Outputs

arXiv - AI · 4 min ·
[2603.04428] Agent Memory Below the Prompt: Persistent Q4 KV Cache for Multi-Agent LLM Inference on Edge Devices
Llms

[2603.04428] Agent Memory Below the Prompt: Persistent Q4 KV Cache for Multi-Agent LLM Inference on Edge Devices

Abstract page for arXiv paper 2603.04428: Agent Memory Below the Prompt: Persistent Q4 KV Cache for Multi-Agent LLM Inference on Edge Dev...

arXiv - Machine Learning · 4 min ·
[2603.04421] Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?
Llms

[2603.04421] Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?

Abstract page for arXiv paper 2603.04421: Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?

arXiv - AI · 3 min ·
[2603.04419] Context-Dependent Affordance Computation in Vision-Language Models
Llms

[2603.04419] Context-Dependent Affordance Computation in Vision-Language Models

Abstract page for arXiv paper 2603.04419: Context-Dependent Affordance Computation in Vision-Language Models

arXiv - Machine Learning · 4 min ·
[2603.04413] Simulating Meaning, Nevermore! Introducing ICR: A Semiotic-Hermeneutic Metric for Evaluating Meaning in LLM Text Summaries
Llms

[2603.04413] Simulating Meaning, Nevermore! Introducing ICR: A Semiotic-Hermeneutic Metric for Evaluating Meaning in LLM Text Summaries

Abstract page for arXiv paper 2603.04413: Simulating Meaning, Nevermore! Introducing ICR: A Semiotic-Hermeneutic Metric for Evaluating Me...

arXiv - AI · 4 min ·
[2603.04411] One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache
Llms

[2603.04411] One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache

Abstract page for arXiv paper 2603.04411: One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache

arXiv - Machine Learning · 3 min ·
[2603.04410] SalamahBench: Toward Standardized Safety Evaluation for Arabic Language Models
Llms

[2603.04410] SalamahBench: Toward Standardized Safety Evaluation for Arabic Language Models

Abstract page for arXiv paper 2603.04410: SalamahBench: Toward Standardized Safety Evaluation for Arabic Language Models

arXiv - AI · 4 min ·
[2603.04409] Unpacking Human Preference for LLMs: Demographically Aware Evaluation with the HUMAINE Framework
Llms

[2603.04409] Unpacking Human Preference for LLMs: Demographically Aware Evaluation with the HUMAINE Framework

Abstract page for arXiv paper 2603.04409: Unpacking Human Preference for LLMs: Demographically Aware Evaluation with the HUMAINE Framework

arXiv - AI · 4 min ·
[2603.04406] CTRL-RAG: Contrastive Likelihood Reward Based Reinforcement Learning for Context-Faithful RAG Models
Llms

[2603.04406] CTRL-RAG: Contrastive Likelihood Reward Based Reinforcement Learning for Context-Faithful RAG Models

Abstract page for arXiv paper 2603.04406: CTRL-RAG: Contrastive Likelihood Reward Based Reinforcement Learning for Context-Faithful RAG M...

arXiv - AI · 4 min ·
[2603.04407] Semantic Containment as a Fundamental Property of Emergent Misalignment
Llms

[2603.04407] Semantic Containment as a Fundamental Property of Emergent Misalignment

Abstract page for arXiv paper 2603.04407: Semantic Containment as a Fundamental Property of Emergent Misalignment

arXiv - AI · 3 min ·
[2603.04405] Lost in Translation: How Language Re-Aligns Vision for Cross-Species Pathology
Llms

[2603.04405] Lost in Translation: How Language Re-Aligns Vision for Cross-Species Pathology

Abstract page for arXiv paper 2603.04405: Lost in Translation: How Language Re-Aligns Vision for Cross-Species Pathology

arXiv - Machine Learning · 4 min ·
Previous Page 188 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime