Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

I am seeing Claude everywhere

Every single Instagram reel or TikTok I scroll i see people mentioning Claude and glazing it like it’s some kind of master tool that’s be...

Reddit - Artificial Intelligence · 1 min ·
Llms

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

Hey everyone I've set up a self-hosted API gateway using [New-API](QuantumNous/new-ap) to manage and distribute Claude Opus 4.6 access ac...

Reddit - Artificial Intelligence · 1 min ·
Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED
Llms

Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED

Plus: The FBI says a recent hack of its wiretap tools poses a national security risk, attackers stole Cisco source code as part of an ong...

Wired - AI · 9 min ·

All Content

[2603.21636] Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks
Llms

[2603.21636] Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks

Abstract page for arXiv paper 2603.21636: Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confide...

arXiv - AI · 4 min ·
[2603.21630] EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises
Llms

[2603.21630] EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises

Abstract page for arXiv paper 2603.21630: EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises

arXiv - AI · 3 min ·
[2603.21607] INTRYGUE: Induction-Aware Entropy Gating for Reliable RAG Uncertainty Estimation
Llms

[2603.21607] INTRYGUE: Induction-Aware Entropy Gating for Reliable RAG Uncertainty Estimation

Abstract page for arXiv paper 2603.21607: INTRYGUE: Induction-Aware Entropy Gating for Reliable RAG Uncertainty Estimation

arXiv - AI · 3 min ·
[2603.21597] A Multidisciplinary AI Board for Multimodal Dementia Characterization and Risk Assessment
Llms

[2603.21597] A Multidisciplinary AI Board for Multimodal Dementia Characterization and Risk Assessment

Abstract page for arXiv paper 2603.21597: A Multidisciplinary AI Board for Multimodal Dementia Characterization and Risk Assessment

arXiv - AI · 4 min ·
[2603.21574] Adaptive Robust Estimator for Multi-Agent Reinforcement Learning
Llms

[2603.21574] Adaptive Robust Estimator for Multi-Agent Reinforcement Learning

Abstract page for arXiv paper 2603.21574: Adaptive Robust Estimator for Multi-Agent Reinforcement Learning

arXiv - AI · 3 min ·
[2603.21577] Mind over Space: Can Multimodal Large Language Models Mentally Navigate?
Llms

[2603.21577] Mind over Space: Can Multimodal Large Language Models Mentally Navigate?

Abstract page for arXiv paper 2603.21577: Mind over Space: Can Multimodal Large Language Models Mentally Navigate?

arXiv - AI · 4 min ·
[2603.21563] Counterfactual Credit Policy Optimization for Multi-Agent Collaboration
Llms

[2603.21563] Counterfactual Credit Policy Optimization for Multi-Agent Collaboration

Abstract page for arXiv paper 2603.21563: Counterfactual Credit Policy Optimization for Multi-Agent Collaboration

arXiv - AI · 3 min ·
[2603.21430] DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation
Llms

[2603.21430] DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation

Abstract page for arXiv paper 2603.21430: DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation

arXiv - AI · 4 min ·
[2603.21415] Silent Commitment Failure in Instruction-Tuned Language Models: Evidence of Governability Divergence Across Architectures
Llms

[2603.21415] Silent Commitment Failure in Instruction-Tuned Language Models: Evidence of Governability Divergence Across Architectures

Abstract page for arXiv paper 2603.21415: Silent Commitment Failure in Instruction-Tuned Language Models: Evidence of Governability Diver...

arXiv - Machine Learning · 4 min ·
[2603.21398] Persona Vectors in Games: Measuring and Steering Strategies via Activation Vectors
Llms

[2603.21398] Persona Vectors in Games: Measuring and Steering Strategies via Activation Vectors

Abstract page for arXiv paper 2603.21398: Persona Vectors in Games: Measuring and Steering Strategies via Activation Vectors

arXiv - AI · 3 min ·
[2603.21376] A transformer architecture alteration to incentivise externalised reasoning
Llms

[2603.21376] A transformer architecture alteration to incentivise externalised reasoning

Abstract page for arXiv paper 2603.21376: A transformer architecture alteration to incentivise externalised reasoning

arXiv - AI · 3 min ·
[2603.21362] AdaRubric: Task-Adaptive Rubrics for LLM Agent Evaluation
Llms

[2603.21362] AdaRubric: Task-Adaptive Rubrics for LLM Agent Evaluation

Abstract page for arXiv paper 2603.21362: AdaRubric: Task-Adaptive Rubrics for LLM Agent Evaluation

arXiv - AI · 3 min ·
[2603.21357] AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling
Llms

[2603.21357] AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling

Abstract page for arXiv paper 2603.21357: AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling

arXiv - AI · 4 min ·
[2603.21341] RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action Models
Llms

[2603.21341] RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action Models

Abstract page for arXiv paper 2603.21341: RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action...

arXiv - AI · 4 min ·
[2603.21321] Improving Coherence and Persistence in Agentic AI for System Optimization
Llms

[2603.21321] Improving Coherence and Persistence in Agentic AI for System Optimization

Abstract page for arXiv paper 2603.21321: Improving Coherence and Persistence in Agentic AI for System Optimization

arXiv - AI · 4 min ·
[2603.21250] Graph of States: Solving Abductive Tasks with Large Language Models
Llms

[2603.21250] Graph of States: Solving Abductive Tasks with Large Language Models

Abstract page for arXiv paper 2603.21250: Graph of States: Solving Abductive Tasks with Large Language Models

arXiv - AI · 3 min ·
[2603.21237] ConsRoute:Consistency-Aware Adaptive Query Routing for Cloud-Edge-Device Large Language Models
Llms

[2603.21237] ConsRoute:Consistency-Aware Adaptive Query Routing for Cloud-Edge-Device Large Language Models

Abstract page for arXiv paper 2603.21237: ConsRoute:Consistency-Aware Adaptive Query Routing for Cloud-Edge-Device Large Language Models

arXiv - AI · 4 min ·
[2603.21162] Revisiting Tree Search for LLMs: Gumbel and Sequential Halving for Budget-Scalable Reasoning
Llms

[2603.21162] Revisiting Tree Search for LLMs: Gumbel and Sequential Halving for Budget-Scalable Reasoning

Abstract page for arXiv paper 2603.21162: Revisiting Tree Search for LLMs: Gumbel and Sequential Halving for Budget-Scalable Reasoning

arXiv - Machine Learning · 3 min ·
[2603.21155] Can LLMs Fool Graph Learning? Exploring Universal Adversarial Attacks on Text-Attributed Graphs
Llms

[2603.21155] Can LLMs Fool Graph Learning? Exploring Universal Adversarial Attacks on Text-Attributed Graphs

Abstract page for arXiv paper 2603.21155: Can LLMs Fool Graph Learning? Exploring Universal Adversarial Attacks on Text-Attributed Graphs

arXiv - AI · 4 min ·
[2603.21140] ORACLE: Optimizing Reasoning Abilities of Large Language Models via Constraint-Led Synthetic Data Elicitation
Llms

[2603.21140] ORACLE: Optimizing Reasoning Abilities of Large Language Models via Constraint-Led Synthetic Data Elicitation

Abstract page for arXiv paper 2603.21140: ORACLE: Optimizing Reasoning Abilities of Large Language Models via Constraint-Led Synthetic Da...

arXiv - AI · 4 min ·
Previous Page 72 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime