Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

What does Gemini think of you?

I noticed that Gemini was referring back to a lot of queries I've made in the past and was using that knowledge to drive follow up prompt...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

This app helps you see what LLMs you can run on your hardware

submitted by /u/dev_is_active [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

TRACER: Learn-to-Defer for LLM Classification with Formal Teacher-Agreement Guarantees

I'm releasing TRACER (Trace-Based Adaptive Cost-Efficient Routing), a library for learning cost-efficient routing policies from LLM trace...

Reddit - Machine Learning · 1 min · about 2 hours ago

All Content

Llms

[2603.22882] TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Exploration

Abstract page for arXiv paper 2603.22882: TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Explora...

arXiv - Machine Learning · 4 min · 5 days ago

Llms

[2603.22784] Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models

Abstract page for arXiv paper 2603.22784: Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models

arXiv - Machine Learning · 4 min · 5 days ago

Llms

[2603.22295] Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emotion Categorization in LLMs

Abstract page for arXiv paper 2603.22295: Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emoti...

arXiv - AI · 4 min · 5 days ago

Llms

[2603.22293] TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

Abstract page for arXiv paper 2603.22293: TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

arXiv - AI · 3 min · 5 days ago

Llms

[2603.22289] MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing

Abstract page for arXiv paper 2603.22289: MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing

arXiv - AI · 3 min · 5 days ago

Llms

[2603.22713] Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Constraints

Abstract page for arXiv paper 2603.22713: Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Con...

arXiv - Machine Learning · 4 min · 5 days ago

Llms

[2603.22288] Evaluating Prompting Strategies for Chart Question Answering with Large Language Models

Abstract page for arXiv paper 2603.22288: Evaluating Prompting Strategies for Chart Question Answering with Large Language Models

arXiv - AI · 3 min · 5 days ago

Llms

[2603.22287] Founder effects shape the evolutionary dynamics of multimodality in open LLM families

Abstract page for arXiv paper 2603.22287: Founder effects shape the evolutionary dynamics of multimodality in open LLM families

arXiv - AI · 4 min · 5 days ago

Llms

[2502.04188] Automated Microservice Pattern Instance Detection Using Infrastructure-as-Code Artifacts and Large Language Models

Abstract page for arXiv paper 2502.04188: Automated Microservice Pattern Instance Detection Using Infrastructure-as-Code Artifacts and La...

arXiv - AI · 4 min · 5 days ago

Llms

[2603.23406] Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies

Abstract page for arXiv paper 2603.23406: Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies

arXiv - AI · 4 min · 5 days ago

Llms

[2603.23346] RelayS2S: A Dual-Path Speculative Generation for Real-Time Dialogue

Abstract page for arXiv paper 2603.23346: RelayS2S: A Dual-Path Speculative Generation for Real-Time Dialogue

arXiv - AI · 4 min · 5 days ago

Llms

[2603.23292] LLM Olympiad: Why Model Evaluation Needs a Sealed Exam

Abstract page for arXiv paper 2603.23292: LLM Olympiad: Why Model Evaluation Needs a Sealed Exam

arXiv - AI · 3 min · 5 days ago

Llms

[2603.22586] A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks

Abstract page for arXiv paper 2603.22586: A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks

arXiv - Machine Learning · 3 min · 5 days ago

Llms

[2603.23234] MemCollab: Cross-Agent Memory Collaboration via Contrastive Trajectory Distillation

Abstract page for arXiv paper 2603.23234: MemCollab: Cross-Agent Memory Collaboration via Contrastive Trajectory Distillation

arXiv - AI · 4 min · 5 days ago

Llms

[2603.23231] PERMA: Benchmarking Personalized Memory Agents via Event-Driven Preference and Realistic Task Environments

Abstract page for arXiv paper 2603.23231: PERMA: Benchmarking Personalized Memory Agents via Event-Driven Preference and Realistic Task E...

arXiv - AI · 4 min · 5 days ago

Llms

[2603.23114] Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment

Abstract page for arXiv paper 2603.23114: Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment

arXiv - AI · 3 min · 5 days ago

Llms

[2603.23085] MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models

Abstract page for arXiv paper 2603.23085: MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Langu...

arXiv - AI · 4 min · 5 days ago

Llms

[2603.22455] SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale

Abstract page for arXiv paper 2603.22455: SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale

arXiv - Machine Learning · 4 min · 5 days ago

Llms

[2603.23004] Can Large Language Models Reason and Optimize Under Constraints?

Abstract page for arXiv paper 2603.23004: Can Large Language Models Reason and Optimize Under Constraints?

arXiv - AI · 3 min · 5 days ago

Llms

[2603.22978] JFTA-Bench: Evaluate LLM's Ability of Tracking and Analyzing Malfunctions Using Fault Trees

Abstract page for arXiv paper 2603.22978: JFTA-Bench: Evaluate LLM's Ability of Tracking and Analyzing Malfunctions Using Fault Trees

arXiv - AI · 3 min · 5 days ago

Previous Page 25 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

What does Gemini think of you?

This app helps you see what LLMs you can run on your hardware

TRACER: Learn-to-Defer for LLM Classification with Formal Teacher-Agreement Guarantees

All Content

[2603.22882] TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Exploration

[2603.22784] Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models

[2603.22295] Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emotion Categorization in LLMs

[2603.22293] TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

[2603.22289] MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing

[2603.22713] Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Constraints

[2603.22288] Evaluating Prompting Strategies for Chart Question Answering with Large Language Models

[2603.22287] Founder effects shape the evolutionary dynamics of multimodality in open LLM families

[2502.04188] Automated Microservice Pattern Instance Detection Using Infrastructure-as-Code Artifacts and Large Language Models

[2603.23406] Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies

[2603.23346] RelayS2S: A Dual-Path Speculative Generation for Real-Time Dialogue

[2603.23292] LLM Olympiad: Why Model Evaluation Needs a Sealed Exam

[2603.22586] A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks

[2603.23234] MemCollab: Cross-Agent Memory Collaboration via Contrastive Trajectory Distillation

[2603.23231] PERMA: Benchmarking Personalized Memory Agents via Event-Driven Preference and Realistic Task Environments

[2603.23114] Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment

[2603.23085] MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models

[2603.22455] SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale

[2603.23004] Can Large Language Models Reason and Optimize Under Constraints?

[2603.22978] JFTA-Bench: Evaluate LLM's Ability of Tracking and Analyzing Malfunctions Using Fault Trees

Related Topics

Stay updated with AI News