Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Building knowledge bases from YouTube data using LLMs -- my workflow after 52 guides

I've been building a system that turns YouTube channels into structured knowledge bases. Thought I'd share the workflow since Karpathy's ...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

What is AI, how do apps like ChatGPT work and why are there concerns?

AI is transforming modern life, but some critics worry about its potential misuse and environmental impact.

AI News - General · 7 min · about 5 hours ago

Llms

[2603.29957] Think Anywhere in Code Generation

Abstract page for arXiv paper 2603.29957: Think Anywhere in Code Generation

arXiv - Machine Learning · 3 min · about 8 hours ago

All Content

Llms

[2603.22784] Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models

Abstract page for arXiv paper 2603.22784: Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models

arXiv - Machine Learning · 4 min · 9 days ago

Llms

[2603.22295] Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emotion Categorization in LLMs

Abstract page for arXiv paper 2603.22295: Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emoti...

arXiv - AI · 4 min · 9 days ago

Llms

[2603.22293] TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

Abstract page for arXiv paper 2603.22293: TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

arXiv - AI · 3 min · 9 days ago

Llms

[2603.22289] MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing

Abstract page for arXiv paper 2603.22289: MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing

arXiv - AI · 3 min · 9 days ago

Llms

[2603.22713] Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Constraints

Abstract page for arXiv paper 2603.22713: Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Con...

arXiv - Machine Learning · 4 min · 9 days ago

Llms

[2603.22288] Evaluating Prompting Strategies for Chart Question Answering with Large Language Models

Abstract page for arXiv paper 2603.22288: Evaluating Prompting Strategies for Chart Question Answering with Large Language Models

arXiv - AI · 3 min · 9 days ago

Llms

[2603.22287] Founder effects shape the evolutionary dynamics of multimodality in open LLM families

Abstract page for arXiv paper 2603.22287: Founder effects shape the evolutionary dynamics of multimodality in open LLM families

arXiv - AI · 4 min · 9 days ago

Llms

[2502.04188] Automated Microservice Pattern Instance Detection Using Infrastructure-as-Code Artifacts and Large Language Models

Abstract page for arXiv paper 2502.04188: Automated Microservice Pattern Instance Detection Using Infrastructure-as-Code Artifacts and La...

arXiv - AI · 4 min · 9 days ago

Llms

[2603.23406] Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies

Abstract page for arXiv paper 2603.23406: Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies

arXiv - AI · 4 min · 9 days ago

Llms

[2603.23346] RelayS2S: A Dual-Path Speculative Generation for Real-Time Dialogue

Abstract page for arXiv paper 2603.23346: RelayS2S: A Dual-Path Speculative Generation for Real-Time Dialogue

arXiv - AI · 4 min · 9 days ago

Llms

[2603.23292] LLM Olympiad: Why Model Evaluation Needs a Sealed Exam

Abstract page for arXiv paper 2603.23292: LLM Olympiad: Why Model Evaluation Needs a Sealed Exam

arXiv - AI · 3 min · 9 days ago

Llms

[2603.22586] A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks

Abstract page for arXiv paper 2603.22586: A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks

arXiv - Machine Learning · 3 min · 9 days ago

Llms

[2603.23234] MemCollab: Cross-Agent Memory Collaboration via Contrastive Trajectory Distillation

Abstract page for arXiv paper 2603.23234: MemCollab: Cross-Agent Memory Collaboration via Contrastive Trajectory Distillation

arXiv - AI · 4 min · 9 days ago

Llms

[2603.23231] PERMA: Benchmarking Personalized Memory Agents via Event-Driven Preference and Realistic Task Environments

Abstract page for arXiv paper 2603.23231: PERMA: Benchmarking Personalized Memory Agents via Event-Driven Preference and Realistic Task E...

arXiv - AI · 4 min · 9 days ago

Llms

[2603.23114] Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment

Abstract page for arXiv paper 2603.23114: Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment

arXiv - AI · 3 min · 9 days ago

Llms

[2603.23085] MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models

Abstract page for arXiv paper 2603.23085: MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Langu...

arXiv - AI · 4 min · 9 days ago

Llms

[2603.22455] SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale

Abstract page for arXiv paper 2603.22455: SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale

arXiv - Machine Learning · 4 min · 9 days ago

Llms

[2603.23004] Can Large Language Models Reason and Optimize Under Constraints?

Abstract page for arXiv paper 2603.23004: Can Large Language Models Reason and Optimize Under Constraints?

arXiv - AI · 3 min · 9 days ago

Llms

[2603.22978] JFTA-Bench: Evaluate LLM's Ability of Tracking and Analyzing Malfunctions Using Fault Trees

Abstract page for arXiv paper 2603.22978: JFTA-Bench: Evaluate LLM's Ability of Tracking and Analyzing Malfunctions Using Fault Trees

arXiv - AI · 3 min · 9 days ago

Llms

[2603.22942] Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning

Abstract page for arXiv paper 2603.22942: Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning

arXiv - AI · 3 min · 9 days ago

Previous Page 55 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Building knowledge bases from YouTube data using LLMs -- my workflow after 52 guides

What is AI, how do apps like ChatGPT work and why are there concerns?

[2603.29957] Think Anywhere in Code Generation

All Content

[2603.22784] Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models

[2603.22295] Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emotion Categorization in LLMs

[2603.22293] TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

[2603.22289] MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing

[2603.22713] Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Constraints

[2603.22288] Evaluating Prompting Strategies for Chart Question Answering with Large Language Models

[2603.22287] Founder effects shape the evolutionary dynamics of multimodality in open LLM families

[2502.04188] Automated Microservice Pattern Instance Detection Using Infrastructure-as-Code Artifacts and Large Language Models

[2603.23406] Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies

[2603.23346] RelayS2S: A Dual-Path Speculative Generation for Real-Time Dialogue

[2603.23292] LLM Olympiad: Why Model Evaluation Needs a Sealed Exam

[2603.22586] A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks

[2603.23234] MemCollab: Cross-Agent Memory Collaboration via Contrastive Trajectory Distillation

[2603.23231] PERMA: Benchmarking Personalized Memory Agents via Event-Driven Preference and Realistic Task Environments

[2603.23114] Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment

[2603.23085] MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models

[2603.22455] SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale

[2603.23004] Can Large Language Models Reason and Optimize Under Constraints?

[2603.22978] JFTA-Bench: Evaluate LLM's Ability of Tracking and Analyzing Malfunctions Using Fault Trees

[2603.22942] Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning

Related Topics

Stay updated with AI News