Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Building knowledge bases from YouTube data using LLMs -- my workflow after 52 guides

I've been building a system that turns YouTube channels into structured knowledge bases. Thought I'd share the workflow since Karpathy's ...

Reddit - Artificial Intelligence · 1 min ·
What is AI, how do apps like ChatGPT work and why are there concerns?
Llms

What is AI, how do apps like ChatGPT work and why are there concerns?

AI is transforming modern life, but some critics worry about its potential misuse and environmental impact.

AI News - General · 7 min ·
[2603.29957] Think Anywhere in Code Generation
Llms

[2603.29957] Think Anywhere in Code Generation

Abstract page for arXiv paper 2603.29957: Think Anywhere in Code Generation

arXiv - Machine Learning · 3 min ·

All Content

[2603.22784] Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models
Llms

[2603.22784] Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models

Abstract page for arXiv paper 2603.22784: Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models

arXiv - Machine Learning · 4 min ·
[2603.22295] Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emotion Categorization in LLMs
Llms

[2603.22295] Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emotion Categorization in LLMs

Abstract page for arXiv paper 2603.22295: Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emoti...

arXiv - AI · 4 min ·
[2603.22293] TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs
Llms

[2603.22293] TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

Abstract page for arXiv paper 2603.22293: TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

arXiv - AI · 3 min ·
[2603.22289] MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing
Llms

[2603.22289] MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing

Abstract page for arXiv paper 2603.22289: MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing

arXiv - AI · 3 min ·
[2603.22713] Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Constraints
Llms

[2603.22713] Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Constraints

Abstract page for arXiv paper 2603.22713: Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Con...

arXiv - Machine Learning · 4 min ·
[2603.22288] Evaluating Prompting Strategies for Chart Question Answering with Large Language Models
Llms

[2603.22288] Evaluating Prompting Strategies for Chart Question Answering with Large Language Models

Abstract page for arXiv paper 2603.22288: Evaluating Prompting Strategies for Chart Question Answering with Large Language Models

arXiv - AI · 3 min ·
[2603.22287] Founder effects shape the evolutionary dynamics of multimodality in open LLM families
Llms

[2603.22287] Founder effects shape the evolutionary dynamics of multimodality in open LLM families

Abstract page for arXiv paper 2603.22287: Founder effects shape the evolutionary dynamics of multimodality in open LLM families

arXiv - AI · 4 min ·
[2502.04188] Automated Microservice Pattern Instance Detection Using Infrastructure-as-Code Artifacts and Large Language Models
Llms

[2502.04188] Automated Microservice Pattern Instance Detection Using Infrastructure-as-Code Artifacts and Large Language Models

Abstract page for arXiv paper 2502.04188: Automated Microservice Pattern Instance Detection Using Infrastructure-as-Code Artifacts and La...

arXiv - AI · 4 min ·
[2603.23406] Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies
Llms

[2603.23406] Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies

Abstract page for arXiv paper 2603.23406: Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies

arXiv - AI · 4 min ·
[2603.23346] RelayS2S: A Dual-Path Speculative Generation for Real-Time Dialogue
Llms

[2603.23346] RelayS2S: A Dual-Path Speculative Generation for Real-Time Dialogue

Abstract page for arXiv paper 2603.23346: RelayS2S: A Dual-Path Speculative Generation for Real-Time Dialogue

arXiv - AI · 4 min ·
[2603.23292] LLM Olympiad: Why Model Evaluation Needs a Sealed Exam
Llms

[2603.23292] LLM Olympiad: Why Model Evaluation Needs a Sealed Exam

Abstract page for arXiv paper 2603.23292: LLM Olympiad: Why Model Evaluation Needs a Sealed Exam

arXiv - AI · 3 min ·
[2603.22586] A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks
Llms

[2603.22586] A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks

Abstract page for arXiv paper 2603.22586: A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks

arXiv - Machine Learning · 3 min ·
[2603.23234] MemCollab: Cross-Agent Memory Collaboration via Contrastive Trajectory Distillation
Llms

[2603.23234] MemCollab: Cross-Agent Memory Collaboration via Contrastive Trajectory Distillation

Abstract page for arXiv paper 2603.23234: MemCollab: Cross-Agent Memory Collaboration via Contrastive Trajectory Distillation

arXiv - AI · 4 min ·
[2603.23231] PERMA: Benchmarking Personalized Memory Agents via Event-Driven Preference and Realistic Task Environments
Llms

[2603.23231] PERMA: Benchmarking Personalized Memory Agents via Event-Driven Preference and Realistic Task Environments

Abstract page for arXiv paper 2603.23231: PERMA: Benchmarking Personalized Memory Agents via Event-Driven Preference and Realistic Task E...

arXiv - AI · 4 min ·
[2603.23114] Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment
Llms

[2603.23114] Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment

Abstract page for arXiv paper 2603.23114: Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment

arXiv - AI · 3 min ·
[2603.23085] MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models
Llms

[2603.23085] MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models

Abstract page for arXiv paper 2603.23085: MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Langu...

arXiv - AI · 4 min ·
[2603.22455] SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale
Llms

[2603.22455] SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale

Abstract page for arXiv paper 2603.22455: SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale

arXiv - Machine Learning · 4 min ·
[2603.23004] Can Large Language Models Reason and Optimize Under Constraints?
Llms

[2603.23004] Can Large Language Models Reason and Optimize Under Constraints?

Abstract page for arXiv paper 2603.23004: Can Large Language Models Reason and Optimize Under Constraints?

arXiv - AI · 3 min ·
[2603.22978] JFTA-Bench: Evaluate LLM's Ability of Tracking and Analyzing Malfunctions Using Fault Trees
Llms

[2603.22978] JFTA-Bench: Evaluate LLM's Ability of Tracking and Analyzing Malfunctions Using Fault Trees

Abstract page for arXiv paper 2603.22978: JFTA-Bench: Evaluate LLM's Ability of Tracking and Analyzing Malfunctions Using Fault Trees

arXiv - AI · 3 min ·
[2603.22942] Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning
Llms

[2603.22942] Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning

Abstract page for arXiv paper 2603.22942: Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning

arXiv - AI · 3 min ·
Previous Page 55 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime