Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Building knowledge bases from YouTube data using LLMs -- my workflow after 52 guides

I've been building a system that turns YouTube channels into structured knowledge bases. Thought I'd share the workflow since Karpathy's ...

Reddit - Artificial Intelligence · 1 min ·
What is AI, how do apps like ChatGPT work and why are there concerns?
Llms

What is AI, how do apps like ChatGPT work and why are there concerns?

AI is transforming modern life, but some critics worry about its potential misuse and environmental impact.

AI News - General · 7 min ·
[2603.29957] Think Anywhere in Code Generation
Llms

[2603.29957] Think Anywhere in Code Generation

Abstract page for arXiv paper 2603.29957: Think Anywhere in Code Generation

arXiv - Machine Learning · 3 min ·

All Content

[2603.22376] AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Agents with Cloud Computing Access
Llms

[2603.22376] AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Agents with Cloud Computing Access

Abstract page for arXiv paper 2603.22376: AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Age...

arXiv - AI · 4 min ·
[2603.23461] End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions
Llms

[2603.23461] End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions

Abstract page for arXiv paper 2603.23461: End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions

arXiv - Machine Learning · 3 min ·
[2603.23414] SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling
Llms

[2603.23414] SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

Abstract page for arXiv paper 2603.23414: SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

arXiv - AI · 4 min ·
[2603.22368] When Visuals Aren't the Problem: Evaluating Vision-Language Models on Misleading Data Visualizations
Llms

[2603.22368] When Visuals Aren't the Problem: Evaluating Vision-Language Models on Misleading Data Visualizations

Abstract page for arXiv paper 2603.22368: When Visuals Aren't the Problem: Evaluating Vision-Language Models on Misleading Data Visualiza...

arXiv - AI · 4 min ·
[2603.23355] Off-Policy Value-Based Reinforcement Learning for Large Language Models
Llms

[2603.23355] Off-Policy Value-Based Reinforcement Learning for Large Language Models

Abstract page for arXiv paper 2603.23355: Off-Policy Value-Based Reinforcement Learning for Large Language Models

arXiv - Machine Learning · 3 min ·
[2603.22367] Reasoner-Executor-Synthesizer: Scalable Agentic Architecture with Static O(1) Context Window
Llms

[2603.22367] Reasoner-Executor-Synthesizer: Scalable Agentic Architecture with Static O(1) Context Window

Abstract page for arXiv paper 2603.22367: Reasoner-Executor-Synthesizer: Scalable Agentic Architecture with Static O(1) Context Window

arXiv - AI · 3 min ·
[2603.23268] SafeSeek: Universal Attribution of Safety Circuits in Language Models
Llms

[2603.23268] SafeSeek: Universal Attribution of Safety Circuits in Language Models

Abstract page for arXiv paper 2603.23268: SafeSeek: Universal Attribution of Safety Circuits in Language Models

arXiv - AI · 4 min ·
[2603.22363] Early Discoveries of Algorithmist I: Promise of Provable Algorithm Synthesis at Scale
Llms

[2603.22363] Early Discoveries of Algorithmist I: Promise of Provable Algorithm Synthesis at Scale

Abstract page for arXiv paper 2603.22363: Early Discoveries of Algorithmist I: Promise of Provable Algorithm Synthesis at Scale

arXiv - AI · 4 min ·
[2603.22341] T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search
Llms

[2603.22341] T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

Abstract page for arXiv paper 2603.22341: T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

arXiv - AI · 3 min ·
[2603.23198] Sparser, Faster, Lighter Transformer Language Models
Llms

[2603.23198] Sparser, Faster, Lighter Transformer Language Models

Abstract page for arXiv paper 2603.23198: Sparser, Faster, Lighter Transformer Language Models

arXiv - Machine Learning · 3 min ·
[2603.22335] Causal Direct Preference Optimization for Distributionally Robust Generative Recommendation
Llms

[2603.22335] Causal Direct Preference Optimization for Distributionally Robust Generative Recommendation

Abstract page for arXiv paper 2603.22335: Causal Direct Preference Optimization for Distributionally Robust Generative Recommendation

arXiv - AI · 3 min ·
[2603.23173] A Schrödinger Eigenfunction Method for Long-Horizon Stochastic Optimal Control
Llms

[2603.23173] A Schrödinger Eigenfunction Method for Long-Horizon Stochastic Optimal Control

Abstract page for arXiv paper 2603.23173: A Schrödinger Eigenfunction Method for Long-Horizon Stochastic Optimal Control

arXiv - Machine Learning · 4 min ·
[2603.23140] DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models
Llms

[2603.23140] DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models

Abstract page for arXiv paper 2603.23140: DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models

arXiv - Machine Learning · 4 min ·
[2603.23129] Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair
Llms

[2603.23129] Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair

Abstract page for arXiv paper 2603.23129: Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy...

arXiv - Machine Learning · 4 min ·
[2603.22327] AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI
Llms

[2603.22327] AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI

Abstract page for arXiv paper 2603.22327: AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI

arXiv - AI · 3 min ·
[2603.23043] Assessing the Robustness of Climate Foundation Models under No-Analog Distribution Shifts
Llms

[2603.23043] Assessing the Robustness of Climate Foundation Models under No-Analog Distribution Shifts

Abstract page for arXiv paper 2603.23043: Assessing the Robustness of Climate Foundation Models under No-Analog Distribution Shifts

arXiv - AI · 4 min ·
[2603.22984] Can Graph Foundation Models Generalize Over Architecture?
Llms

[2603.22984] Can Graph Foundation Models Generalize Over Architecture?

Abstract page for arXiv paper 2603.22984: Can Graph Foundation Models Generalize Over Architecture?

arXiv - AI · 4 min ·
[2603.22321] From Instructions to Assistance: a Dataset Aligning Instruction Manuals with Assembly Videos for Evaluating Multimodal LLMs
Llms

[2603.22321] From Instructions to Assistance: a Dataset Aligning Instruction Manuals with Assembly Videos for Evaluating Multimodal LLMs

Abstract page for arXiv paper 2603.22321: From Instructions to Assistance: a Dataset Aligning Instruction Manuals with Assembly Videos fo...

arXiv - AI · 4 min ·
[2603.22892] VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents
Llms

[2603.22892] VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents

Abstract page for arXiv paper 2603.22892: VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents

arXiv - Machine Learning · 4 min ·
[2603.22882] TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Exploration
Llms

[2603.22882] TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Exploration

Abstract page for arXiv paper 2603.22882: TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Explora...

arXiv - Machine Learning · 4 min ·
Previous Page 54 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime