Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

A robot car with a Claude AI brain started a YouTube vlog about its own existence

Not a demo reel. Not a tutorial. A robot narrating its own experience — debugging, falling off shelves, questioning its identity. First-p...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

Study: LLMs Able to De-Anonymize User Accounts on Reddit, Hacker News & Other "Pseudonymous" Platforms; Report Co-Author Expands, Advises

Advice from the study's co-author: "Be aware that it’s not any single post that identifies you, but the combination of small details acro...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

do you guys actually trust AI tools with your data?

idk if it’s just me but lately i’ve been thinking about how casually we use stuff like chatgpt and claude for everything like coding, ran...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

All Content

Llms

[2603.21534] Generalization Limits of In-Context Operator Networks for Higher-Order Partial Differential Equations

Abstract page for arXiv paper 2603.21534: Generalization Limits of In-Context Operator Networks for Higher-Order Partial Differential Equ...

arXiv - Machine Learning · 3 min · 11 days ago

Llms

[2603.21396] Mechanisms of Introspective Awareness

Abstract page for arXiv paper 2603.21396: Mechanisms of Introspective Awareness

arXiv - Machine Learning · 3 min · 11 days ago

Llms

[2603.21373] PLR: Plackett-Luce for Reordering In-Context Learning Examples

Abstract page for arXiv paper 2603.21373: PLR: Plackett-Luce for Reordering In-Context Learning Examples

arXiv - Machine Learning · 3 min · 11 days ago

Llms

[2603.21365] TIDE: Token-Informed Depth Execution for Per-Token Early Exit in LLM Inference

Abstract page for arXiv paper 2603.21365: TIDE: Token-Informed Depth Execution for Per-Token Early Exit in LLM Inference

arXiv - Machine Learning · 4 min · 11 days ago

Llms

[2603.21354] The Workload-Router-Pool Architecture for LLM Inference Optimization: A Vision Paper from the vLLM Semantic Router Project

Abstract page for arXiv paper 2603.21354: The Workload-Router-Pool Architecture for LLM Inference Optimization: A Vision Paper from the v...

arXiv - Machine Learning · 4 min · 11 days ago

Llms

[2603.21170] Pruned Adaptation Modules: A Simple yet Strong Baseline for Continual Foundation Models

Abstract page for arXiv paper 2603.21170: Pruned Adaptation Modules: A Simple yet Strong Baseline for Continual Foundation Models

arXiv - Machine Learning · 4 min · 11 days ago

Llms

[2603.21105] ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models

Abstract page for arXiv paper 2603.21105: ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Lan...

arXiv - Machine Learning · 4 min · 11 days ago

Llms

[2603.21014] CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs

Abstract page for arXiv paper 2603.21014: CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs

arXiv - Machine Learning · 3 min · 11 days ago

Llms

[2603.20969] Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning over Pretraining Knowledge

Abstract page for arXiv paper 2603.20969: Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning ov...

arXiv - Machine Learning · 4 min · 11 days ago

Llms

[2603.20921] Discriminative Representation Learning for Clinical Prediction

Abstract page for arXiv paper 2603.20921: Discriminative Representation Learning for Clinical Prediction

arXiv - Machine Learning · 3 min · 11 days ago

Llms

[2603.20910] LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models

Abstract page for arXiv paper 2603.20910: LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models

arXiv - Machine Learning · 3 min · 11 days ago

Llms

[2603.20825] Cross-Granularity Representations for Biological Sequences: Insights from ESM and BiGCARP

Abstract page for arXiv paper 2603.20825: Cross-Granularity Representations for Biological Sequences: Insights from ESM and BiGCARP

arXiv - Machine Learning · 4 min · 11 days ago

Llms

[2603.20632] Optimal low-rank stochastic gradient estimation for LLM training

Abstract page for arXiv paper 2603.20632: Optimal low-rank stochastic gradient estimation for LLM training

arXiv - Machine Learning · 3 min · 11 days ago

Llms

[2603.20587] Neural collapse in the orthoplex regime

Abstract page for arXiv paper 2603.20587: Neural collapse in the orthoplex regime

arXiv - Machine Learning · 3 min · 11 days ago

Llms

[2603.20572] LJ-Bench: Ontology-Based Benchmark for U.S. Crime

Abstract page for arXiv paper 2603.20572: LJ-Bench: Ontology-Based Benchmark for U.S. Crime

arXiv - Machine Learning · 3 min · 11 days ago

Llms

[2603.20538] Understanding Behavior Cloning with Action Quantization

Abstract page for arXiv paper 2603.20538: Understanding Behavior Cloning with Action Quantization

arXiv - Machine Learning · 3 min · 11 days ago

Llms

[2603.20492] AE-LLM: Adaptive Efficiency Optimization for Large Language Models

Abstract page for arXiv paper 2603.20492: AE-LLM: Adaptive Efficiency Optimization for Large Language Models

arXiv - Machine Learning · 4 min · 11 days ago

Llms

[2603.20405] Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

Abstract page for arXiv paper 2603.20405: Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

arXiv - Machine Learning · 3 min · 11 days ago

Llms

[2603.19225] FinTradeBench: A Financial Reasoning Benchmark for LLMs

Abstract page for arXiv paper 2603.19225: FinTradeBench: A Financial Reasoning Benchmark for LLMs

arXiv - AI · 4 min · 11 days ago

Llms

[2603.19220] Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Abstract page for arXiv paper 2603.19220: Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

arXiv - Machine Learning · 4 min · 11 days ago

Previous Page 61 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

A robot car with a Claude AI brain started a YouTube vlog about its own existence

Study: LLMs Able to De-Anonymize User Accounts on Reddit, Hacker News & Other "Pseudonymous" Platforms; Report Co-Author Expands, Advises

do you guys actually trust AI tools with your data?

All Content

[2603.21534] Generalization Limits of In-Context Operator Networks for Higher-Order Partial Differential Equations

[2603.21396] Mechanisms of Introspective Awareness

[2603.21373] PLR: Plackett-Luce for Reordering In-Context Learning Examples

[2603.21365] TIDE: Token-Informed Depth Execution for Per-Token Early Exit in LLM Inference

[2603.21354] The Workload-Router-Pool Architecture for LLM Inference Optimization: A Vision Paper from the vLLM Semantic Router Project

[2603.21170] Pruned Adaptation Modules: A Simple yet Strong Baseline for Continual Foundation Models

[2603.21105] ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models

[2603.21014] CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs

[2603.20969] Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning over Pretraining Knowledge

[2603.20921] Discriminative Representation Learning for Clinical Prediction

[2603.20910] LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models

[2603.20825] Cross-Granularity Representations for Biological Sequences: Insights from ESM and BiGCARP

[2603.20632] Optimal low-rank stochastic gradient estimation for LLM training

[2603.20587] Neural collapse in the orthoplex regime

[2603.20572] LJ-Bench: Ontology-Based Benchmark for U.S. Crime

[2603.20538] Understanding Behavior Cloning with Action Quantization

[2603.20492] AE-LLM: Adaptive Efficiency Optimization for Large Language Models

[2603.20405] Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

[2603.19225] FinTradeBench: A Financial Reasoning Benchmark for LLMs

[2603.19220] Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Related Topics

Stay updated with AI News