Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

A robot car with a Claude AI brain started a YouTube vlog about its own existence

Not a demo reel. Not a tutorial. A robot narrating its own experience — debugging, falling off shelves, questioning its identity. First-p...

Reddit - Artificial Intelligence · 1 min ·
Llms

Study: LLMs Able to De-Anonymize User Accounts on Reddit, Hacker News & Other "Pseudonymous" Platforms; Report Co-Author Expands, Advises

Advice from the study's co-author: "Be aware that it’s not any single post that identifies you, but the combination of small details acro...

Reddit - Artificial Intelligence · 1 min ·
Llms

do you guys actually trust AI tools with your data?

idk if it’s just me but lately i’ve been thinking about how casually we use stuff like chatgpt and claude for everything like coding, ran...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.21534] Generalization Limits of In-Context Operator Networks for Higher-Order Partial Differential Equations
Llms

[2603.21534] Generalization Limits of In-Context Operator Networks for Higher-Order Partial Differential Equations

Abstract page for arXiv paper 2603.21534: Generalization Limits of In-Context Operator Networks for Higher-Order Partial Differential Equ...

arXiv - Machine Learning · 3 min ·
[2603.21396] Mechanisms of Introspective Awareness
Llms

[2603.21396] Mechanisms of Introspective Awareness

Abstract page for arXiv paper 2603.21396: Mechanisms of Introspective Awareness

arXiv - Machine Learning · 3 min ·
[2603.21373] PLR: Plackett-Luce for Reordering In-Context Learning Examples
Llms

[2603.21373] PLR: Plackett-Luce for Reordering In-Context Learning Examples

Abstract page for arXiv paper 2603.21373: PLR: Plackett-Luce for Reordering In-Context Learning Examples

arXiv - Machine Learning · 3 min ·
[2603.21365] TIDE: Token-Informed Depth Execution for Per-Token Early Exit in LLM Inference
Llms

[2603.21365] TIDE: Token-Informed Depth Execution for Per-Token Early Exit in LLM Inference

Abstract page for arXiv paper 2603.21365: TIDE: Token-Informed Depth Execution for Per-Token Early Exit in LLM Inference

arXiv - Machine Learning · 4 min ·
[2603.21354] The Workload-Router-Pool Architecture for LLM Inference Optimization: A Vision Paper from the vLLM Semantic Router Project
Llms

[2603.21354] The Workload-Router-Pool Architecture for LLM Inference Optimization: A Vision Paper from the vLLM Semantic Router Project

Abstract page for arXiv paper 2603.21354: The Workload-Router-Pool Architecture for LLM Inference Optimization: A Vision Paper from the v...

arXiv - Machine Learning · 4 min ·
[2603.21170] Pruned Adaptation Modules: A Simple yet Strong Baseline for Continual Foundation Models
Llms

[2603.21170] Pruned Adaptation Modules: A Simple yet Strong Baseline for Continual Foundation Models

Abstract page for arXiv paper 2603.21170: Pruned Adaptation Modules: A Simple yet Strong Baseline for Continual Foundation Models

arXiv - Machine Learning · 4 min ·
[2603.21105] ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models
Llms

[2603.21105] ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models

Abstract page for arXiv paper 2603.21105: ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Lan...

arXiv - Machine Learning · 4 min ·
[2603.21014] CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs
Llms

[2603.21014] CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs

Abstract page for arXiv paper 2603.21014: CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs

arXiv - Machine Learning · 3 min ·
[2603.20969] Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning over Pretraining Knowledge
Llms

[2603.20969] Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning over Pretraining Knowledge

Abstract page for arXiv paper 2603.20969: Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning ov...

arXiv - Machine Learning · 4 min ·
[2603.20921] Discriminative Representation Learning for Clinical Prediction
Llms

[2603.20921] Discriminative Representation Learning for Clinical Prediction

Abstract page for arXiv paper 2603.20921: Discriminative Representation Learning for Clinical Prediction

arXiv - Machine Learning · 3 min ·
[2603.20910] LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models
Llms

[2603.20910] LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models

Abstract page for arXiv paper 2603.20910: LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models

arXiv - Machine Learning · 3 min ·
[2603.20825] Cross-Granularity Representations for Biological Sequences: Insights from ESM and BiGCARP
Llms

[2603.20825] Cross-Granularity Representations for Biological Sequences: Insights from ESM and BiGCARP

Abstract page for arXiv paper 2603.20825: Cross-Granularity Representations for Biological Sequences: Insights from ESM and BiGCARP

arXiv - Machine Learning · 4 min ·
[2603.20632] Optimal low-rank stochastic gradient estimation for LLM training
Llms

[2603.20632] Optimal low-rank stochastic gradient estimation for LLM training

Abstract page for arXiv paper 2603.20632: Optimal low-rank stochastic gradient estimation for LLM training

arXiv - Machine Learning · 3 min ·
[2603.20587] Neural collapse in the orthoplex regime
Llms

[2603.20587] Neural collapse in the orthoplex regime

Abstract page for arXiv paper 2603.20587: Neural collapse in the orthoplex regime

arXiv - Machine Learning · 3 min ·
[2603.20572] LJ-Bench: Ontology-Based Benchmark for U.S. Crime
Llms

[2603.20572] LJ-Bench: Ontology-Based Benchmark for U.S. Crime

Abstract page for arXiv paper 2603.20572: LJ-Bench: Ontology-Based Benchmark for U.S. Crime

arXiv - Machine Learning · 3 min ·
[2603.20538] Understanding Behavior Cloning with Action Quantization
Llms

[2603.20538] Understanding Behavior Cloning with Action Quantization

Abstract page for arXiv paper 2603.20538: Understanding Behavior Cloning with Action Quantization

arXiv - Machine Learning · 3 min ·
[2603.20492] AE-LLM: Adaptive Efficiency Optimization for Large Language Models
Llms

[2603.20492] AE-LLM: Adaptive Efficiency Optimization for Large Language Models

Abstract page for arXiv paper 2603.20492: AE-LLM: Adaptive Efficiency Optimization for Large Language Models

arXiv - Machine Learning · 4 min ·
[2603.20405] Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP
Llms

[2603.20405] Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

Abstract page for arXiv paper 2603.20405: Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

arXiv - Machine Learning · 3 min ·
[2603.19225] FinTradeBench: A Financial Reasoning Benchmark for LLMs
Llms

[2603.19225] FinTradeBench: A Financial Reasoning Benchmark for LLMs

Abstract page for arXiv paper 2603.19225: FinTradeBench: A Financial Reasoning Benchmark for LLMs

arXiv - AI · 4 min ·
[2603.19220] Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Llms

[2603.19220] Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Abstract page for arXiv paper 2603.19220: Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

arXiv - Machine Learning · 4 min ·
Previous Page 61 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime