Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

How do you test AI agents in production? The unpredictability is overwhelming.[D]

I’ve been in QA for almost a decade. My mental model for quality was always: given input X, assert output Y. Now I’m on a team that’s shi...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

Confusing Website

i'm trying to find a video online and couldn't so i asked ChatGPT by describing the video and i was given a link and i'm trying to make s...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

I tested the same prompt across multiple AI models… the differences surprised me

I’ve been experimenting with different AI models lately (ChatGPT, Claude, etc.), and I tried something simple: Using the exact same promp...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

All Content

Llms

[2506.09016] SPEED-RL: Faster Training of Reasoning Models via Online Curriculum Learning

Abstract page for arXiv paper 2506.09016: SPEED-RL: Faster Training of Reasoning Models via Online Curriculum Learning

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2505.23648] Continuous Chain of Thought Enables Parallel Exploration and Reasoning

Abstract page for arXiv paper 2505.23648: Continuous Chain of Thought Enables Parallel Exploration and Reasoning

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.05280] Layer by layer, module by module: Choose both for optimal OOD probing of ViT

Abstract page for arXiv paper 2603.05280: Layer by layer, module by module: Choose both for optimal OOD probing of ViT

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.05143] Feature Resemblance: On the Theoretical Understanding of Analogical Reasoning in Transformers

Abstract page for arXiv paper 2603.05143: Feature Resemblance: On the Theoretical Understanding of Analogical Reasoning in Transformers

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.05035] Good-Enough LLM Obfuscation (GELO)

Abstract page for arXiv paper 2603.05035: Good-Enough LLM Obfuscation (GELO)

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.05026] RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

Abstract page for arXiv paper 2603.05026: RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.04964] Replaying pre-training data improves fine-tuning

Abstract page for arXiv paper 2603.04964: Replaying pre-training data improves fine-tuning

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.04716] SLO-Aware Compute Resource Allocation for Prefill-Decode Disaggregated LLM Inference

Abstract page for arXiv paper 2603.04716: SLO-Aware Compute Resource Allocation for Prefill-Decode Disaggregated LLM Inference

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.04480] AbAffinity: A Large Language Model for Predicting Antibody Binding Affinity against SARS-CoV-2

Abstract page for arXiv paper 2603.04480: AbAffinity: A Large Language Model for Predicting Antibody Binding Affinity against SARS-CoV-2

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.04466] Act-Observe-Rewrite: Multimodal Coding Agents as In-Context Policy Learners for Robot Manipulation

Abstract page for arXiv paper 2603.04466: Act-Observe-Rewrite: Multimodal Coding Agents as In-Context Policy Learners for Robot Manipulation

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.05232] SlideSparse: Fast and Flexible (2N-2):2N Structured Sparsity

Abstract page for arXiv paper 2603.05232: SlideSparse: Fast and Flexible (2N-2):2N Structured Sparsity

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.04972] Functionality-Oriented LLM Merging on the Fisher--Rao Manifold

Abstract page for arXiv paper 2603.04972: Functionality-Oriented LLM Merging on the Fisher--Rao Manifold

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.04956] WaterSIC: information-theoretically (near) optimal linear layer quantization

Abstract page for arXiv paper 2603.04956: WaterSIC: information-theoretically (near) optimal linear layer quantization

arXiv - Machine Learning · 3 min · about 2 months ago

$[2603.04948] $\nabla$-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space$

Llms

[2603.04948] $\nabla$-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space

Abstract page for arXiv paper 2603.04948: $\nabla$-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.04898] U-Parking: Distributed UWB-Assisted Autonomous Parking System with Robust Localization and Intelligent Planning

Abstract page for arXiv paper 2603.04898: U-Parking: Distributed UWB-Assisted Autonomous Parking System with Robust Localization and Inte...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.04851] Why Is RLHF Alignment Shallow? A Gradient Analysis

Abstract page for arXiv paper 2603.04851: Why Is RLHF Alignment Shallow? A Gradient Analysis

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.04692] Engineering Regression Without Real-Data Training: Domain Adaptation for Tabular Foundation Models Using Multi-Dataset Embeddings

Abstract page for arXiv paper 2603.04692: Engineering Regression Without Real-Data Training: Domain Adaptation for Tabular Foundation Mod...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.04606] PDE foundation model-accelerated inverse estimation of system parameters in inertial confinement fusion

Abstract page for arXiv paper 2603.04606: PDE foundation model-accelerated inverse estimation of system parameters in inertial confinemen...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.04545] An LLM-Guided Query-Aware Inference System for GNN Models on Large Knowledge Graphs

Abstract page for arXiv paper 2603.04545: An LLM-Guided Query-Aware Inference System for GNN Models on Large Knowledge Graphs

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.04478] Standing on the Shoulders of Giants: Rethinking EEG Foundation Model Pretraining via Multi-Teacher Distillation

Abstract page for arXiv paper 2603.04478: Standing on the Shoulders of Giants: Rethinking EEG Foundation Model Pretraining via Multi-Teac...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 236 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

How do you test AI agents in production? The unpredictability is overwhelming.[D]

Confusing Website

I tested the same prompt across multiple AI models… the differences surprised me

All Content

[2506.09016] SPEED-RL: Faster Training of Reasoning Models via Online Curriculum Learning

[2505.23648] Continuous Chain of Thought Enables Parallel Exploration and Reasoning

[2603.05280] Layer by layer, module by module: Choose both for optimal OOD probing of ViT

[2603.05143] Feature Resemblance: On the Theoretical Understanding of Analogical Reasoning in Transformers

[2603.05035] Good-Enough LLM Obfuscation (GELO)

[2603.05026] RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

[2603.04964] Replaying pre-training data improves fine-tuning

[2603.04716] SLO-Aware Compute Resource Allocation for Prefill-Decode Disaggregated LLM Inference

[2603.04480] AbAffinity: A Large Language Model for Predicting Antibody Binding Affinity against SARS-CoV-2

[2603.04466] Act-Observe-Rewrite: Multimodal Coding Agents as In-Context Policy Learners for Robot Manipulation

[2603.05232] SlideSparse: Fast and Flexible (2N-2):2N Structured Sparsity

[2603.04972] Functionality-Oriented LLM Merging on the Fisher--Rao Manifold

[2603.04956] WaterSIC: information-theoretically (near) optimal linear layer quantization

[2603.04948] $\nabla$-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space

[2603.04898] U-Parking: Distributed UWB-Assisted Autonomous Parking System with Robust Localization and Intelligent Planning

[2603.04851] Why Is RLHF Alignment Shallow? A Gradient Analysis

[2603.04692] Engineering Regression Without Real-Data Training: Domain Adaptation for Tabular Foundation Models Using Multi-Dataset Embeddings

[2603.04606] PDE foundation model-accelerated inverse estimation of system parameters in inertial confinement fusion

[2603.04545] An LLM-Guided Query-Aware Inference System for GNN Models on Large Knowledge Graphs

[2603.04478] Standing on the Shoulders of Giants: Rethinking EEG Foundation Model Pretraining via Multi-Teacher Distillation

Related Topics

Stay updated with AI News