Large Language Models
GPT, Claude, Gemini, and other LLMs
Top This Week
I matched Meta AI against ChatGPT and one clearly lives on the internet more
Muse Spark gives Meta AI an eye for what's trending and an instinct to influence
Walmart’s AI Push Links Gemini App Experience With U.S. Manufacturing Shift
Walmart (NasdaqGS:WMT) is expanding its partnership with Google to integrate Gemini AI into the Walmart mobile app, aiming to support ins...
All Content
[2603.02512] Human-Certified Module Repositories for the AI Age
Abstract page for arXiv paper 2603.02512: Human-Certified Module Repositories for the AI Age
[2603.03206] Understanding and Mitigating Dataset Corruption in LLM Steering
Abstract page for arXiv paper 2603.03206: Understanding and Mitigating Dataset Corruption in LLM Steering
[2603.02420] Slurry-as-a-Service: A Modest Proposal on Scalable Pluralistic Alignment for Nutrient Optimization
Abstract page for arXiv paper 2603.02420: Slurry-as-a-Service: A Modest Proposal on Scalable Pluralistic Alignment for Nutrient Optimization
[2603.03155] Information Routing in Atomistic Foundation Models: How Equivariance Creates Linearly Disentangled Representations
Abstract page for arXiv paper 2603.03155: Information Routing in Atomistic Foundation Models: How Equivariance Creates Linearly Disentang...
[2603.02297] ZeroDayBench: Evaluating LLM Agents on Unseen Zero-Day Vulnerabilities for Cyberdefense
Abstract page for arXiv paper 2603.02297: ZeroDayBench: Evaluating LLM Agents on Unseen Zero-Day Vulnerabilities for Cyberdefense
[2603.02345] RIVA: Leveraging LLM Agents for Reliable Configuration Drift Detection
Abstract page for arXiv paper 2603.02345: RIVA: Leveraging LLM Agents for Reliable Configuration Drift Detection
[2603.03031] Step-Level Sparse Autoencoder for Reasoning Process Interpretation
Abstract page for arXiv paper 2603.03031: Step-Level Sparse Autoencoder for Reasoning Process Interpretation
[2603.02277] Quantifying Frontier LLM Capabilities for Container Sandbox Escape
Abstract page for arXiv paper 2603.02277: Quantifying Frontier LLM Capabilities for Container Sandbox Escape
[2603.03000] Why Does RLAIF Work At All?
Abstract page for arXiv paper 2603.03000: Why Does RLAIF Work At All?
[2603.02266] When Scaling Fails: Mitigating Audio Perception Decay of LALMs via Multi-Step Perception-Aware Reasoning
Abstract page for arXiv paper 2603.02266: When Scaling Fails: Mitigating Audio Perception Decay of LALMs via Multi-Step Perception-Aware ...
[2603.02262] Silent Sabotage During Fine-Tuning: Few-Shot Rationale Poisoning of Compact Medical LLMs
Abstract page for arXiv paper 2603.02262: Silent Sabotage During Fine-Tuning: Few-Shot Rationale Poisoning of Compact Medical LLMs
[2603.02951] CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning
Abstract page for arXiv paper 2603.02951: CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning
[2603.02938] Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large Language Models
Abstract page for arXiv paper 2603.02938: Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large L...
[2603.02913] Eliciting Numerical Predictive Distributions of LLMs Without Autoregression
Abstract page for arXiv paper 2603.02913: Eliciting Numerical Predictive Distributions of LLMs Without Autoregression
[2603.02840] Adapting Time Series Foundation Models through Data Mixtures
Abstract page for arXiv paper 2603.02840: Adapting Time Series Foundation Models through Data Mixtures
[2603.02792] From Heuristic Selection to Automated Algorithm Design: LLMs Benefit from Strong Priors
Abstract page for arXiv paper 2603.02792: From Heuristic Selection to Automated Algorithm Design: LLMs Benefit from Strong Priors
[2603.02675] From Shallow to Deep: Pinning Semantic Intent via Causal GRPO
Abstract page for arXiv paper 2603.02675: From Shallow to Deep: Pinning Semantic Intent via Causal GRPO
[2504.21023] Param$Δ$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost
Abstract page for arXiv paper 2504.21023: Param$Δ$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost
[2603.03258] Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals
Abstract page for arXiv paper 2603.03258: Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals
[2603.02635] SaFeR-ToolKit: Structured Reasoning via Virtual Tool Calling for Multimodal Safety
Abstract page for arXiv paper 2603.02635: SaFeR-ToolKit: Structured Reasoning via Virtual Tool Calling for Multimodal Safety
Related Topics
Stay updated with AI News
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime