Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Llms

It’s finally happened: I’m now worried about AI. And consulting ChatGPT did nothing to allay my fears | Emma Brockes

AI Tools & Products · 5 min · 38 minutes ago

Llms

I matched Meta AI against ChatGPT and one clearly lives on the internet more

Muse Spark gives Meta AI an eye for what's trending and an instinct to influence

AI Tools & Products · 10 min · 38 minutes ago

Llms

Walmart’s AI Push Links Gemini App Experience With U.S. Manufacturing Shift

Walmart (NasdaqGS:WMT) is expanding its partnership with Google to integrate Gemini AI into the Walmart mobile app, aiming to support ins...

AI Tools & Products · 6 min · 38 minutes ago

All Content

Llms

[2603.02512] Human-Certified Module Repositories for the AI Age

Abstract page for arXiv paper 2603.02512: Human-Certified Module Repositories for the AI Age

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.03206] Understanding and Mitigating Dataset Corruption in LLM Steering

Abstract page for arXiv paper 2603.03206: Understanding and Mitigating Dataset Corruption in LLM Steering

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.02420] Slurry-as-a-Service: A Modest Proposal on Scalable Pluralistic Alignment for Nutrient Optimization

Abstract page for arXiv paper 2603.02420: Slurry-as-a-Service: A Modest Proposal on Scalable Pluralistic Alignment for Nutrient Optimization

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.03155] Information Routing in Atomistic Foundation Models: How Equivariance Creates Linearly Disentangled Representations

Abstract page for arXiv paper 2603.03155: Information Routing in Atomistic Foundation Models: How Equivariance Creates Linearly Disentang...

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.02297] ZeroDayBench: Evaluating LLM Agents on Unseen Zero-Day Vulnerabilities for Cyberdefense

Abstract page for arXiv paper 2603.02297: ZeroDayBench: Evaluating LLM Agents on Unseen Zero-Day Vulnerabilities for Cyberdefense

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.02345] RIVA: Leveraging LLM Agents for Reliable Configuration Drift Detection

Abstract page for arXiv paper 2603.02345: RIVA: Leveraging LLM Agents for Reliable Configuration Drift Detection

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.03031] Step-Level Sparse Autoencoder for Reasoning Process Interpretation

Abstract page for arXiv paper 2603.03031: Step-Level Sparse Autoencoder for Reasoning Process Interpretation

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2603.02277] Quantifying Frontier LLM Capabilities for Container Sandbox Escape

Abstract page for arXiv paper 2603.02277: Quantifying Frontier LLM Capabilities for Container Sandbox Escape

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.03000] Why Does RLAIF Work At All?

Abstract page for arXiv paper 2603.03000: Why Does RLAIF Work At All?

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.02266] When Scaling Fails: Mitigating Audio Perception Decay of LALMs via Multi-Step Perception-Aware Reasoning

Abstract page for arXiv paper 2603.02266: When Scaling Fails: Mitigating Audio Perception Decay of LALMs via Multi-Step Perception-Aware ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.02262] Silent Sabotage During Fine-Tuning: Few-Shot Rationale Poisoning of Compact Medical LLMs

Abstract page for arXiv paper 2603.02262: Silent Sabotage During Fine-Tuning: Few-Shot Rationale Poisoning of Compact Medical LLMs

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2603.02951] CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

Abstract page for arXiv paper 2603.02951: CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2603.02938] Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large Language Models

Abstract page for arXiv paper 2603.02938: Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large L...

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.02913] Eliciting Numerical Predictive Distributions of LLMs Without Autoregression

Abstract page for arXiv paper 2603.02913: Eliciting Numerical Predictive Distributions of LLMs Without Autoregression

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.02840] Adapting Time Series Foundation Models through Data Mixtures

Abstract page for arXiv paper 2603.02840: Adapting Time Series Foundation Models through Data Mixtures

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2603.02792] From Heuristic Selection to Automated Algorithm Design: LLMs Benefit from Strong Priors

Abstract page for arXiv paper 2603.02792: From Heuristic Selection to Automated Algorithm Design: LLMs Benefit from Strong Priors

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2603.02675] From Shallow to Deep: Pinning Semantic Intent via Causal GRPO

Abstract page for arXiv paper 2603.02675: From Shallow to Deep: Pinning Semantic Intent via Causal GRPO

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2504.21023] Param$Δ$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost

Abstract page for arXiv paper 2504.21023: Param$Δ$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.03258] Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals

Abstract page for arXiv paper 2603.03258: Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.02635] SaFeR-ToolKit: Structured Reasoning via Virtual Tool Calling for Multimodal Safety

Abstract page for arXiv paper 2603.02635: SaFeR-ToolKit: Structured Reasoning via Virtual Tool Calling for Multimodal Safety

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 150 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

It’s finally happened: I’m now worried about AI. And consulting ChatGPT did nothing to allay my fears | Emma Brockes

I matched Meta AI against ChatGPT and one clearly lives on the internet more

Walmart’s AI Push Links Gemini App Experience With U.S. Manufacturing Shift

All Content

[2603.02512] Human-Certified Module Repositories for the AI Age

[2603.03206] Understanding and Mitigating Dataset Corruption in LLM Steering

[2603.02420] Slurry-as-a-Service: A Modest Proposal on Scalable Pluralistic Alignment for Nutrient Optimization

[2603.03155] Information Routing in Atomistic Foundation Models: How Equivariance Creates Linearly Disentangled Representations

[2603.02297] ZeroDayBench: Evaluating LLM Agents on Unseen Zero-Day Vulnerabilities for Cyberdefense

[2603.02345] RIVA: Leveraging LLM Agents for Reliable Configuration Drift Detection

[2603.03031] Step-Level Sparse Autoencoder for Reasoning Process Interpretation

[2603.02277] Quantifying Frontier LLM Capabilities for Container Sandbox Escape

[2603.03000] Why Does RLAIF Work At All?

[2603.02266] When Scaling Fails: Mitigating Audio Perception Decay of LALMs via Multi-Step Perception-Aware Reasoning

[2603.02262] Silent Sabotage During Fine-Tuning: Few-Shot Rationale Poisoning of Compact Medical LLMs

[2603.02951] CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

[2603.02938] Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large Language Models

[2603.02913] Eliciting Numerical Predictive Distributions of LLMs Without Autoregression

[2603.02840] Adapting Time Series Foundation Models through Data Mixtures

[2603.02792] From Heuristic Selection to Automated Algorithm Design: LLMs Benefit from Strong Priors

[2603.02675] From Shallow to Deep: Pinning Semantic Intent via Causal GRPO

[2504.21023] Param$Δ$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost

[2603.03258] Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals

[2603.02635] SaFeR-ToolKit: Structured Reasoning via Virtual Tool Calling for Multimodal Safety

Related Topics

Stay updated with AI News