Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Claude Max 20x usage hit 40% by Monday noon — how does Codex CLI compare?

I'm on Claude Max (the $100/mo plan) and noticed something that surprised me. By Monday noon I had already used 40% of the 20x monthly li...

Reddit - Artificial Intelligence · 1 min · 43 minutes ago

Llms

How to use the new ChatGPT app integrations, including DoorDash, Spotify, Uber, and others | TechCrunch

Learn how to use Spotify, Canva, Figma, Expedia, and other apps directly in ChatGPT.

TechCrunch - AI · 10 min · about 3 hours ago

Llms

Anthropic Restricts Claude Agent Access Amid AI Automation Boom in Crypto

AI Tools & Products · 7 min · about 9 hours ago

All Content

Llms

[2509.25149] Pretraining Large Language Models with NVFP4

Abstract page for arXiv paper 2509.25149: Pretraining Large Language Models with NVFP4

arXiv - Machine Learning · 5 min · about 1 month ago

Llms

[2510.00177] PrefDisco: Benchmarking Proactive Personalized Reasoning

Abstract page for arXiv paper 2510.00177: PrefDisco: Benchmarking Proactive Personalized Reasoning

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.24210] BeyondBench: Contamination-Resistant Evaluation of Reasoning in Language Models

Abstract page for arXiv paper 2509.24210: BeyondBench: Contamination-Resistant Evaluation of Reasoning in Language Models

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.23886] Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer

Abstract page for arXiv paper 2509.23886: Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.20321] Conversational Speech Reveals Structural Robustness Failures in SpeechLLM Backbones

Abstract page for arXiv paper 2509.20321: Conversational Speech Reveals Structural Robustness Failures in SpeechLLM Backbones

arXiv - AI · 3 min · about 1 month ago

Llms

[2508.06249] In-Training Defenses against Emergent Misalignment in Language Models

Abstract page for arXiv paper 2508.06249: In-Training Defenses against Emergent Misalignment in Language Models

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2507.01785] MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining

Abstract page for arXiv paper 2507.01785: MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2506.23508] Why Reinforcement Fine-Tuning Enables MLLMs Preserve Prior Knowledge Better: A Data Perspective

Abstract page for arXiv paper 2506.23508: Why Reinforcement Fine-Tuning Enables MLLMs Preserve Prior Knowledge Better: A Data Perspective

arXiv - AI · 4 min · about 1 month ago

Llms

[2506.01062] SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models

Abstract page for arXiv paper 2506.01062: SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2505.19255] VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use

Abstract page for arXiv paper 2505.19255: VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2504.04372] Assessing the Impact of Code Changes on the Fault Localizability of Large Language Models

Abstract page for arXiv paper 2504.04372: Assessing the Impact of Code Changes on the Fault Localizability of Large Language Models

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.00485] Replacing Parameters with Preferences: Federated Alignment of Heterogeneous Vision-Language Models

Abstract page for arXiv paper 2602.00485: Replacing Parameters with Preferences: Federated Alignment of Heterogeneous Vision-Language Models

arXiv - AI · 4 min · about 1 month ago

Llms

[2601.03604] Interleaved Tool-Call Reasoning for Protein Function Understanding

Abstract page for arXiv paper 2601.03604: Interleaved Tool-Call Reasoning for Protein Function Understanding

arXiv - AI · 3 min · about 1 month ago

Llms

[2512.10534] Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning

Abstract page for arXiv paper 2512.10534: Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforceme...

arXiv - AI · 4 min · about 1 month ago

Llms

[2601.22571] PerfGuard: A Performance-Aware Agent for Visual Content Generation

Abstract page for arXiv paper 2601.22571: PerfGuard: A Performance-Aware Agent for Visual Content Generation

arXiv - AI · 4 min · about 1 month ago

Llms

[2512.14106] HydroGEM: A Self Supervised Zero Shot Hybrid TCN Transformer Foundation Model for Continental Scale Streamflow Quality Control

Abstract page for arXiv paper 2512.14106: HydroGEM: A Self Supervised Zero Shot Hybrid TCN Transformer Foundation Model for Continental S...

arXiv - AI · 4 min · about 1 month ago

Llms

[2512.07081] ClinNoteAgents: An LLM Multi-Agent System for Predicting and Interpreting Heart Failure 30-Day Readmission from Clinical Notes

Abstract page for arXiv paper 2512.07081: ClinNoteAgents: An LLM Multi-Agent System for Predicting and Interpreting Heart Failure 30-Day ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2505.13770] Ice Cream Doesn't Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference

Abstract page for arXiv paper 2505.13770: Ice Cream Doesn't Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Infe...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.21033] Towards Trustworthy Legal AI through LLM Agents and Formal Reasoning

Abstract page for arXiv paper 2511.21033: Towards Trustworthy Legal AI through LLM Agents and Formal Reasoning

arXiv - AI · 4 min · about 1 month ago

Llms

[2511.04439] CoRPO: Adding a Correctness Bias to GRPO Improves Generalization

Abstract page for arXiv paper 2511.04439: CoRPO: Adding a Correctness Bias to GRPO Improves Generalization

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 93 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Claude Max 20x usage hit 40% by Monday noon — how does Codex CLI compare?

How to use the new ChatGPT app integrations, including DoorDash, Spotify, Uber, and others | TechCrunch

Anthropic Restricts Claude Agent Access Amid AI Automation Boom in Crypto

All Content

[2509.25149] Pretraining Large Language Models with NVFP4

[2510.00177] PrefDisco: Benchmarking Proactive Personalized Reasoning

[2509.24210] BeyondBench: Contamination-Resistant Evaluation of Reasoning in Language Models

[2509.23886] Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer

[2509.20321] Conversational Speech Reveals Structural Robustness Failures in SpeechLLM Backbones

[2508.06249] In-Training Defenses against Emergent Misalignment in Language Models

[2507.01785] MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining

[2506.23508] Why Reinforcement Fine-Tuning Enables MLLMs Preserve Prior Knowledge Better: A Data Perspective

[2506.01062] SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models

[2505.19255] VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use

[2504.04372] Assessing the Impact of Code Changes on the Fault Localizability of Large Language Models

[2602.00485] Replacing Parameters with Preferences: Federated Alignment of Heterogeneous Vision-Language Models

[2601.03604] Interleaved Tool-Call Reasoning for Protein Function Understanding

[2512.10534] Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning

[2601.22571] PerfGuard: A Performance-Aware Agent for Visual Content Generation

[2512.14106] HydroGEM: A Self Supervised Zero Shot Hybrid TCN Transformer Foundation Model for Continental Scale Streamflow Quality Control

[2512.07081] ClinNoteAgents: An LLM Multi-Agent System for Predicting and Interpreting Heart Failure 30-Day Readmission from Clinical Notes

[2505.13770] Ice Cream Doesn't Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference

[2511.21033] Towards Trustworthy Legal AI through LLM Agents and Formal Reasoning

[2511.04439] CoRPO: Adding a Correctness Bias to GRPO Improves Generalization

Related Topics

Stay updated with AI News