Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Built a multiplayer map where you can see everyone's Claude Code activity as creatures battling it out

Hello r/artificial I built this specifically for Claude Code users - every prompt you run feeds a digital pet called a Prompt Creature. T...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

The loss curve said tie. The judges said otherwise. Seeking replication for an early LLM training result [R]

TL;DR - I've written two novel functions that shape the training signal for LLMs. Early tests show people prefer responses from models tr...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

Karpathy dropped a 200-line GPT, so I used the math to turn pandas DataFrames into searchable context windows and open sourced it (and automated my stats pipeline). [P]

TL;DR: I got tired of manually running Shapiro-Wilk tests and copy-pasting p-values at 2 AM. I built an open-source, async Python pipelin...

Reddit - Machine Learning · 1 min · about 4 hours ago

All Content

Llms

Claude AI: Why are there so many internet outages?

AI Tools & Products · 6 min · about 2 months ago

Llms

Claude outages lay bare software developers' growing reliance on AI: 'I guess I'll write code like a caveman'

AI Tools & Products · 5 min · about 2 months ago

Llms

Google hit with shocking wrongful death lawsuit over Gemini AI chatbot

AI Tools & Products · 7 min · about 2 months ago

Llms

[2602.06412] Stopping Computation for Converged Tokens in Masked Diffusion-LM Decoding

Abstract page for arXiv paper 2602.06412: Stopping Computation for Converged Tokens in Masked Diffusion-LM Decoding

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2601.23157] No More, No Less: Least-Privilege Language Models

Abstract page for arXiv paper 2601.23157: No More, No Less: Least-Privilege Language Models

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.04755] When Silence Is Golden: Can LLMs Learn to Abstain in Temporal QA and Beyond?

Abstract page for arXiv paper 2602.04755: When Silence Is Golden: Can LLMs Learn to Abstain in Temporal QA and Beyond?

arXiv - AI · 4 min · about 2 months ago

Llms

[2601.19933] NRR-Phi: Text-to-State Mapping for Ambiguity Preservation in LLM Inference

Abstract page for arXiv paper 2601.19933: NRR-Phi: Text-to-State Mapping for Ambiguity Preservation in LLM Inference

arXiv - AI · 4 min · about 2 months ago

Llms

[2512.12594] ceLLMate: Sandboxing Browser AI Agents

Abstract page for arXiv paper 2512.12594: ceLLMate: Sandboxing Browser AI Agents

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2512.19570] The Epistemological Consequences of Large Language Models: Rethinking collective intelligence and institutional knowledge

Abstract page for arXiv paper 2512.19570: The Epistemological Consequences of Large Language Models: Rethinking collective intelligence a...

arXiv - AI · 4 min · about 2 months ago

Llms

[2510.15040] Composition-Grounded Data Synthesis for Visual Reasoning

Abstract page for arXiv paper 2510.15040: Composition-Grounded Data Synthesis for Visual Reasoning

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2512.18925] Beyond the Prompt: An Empirical Study of Cursor Rules

Abstract page for arXiv paper 2512.18925: Beyond the Prompt: An Empirical Study of Cursor Rules

arXiv - AI · 4 min · about 2 months ago

Llms

[2512.15792] A Systematic Analysis of Biases in Large Language Models

Abstract page for arXiv paper 2512.15792: A Systematic Analysis of Biases in Large Language Models

arXiv - AI · 3 min · about 2 months ago

Llms

[2510.02578] FLOWR.root: A flow matching based foundation model for joint multi-purpose structure-aware 3D ligand generation and affinity prediction

Abstract page for arXiv paper 2510.02578: FLOWR.root: A flow matching based foundation model for joint multi-purpose structure-aware 3D l...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2511.09396] Multimodal Large Language Models for Low-Resource Languages: A Case Study for Basque

Abstract page for arXiv paper 2511.09396: Multimodal Large Language Models for Low-Resource Languages: A Case Study for Basque

arXiv - AI · 3 min · about 2 months ago

Llms

[2511.03441] CareMedEval dataset: Evaluating Critical Appraisal and Reasoning in the Biomedical Field

Abstract page for arXiv paper 2511.03441: CareMedEval dataset: Evaluating Critical Appraisal and Reasoning in the Biomedical Field

arXiv - AI · 4 min · about 2 months ago

Llms

[2510.24702] Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

Abstract page for arXiv paper 2510.24702: Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

arXiv - AI · 4 min · about 2 months ago

Llms

[2510.24178] MuSaG: A Multimodal German Sarcasm Dataset with Full-Modal Annotations

Abstract page for arXiv paper 2510.24178: MuSaG: A Multimodal German Sarcasm Dataset with Full-Modal Annotations

arXiv - AI · 4 min · about 2 months ago

Llms

[2510.10889] Topological Alignment of Shared Vision-Language Embedding Space

Abstract page for arXiv paper 2510.10889: Topological Alignment of Shared Vision-Language Embedding Space

arXiv - AI · 3 min · about 2 months ago

Llms

[2510.07181] TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics

Abstract page for arXiv paper 2510.07181: TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics

arXiv - AI · 4 min · about 2 months ago

Llms

[2505.06046] Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information

Abstract page for arXiv paper 2505.06046: Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 251 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Built a multiplayer map where you can see everyone's Claude Code activity as creatures battling it out

The loss curve said tie. The judges said otherwise. Seeking replication for an early LLM training result [R]

Karpathy dropped a 200-line GPT, so I used the math to turn pandas DataFrames into searchable context windows and open sourced it (and automated my stats pipeline). [P]

All Content

Claude AI: Why are there so many internet outages?

Claude outages lay bare software developers' growing reliance on AI: 'I guess I'll write code like a caveman'

Google hit with shocking wrongful death lawsuit over Gemini AI chatbot

[2602.06412] Stopping Computation for Converged Tokens in Masked Diffusion-LM Decoding

[2601.23157] No More, No Less: Least-Privilege Language Models

[2602.04755] When Silence Is Golden: Can LLMs Learn to Abstain in Temporal QA and Beyond?

[2601.19933] NRR-Phi: Text-to-State Mapping for Ambiguity Preservation in LLM Inference

[2512.12594] ceLLMate: Sandboxing Browser AI Agents

[2512.19570] The Epistemological Consequences of Large Language Models: Rethinking collective intelligence and institutional knowledge

[2510.15040] Composition-Grounded Data Synthesis for Visual Reasoning

[2512.18925] Beyond the Prompt: An Empirical Study of Cursor Rules

[2512.15792] A Systematic Analysis of Biases in Large Language Models

[2510.02578] FLOWR.root: A flow matching based foundation model for joint multi-purpose structure-aware 3D ligand generation and affinity prediction

[2511.09396] Multimodal Large Language Models for Low-Resource Languages: A Case Study for Basque

[2511.03441] CareMedEval dataset: Evaluating Critical Appraisal and Reasoning in the Biomedical Field

[2510.24702] Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

[2510.24178] MuSaG: A Multimodal German Sarcasm Dataset with Full-Modal Annotations

[2510.10889] Topological Alignment of Shared Vision-Language Embedding Space

[2510.07181] TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics

[2505.06046] Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information

Related Topics

Stay updated with AI News