Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Deterministic vs. probabilistic guardrails for agentic AI — our approach and an open-source tool [D]

We've been thinking hard about whether safety guardrails for AI agents should be LLM-based (probabilistic) or rule-based (deterministic)....

Reddit - Machine Learning · 1 min · 8 minutes ago

Llms

The 12-month window | TechCrunch

A lot of AI startups exist partly because the foundation models haven't expanded into their category yet. As many jokingly acknowledge, t...

TechCrunch - AI · 3 min · 8 minutes ago

Llms

How LLMs decide which pages to cite — and how to optimize for it

When ChatGPT or Perplexity answers a question, it runs RAG: retrieves top candidates from a crawled index, then scores them. The scoring ...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

All Content

Llms

Sam Altman responds after mass ChatGPT uninstalls help Claude AI become the most popular iPhone app

submitted by /u/Tiny-Independent273 [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 2 months ago

Llms

Is ChatGPT Softening Its Coverage of the US Government? I Ran an Experiment.

I have suspected something fundamental has changed within OpenAI and ChatGPT since 5.2 came out, I noticed it would become blunt and appe...

Reddit - Artificial Intelligence · 1 min · about 2 months ago

Llms

Pentagon Used Claude AI to Attack Iran Just Hours After Trump’s Ban on Anthropic

AI Tools & Products · 2 min · about 2 months ago

Llms

Anthropic’s Claude is suddenly the most popular iPhone app following Pentagon feud

AI Tools & Products · 6 min · about 2 months ago

Llms

LLMs can unmask pseudonymous users at scale with surprising accuracy - Ars Technica

Pseudonymity has never been perfect for preserving privacy. Soon it may be pointless.

Ars Technica - AI · 7 min · about 2 months ago

Llms

[R] Phase-Only Language Model via O(N Log N) FFT Mixing (PRISM): Exploring Interference Under Unit-Magnitude Constraints

Hello everyone, this is about https://arxiv.org/abs/2512.01208 I have decided to share it to get some feedback. I think it is interesting...

Reddit - Machine Learning · 1 min · about 2 months ago

Llms

[D] frontier models are a zero sum game for a few tasks - what they gain in reasoning they lose in your specific thing

when Google shipped Gemini 3 last November, it set new benchmarks on reasoning and coding. but it also removed pixel-level image segmenta...

Reddit - Machine Learning · 1 min · about 2 months ago

Llms

[2602.11909] Echo: Towards Advanced Audio Comprehension via Audio-Interleaved Reasoning

Abstract page for arXiv paper 2602.11909: Echo: Towards Advanced Audio Comprehension via Audio-Interleaved Reasoning

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2601.18685] LLAMA LIMA: A Living Meta-Analysis on the Effects of Generative AI on Learning Mathematics

Abstract page for arXiv paper 2601.18685: LLAMA LIMA: A Living Meta-Analysis on the Effects of Generative AI on Learning Mathematics

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2601.08427] Silence the Judge: Reinforcement Learning with Self-Verifier via Latent Geometric Clustering

Abstract page for arXiv paper 2601.08427: Silence the Judge: Reinforcement Learning with Self-Verifier via Latent Geometric Clustering

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2510.20095] BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models

Abstract page for arXiv paper 2510.20095: BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2510.13849] Language steering in latent space to mitigate unintended code-switching

Abstract page for arXiv paper 2510.13849: Language steering in latent space to mitigate unintended code-switching

arXiv - Machine Learning · 3 min · about 2 months ago

$[2510.08919] PHyCLIP: $\ell_1$-Product of Hyperbolic Factors Unifies Hierarchy and Compositionality in Vision-Language Representation Learning$

Llms

[2510.08919] PHyCLIP: $\ell_1$-Product of Hyperbolic Factors Unifies Hierarchy and Compositionality in Vision-Language Representation Learning

Abstract page for arXiv paper 2510.08919: PHyCLIP: $\ell_1$-Product of Hyperbolic Factors Unifies Hierarchy and Compositionality in Visio...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2506.16411] When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework

Abstract page for arXiv paper 2506.16411: When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2506.05639] FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition

Abstract page for arXiv paper 2506.05639: FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2505.09655] DRA-GRPO: Your GRPO Needs to Know Diverse Reasoning Paths for Mathematical Reasoning

Abstract page for arXiv paper 2505.09655: DRA-GRPO: Your GRPO Needs to Know Diverse Reasoning Paths for Mathematical Reasoning

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2501.06762] Improving the adaptive and continuous learning capabilities of artificial neural networks: Lessons from multi-neuromodulatory dynamics

Abstract page for arXiv paper 2501.06762: Improving the adaptive and continuous learning capabilities of artificial neural networks: Less...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.11761] MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling

Abstract page for arXiv paper 2602.11761: MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.10609] Online Causal Kalman Filtering for Stable and Effective Policy Optimization

Abstract page for arXiv paper 2602.10609: Online Causal Kalman Filtering for Stable and Effective Policy Optimization

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.02185] Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Abstract page for arXiv paper 2602.02185: Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Langua...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 211 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Deterministic vs. probabilistic guardrails for agentic AI — our approach and an open-source tool [D]

The 12-month window | TechCrunch

How LLMs decide which pages to cite — and how to optimize for it

All Content

Sam Altman responds after mass ChatGPT uninstalls help Claude AI become the most popular iPhone app

Is ChatGPT Softening Its Coverage of the US Government? I Ran an Experiment.

Pentagon Used Claude AI to Attack Iran Just Hours After Trump’s Ban on Anthropic

Anthropic’s Claude is suddenly the most popular iPhone app following Pentagon feud

LLMs can unmask pseudonymous users at scale with surprising accuracy - Ars Technica

[R] Phase-Only Language Model via O(N Log N) FFT Mixing (PRISM): Exploring Interference Under Unit-Magnitude Constraints

[D] frontier models are a zero sum game for a few tasks - what they gain in reasoning they lose in your specific thing

[2602.11909] Echo: Towards Advanced Audio Comprehension via Audio-Interleaved Reasoning

[2601.18685] LLAMA LIMA: A Living Meta-Analysis on the Effects of Generative AI on Learning Mathematics

[2601.08427] Silence the Judge: Reinforcement Learning with Self-Verifier via Latent Geometric Clustering

[2510.20095] BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models

[2510.13849] Language steering in latent space to mitigate unintended code-switching

[2510.08919] PHyCLIP: $\ell_1$-Product of Hyperbolic Factors Unifies Hierarchy and Compositionality in Vision-Language Representation Learning

[2506.16411] When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework

[2506.05639] FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition

[2505.09655] DRA-GRPO: Your GRPO Needs to Know Diverse Reasoning Paths for Mathematical Reasoning

[2501.06762] Improving the adaptive and continuous learning capabilities of artificial neural networks: Lessons from multi-neuromodulatory dynamics

[2602.11761] MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling

[2602.10609] Online Causal Kalman Filtering for Stable and Effective Policy Optimization

[2602.02185] Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Related Topics

Stay updated with AI News