Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Deterministic vs. probabilistic guardrails for agentic AI — our approach and an open-source tool [D]

We've been thinking hard about whether safety guardrails for AI agents should be LLM-based (probabilistic) or rule-based (deterministic)....

Reddit - Machine Learning · 1 min ·
The 12-month window | TechCrunch
Llms

The 12-month window | TechCrunch

A lot of AI startups exist partly because the foundation models haven't expanded into their category yet. As many jokingly acknowledge, t...

TechCrunch - AI · 3 min ·
Llms

How LLMs decide which pages to cite — and how to optimize for it

When ChatGPT or Perplexity answers a question, it runs RAG: retrieves top candidates from a crawled index, then scores them. The scoring ...

Reddit - Artificial Intelligence · 1 min ·

All Content

Llms

Sam Altman responds after mass ChatGPT uninstalls help Claude AI become the most popular iPhone app

submitted by /u/Tiny-Independent273 [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Llms

Is ChatGPT Softening Its Coverage of the US Government? I Ran an Experiment.

I have suspected something fundamental has changed within OpenAI and ChatGPT since 5.2 came out, I noticed it would become blunt and appe...

Reddit - Artificial Intelligence · 1 min ·
Pentagon Used Claude AI to Attack Iran Just Hours After Trump’s Ban on Anthropic
Llms

Pentagon Used Claude AI to Attack Iran Just Hours After Trump’s Ban on Anthropic

AI Tools & Products · 2 min ·
Anthropic’s Claude is suddenly the most popular iPhone app following Pentagon feud
Llms

Anthropic’s Claude is suddenly the most popular iPhone app following Pentagon feud

AI Tools & Products · 6 min ·
LLMs can unmask pseudonymous users at scale with surprising accuracy - Ars Technica
Llms

LLMs can unmask pseudonymous users at scale with surprising accuracy - Ars Technica

Pseudonymity has never been perfect for preserving privacy. Soon it may be pointless.

Ars Technica - AI · 7 min ·
Llms

[R] Phase-Only Language Model via O(N Log N) FFT Mixing (PRISM): Exploring Interference Under Unit-Magnitude Constraints

Hello everyone, this is about https://arxiv.org/abs/2512.01208 I have decided to share it to get some feedback. I think it is interesting...

Reddit - Machine Learning · 1 min ·
Llms

[D] frontier models are a zero sum game for a few tasks - what they gain in reasoning they lose in your specific thing

when Google shipped Gemini 3 last November, it set new benchmarks on reasoning and coding. but it also removed pixel-level image segmenta...

Reddit - Machine Learning · 1 min ·
[2602.11909] Echo: Towards Advanced Audio Comprehension via Audio-Interleaved Reasoning
Llms

[2602.11909] Echo: Towards Advanced Audio Comprehension via Audio-Interleaved Reasoning

Abstract page for arXiv paper 2602.11909: Echo: Towards Advanced Audio Comprehension via Audio-Interleaved Reasoning

arXiv - Machine Learning · 4 min ·
[2601.18685] LLAMA LIMA: A Living Meta-Analysis on the Effects of Generative AI on Learning Mathematics
Llms

[2601.18685] LLAMA LIMA: A Living Meta-Analysis on the Effects of Generative AI on Learning Mathematics

Abstract page for arXiv paper 2601.18685: LLAMA LIMA: A Living Meta-Analysis on the Effects of Generative AI on Learning Mathematics

arXiv - Machine Learning · 3 min ·
[2601.08427] Silence the Judge: Reinforcement Learning with Self-Verifier via Latent Geometric Clustering
Llms

[2601.08427] Silence the Judge: Reinforcement Learning with Self-Verifier via Latent Geometric Clustering

Abstract page for arXiv paper 2601.08427: Silence the Judge: Reinforcement Learning with Self-Verifier via Latent Geometric Clustering

arXiv - Machine Learning · 4 min ·
[2510.20095] BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models
Llms

[2510.20095] BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models

Abstract page for arXiv paper 2510.20095: BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models

arXiv - Machine Learning · 4 min ·
[2510.13849] Language steering in latent space to mitigate unintended code-switching
Llms

[2510.13849] Language steering in latent space to mitigate unintended code-switching

Abstract page for arXiv paper 2510.13849: Language steering in latent space to mitigate unintended code-switching

arXiv - Machine Learning · 3 min ·
[2510.08919] PHyCLIP: $\ell_1$-Product of Hyperbolic Factors Unifies Hierarchy and Compositionality in Vision-Language Representation Learning
Llms

[2510.08919] PHyCLIP: $\ell_1$-Product of Hyperbolic Factors Unifies Hierarchy and Compositionality in Vision-Language Representation Learning

Abstract page for arXiv paper 2510.08919: PHyCLIP: $\ell_1$-Product of Hyperbolic Factors Unifies Hierarchy and Compositionality in Visio...

arXiv - Machine Learning · 4 min ·
[2506.16411] When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework
Llms

[2506.16411] When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework

Abstract page for arXiv paper 2506.16411: When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework

arXiv - Machine Learning · 4 min ·
[2506.05639] FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition
Llms

[2506.05639] FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition

Abstract page for arXiv paper 2506.05639: FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition

arXiv - Machine Learning · 3 min ·
[2505.09655] DRA-GRPO: Your GRPO Needs to Know Diverse Reasoning Paths for Mathematical Reasoning
Llms

[2505.09655] DRA-GRPO: Your GRPO Needs to Know Diverse Reasoning Paths for Mathematical Reasoning

Abstract page for arXiv paper 2505.09655: DRA-GRPO: Your GRPO Needs to Know Diverse Reasoning Paths for Mathematical Reasoning

arXiv - Machine Learning · 4 min ·
[2501.06762] Improving the adaptive and continuous learning capabilities of artificial neural networks: Lessons from multi-neuromodulatory dynamics
Llms

[2501.06762] Improving the adaptive and continuous learning capabilities of artificial neural networks: Lessons from multi-neuromodulatory dynamics

Abstract page for arXiv paper 2501.06762: Improving the adaptive and continuous learning capabilities of artificial neural networks: Less...

arXiv - Machine Learning · 4 min ·
[2602.11761] MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling
Llms

[2602.11761] MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling

Abstract page for arXiv paper 2602.11761: MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling

arXiv - Machine Learning · 4 min ·
[2602.10609] Online Causal Kalman Filtering for Stable and Effective Policy Optimization
Llms

[2602.10609] Online Causal Kalman Filtering for Stable and Effective Policy Optimization

Abstract page for arXiv paper 2602.10609: Online Causal Kalman Filtering for Stable and Effective Policy Optimization

arXiv - AI · 4 min ·
[2602.02185] Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models
Llms

[2602.02185] Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Abstract page for arXiv paper 2602.02185: Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Langua...

arXiv - Machine Learning · 4 min ·
Previous Page 211 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime