Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

The Rationing: AI companies are using the "subsidize, addict, extract" playbook — and developers are the product

Anthropic just ran the classic platform playbook on developers: offer generous limits to build dependency, then tighten the screws once t...

Reddit - Artificial Intelligence · 1 min ·
Llms

CLI for Google AI Search (gai.google) — run AI-powered code/tech searches headlessly from your terminal

Google AI (gai.google) gives Gemini-powered answers for technical queries — think AI-enhanced search with code understanding. I built a C...

Reddit - Artificial Intelligence · 1 min ·
Llms

Why are we blindly trusting AI companies with our data?

Lately I’ve been seeing a story floating around that really made me pause. Apparently, there were claims that the US government asked Ant...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.22623] To Agree or To Be Right? The Grounding-Sycophancy Tradeoff in Medical Vision-Language Models
Llms

[2603.22623] To Agree or To Be Right? The Grounding-Sycophancy Tradeoff in Medical Vision-Language Models

Abstract page for arXiv paper 2603.22623: To Agree or To Be Right? The Grounding-Sycophancy Tradeoff in Medical Vision-Language Models

arXiv - AI · 4 min ·
[2603.22563] Privacy-Preserving Reinforcement Learning from Human Feedback via Decoupled Reward Modeling
Llms

[2603.22563] Privacy-Preserving Reinforcement Learning from Human Feedback via Decoupled Reward Modeling

Abstract page for arXiv paper 2603.22563: Privacy-Preserving Reinforcement Learning from Human Feedback via Decoupled Reward Modeling

arXiv - Machine Learning · 3 min ·
[2603.22499] OrgForge-IT: A Verifiable Synthetic Benchmark for LLM-Based Insider Threat Detection
Llms

[2603.22499] OrgForge-IT: A Verifiable Synthetic Benchmark for LLM-Based Insider Threat Detection

Abstract page for arXiv paper 2603.22499: OrgForge-IT: A Verifiable Synthetic Benchmark for LLM-Based Insider Threat Detection

arXiv - Machine Learning · 4 min ·
[2603.22593] Language Models Can Explain Visual Features via Steering
Llms

[2603.22593] Language Models Can Explain Visual Features via Steering

Abstract page for arXiv paper 2603.22593: Language Models Can Explain Visual Features via Steering

arXiv - AI · 3 min ·
[2603.22582] Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?
Llms

[2603.22582] Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?

Abstract page for arXiv paper 2603.22582: Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?

arXiv - AI · 4 min ·
[2603.22577] STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving
Llms

[2603.22577] STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving

Abstract page for arXiv paper 2603.22577: STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving

arXiv - AI · 4 min ·
[2603.22528] GraphRAG for Engineering Diagrams: ChatP&ID Enables LLM Interaction with P&IDs
Llms

[2603.22528] GraphRAG for Engineering Diagrams: ChatP&ID Enables LLM Interaction with P&IDs

Abstract page for arXiv paper 2603.22528: GraphRAG for Engineering Diagrams: ChatP&ID Enables LLM Interaction with P&IDs

arXiv - AI · 4 min ·
[2603.22519] LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface
Llms

[2603.22519] LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface

Abstract page for arXiv paper 2603.22519: LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface

arXiv - AI · 4 min ·
[2603.22510] Do Large Language Models Reduce Research Novelty? Evidence from Information Systems Journals
Llms

[2603.22510] Do Large Language Models Reduce Research Novelty? Evidence from Information Systems Journals

Abstract page for arXiv paper 2603.22510: Do Large Language Models Reduce Research Novelty? Evidence from Information Systems Journals

arXiv - AI · 3 min ·
[2603.22492] Tiny Inference-Time Scaling with Latent Verifiers
Llms

[2603.22492] Tiny Inference-Time Scaling with Latent Verifiers

Abstract page for arXiv paper 2603.22492: Tiny Inference-Time Scaling with Latent Verifiers

arXiv - AI · 4 min ·
[2603.22479] Cognitive Training for Language Models: Towards General Capabilities via Cross-Entropy Games
Llms

[2603.22479] Cognitive Training for Language Models: Towards General Capabilities via Cross-Entropy Games

Abstract page for arXiv paper 2603.22479: Cognitive Training for Language Models: Towards General Capabilities via Cross-Entropy Games

arXiv - AI · 3 min ·
[2603.22473] Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures
Llms

[2603.22473] Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures

Abstract page for arXiv paper 2603.22473: Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architec...

arXiv - AI · 3 min ·
[2603.22355] Demystifying Low-Rank Knowledge Distillation in Large Language Models: Convergence, Generalization, and Information-Theoretic Guarantees
Llms

[2603.22355] Demystifying Low-Rank Knowledge Distillation in Large Language Models: Convergence, Generalization, and Information-Theoretic Guarantees

Abstract page for arXiv paper 2603.22355: Demystifying Low-Rank Knowledge Distillation in Large Language Models: Convergence, Generalizat...

arXiv - Machine Learning · 4 min ·
[2603.22344] Errors in AI-Assisted Retrieval of Medical Literature: A Comparative Study
Llms

[2603.22344] Errors in AI-Assisted Retrieval of Medical Literature: A Comparative Study

Abstract page for arXiv paper 2603.22344: Errors in AI-Assisted Retrieval of Medical Literature: A Comparative Study

arXiv - Machine Learning · 4 min ·
[2603.22459] LLM-guided headline rewriting for clickability enhancement without clickbait
Llms

[2603.22459] LLM-guided headline rewriting for clickability enhancement without clickbait

Abstract page for arXiv paper 2603.22459: LLM-guided headline rewriting for clickability enhancement without clickbait

arXiv - AI · 4 min ·
[2603.22446] Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs
Llms

[2603.22446] Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Abstract page for arXiv paper 2603.22446: Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

arXiv - AI · 4 min ·
[2603.22330] Fair splits flip the leaderboard: CHANRG reveals limited generalization in RNA secondary-structure prediction
Llms

[2603.22330] Fair splits flip the leaderboard: CHANRG reveals limited generalization in RNA secondary-structure prediction

Abstract page for arXiv paper 2603.22330: Fair splits flip the leaderboard: CHANRG reveals limited generalization in RNA secondary-struct...

arXiv - Machine Learning · 3 min ·
[2603.22376] AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Agents with Cloud Computing Access
Llms

[2603.22376] AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Agents with Cloud Computing Access

Abstract page for arXiv paper 2603.22376: AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Age...

arXiv - AI · 4 min ·
[2603.23461] End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions
Llms

[2603.23461] End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions

Abstract page for arXiv paper 2603.23461: End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions

arXiv - Machine Learning · 3 min ·
[2603.23414] SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling
Llms

[2603.23414] SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

Abstract page for arXiv paper 2603.23414: SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

arXiv - AI · 4 min ·
Previous Page 23 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime