Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

Hey everyone I've set up a self-hosted API gateway using [New-API](QuantumNous/new-ap) to manage and distribute Claude Opus 4.6 access ac...

Reddit - Artificial Intelligence · 1 min ·
Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED
Llms

Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED

Plus: The FBI says a recent hack of its wiretap tools poses a national security risk, attackers stole Cisco source code as part of an ong...

Wired - AI · 9 min ·
Llms

People anxious about deviating from what AI tells them to do?

My friend came over yesterday to dye her hair. She had asked ChatGPT for the 'correct' way to do it. Chat told her to dye the ends first,...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.20229] Characterizing the ability of LLMs to recapitulate Americans' distributional responses to public opinion polling questions across political issues
Llms

[2603.20229] Characterizing the ability of LLMs to recapitulate Americans' distributional responses to public opinion polling questions across political issues

Abstract page for arXiv paper 2603.20229: Characterizing the ability of LLMs to recapitulate Americans' distributional responses to publi...

arXiv - AI · 4 min ·
[2603.20225] The Arrival of AGI? When Expert Personas Exceed Expert Benchmarks
Llms

[2603.20225] The Arrival of AGI? When Expert Personas Exceed Expert Benchmarks

Abstract page for arXiv paper 2603.20225: The Arrival of AGI? When Expert Personas Exceed Expert Benchmarks

arXiv - AI · 4 min ·
[2603.20216] Locally Coherent Parallel Decoding in Diffusion Language Models
Llms

[2603.20216] Locally Coherent Parallel Decoding in Diffusion Language Models

Abstract page for arXiv paper 2603.20216: Locally Coherent Parallel Decoding in Diffusion Language Models

arXiv - Machine Learning · 3 min ·
[2603.20211] Exploring Teacher-Chatbot Interaction and Affect in Block-Based Programming
Llms

[2603.20211] Exploring Teacher-Chatbot Interaction and Affect in Block-Based Programming

Abstract page for arXiv paper 2603.20211: Exploring Teacher-Chatbot Interaction and Affect in Block-Based Programming

arXiv - AI · 3 min ·
[2603.20209] Children's Intelligence Tests Pose Challenges for MLLMs? KidGym: A 2D Grid-Based Reasoning Benchmark for MLLMs
Llms

[2603.20209] Children's Intelligence Tests Pose Challenges for MLLMs? KidGym: A 2D Grid-Based Reasoning Benchmark for MLLMs

Abstract page for arXiv paper 2603.20209: Children's Intelligence Tests Pose Challenges for MLLMs? KidGym: A 2D Grid-Based Reasoning Benc...

arXiv - AI · 4 min ·
[2603.20208] RedacBench: Can AI Erase Your Secrets?
Llms

[2603.20208] RedacBench: Can AI Erase Your Secrets?

Abstract page for arXiv paper 2603.20208: RedacBench: Can AI Erase Your Secrets?

arXiv - AI · 3 min ·
[2603.20206] Enhancing Safety of Large Language Models via Embedding Space Separation
Llms

[2603.20206] Enhancing Safety of Large Language Models via Embedding Space Separation

Abstract page for arXiv paper 2603.20206: Enhancing Safety of Large Language Models via Embedding Space Separation

arXiv - AI · 3 min ·
[2603.20204] Measuring Research Convergence in Interdisciplinary Teams Using Large Language Models and Graph Analytics
Llms

[2603.20204] Measuring Research Convergence in Interdisciplinary Teams Using Large Language Models and Graph Analytics

Abstract page for arXiv paper 2603.20204: Measuring Research Convergence in Interdisciplinary Teams Using Large Language Models and Graph...

arXiv - AI · 4 min ·
[2603.22179] MARCUS: An agentic, multimodal vision-language model for cardiac diagnosis and management
Llms

[2603.22179] MARCUS: An agentic, multimodal vision-language model for cardiac diagnosis and management

Abstract page for arXiv paper 2603.22179: MARCUS: An agentic, multimodal vision-language model for cardiac diagnosis and management

arXiv - AI · 4 min ·
[2603.22097] SpecTM: Spectral Targeted Masking for Trustworthy Foundation Models
Llms

[2603.22097] SpecTM: Spectral Targeted Masking for Trustworthy Foundation Models

Abstract page for arXiv paper 2603.22097: SpecTM: Spectral Targeted Masking for Trustworthy Foundation Models

arXiv - Machine Learning · 3 min ·
[2603.22096] GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning
Llms

[2603.22096] GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning

Abstract page for arXiv paper 2603.22096: GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning

arXiv - AI · 3 min ·
[2603.22083] A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP
Llms

[2603.22083] A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP

Abstract page for arXiv paper 2603.22083: A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP

arXiv - AI · 4 min ·
[2603.21854] Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models
Llms

[2603.21854] Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models

Abstract page for arXiv paper 2603.21854: Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language ...

arXiv - AI · 4 min ·
[2603.21745] The Presupposition Problem in Representation Genesis
Llms

[2603.21745] The Presupposition Problem in Representation Genesis

Abstract page for arXiv paper 2603.21745: The Presupposition Problem in Representation Genesis

arXiv - AI · 4 min ·
[2603.21728] EvoIdeator: Evolving Scientific Ideas through Checklist-Grounded Reinforcement Learning
Llms

[2603.21728] EvoIdeator: Evolving Scientific Ideas through Checklist-Grounded Reinforcement Learning

Abstract page for arXiv paper 2603.21728: EvoIdeator: Evolving Scientific Ideas through Checklist-Grounded Reinforcement Learning

arXiv - AI · 4 min ·
[2603.21725] CurvZO: Adaptive Curvature-Guided Sparse Zeroth-Order Optimization for Efficient LLM Fine-Tuning
Llms

[2603.21725] CurvZO: Adaptive Curvature-Guided Sparse Zeroth-Order Optimization for Efficient LLM Fine-Tuning

Abstract page for arXiv paper 2603.21725: CurvZO: Adaptive Curvature-Guided Sparse Zeroth-Order Optimization for Efficient LLM Fine-Tuning

arXiv - Machine Learning · 4 min ·
[2603.21708] Compensating Visual Insufficiency with Stratified Language Guidance for Long-Tail Class Incremental Learning
Llms

[2603.21708] Compensating Visual Insufficiency with Stratified Language Guidance for Long-Tail Class Incremental Learning

Abstract page for arXiv paper 2603.21708: Compensating Visual Insufficiency with Stratified Language Guidance for Long-Tail Class Increme...

arXiv - AI · 3 min ·
[2603.21693] Deterministic Hallucination Detection in Medical VQA via Confidence-Evidence Bayesian Gain
Llms

[2603.21693] Deterministic Hallucination Detection in Medical VQA via Confidence-Evidence Bayesian Gain

Abstract page for arXiv paper 2603.21693: Deterministic Hallucination Detection in Medical VQA via Confidence-Evidence Bayesian Gain

arXiv - AI · 4 min ·
[2603.21690] AI Token Futures Market: Commoditization of Compute and Derivatives Contract Design
Llms

[2603.21690] AI Token Futures Market: Commoditization of Compute and Derivatives Contract Design

Abstract page for arXiv paper 2603.21690: AI Token Futures Market: Commoditization of Compute and Derivatives Contract Design

arXiv - AI · 3 min ·
[2603.21636] Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks
Llms

[2603.21636] Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks

Abstract page for arXiv paper 2603.21636: Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confide...

arXiv - AI · 4 min ·
Previous Page 71 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime