Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

Hey everyone I've set up a self-hosted API gateway using [New-API](QuantumNous/new-ap) to manage and distribute Claude Opus 4.6 access ac...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED

Plus: The FBI says a recent hack of its wiretap tools poses a national security risk, attackers stole Cisco source code as part of an ong...

Wired - AI · 9 min · about 3 hours ago

Llms

People anxious about deviating from what AI tells them to do?

My friend came over yesterday to dye her hair. She had asked ChatGPT for the 'correct' way to do it. Chat told her to dye the ends first,...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

All Content

Llms

[2603.20229] Characterizing the ability of LLMs to recapitulate Americans' distributional responses to public opinion polling questions across political issues

Abstract page for arXiv paper 2603.20229: Characterizing the ability of LLMs to recapitulate Americans' distributional responses to publi...

arXiv - AI · 4 min · 11 days ago

Llms

[2603.20225] The Arrival of AGI? When Expert Personas Exceed Expert Benchmarks

Abstract page for arXiv paper 2603.20225: The Arrival of AGI? When Expert Personas Exceed Expert Benchmarks

arXiv - AI · 4 min · 11 days ago

Llms

[2603.20216] Locally Coherent Parallel Decoding in Diffusion Language Models

Abstract page for arXiv paper 2603.20216: Locally Coherent Parallel Decoding in Diffusion Language Models

arXiv - Machine Learning · 3 min · 11 days ago

Llms

[2603.20211] Exploring Teacher-Chatbot Interaction and Affect in Block-Based Programming

Abstract page for arXiv paper 2603.20211: Exploring Teacher-Chatbot Interaction and Affect in Block-Based Programming

arXiv - AI · 3 min · 11 days ago

Llms

[2603.20209] Children's Intelligence Tests Pose Challenges for MLLMs? KidGym: A 2D Grid-Based Reasoning Benchmark for MLLMs

Abstract page for arXiv paper 2603.20209: Children's Intelligence Tests Pose Challenges for MLLMs? KidGym: A 2D Grid-Based Reasoning Benc...

arXiv - AI · 4 min · 11 days ago

Llms

[2603.20208] RedacBench: Can AI Erase Your Secrets?

Abstract page for arXiv paper 2603.20208: RedacBench: Can AI Erase Your Secrets?

arXiv - AI · 3 min · 11 days ago

Llms

[2603.20206] Enhancing Safety of Large Language Models via Embedding Space Separation

Abstract page for arXiv paper 2603.20206: Enhancing Safety of Large Language Models via Embedding Space Separation

arXiv - AI · 3 min · 11 days ago

Llms

[2603.20204] Measuring Research Convergence in Interdisciplinary Teams Using Large Language Models and Graph Analytics

Abstract page for arXiv paper 2603.20204: Measuring Research Convergence in Interdisciplinary Teams Using Large Language Models and Graph...

arXiv - AI · 4 min · 11 days ago

Llms

[2603.22179] MARCUS: An agentic, multimodal vision-language model for cardiac diagnosis and management

Abstract page for arXiv paper 2603.22179: MARCUS: An agentic, multimodal vision-language model for cardiac diagnosis and management

arXiv - AI · 4 min · 11 days ago

Llms

[2603.22097] SpecTM: Spectral Targeted Masking for Trustworthy Foundation Models

Abstract page for arXiv paper 2603.22097: SpecTM: Spectral Targeted Masking for Trustworthy Foundation Models

arXiv - Machine Learning · 3 min · 11 days ago

Llms

[2603.22096] GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning

Abstract page for arXiv paper 2603.22096: GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning

arXiv - AI · 3 min · 11 days ago

Llms

[2603.22083] A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP

Abstract page for arXiv paper 2603.22083: A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21854] Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models

Abstract page for arXiv paper 2603.21854: Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language ...

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21745] The Presupposition Problem in Representation Genesis

Abstract page for arXiv paper 2603.21745: The Presupposition Problem in Representation Genesis

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21728] EvoIdeator: Evolving Scientific Ideas through Checklist-Grounded Reinforcement Learning

Abstract page for arXiv paper 2603.21728: EvoIdeator: Evolving Scientific Ideas through Checklist-Grounded Reinforcement Learning

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21725] CurvZO: Adaptive Curvature-Guided Sparse Zeroth-Order Optimization for Efficient LLM Fine-Tuning

Abstract page for arXiv paper 2603.21725: CurvZO: Adaptive Curvature-Guided Sparse Zeroth-Order Optimization for Efficient LLM Fine-Tuning

arXiv - Machine Learning · 4 min · 11 days ago

Llms

[2603.21708] Compensating Visual Insufficiency with Stratified Language Guidance for Long-Tail Class Incremental Learning

Abstract page for arXiv paper 2603.21708: Compensating Visual Insufficiency with Stratified Language Guidance for Long-Tail Class Increme...

arXiv - AI · 3 min · 11 days ago

Llms

[2603.21693] Deterministic Hallucination Detection in Medical VQA via Confidence-Evidence Bayesian Gain

Abstract page for arXiv paper 2603.21693: Deterministic Hallucination Detection in Medical VQA via Confidence-Evidence Bayesian Gain

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21690] AI Token Futures Market: Commoditization of Compute and Derivatives Contract Design

Abstract page for arXiv paper 2603.21690: AI Token Futures Market: Commoditization of Compute and Derivatives Contract Design

arXiv - AI · 3 min · 11 days ago

Llms

[2603.21636] Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks

Abstract page for arXiv paper 2603.21636: Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confide...

arXiv - AI · 4 min · 11 days ago

Previous Page 71 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED

People anxious about deviating from what AI tells them to do?

All Content

[2603.20229] Characterizing the ability of LLMs to recapitulate Americans' distributional responses to public opinion polling questions across political issues

[2603.20225] The Arrival of AGI? When Expert Personas Exceed Expert Benchmarks

[2603.20216] Locally Coherent Parallel Decoding in Diffusion Language Models

[2603.20211] Exploring Teacher-Chatbot Interaction and Affect in Block-Based Programming

[2603.20209] Children's Intelligence Tests Pose Challenges for MLLMs? KidGym: A 2D Grid-Based Reasoning Benchmark for MLLMs

[2603.20208] RedacBench: Can AI Erase Your Secrets?

[2603.20206] Enhancing Safety of Large Language Models via Embedding Space Separation

[2603.20204] Measuring Research Convergence in Interdisciplinary Teams Using Large Language Models and Graph Analytics

[2603.22179] MARCUS: An agentic, multimodal vision-language model for cardiac diagnosis and management

[2603.22097] SpecTM: Spectral Targeted Masking for Trustworthy Foundation Models

[2603.22096] GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning

[2603.22083] A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP

[2603.21854] Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models

[2603.21745] The Presupposition Problem in Representation Genesis

[2603.21728] EvoIdeator: Evolving Scientific Ideas through Checklist-Grounded Reinforcement Learning

[2603.21725] CurvZO: Adaptive Curvature-Guided Sparse Zeroth-Order Optimization for Efficient LLM Fine-Tuning

[2603.21708] Compensating Visual Insufficiency with Stratified Language Guidance for Long-Tail Class Incremental Learning

[2603.21693] Deterministic Hallucination Detection in Medical VQA via Confidence-Evidence Bayesian Gain

[2603.21690] AI Token Futures Market: Commoditization of Compute and Derivatives Contract Design

[2603.21636] Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks

Related Topics

Stay updated with AI News