Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Is the Mirage Effect a bug, or is it Geometric Reconstruction in action? A framework for why VLMs perform better "hallucinating" than guessing, and what that may tell us about what's really inside these models

Last week, a team from Stanford and UCSF (Asadi, O'Sullivan, Fei-Fei Li, Euan Ashley et al.) dropped two companion papers. The first, MAR...

Reddit - Artificial Intelligence · 1 min ·
Llms

Paper Finds That Leading AI Chatbots Like ChatGPT and Claude Remain Incredibly Sycophantic, Resulting in Twisted Effects on Users

https://futurism.com/artificial-intelligence/paper-ai-chatbots-chatgpt-claude-sycophantic Your AI chatbot isn’t neutral. Trust its advice...

Reddit - Artificial Intelligence · 1 min ·
Claude Code leak exposes a Tamagotchi-style ‘pet’ and an always-on agent | The Verge
Llms

Claude Code leak exposes a Tamagotchi-style ‘pet’ and an always-on agent | The Verge

Anthropic says “human error” resulted in a leak that exposed Claude Code’s source code. The leaked code, which has since been copied to G...

The Verge - AI · 4 min ·

All Content

[2507.03156] The Impact of LLM-Assistants on Software Developer Productivity: A Systematic Review and Mapping Study
Llms

[2507.03156] The Impact of LLM-Assistants on Software Developer Productivity: A Systematic Review and Mapping Study

Abstract page for arXiv paper 2507.03156: The Impact of LLM-Assistants on Software Developer Productivity: A Systematic Review and Mappin...

arXiv - AI · 4 min ·
[2506.13925] Segmenting Visuals With Querying Words: Language Anchors For Semi-Supervised Image Segmentation
Llms

[2506.13925] Segmenting Visuals With Querying Words: Language Anchors For Semi-Supervised Image Segmentation

Abstract page for arXiv paper 2506.13925: Segmenting Visuals With Querying Words: Language Anchors For Semi-Supervised Image Segmentation

arXiv - AI · 4 min ·
[2506.11128] Theory-Grounded Evaluation of Human-Like Fallacy Patterns in LLM Reasoning
Llms

[2506.11128] Theory-Grounded Evaluation of Human-Like Fallacy Patterns in LLM Reasoning

Abstract page for arXiv paper 2506.11128: Theory-Grounded Evaluation of Human-Like Fallacy Patterns in LLM Reasoning

arXiv - AI · 3 min ·
[2505.20730] Do LLMs Understand Collaborative Signals? Diagnosis and Repair
Llms

[2505.20730] Do LLMs Understand Collaborative Signals? Diagnosis and Repair

Abstract page for arXiv paper 2505.20730: Do LLMs Understand Collaborative Signals? Diagnosis and Repair

arXiv - Machine Learning · 4 min ·
[2504.14636] AlphaZero-Edu: Democratizing Access to AlphaZero
Llms

[2504.14636] AlphaZero-Edu: Democratizing Access to AlphaZero

Abstract page for arXiv paper 2504.14636: AlphaZero-Edu: Democratizing Access to AlphaZero

arXiv - Machine Learning · 3 min ·
[2503.13401] Levels of Analysis for Large Language Models
Llms

[2503.13401] Levels of Analysis for Large Language Models

Abstract page for arXiv paper 2503.13401: Levels of Analysis for Large Language Models

arXiv - AI · 3 min ·
[2502.11026] RLHF in an SFT Way: From Optimal Solution to Reward-Weighted Alignment
Llms

[2502.11026] RLHF in an SFT Way: From Optimal Solution to Reward-Weighted Alignment

Abstract page for arXiv paper 2502.11026: RLHF in an SFT Way: From Optimal Solution to Reward-Weighted Alignment

arXiv - Machine Learning · 4 min ·
[2502.00618] DesCLIP: Robust Continual Learning via General Attribute Descriptions for VLM-Based Visual Recognition
Llms

[2502.00618] DesCLIP: Robust Continual Learning via General Attribute Descriptions for VLM-Based Visual Recognition

Abstract page for arXiv paper 2502.00618: DesCLIP: Robust Continual Learning via General Attribute Descriptions for VLM-Based Visual Reco...

arXiv - AI · 4 min ·
[2501.02406] A Training-free Method for LLM Text Attribution
Llms

[2501.02406] A Training-free Method for LLM Text Attribution

Abstract page for arXiv paper 2501.02406: A Training-free Method for LLM Text Attribution

arXiv - Machine Learning · 4 min ·
[2410.01591] Imaging foundation model for universal enhancement of non-ideal measurement CT
Llms

[2410.01591] Imaging foundation model for universal enhancement of non-ideal measurement CT

Abstract page for arXiv paper 2410.01591: Imaging foundation model for universal enhancement of non-ideal measurement CT

arXiv - AI · 4 min ·
[2402.01749] Towards Urban General Intelligence: A Review and Outlook of Urban Foundation Models
Llms

[2402.01749] Towards Urban General Intelligence: A Review and Outlook of Urban Foundation Models

Abstract page for arXiv paper 2402.01749: Towards Urban General Intelligence: A Review and Outlook of Urban Foundation Models

arXiv - Machine Learning · 4 min ·
[2406.01914] HPE-CogVLM: Advancing Vision Language Models with a Head Pose Grounding Task
Llms

[2406.01914] HPE-CogVLM: Advancing Vision Language Models with a Head Pose Grounding Task

Abstract page for arXiv paper 2406.01914: HPE-CogVLM: Advancing Vision Language Models with a Head Pose Grounding Task

arXiv - AI · 4 min ·
[2603.18908] Secure Linear Alignment of Large Language Models
Llms

[2603.18908] Secure Linear Alignment of Large Language Models

Abstract page for arXiv paper 2603.18908: Secure Linear Alignment of Large Language Models

arXiv - AI · 3 min ·
[2603.13239] Benchmarking Zero-Shot Reasoning Approaches for Error Detection in Solidity Smart Contracts
Llms

[2603.13239] Benchmarking Zero-Shot Reasoning Approaches for Error Detection in Solidity Smart Contracts

Abstract page for arXiv paper 2603.13239: Benchmarking Zero-Shot Reasoning Approaches for Error Detection in Solidity Smart Contracts

arXiv - AI · 4 min ·
[2603.11721] When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows
Llms

[2603.11721] When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows

Abstract page for arXiv paper 2603.11721: When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows

arXiv - AI · 4 min ·
[2603.11679] LLMs can construct powerful representations and streamline sample-efficient supervised learning
Llms

[2603.11679] LLMs can construct powerful representations and streamline sample-efficient supervised learning

Abstract page for arXiv paper 2603.11679: LLMs can construct powerful representations and streamline sample-efficient supervised learning

arXiv - AI · 4 min ·
[2603.09313] Curveball Steering: The Right Direction To Steer Isn't Always Linear
Llms

[2603.09313] Curveball Steering: The Right Direction To Steer Isn't Always Linear

Abstract page for arXiv paper 2603.09313: Curveball Steering: The Right Direction To Steer Isn't Always Linear

arXiv - AI · 3 min ·
[2603.08388] A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation
Llms

[2603.08388] A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation

Abstract page for arXiv paper 2603.08388: A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Gen...

arXiv - AI · 4 min ·
[2602.01297] RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis
Llms

[2602.01297] RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis

Abstract page for arXiv paper 2602.01297: RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis

arXiv - AI · 4 min ·
[2602.01082] EvoOpt-LLM: Evolving industrial optimization models with large language models
Llms

[2602.01082] EvoOpt-LLM: Evolving industrial optimization models with large language models

Abstract page for arXiv paper 2602.01082: EvoOpt-LLM: Evolving industrial optimization models with large language models

arXiv - AI · 4 min ·
Previous Page 47 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime