Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

AI Has Broken the Internet

So the web has been breaking a lot lately. Vercel is down. GitHub is down. Claude is down. Cloudflare is down. AWS is down. Everything is...

Reddit - Artificial Intelligence · 1 min ·
Llms

LLM agents can trigger real actions now. But what actually stops them from executing?

We ran into a simple but important issue while building agents with tool calling: the model can propose actions but nothing actually enfo...

Reddit - Artificial Intelligence · 1 min ·
Llms

Are LLMs a Dead End? (Investors Just Bet $1 Billion on “Yes”)

| AI Reality Check | Cal Newport Chapters 0:00 What is Yan LeCun Up To? 14:55 How is it possible that LeCun could be right about LLM’s be...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.09313] Curveball Steering: The Right Direction To Steer Isn't Always Linear
Llms

[2603.09313] Curveball Steering: The Right Direction To Steer Isn't Always Linear

Abstract page for arXiv paper 2603.09313: Curveball Steering: The Right Direction To Steer Isn't Always Linear

arXiv - AI · 3 min ·
[2603.08388] A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation
Llms

[2603.08388] A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation

Abstract page for arXiv paper 2603.08388: A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Gen...

arXiv - AI · 4 min ·
[2602.01297] RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis
Llms

[2602.01297] RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis

Abstract page for arXiv paper 2602.01297: RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis

arXiv - AI · 4 min ·
[2602.01082] EvoOpt-LLM: Evolving industrial optimization models with large language models
Llms

[2602.01082] EvoOpt-LLM: Evolving industrial optimization models with large language models

Abstract page for arXiv paper 2602.01082: EvoOpt-LLM: Evolving industrial optimization models with large language models

arXiv - AI · 4 min ·
[2601.10744] Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration
Llms

[2601.10744] Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration

Abstract page for arXiv paper 2601.10744: Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Fram...

arXiv - AI · 4 min ·
[2601.10679] Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models
Llms

[2601.10679] Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models

Abstract page for arXiv paper 2601.10679: Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoni...

arXiv - Machine Learning · 4 min ·
[2511.10065] RadHiera: Semantic Hierarchical Reinforcement Learning for Medical Report Generation
Llms

[2511.10065] RadHiera: Semantic Hierarchical Reinforcement Learning for Medical Report Generation

Abstract page for arXiv paper 2511.10065: RadHiera: Semantic Hierarchical Reinforcement Learning for Medical Report Generation

arXiv - AI · 4 min ·
[2511.06626] Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives
Llms

[2511.06626] Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives

Abstract page for arXiv paper 2511.06626: Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives

arXiv - AI · 4 min ·
[2512.14395] Massive Editing for Large Language Models Based on Dynamic Weight Generation
Llms

[2512.14395] Massive Editing for Large Language Models Based on Dynamic Weight Generation

Abstract page for arXiv paper 2512.14395: Massive Editing for Large Language Models Based on Dynamic Weight Generation

arXiv - AI · 4 min ·
[2511.16814] Stable diffusion models reveal a persisting human and AI gap in visual creativity
Llms

[2511.16814] Stable diffusion models reveal a persisting human and AI gap in visual creativity

Abstract page for arXiv paper 2511.16814: Stable diffusion models reveal a persisting human and AI gap in visual creativity

arXiv - AI · 4 min ·
[2511.03235] From Five Dimensions to Many: Large Language Models as Precise and Interpretable Psychological Profilers
Llms

[2511.03235] From Five Dimensions to Many: Large Language Models as Precise and Interpretable Psychological Profilers

Abstract page for arXiv paper 2511.03235: From Five Dimensions to Many: Large Language Models as Precise and Interpretable Psychological ...

arXiv - AI · 4 min ·
[2508.06931] Automated Formalization via Conceptual Retrieval-Augmented LLMs
Llms

[2508.06931] Automated Formalization via Conceptual Retrieval-Augmented LLMs

Abstract page for arXiv paper 2508.06931: Automated Formalization via Conceptual Retrieval-Augmented LLMs

arXiv - Machine Learning · 4 min ·
[2506.00835] SynPO: Synergizing Descriptiveness and Preference Optimization for Video Detailed Captioning
Llms

[2506.00835] SynPO: Synergizing Descriptiveness and Preference Optimization for Video Detailed Captioning

Abstract page for arXiv paper 2506.00835: SynPO: Synergizing Descriptiveness and Preference Optimization for Video Detailed Captioning

arXiv - AI · 4 min ·
[2505.23667] Formula-R1: Incentivizing LLM Reasoning over Complex Tables with Numerical Computation via Formula-Driven Reinforcement Learning
Llms

[2505.23667] Formula-R1: Incentivizing LLM Reasoning over Complex Tables with Numerical Computation via Formula-Driven Reinforcement Learning

Abstract page for arXiv paper 2505.23667: Formula-R1: Incentivizing LLM Reasoning over Complex Tables with Numerical Computation via Form...

arXiv - AI · 4 min ·
[2412.02868] PrecLLM: A Privacy-Preserving Framework for Efficient Clinical Annotation Extraction from Unstructured EHRs using Small-Scale LLMs
Llms

[2412.02868] PrecLLM: A Privacy-Preserving Framework for Efficient Clinical Annotation Extraction from Unstructured EHRs using Small-Scale LLMs

Abstract page for arXiv paper 2412.02868: PrecLLM: A Privacy-Preserving Framework for Efficient Clinical Annotation Extraction from Unstr...

arXiv - AI · 4 min ·
[2603.22281] ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model
Llms

[2603.22281] ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model

Abstract page for arXiv paper 2603.22281: ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model

arXiv - Machine Learning · 4 min ·
[2603.22279] 3D-Layout-R1: Structured Reasoning for Language-Instructed Spatial Editing
Llms

[2603.22279] 3D-Layout-R1: Structured Reasoning for Language-Instructed Spatial Editing

Abstract page for arXiv paper 2603.22279: 3D-Layout-R1: Structured Reasoning for Language-Instructed Spatial Editing

arXiv - AI · 3 min ·
[2603.22248] Confidence-Based Decoding is Provably Efficient for Diffusion Language Models
Llms

[2603.22248] Confidence-Based Decoding is Provably Efficient for Diffusion Language Models

Abstract page for arXiv paper 2603.22248: Confidence-Based Decoding is Provably Efficient for Diffusion Language Models

arXiv - Machine Learning · 4 min ·
[2603.22214] Evaluating the Reliability and Fidelity of Automated Judgment Systems of Large Language Models
Llms

[2603.22214] Evaluating the Reliability and Fidelity of Automated Judgment Systems of Large Language Models

Abstract page for arXiv paper 2603.22214: Evaluating the Reliability and Fidelity of Automated Judgment Systems of Large Language Models

arXiv - Machine Learning · 4 min ·
[2603.22213] SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection
Llms

[2603.22213] SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection

Abstract page for arXiv paper 2603.22213: SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection

arXiv - Machine Learning · 4 min ·
Previous Page 48 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime