Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Claude code x n8n

Hi everyone, I’ve been exploring MCP and integrating tools like n8n with Claude Code, and I’m trying to understand how practical this rea...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

LLM comprehension question

Basically, does anyone else also get a really strange sense of lingering confusion and non-comprehension when an LLM explains a complex c...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

Curated 550+ free AI tools useful for building projects (LLMs, APIs, local models, RAG, agents)

Over the last few days I was collecting free or low cost AI tools that are actually useful if you want to build stuff, not just try rando...

Reddit - Artificial Intelligence · 1 min · about 7 hours ago

All Content

Llms

[2505.19590] Learning to Reason without External Rewards

Abstract page for arXiv paper 2505.19590: Learning to Reason without External Rewards

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2506.13474] Language Agents for Hypothesis-driven Clinical Decision Making with Reinforcement Learning

Abstract page for arXiv paper 2506.13474: Language Agents for Hypothesis-driven Clinical Decision Making with Reinforcement Learning

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2506.15498] SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling

Abstract page for arXiv paper 2506.15498: SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervisio...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2505.18116] NFT: Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

Abstract page for arXiv paper 2505.18116: NFT: Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2506.10085] VITA: Zero-Shot Value Functions via Test-Time Adaptation of Vision-Language Models

Abstract page for arXiv paper 2506.10085: VITA: Zero-Shot Value Functions via Test-Time Adaptation of Vision-Language Models

arXiv - AI · 4 min · about 1 month ago

Llms

[2505.16122] Plan and Budget: Effective and Efficient Test-Time Scaling on Reasoning Large Language Models

Abstract page for arXiv paper 2505.16122: Plan and Budget: Effective and Efficient Test-Time Scaling on Reasoning Large Language Models

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2505.14042] Adversarially Pretrained Transformers May Be Universally Robust In-Context Learners

Abstract page for arXiv paper 2505.14042: Adversarially Pretrained Transformers May Be Universally Robust In-Context Learners

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2506.08902] Intention-Conditioned Flow Occupancy Models

Abstract page for arXiv paper 2506.08902: Intention-Conditioned Flow Occupancy Models

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2506.06683] RoboPARA: Dual-Arm Robot Planning with Parallel Allocation and Recomposition Across Tasks

Abstract page for arXiv paper 2506.06683: RoboPARA: Dual-Arm Robot Planning with Parallel Allocation and Recomposition Across Tasks

arXiv - AI · 4 min · about 1 month ago

Llms

[2506.03135] OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models

Abstract page for arXiv paper 2506.03135: OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models

arXiv - AI · 4 min · about 1 month ago

Llms

[2506.02860] Tru-POMDP: Task Planning Under Uncertainty via Tree of Hypotheses and Open-Ended POMDPs

Abstract page for arXiv paper 2506.02860: Tru-POMDP: Task Planning Under Uncertainty via Tree of Hypotheses and Open-Ended POMDPs

arXiv - AI · 3 min · about 1 month ago

Llms

[2505.11076] Addition is almost all you need: Compressing large language models with double binary factorization

Abstract page for arXiv paper 2505.11076: Addition is almost all you need: Compressing large language models with double binary factoriza...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2505.24298] AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

Abstract page for arXiv paper 2505.24298: AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2505.21786] VeriTrail: Closed-Domain Hallucination Detection with Traceability

Abstract page for arXiv paper 2505.21786: VeriTrail: Closed-Domain Hallucination Detection with Traceability

arXiv - AI · 3 min · about 1 month ago

Llms

[2504.03889] Identifying and Evaluating Inactive Heads in Pretrained LLMs

Abstract page for arXiv paper 2504.03889: Identifying and Evaluating Inactive Heads in Pretrained LLMs

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2505.20278] Characterizing Pattern Matching and Its Limits on Compositional Task Structures

Abstract page for arXiv paper 2505.20278: Characterizing Pattern Matching and Its Limits on Compositional Task Structures

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2505.21413] RefTool: Reference-Guided Tool Creation for Knowledge-Intensive Reasoning

Abstract page for arXiv paper 2505.21413: RefTool: Reference-Guided Tool Creation for Knowledge-Intensive Reasoning

arXiv - AI · 4 min · about 1 month ago

Llms

[2505.21396] Augmenting Research Ideation with Data: An Empirical Investigation in Social Science

Abstract page for arXiv paper 2505.21396: Augmenting Research Ideation with Data: An Empirical Investigation in Social Science

arXiv - AI · 4 min · about 1 month ago

Llms

[2503.08980] I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data?

Abstract page for arXiv paper 2503.08980: I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2505.16056] Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

Abstract page for arXiv paper 2505.16056: Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 161 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Claude code x n8n

LLM comprehension question

Curated 550+ free AI tools useful for building projects (LLMs, APIs, local models, RAG, agents)

All Content

[2505.19590] Learning to Reason without External Rewards

[2506.13474] Language Agents for Hypothesis-driven Clinical Decision Making with Reinforcement Learning

[2506.15498] SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling

[2505.18116] NFT: Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

[2506.10085] VITA: Zero-Shot Value Functions via Test-Time Adaptation of Vision-Language Models

[2505.16122] Plan and Budget: Effective and Efficient Test-Time Scaling on Reasoning Large Language Models

[2505.14042] Adversarially Pretrained Transformers May Be Universally Robust In-Context Learners

[2506.08902] Intention-Conditioned Flow Occupancy Models

[2506.06683] RoboPARA: Dual-Arm Robot Planning with Parallel Allocation and Recomposition Across Tasks

[2506.03135] OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models

[2506.02860] Tru-POMDP: Task Planning Under Uncertainty via Tree of Hypotheses and Open-Ended POMDPs

[2505.11076] Addition is almost all you need: Compressing large language models with double binary factorization

[2505.24298] AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

[2505.21786] VeriTrail: Closed-Domain Hallucination Detection with Traceability

[2504.03889] Identifying and Evaluating Inactive Heads in Pretrained LLMs

[2505.20278] Characterizing Pattern Matching and Its Limits on Compositional Task Structures

[2505.21413] RefTool: Reference-Guided Tool Creation for Knowledge-Intensive Reasoning

[2505.21396] Augmenting Research Ideation with Data: An Empirical Investigation in Social Science

[2503.08980] I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data?

[2505.16056] Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

Related Topics

Stay updated with AI News