Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Anthropic Restricts Claude Agent Access Amid AI Automation Boom in Crypto
Llms

Anthropic Restricts Claude Agent Access Amid AI Automation Boom in Crypto

AI Tools & Products · 7 min ·
Is cutting ‘please’ when talking to ChatGPT better for the planet? An expert explains
Llms

Is cutting ‘please’ when talking to ChatGPT better for the planet? An expert explains

AI Tools & Products · 5 min ·
AI Desktop 98 lets you chat with Claude, ChatGPT, and Gemini through a Windows 98-inspired interface
Llms

AI Desktop 98 lets you chat with Claude, ChatGPT, and Gemini through a Windows 98-inspired interface

AI Tools & Products · 3 min ·

All Content

[2603.19276] From Flat to Structural: Enhancing Automated Short Answer Grading with GraphRAG
Llms

[2603.19276] From Flat to Structural: Enhancing Automated Short Answer Grading with GraphRAG

Abstract page for arXiv paper 2603.19276: From Flat to Structural: Enhancing Automated Short Answer Grading with GraphRAG

arXiv - AI · 4 min ·
[2603.19275] Improving Automatic Summarization of Radiology Reports through Mid-Training of Large Language Models
Llms

[2603.19275] Improving Automatic Summarization of Radiology Reports through Mid-Training of Large Language Models

Abstract page for arXiv paper 2603.19275: Improving Automatic Summarization of Radiology Reports through Mid-Training of Large Language M...

arXiv - AI · 3 min ·
[2603.19274] CURE: A Multimodal Benchmark for Clinical Understanding and Retrieval Evaluation
Llms

[2603.19274] CURE: A Multimodal Benchmark for Clinical Understanding and Retrieval Evaluation

Abstract page for arXiv paper 2603.19274: CURE: A Multimodal Benchmark for Clinical Understanding and Retrieval Evaluation

arXiv - AI · 4 min ·
[2603.19273] LSR: Linguistic Safety Robustness Benchmark for Low-Resource West African Languages
Llms

[2603.19273] LSR: Linguistic Safety Robustness Benchmark for Low-Resource West African Languages

Abstract page for arXiv paper 2603.19273: LSR: Linguistic Safety Robustness Benchmark for Low-Resource West African Languages

arXiv - AI · 3 min ·
[2603.19271] A Human-Centered Workflow for Using Large Language Models in Content Analysis
Llms

[2603.19271] A Human-Centered Workflow for Using Large Language Models in Content Analysis

Abstract page for arXiv paper 2603.19271: A Human-Centered Workflow for Using Large Language Models in Content Analysis

arXiv - AI · 3 min ·
[2603.19268] Full-Stack Domain Enhancement for Combustion LLMs: Construction and Optimization
Llms

[2603.19268] Full-Stack Domain Enhancement for Combustion LLMs: Construction and Optimization

Abstract page for arXiv paper 2603.19268: Full-Stack Domain Enhancement for Combustion LLMs: Construction and Optimization

arXiv - AI · 3 min ·
[2603.19266] Probing to Refine: Reinforcement Distillation of LLMs via Explanatory Inversion
Llms

[2603.19266] Probing to Refine: Reinforcement Distillation of LLMs via Explanatory Inversion

Abstract page for arXiv paper 2603.19266: Probing to Refine: Reinforcement Distillation of LLMs via Explanatory Inversion

arXiv - Machine Learning · 4 min ·
[2603.19265] When the Pure Reasoner Meets the Impossible Object: Analytic vs. Synthetic Fine-Tuning and the Suppression of Genesis in Language Models
Llms

[2603.19265] When the Pure Reasoner Meets the Impossible Object: Analytic vs. Synthetic Fine-Tuning and the Suppression of Genesis in Language Models

Abstract page for arXiv paper 2603.19265: When the Pure Reasoner Meets the Impossible Object: Analytic vs. Synthetic Fine-Tuning and the ...

arXiv - AI · 4 min ·
[2603.19264] Generative Active Testing: Efficient LLM Evaluation via Proxy Task Adaptation
Llms

[2603.19264] Generative Active Testing: Efficient LLM Evaluation via Proxy Task Adaptation

Abstract page for arXiv paper 2603.19264: Generative Active Testing: Efficient LLM Evaluation via Proxy Task Adaptation

arXiv - AI · 3 min ·
[2603.19262] The α-Law of Observable Belief Revision in Large Language Model Inference
Llms

[2603.19262] The α-Law of Observable Belief Revision in Large Language Model Inference

Abstract page for arXiv paper 2603.19262: The α-Law of Observable Belief Revision in Large Language Model Inference

arXiv - AI · 4 min ·
[2603.19255] LARFT: Closing the Cognition-Action Gap for Length Instruction Following in Large Language Models
Llms

[2603.19255] LARFT: Closing the Cognition-Action Gap for Length Instruction Following in Large Language Models

Abstract page for arXiv paper 2603.19255: LARFT: Closing the Cognition-Action Gap for Length Instruction Following in Large Language Models

arXiv - AI · 4 min ·
[2603.19258] MAPLE: Metadata Augmented Private Language Evolution
Llms

[2603.19258] MAPLE: Metadata Augmented Private Language Evolution

Abstract page for arXiv paper 2603.19258: MAPLE: Metadata Augmented Private Language Evolution

arXiv - Machine Learning · 4 min ·
[2603.19252] GeoChallenge: A Multi-Answer Multiple-Choice Benchmark for Geometric Reasoning with Diagrams
Llms

[2603.19252] GeoChallenge: A Multi-Answer Multiple-Choice Benchmark for Geometric Reasoning with Diagrams

Abstract page for arXiv paper 2603.19252: GeoChallenge: A Multi-Answer Multiple-Choice Benchmark for Geometric Reasoning with Diagrams

arXiv - AI · 3 min ·
[2603.19253] A comprehensive study of LLM-based argument classification: from Llama through DeepSeek to GPT-5.2
Llms

[2603.19253] A comprehensive study of LLM-based argument classification: from Llama through DeepSeek to GPT-5.2

Abstract page for arXiv paper 2603.19253: A comprehensive study of LLM-based argument classification: from Llama through DeepSeek to GPT-5.2

arXiv - AI · 4 min ·
[2603.19236] L-PRISMA: An Extension of PRISMA in the Era of Generative Artificial Intelligence (GenAI)
Llms

[2603.19236] L-PRISMA: An Extension of PRISMA in the Era of Generative Artificial Intelligence (GenAI)

Abstract page for arXiv paper 2603.19236: L-PRISMA: An Extension of PRISMA in the Era of Generative Artificial Intelligence (GenAI)

arXiv - AI · 3 min ·
[2603.19247] When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models
Llms

[2603.19247] When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models

Abstract page for arXiv paper 2603.19247: When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models

arXiv - AI · 4 min ·
[2603.17765] Grounded Multimodal Retrieval-Augmented Drafting of Radiology Impressions Using Case-Based Similarity Search
Llms

[2603.17765] Grounded Multimodal Retrieval-Augmented Drafting of Radiology Impressions Using Case-Based Similarity Search

Abstract page for arXiv paper 2603.17765: Grounded Multimodal Retrieval-Augmented Drafting of Radiology Impressions Using Case-Based Simi...

arXiv - AI · 4 min ·
[2603.20170] Learning Dynamic Belief Graphs for Theory-of-mind Reasoning
Llms

[2603.20170] Learning Dynamic Belief Graphs for Theory-of-mind Reasoning

Abstract page for arXiv paper 2603.20170: Learning Dynamic Belief Graphs for Theory-of-mind Reasoning

arXiv - AI · 3 min ·
[2603.20101] Pitfalls in Evaluating Interpretability Agents
Llms

[2603.20101] Pitfalls in Evaluating Interpretability Agents

Abstract page for arXiv paper 2603.20101: Pitfalls in Evaluating Interpretability Agents

arXiv - AI · 4 min ·
[2603.20046] Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs
Llms

[2603.20046] Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs

Abstract page for arXiv paper 2603.20046: Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for ...

arXiv - AI · 4 min ·
Previous Page 88 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime