Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

8 free AI courses from Anthropic’s Claude platform with certificates

AI News - General ·
Llms

Claude developer hosts Christian leaders for AI summit

AI Tools & Products ·
CoreWeave stock pops 11% on deal to power Anthropic's Claude
Llms

CoreWeave stock pops 11% on deal to power Anthropic's Claude

AI Tools & Products · 3 min ·

All Content

[2603.02193] Symbol-Equivariant Recurrent Reasoning Models
Llms

[2603.02193] Symbol-Equivariant Recurrent Reasoning Models

Abstract page for arXiv paper 2603.02193: Symbol-Equivariant Recurrent Reasoning Models

arXiv - Machine Learning · 3 min ·
[2603.02188] Multi-Head Low-Rank Attention
Llms

[2603.02188] Multi-Head Low-Rank Attention

Abstract page for arXiv paper 2603.02188: Multi-Head Low-Rank Attention

arXiv - Machine Learning · 3 min ·
[2603.01696] Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcement Learning
Llms

[2603.01696] Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcement Learning

Abstract page for arXiv paper 2603.01696: Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcem...

arXiv - AI · 4 min ·
[2603.01694] MVR: Multi-view Video Reward Shaping for Reinforcement Learning
Llms

[2603.01694] MVR: Multi-view Video Reward Shaping for Reinforcement Learning

Abstract page for arXiv paper 2603.01694: MVR: Multi-view Video Reward Shaping for Reinforcement Learning

arXiv - Machine Learning · 4 min ·
[2603.02112] Recursive Models for Long-Horizon Reasoning
Llms

[2603.02112] Recursive Models for Long-Horizon Reasoning

Abstract page for arXiv paper 2603.02112: Recursive Models for Long-Horizon Reasoning

arXiv - Machine Learning · 3 min ·
[2603.02092] Adam Converges Without Any Modification On Update Rules
Llms

[2603.02092] Adam Converges Without Any Modification On Update Rules

Abstract page for arXiv paper 2603.02092: Adam Converges Without Any Modification On Update Rules

arXiv - Machine Learning · 4 min ·
[2603.01683] Surgical Post-Training: Cutting Errors, Keeping Knowledge
Llms

[2603.01683] Surgical Post-Training: Cutting Errors, Keeping Knowledge

Abstract page for arXiv paper 2603.01683: Surgical Post-Training: Cutting Errors, Keeping Knowledge

arXiv - AI · 3 min ·
[2603.02091] Learning from Synthetic Data Improves Multi-hop Reasoning
Llms

[2603.02091] Learning from Synthetic Data Improves Multi-hop Reasoning

Abstract page for arXiv paper 2603.02091: Learning from Synthetic Data Improves Multi-hop Reasoning

arXiv - Machine Learning · 3 min ·
[2603.01651] LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence
Llms

[2603.01651] LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence

Abstract page for arXiv paper 2603.01651: LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence

arXiv - AI · 4 min ·
[2603.02045] Expanding LLM Agent Boundaries with Strategy-Guided Exploration
Llms

[2603.02045] Expanding LLM Agent Boundaries with Strategy-Guided Exploration

Abstract page for arXiv paper 2603.02045: Expanding LLM Agent Boundaries with Strategy-Guided Exploration

arXiv - Machine Learning · 4 min ·
[2603.01625] Measuring What VLMs Don't Say: Validation Metrics Hide Clinical Terminology Erasure in Radiology Report Generation
Llms

[2603.01625] Measuring What VLMs Don't Say: Validation Metrics Hide Clinical Terminology Erasure in Radiology Report Generation

Abstract page for arXiv paper 2603.01625: Measuring What VLMs Don't Say: Validation Metrics Hide Clinical Terminology Erasure in Radiolog...

arXiv - AI · 4 min ·
[2603.01574] DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual Entropy Lull Pattern
Llms

[2603.01574] DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual Entropy Lull Pattern

Abstract page for arXiv paper 2603.01574: DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual ...

arXiv - AI · 4 min ·
[2603.01550] Extracting Training Dialogue Data from Large Language Model based Task Bots
Llms

[2603.01550] Extracting Training Dialogue Data from Large Language Model based Task Bots

Abstract page for arXiv paper 2603.01550: Extracting Training Dialogue Data from Large Language Model based Task Bots

arXiv - AI · 4 min ·
[2603.01950] Semantic Similarity is a Spurious Measure of Comic Understanding: Lessons Learned from Hallucinations in a Benchmarking Experiment
Llms

[2603.01950] Semantic Similarity is a Spurious Measure of Comic Understanding: Lessons Learned from Hallucinations in a Benchmarking Experiment

Abstract page for arXiv paper 2603.01950: Semantic Similarity is a Spurious Measure of Comic Understanding: Lessons Learned from Hallucin...

arXiv - Machine Learning · 3 min ·
[2603.01499] Towards Privacy-Preserving LLM Inference via Collaborative Obfuscation (Technical Report)
Llms

[2603.01499] Towards Privacy-Preserving LLM Inference via Collaborative Obfuscation (Technical Report)

Abstract page for arXiv paper 2603.01499: Towards Privacy-Preserving LLM Inference via Collaborative Obfuscation (Technical Report)

arXiv - AI · 4 min ·
[2603.01494] Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision
Llms

[2603.01494] Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision

Abstract page for arXiv paper 2603.01494: Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision

arXiv - Machine Learning · 4 min ·
[2603.01907] Efficient RLVR Training via Weighted Mutual Information Data Selection
Llms

[2603.01907] Efficient RLVR Training via Weighted Mutual Information Data Selection

Abstract page for arXiv paper 2603.01907: Efficient RLVR Training via Weighted Mutual Information Data Selection

arXiv - Machine Learning · 4 min ·
[2603.01879] Diagnosing Generalization Failures from Representational Geometry Markers
Llms

[2603.01879] Diagnosing Generalization Failures from Representational Geometry Markers

Abstract page for arXiv paper 2603.01879: Diagnosing Generalization Failures from Representational Geometry Markers

arXiv - Machine Learning · 4 min ·
[2603.01455] From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents
Llms

[2603.01455] From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents

Abstract page for arXiv paper 2603.01455: From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottlene...

arXiv - AI · 4 min ·
[2603.01454] VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models
Llms

[2603.01454] VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models

Abstract page for arXiv paper 2603.01454: VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models

arXiv - AI · 3 min ·
Previous Page 167 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime