Large Language Models
GPT, Claude, Gemini, and other LLMs
Top This Week
All Content
[2603.02193] Symbol-Equivariant Recurrent Reasoning Models
Abstract page for arXiv paper 2603.02193: Symbol-Equivariant Recurrent Reasoning Models
[2603.02188] Multi-Head Low-Rank Attention
Abstract page for arXiv paper 2603.02188: Multi-Head Low-Rank Attention
[2603.01696] Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcement Learning
Abstract page for arXiv paper 2603.01696: Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcem...
[2603.01694] MVR: Multi-view Video Reward Shaping for Reinforcement Learning
Abstract page for arXiv paper 2603.01694: MVR: Multi-view Video Reward Shaping for Reinforcement Learning
[2603.02112] Recursive Models for Long-Horizon Reasoning
Abstract page for arXiv paper 2603.02112: Recursive Models for Long-Horizon Reasoning
[2603.02092] Adam Converges Without Any Modification On Update Rules
Abstract page for arXiv paper 2603.02092: Adam Converges Without Any Modification On Update Rules
[2603.01683] Surgical Post-Training: Cutting Errors, Keeping Knowledge
Abstract page for arXiv paper 2603.01683: Surgical Post-Training: Cutting Errors, Keeping Knowledge
[2603.02091] Learning from Synthetic Data Improves Multi-hop Reasoning
Abstract page for arXiv paper 2603.02091: Learning from Synthetic Data Improves Multi-hop Reasoning
[2603.01651] LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence
Abstract page for arXiv paper 2603.01651: LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence
[2603.02045] Expanding LLM Agent Boundaries with Strategy-Guided Exploration
Abstract page for arXiv paper 2603.02045: Expanding LLM Agent Boundaries with Strategy-Guided Exploration
[2603.01625] Measuring What VLMs Don't Say: Validation Metrics Hide Clinical Terminology Erasure in Radiology Report Generation
Abstract page for arXiv paper 2603.01625: Measuring What VLMs Don't Say: Validation Metrics Hide Clinical Terminology Erasure in Radiolog...
[2603.01574] DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual Entropy Lull Pattern
Abstract page for arXiv paper 2603.01574: DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual ...
[2603.01550] Extracting Training Dialogue Data from Large Language Model based Task Bots
Abstract page for arXiv paper 2603.01550: Extracting Training Dialogue Data from Large Language Model based Task Bots
[2603.01950] Semantic Similarity is a Spurious Measure of Comic Understanding: Lessons Learned from Hallucinations in a Benchmarking Experiment
Abstract page for arXiv paper 2603.01950: Semantic Similarity is a Spurious Measure of Comic Understanding: Lessons Learned from Hallucin...
[2603.01499] Towards Privacy-Preserving LLM Inference via Collaborative Obfuscation (Technical Report)
Abstract page for arXiv paper 2603.01499: Towards Privacy-Preserving LLM Inference via Collaborative Obfuscation (Technical Report)
[2603.01494] Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision
Abstract page for arXiv paper 2603.01494: Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision
[2603.01907] Efficient RLVR Training via Weighted Mutual Information Data Selection
Abstract page for arXiv paper 2603.01907: Efficient RLVR Training via Weighted Mutual Information Data Selection
[2603.01879] Diagnosing Generalization Failures from Representational Geometry Markers
Abstract page for arXiv paper 2603.01879: Diagnosing Generalization Failures from Representational Geometry Markers
[2603.01455] From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents
Abstract page for arXiv paper 2603.01455: From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottlene...
[2603.01454] VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models
Abstract page for arXiv paper 2603.01454: VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models
Related Topics
Stay updated with AI News
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime