Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Moving Past "LLM Vibes" toward Structural Enforcement in AI Agents

We need to address the structural failure currently happening in the AI agent space: too many people are building a beautiful "pedestal" ...

Reddit - Artificial Intelligence · 1 min ·
Llms

My dream of a fully generative game is getting pretty close to possible now. I made a demo where you can prompt any spell and fight online.

Prompt any spell and use it in a 3D physics based world, powered by Gemini 3 Full multiplayer support for up to 6 players with VoIP All m...

Reddit - Artificial Intelligence · 1 min ·
Llms

Caliber: open-source community registry for AI agent config files (CLAUDE.md, .cursor/rules, GEMINI.md) — 888 stars

AI coding tools like Claude Code, Cursor, and Gemini CLI have created a new category of infrastructure: agent configuration files. Develo...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.02092] Adam Converges Without Any Modification On Update Rules
Llms

[2603.02092] Adam Converges Without Any Modification On Update Rules

Abstract page for arXiv paper 2603.02092: Adam Converges Without Any Modification On Update Rules

arXiv - Machine Learning · 4 min ·
[2603.01683] Surgical Post-Training: Cutting Errors, Keeping Knowledge
Llms

[2603.01683] Surgical Post-Training: Cutting Errors, Keeping Knowledge

Abstract page for arXiv paper 2603.01683: Surgical Post-Training: Cutting Errors, Keeping Knowledge

arXiv - AI · 3 min ·
[2603.02091] Learning from Synthetic Data Improves Multi-hop Reasoning
Llms

[2603.02091] Learning from Synthetic Data Improves Multi-hop Reasoning

Abstract page for arXiv paper 2603.02091: Learning from Synthetic Data Improves Multi-hop Reasoning

arXiv - Machine Learning · 3 min ·
[2603.01651] LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence
Llms

[2603.01651] LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence

Abstract page for arXiv paper 2603.01651: LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence

arXiv - AI · 4 min ·
[2603.02045] Expanding LLM Agent Boundaries with Strategy-Guided Exploration
Llms

[2603.02045] Expanding LLM Agent Boundaries with Strategy-Guided Exploration

Abstract page for arXiv paper 2603.02045: Expanding LLM Agent Boundaries with Strategy-Guided Exploration

arXiv - Machine Learning · 4 min ·
[2603.01625] Measuring What VLMs Don't Say: Validation Metrics Hide Clinical Terminology Erasure in Radiology Report Generation
Llms

[2603.01625] Measuring What VLMs Don't Say: Validation Metrics Hide Clinical Terminology Erasure in Radiology Report Generation

Abstract page for arXiv paper 2603.01625: Measuring What VLMs Don't Say: Validation Metrics Hide Clinical Terminology Erasure in Radiolog...

arXiv - AI · 4 min ·
[2603.01574] DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual Entropy Lull Pattern
Llms

[2603.01574] DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual Entropy Lull Pattern

Abstract page for arXiv paper 2603.01574: DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual ...

arXiv - AI · 4 min ·
[2603.01550] Extracting Training Dialogue Data from Large Language Model based Task Bots
Llms

[2603.01550] Extracting Training Dialogue Data from Large Language Model based Task Bots

Abstract page for arXiv paper 2603.01550: Extracting Training Dialogue Data from Large Language Model based Task Bots

arXiv - AI · 4 min ·
[2603.01950] Semantic Similarity is a Spurious Measure of Comic Understanding: Lessons Learned from Hallucinations in a Benchmarking Experiment
Llms

[2603.01950] Semantic Similarity is a Spurious Measure of Comic Understanding: Lessons Learned from Hallucinations in a Benchmarking Experiment

Abstract page for arXiv paper 2603.01950: Semantic Similarity is a Spurious Measure of Comic Understanding: Lessons Learned from Hallucin...

arXiv - Machine Learning · 3 min ·
[2603.01499] Towards Privacy-Preserving LLM Inference via Collaborative Obfuscation (Technical Report)
Llms

[2603.01499] Towards Privacy-Preserving LLM Inference via Collaborative Obfuscation (Technical Report)

Abstract page for arXiv paper 2603.01499: Towards Privacy-Preserving LLM Inference via Collaborative Obfuscation (Technical Report)

arXiv - AI · 4 min ·
[2603.01494] Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision
Llms

[2603.01494] Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision

Abstract page for arXiv paper 2603.01494: Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision

arXiv - Machine Learning · 4 min ·
[2603.01907] Efficient RLVR Training via Weighted Mutual Information Data Selection
Llms

[2603.01907] Efficient RLVR Training via Weighted Mutual Information Data Selection

Abstract page for arXiv paper 2603.01907: Efficient RLVR Training via Weighted Mutual Information Data Selection

arXiv - Machine Learning · 4 min ·
[2603.01879] Diagnosing Generalization Failures from Representational Geometry Markers
Llms

[2603.01879] Diagnosing Generalization Failures from Representational Geometry Markers

Abstract page for arXiv paper 2603.01879: Diagnosing Generalization Failures from Representational Geometry Markers

arXiv - Machine Learning · 4 min ·
[2603.01455] From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents
Llms

[2603.01455] From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents

Abstract page for arXiv paper 2603.01455: From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottlene...

arXiv - AI · 4 min ·
[2603.01454] VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models
Llms

[2603.01454] VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models

Abstract page for arXiv paper 2603.01454: VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models

arXiv - AI · 3 min ·
[2603.01438] Enhancing Persona Following at Decoding Time via Dynamic Importance Estimation for Role-Playing Agents
Llms

[2603.01438] Enhancing Persona Following at Decoding Time via Dynamic Importance Estimation for Role-Playing Agents

Abstract page for arXiv paper 2603.01438: Enhancing Persona Following at Decoding Time via Dynamic Importance Estimation for Role-Playing...

arXiv - AI · 4 min ·
[2603.01385] Toward Graph-Tokenizing Large Language Models with Reconstructive Graph Instruction Tuning
Llms

[2603.01385] Toward Graph-Tokenizing Large Language Models with Reconstructive Graph Instruction Tuning

Abstract page for arXiv paper 2603.01385: Toward Graph-Tokenizing Large Language Models with Reconstructive Graph Instruction Tuning

arXiv - AI · 4 min ·
[2603.01780] D3LM: A Discrete DNA Diffusion Language Model for Bidirectional DNA Understanding and Generation
Llms

[2603.01780] D3LM: A Discrete DNA Diffusion Language Model for Bidirectional DNA Understanding and Generation

Abstract page for arXiv paper 2603.01780: D3LM: A Discrete DNA Diffusion Language Model for Bidirectional DNA Understanding and Generation

arXiv - Machine Learning · 4 min ·
[2603.01761] Modular Memory is the Key to Continual Learning Agents
Llms

[2603.01761] Modular Memory is the Key to Continual Learning Agents

Abstract page for arXiv paper 2603.01761: Modular Memory is the Key to Continual Learning Agents

arXiv - Machine Learning · 4 min ·
[2603.01759] Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning
Llms

[2603.01759] Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning

Abstract page for arXiv paper 2603.01759: Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning

arXiv - Machine Learning · 4 min ·
Previous Page 313 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime