Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Moving Past "LLM Vibes" toward Structural Enforcement in AI Agents

We need to address the structural failure currently happening in the AI agent space: too many people are building a beautiful "pedestal" ...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

Llms

My dream of a fully generative game is getting pretty close to possible now. I made a demo where you can prompt any spell and fight online.

Prompt any spell and use it in a 3D physics based world, powered by Gemini 3 Full multiplayer support for up to 6 players with VoIP All m...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

Llms

Caliber: open-source community registry for AI agent config files (CLAUDE.md, .cursor/rules, GEMINI.md) — 888 stars

AI coding tools like Claude Code, Cursor, and Gemini CLI have created a new category of infrastructure: agent configuration files. Develo...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

All Content

Llms

[2603.02092] Adam Converges Without Any Modification On Update Rules

Abstract page for arXiv paper 2603.02092: Adam Converges Without Any Modification On Update Rules

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.01683] Surgical Post-Training: Cutting Errors, Keeping Knowledge

Abstract page for arXiv paper 2603.01683: Surgical Post-Training: Cutting Errors, Keeping Knowledge

arXiv - AI · 3 min · 2 months ago

Llms

[2603.02091] Learning from Synthetic Data Improves Multi-hop Reasoning

Abstract page for arXiv paper 2603.02091: Learning from Synthetic Data Improves Multi-hop Reasoning

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2603.01651] LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence

Abstract page for arXiv paper 2603.01651: LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence

arXiv - AI · 4 min · 2 months ago

Llms

[2603.02045] Expanding LLM Agent Boundaries with Strategy-Guided Exploration

Abstract page for arXiv paper 2603.02045: Expanding LLM Agent Boundaries with Strategy-Guided Exploration

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.01625] Measuring What VLMs Don't Say: Validation Metrics Hide Clinical Terminology Erasure in Radiology Report Generation

Abstract page for arXiv paper 2603.01625: Measuring What VLMs Don't Say: Validation Metrics Hide Clinical Terminology Erasure in Radiolog...

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01574] DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual Entropy Lull Pattern

Abstract page for arXiv paper 2603.01574: DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual ...

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01550] Extracting Training Dialogue Data from Large Language Model based Task Bots

Abstract page for arXiv paper 2603.01550: Extracting Training Dialogue Data from Large Language Model based Task Bots

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01950] Semantic Similarity is a Spurious Measure of Comic Understanding: Lessons Learned from Hallucinations in a Benchmarking Experiment

Abstract page for arXiv paper 2603.01950: Semantic Similarity is a Spurious Measure of Comic Understanding: Lessons Learned from Hallucin...

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2603.01499] Towards Privacy-Preserving LLM Inference via Collaborative Obfuscation (Technical Report)

Abstract page for arXiv paper 2603.01499: Towards Privacy-Preserving LLM Inference via Collaborative Obfuscation (Technical Report)

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01494] Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision

Abstract page for arXiv paper 2603.01494: Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.01907] Efficient RLVR Training via Weighted Mutual Information Data Selection

Abstract page for arXiv paper 2603.01907: Efficient RLVR Training via Weighted Mutual Information Data Selection

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.01879] Diagnosing Generalization Failures from Representational Geometry Markers

Abstract page for arXiv paper 2603.01879: Diagnosing Generalization Failures from Representational Geometry Markers

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.01455] From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents

Abstract page for arXiv paper 2603.01455: From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottlene...

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01454] VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models

Abstract page for arXiv paper 2603.01454: VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models

arXiv - AI · 3 min · 2 months ago

Llms

[2603.01438] Enhancing Persona Following at Decoding Time via Dynamic Importance Estimation for Role-Playing Agents

Abstract page for arXiv paper 2603.01438: Enhancing Persona Following at Decoding Time via Dynamic Importance Estimation for Role-Playing...

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01385] Toward Graph-Tokenizing Large Language Models with Reconstructive Graph Instruction Tuning

Abstract page for arXiv paper 2603.01385: Toward Graph-Tokenizing Large Language Models with Reconstructive Graph Instruction Tuning

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01780] D3LM: A Discrete DNA Diffusion Language Model for Bidirectional DNA Understanding and Generation

Abstract page for arXiv paper 2603.01780: D3LM: A Discrete DNA Diffusion Language Model for Bidirectional DNA Understanding and Generation

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.01761] Modular Memory is the Key to Continual Learning Agents

Abstract page for arXiv paper 2603.01761: Modular Memory is the Key to Continual Learning Agents

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.01759] Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning

Abstract page for arXiv paper 2603.01759: Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning

arXiv - Machine Learning · 4 min · 2 months ago

Previous Page 313 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Moving Past "LLM Vibes" toward Structural Enforcement in AI Agents

My dream of a fully generative game is getting pretty close to possible now. I made a demo where you can prompt any spell and fight online.

Caliber: open-source community registry for AI agent config files (CLAUDE.md, .cursor/rules, GEMINI.md) — 888 stars

All Content

[2603.02092] Adam Converges Without Any Modification On Update Rules

[2603.01683] Surgical Post-Training: Cutting Errors, Keeping Knowledge

[2603.02091] Learning from Synthetic Data Improves Multi-hop Reasoning

[2603.01651] LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence

[2603.02045] Expanding LLM Agent Boundaries with Strategy-Guided Exploration

[2603.01625] Measuring What VLMs Don't Say: Validation Metrics Hide Clinical Terminology Erasure in Radiology Report Generation

[2603.01574] DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual Entropy Lull Pattern

[2603.01550] Extracting Training Dialogue Data from Large Language Model based Task Bots

[2603.01950] Semantic Similarity is a Spurious Measure of Comic Understanding: Lessons Learned from Hallucinations in a Benchmarking Experiment

[2603.01499] Towards Privacy-Preserving LLM Inference via Collaborative Obfuscation (Technical Report)

[2603.01494] Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision

[2603.01907] Efficient RLVR Training via Weighted Mutual Information Data Selection

[2603.01879] Diagnosing Generalization Failures from Representational Geometry Markers

[2603.01455] From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents

[2603.01454] VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models

[2603.01438] Enhancing Persona Following at Decoding Time via Dynamic Importance Estimation for Role-Playing Agents

[2603.01385] Toward Graph-Tokenizing Large Language Models with Reconstructive Graph Instruction Tuning

[2603.01780] D3LM: A Discrete DNA Diffusion Language Model for Bidirectional DNA Understanding and Generation

[2603.01761] Modular Memory is the Key to Continual Learning Agents

[2603.01759] Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning

Related Topics

Stay updated with AI News