Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Moving Past "LLM Vibes" toward Structural Enforcement in AI Agents

We need to address the structural failure currently happening in the AI agent space: too many people are building a beautiful "pedestal" ...

Reddit - Artificial Intelligence · 1 min · 19 minutes ago

Llms

My dream of a fully generative game is getting pretty close to possible now. I made a demo where you can prompt any spell and fight online.

Prompt any spell and use it in a 3D physics based world, powered by Gemini 3 Full multiplayer support for up to 6 players with VoIP All m...

Reddit - Artificial Intelligence · 1 min · 19 minutes ago

Llms

Caliber: open-source community registry for AI agent config files (CLAUDE.md, .cursor/rules, GEMINI.md) — 888 stars

AI coding tools like Claude Code, Cursor, and Gemini CLI have created a new category of infrastructure: agent configuration files. Develo...

Reddit - Artificial Intelligence · 1 min · 19 minutes ago

All Content

Llms

[2508.20729] Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision

Abstract page for arXiv paper 2508.20729: Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision

arXiv - AI · 4 min · 2 months ago

Llms

[2508.15030] Collab-REC: An LLM-based Agentic Framework for Balancing Recommendations in Tourism

Abstract page for arXiv paper 2508.15030: Collab-REC: An LLM-based Agentic Framework for Balancing Recommendations in Tourism

arXiv - AI · 3 min · 2 months ago

Llms

[2507.16145] SpiroLLM: Finetuning Pretrained LLMs to Understand Spirogram Time Series with Clinical Validation in COPD Reporting

Abstract page for arXiv paper 2507.16145: SpiroLLM: Finetuning Pretrained LLMs to Understand Spirogram Time Series with Clinical Validati...

arXiv - AI · 4 min · 2 months ago

Llms

[2506.24119] SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Abstract page for arXiv paper 2506.24119: SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforce...

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.01089] CARD: Towards Conditional Design of Multi-agent Topological Structures

Abstract page for arXiv paper 2603.01089: CARD: Towards Conditional Design of Multi-agent Topological Structures

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2506.00530] CityLens: Evaluating Large Vision-Language Models for Urban Socioeconomic Sensing

Abstract page for arXiv paper 2506.00530: CityLens: Evaluating Large Vision-Language Models for Urban Socioeconomic Sensing

arXiv - AI · 4 min · 2 months ago

Llms

[2505.12565] mCLM: A Modular Chemical Language Model that Generates Functional and Makeable Molecules

Abstract page for arXiv paper 2505.12565: mCLM: A Modular Chemical Language Model that Generates Functional and Makeable Molecules

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2505.19653] Token-Importance Guided Direct Preference Optimization

Abstract page for arXiv paper 2505.19653: Token-Importance Guided Direct Preference Optimization

arXiv - AI · 3 min · 2 months ago

Llms

[2504.18453] Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation

Abstract page for arXiv paper 2504.18453: Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Ge...

arXiv - AI · 4 min · 2 months ago

Llms

[2502.07644] SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models

Abstract page for arXiv paper 2502.07644: SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models

arXiv - AI · 4 min · 2 months ago

Llms

[2503.11832] Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated by Machine Unlearning

Abstract page for arXiv paper 2503.11832: Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated ...

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.00846] Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models

Abstract page for arXiv paper 2603.00846: Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2412.03772] A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices

Abstract page for arXiv paper 2412.03772: A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices

arXiv - AI · 4 min · 2 months ago

Llms

[2410.05669] ACPBench: Reasoning about Action, Change, and Planning

Abstract page for arXiv paper 2410.05669: ACPBench: Reasoning about Action, Change, and Planning

arXiv - AI · 4 min · 2 months ago

Llms

[2408.05233] Electric Vehicle User Charging Behavior Analysis Integrating Psychological and Environmental Factors: A Statistical-Driven LLM based Agent Approach

Abstract page for arXiv paper 2408.05233: Electric Vehicle User Charging Behavior Analysis Integrating Psychological and Environmental Fa...

arXiv - AI · 4 min · 2 months ago

Llms

[2603.00638] RAIE: Region-Aware Incremental Preference Editing with LoRA for LLM-based Recommendation

Abstract page for arXiv paper 2603.00638: RAIE: Region-Aware Incremental Preference Editing with LoRA for LLM-based Recommendation

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.02156] How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks

Abstract page for arXiv paper 2603.02156: How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks

arXiv - AI · 4 min · 2 months ago

Llms

[2603.02128] LLMs as Strategic Actors: Behavioral Alignment, Risk Calibration, and Argumentation Framing in Geopolitical Simulations

Abstract page for arXiv paper 2603.02128: LLMs as Strategic Actors: Behavioral Alignment, Risk Calibration, and Argumentation Framing in ...

arXiv - AI · 3 min · 2 months ago

Llms

[2603.00474] Wireless Power Control Based on Large Language Models

Abstract page for arXiv paper 2603.00474: Wireless Power Control Based on Large Language Models

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2603.00359] How Large Language Models Get Stuck: Early structure with persistent errors

Abstract page for arXiv paper 2603.00359: How Large Language Models Get Stuck: Early structure with persistent errors

arXiv - Machine Learning · 4 min · 2 months ago

Previous Page 311 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Moving Past "LLM Vibes" toward Structural Enforcement in AI Agents

My dream of a fully generative game is getting pretty close to possible now. I made a demo where you can prompt any spell and fight online.

Caliber: open-source community registry for AI agent config files (CLAUDE.md, .cursor/rules, GEMINI.md) — 888 stars

All Content

[2508.20729] Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision

[2508.15030] Collab-REC: An LLM-based Agentic Framework for Balancing Recommendations in Tourism

[2507.16145] SpiroLLM: Finetuning Pretrained LLMs to Understand Spirogram Time Series with Clinical Validation in COPD Reporting

[2506.24119] SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

[2603.01089] CARD: Towards Conditional Design of Multi-agent Topological Structures

[2506.00530] CityLens: Evaluating Large Vision-Language Models for Urban Socioeconomic Sensing

[2505.12565] mCLM: A Modular Chemical Language Model that Generates Functional and Makeable Molecules

[2505.19653] Token-Importance Guided Direct Preference Optimization

[2504.18453] Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation

[2502.07644] SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models

[2503.11832] Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated by Machine Unlearning

[2603.00846] Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models

[2412.03772] A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices

[2410.05669] ACPBench: Reasoning about Action, Change, and Planning

[2408.05233] Electric Vehicle User Charging Behavior Analysis Integrating Psychological and Environmental Factors: A Statistical-Driven LLM based Agent Approach

[2603.00638] RAIE: Region-Aware Incremental Preference Editing with LoRA for LLM-based Recommendation

[2603.02156] How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks

[2603.02128] LLMs as Strategic Actors: Behavioral Alignment, Risk Calibration, and Argumentation Framing in Geopolitical Simulations

[2603.00474] Wireless Power Control Based on Large Language Models

[2603.00359] How Large Language Models Get Stuck: Early structure with persistent errors

Related Topics

Stay updated with AI News