Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Moving Past "LLM Vibes" toward Structural Enforcement in AI Agents

We need to address the structural failure currently happening in the AI agent space: too many people are building a beautiful "pedestal" ...

Reddit - Artificial Intelligence · 1 min ·
Llms

My dream of a fully generative game is getting pretty close to possible now. I made a demo where you can prompt any spell and fight online.

Prompt any spell and use it in a 3D physics based world, powered by Gemini 3 Full multiplayer support for up to 6 players with VoIP All m...

Reddit - Artificial Intelligence · 1 min ·
Llms

Caliber: open-source community registry for AI agent config files (CLAUDE.md, .cursor/rules, GEMINI.md) — 888 stars

AI coding tools like Claude Code, Cursor, and Gemini CLI have created a new category of infrastructure: agent configuration files. Develo...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2508.20729] Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision
Llms

[2508.20729] Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision

Abstract page for arXiv paper 2508.20729: Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision

arXiv - AI · 4 min ·
[2508.15030] Collab-REC: An LLM-based Agentic Framework for Balancing Recommendations in Tourism
Llms

[2508.15030] Collab-REC: An LLM-based Agentic Framework for Balancing Recommendations in Tourism

Abstract page for arXiv paper 2508.15030: Collab-REC: An LLM-based Agentic Framework for Balancing Recommendations in Tourism

arXiv - AI · 3 min ·
[2507.16145] SpiroLLM: Finetuning Pretrained LLMs to Understand Spirogram Time Series with Clinical Validation in COPD Reporting
Llms

[2507.16145] SpiroLLM: Finetuning Pretrained LLMs to Understand Spirogram Time Series with Clinical Validation in COPD Reporting

Abstract page for arXiv paper 2507.16145: SpiroLLM: Finetuning Pretrained LLMs to Understand Spirogram Time Series with Clinical Validati...

arXiv - AI · 4 min ·
[2506.24119] SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
Llms

[2506.24119] SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Abstract page for arXiv paper 2506.24119: SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforce...

arXiv - Machine Learning · 4 min ·
[2603.01089] CARD: Towards Conditional Design of Multi-agent Topological Structures
Llms

[2603.01089] CARD: Towards Conditional Design of Multi-agent Topological Structures

Abstract page for arXiv paper 2603.01089: CARD: Towards Conditional Design of Multi-agent Topological Structures

arXiv - Machine Learning · 3 min ·
[2506.00530] CityLens: Evaluating Large Vision-Language Models for Urban Socioeconomic Sensing
Llms

[2506.00530] CityLens: Evaluating Large Vision-Language Models for Urban Socioeconomic Sensing

Abstract page for arXiv paper 2506.00530: CityLens: Evaluating Large Vision-Language Models for Urban Socioeconomic Sensing

arXiv - AI · 4 min ·
[2505.12565] mCLM: A Modular Chemical Language Model that Generates Functional and Makeable Molecules
Llms

[2505.12565] mCLM: A Modular Chemical Language Model that Generates Functional and Makeable Molecules

Abstract page for arXiv paper 2505.12565: mCLM: A Modular Chemical Language Model that Generates Functional and Makeable Molecules

arXiv - Machine Learning · 4 min ·
[2505.19653] Token-Importance Guided Direct Preference Optimization
Llms

[2505.19653] Token-Importance Guided Direct Preference Optimization

Abstract page for arXiv paper 2505.19653: Token-Importance Guided Direct Preference Optimization

arXiv - AI · 3 min ·
[2504.18453] Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation
Llms

[2504.18453] Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation

Abstract page for arXiv paper 2504.18453: Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Ge...

arXiv - AI · 4 min ·
[2502.07644] SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models
Llms

[2502.07644] SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models

Abstract page for arXiv paper 2502.07644: SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models

arXiv - AI · 4 min ·
[2503.11832] Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated by Machine Unlearning
Llms

[2503.11832] Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated by Machine Unlearning

Abstract page for arXiv paper 2503.11832: Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated ...

arXiv - Machine Learning · 4 min ·
[2603.00846] Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models
Llms

[2603.00846] Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models

Abstract page for arXiv paper 2603.00846: Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models

arXiv - Machine Learning · 3 min ·
[2412.03772] A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices
Llms

[2412.03772] A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices

Abstract page for arXiv paper 2412.03772: A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices

arXiv - AI · 4 min ·
[2410.05669] ACPBench: Reasoning about Action, Change, and Planning
Llms

[2410.05669] ACPBench: Reasoning about Action, Change, and Planning

Abstract page for arXiv paper 2410.05669: ACPBench: Reasoning about Action, Change, and Planning

arXiv - AI · 4 min ·
[2408.05233] Electric Vehicle User Charging Behavior Analysis Integrating Psychological and Environmental Factors: A Statistical-Driven LLM based Agent Approach
Llms

[2408.05233] Electric Vehicle User Charging Behavior Analysis Integrating Psychological and Environmental Factors: A Statistical-Driven LLM based Agent Approach

Abstract page for arXiv paper 2408.05233: Electric Vehicle User Charging Behavior Analysis Integrating Psychological and Environmental Fa...

arXiv - AI · 4 min ·
[2603.00638] RAIE: Region-Aware Incremental Preference Editing with LoRA for LLM-based Recommendation
Llms

[2603.00638] RAIE: Region-Aware Incremental Preference Editing with LoRA for LLM-based Recommendation

Abstract page for arXiv paper 2603.00638: RAIE: Region-Aware Incremental Preference Editing with LoRA for LLM-based Recommendation

arXiv - Machine Learning · 4 min ·
[2603.02156] How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks
Llms

[2603.02156] How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks

Abstract page for arXiv paper 2603.02156: How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks

arXiv - AI · 4 min ·
[2603.02128] LLMs as Strategic Actors: Behavioral Alignment, Risk Calibration, and Argumentation Framing in Geopolitical Simulations
Llms

[2603.02128] LLMs as Strategic Actors: Behavioral Alignment, Risk Calibration, and Argumentation Framing in Geopolitical Simulations

Abstract page for arXiv paper 2603.02128: LLMs as Strategic Actors: Behavioral Alignment, Risk Calibration, and Argumentation Framing in ...

arXiv - AI · 3 min ·
[2603.00474] Wireless Power Control Based on Large Language Models
Llms

[2603.00474] Wireless Power Control Based on Large Language Models

Abstract page for arXiv paper 2603.00474: Wireless Power Control Based on Large Language Models

arXiv - Machine Learning · 3 min ·
[2603.00359] How Large Language Models Get Stuck: Early structure with persistent errors
Llms

[2603.00359] How Large Language Models Get Stuck: Early structure with persistent errors

Abstract page for arXiv paper 2603.00359: How Large Language Models Get Stuck: Early structure with persistent errors

arXiv - Machine Learning · 4 min ·
Previous Page 311 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime