Large Language Models
GPT, Claude, Gemini, and other LLMs
Top This Week
All Content
[2603.01214] Reasoning Boosts Opinion Alignment in LLMs
Abstract page for arXiv paper 2603.01214: Reasoning Boosts Opinion Alignment in LLMs
[2509.12282] AISSISTANT: Human-AI Collaborative Review and Perspective Research Workflows in Data Science
Abstract page for arXiv paper 2509.12282: AISSISTANT: Human-AI Collaborative Review and Perspective Research Workflows in Data Science
[2603.01213] Can AI Agents Agree?
Abstract page for arXiv paper 2603.01213: Can AI Agents Agree?
[2509.03906] Toward Clinically Explainable AI for Medical Diagnosis: A Foundation Model with Human-Compatible Reasoning via Reinforcement Learning
Abstract page for arXiv paper 2509.03906: Toward Clinically Explainable AI for Medical Diagnosis: A Foundation Model with Human-Compatibl...
[2509.01938] EigenBench: A Comparative Behavioral Measure of Value Alignment
Abstract page for arXiv paper 2509.01938: EigenBench: A Comparative Behavioral Measure of Value Alignment
[2508.20729] Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision
Abstract page for arXiv paper 2508.20729: Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision
[2508.15030] Collab-REC: An LLM-based Agentic Framework for Balancing Recommendations in Tourism
Abstract page for arXiv paper 2508.15030: Collab-REC: An LLM-based Agentic Framework for Balancing Recommendations in Tourism
[2507.16145] SpiroLLM: Finetuning Pretrained LLMs to Understand Spirogram Time Series with Clinical Validation in COPD Reporting
Abstract page for arXiv paper 2507.16145: SpiroLLM: Finetuning Pretrained LLMs to Understand Spirogram Time Series with Clinical Validati...
[2506.24119] SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
Abstract page for arXiv paper 2506.24119: SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforce...
[2603.01089] CARD: Towards Conditional Design of Multi-agent Topological Structures
Abstract page for arXiv paper 2603.01089: CARD: Towards Conditional Design of Multi-agent Topological Structures
[2506.00530] CityLens: Evaluating Large Vision-Language Models for Urban Socioeconomic Sensing
Abstract page for arXiv paper 2506.00530: CityLens: Evaluating Large Vision-Language Models for Urban Socioeconomic Sensing
[2505.12565] mCLM: A Modular Chemical Language Model that Generates Functional and Makeable Molecules
Abstract page for arXiv paper 2505.12565: mCLM: A Modular Chemical Language Model that Generates Functional and Makeable Molecules
[2505.19653] Token-Importance Guided Direct Preference Optimization
Abstract page for arXiv paper 2505.19653: Token-Importance Guided Direct Preference Optimization
[2504.18453] Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation
Abstract page for arXiv paper 2504.18453: Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Ge...
[2502.07644] SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models
Abstract page for arXiv paper 2502.07644: SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models
[2503.11832] Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated by Machine Unlearning
Abstract page for arXiv paper 2503.11832: Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated ...
[2603.00846] Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models
Abstract page for arXiv paper 2603.00846: Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models
[2412.03772] A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices
Abstract page for arXiv paper 2412.03772: A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices
[2410.05669] ACPBench: Reasoning about Action, Change, and Planning
Abstract page for arXiv paper 2410.05669: ACPBench: Reasoning about Action, Change, and Planning
[2408.05233] Electric Vehicle User Charging Behavior Analysis Integrating Psychological and Environmental Factors: A Statistical-Driven LLM based Agent Approach
Abstract page for arXiv paper 2408.05233: Electric Vehicle User Charging Behavior Analysis Integrating Psychological and Environmental Fa...
Related Topics
Stay updated with AI News
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime