Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Llms

Claude developer hosts Christian leaders for AI summit

AI Tools & Products · 10 minutes ago

Llms

CoreWeave stock pops 11% on deal to power Anthropic's Claude

AI Tools & Products · 3 min · 10 minutes ago

Llms

I Trained for the Paris Marathon Using ChatGPT

AI Tools & Products · 1 min · 10 minutes ago

All Content

Llms

[2603.01214] Reasoning Boosts Opinion Alignment in LLMs

Abstract page for arXiv paper 2603.01214: Reasoning Boosts Opinion Alignment in LLMs

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2509.12282] AISSISTANT: Human-AI Collaborative Review and Perspective Research Workflows in Data Science

Abstract page for arXiv paper 2509.12282: AISSISTANT: Human-AI Collaborative Review and Perspective Research Workflows in Data Science

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2603.01213] Can AI Agents Agree?

Abstract page for arXiv paper 2603.01213: Can AI Agents Agree?

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2509.03906] Toward Clinically Explainable AI for Medical Diagnosis: A Foundation Model with Human-Compatible Reasoning via Reinforcement Learning

Abstract page for arXiv paper 2509.03906: Toward Clinically Explainable AI for Medical Diagnosis: A Foundation Model with Human-Compatibl...

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.01938] EigenBench: A Comparative Behavioral Measure of Value Alignment

Abstract page for arXiv paper 2509.01938: EigenBench: A Comparative Behavioral Measure of Value Alignment

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2508.20729] Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision

Abstract page for arXiv paper 2508.20729: Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision

arXiv - AI · 4 min · about 1 month ago

Llms

[2508.15030] Collab-REC: An LLM-based Agentic Framework for Balancing Recommendations in Tourism

Abstract page for arXiv paper 2508.15030: Collab-REC: An LLM-based Agentic Framework for Balancing Recommendations in Tourism

arXiv - AI · 3 min · about 1 month ago

Llms

[2507.16145] SpiroLLM: Finetuning Pretrained LLMs to Understand Spirogram Time Series with Clinical Validation in COPD Reporting

Abstract page for arXiv paper 2507.16145: SpiroLLM: Finetuning Pretrained LLMs to Understand Spirogram Time Series with Clinical Validati...

arXiv - AI · 4 min · about 1 month ago

Llms

[2506.24119] SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Abstract page for arXiv paper 2506.24119: SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforce...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2603.01089] CARD: Towards Conditional Design of Multi-agent Topological Structures

Abstract page for arXiv paper 2603.01089: CARD: Towards Conditional Design of Multi-agent Topological Structures

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2506.00530] CityLens: Evaluating Large Vision-Language Models for Urban Socioeconomic Sensing

Abstract page for arXiv paper 2506.00530: CityLens: Evaluating Large Vision-Language Models for Urban Socioeconomic Sensing

arXiv - AI · 4 min · about 1 month ago

Llms

[2505.12565] mCLM: A Modular Chemical Language Model that Generates Functional and Makeable Molecules

Abstract page for arXiv paper 2505.12565: mCLM: A Modular Chemical Language Model that Generates Functional and Makeable Molecules

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2505.19653] Token-Importance Guided Direct Preference Optimization

Abstract page for arXiv paper 2505.19653: Token-Importance Guided Direct Preference Optimization

arXiv - AI · 3 min · about 1 month ago

Llms

[2504.18453] Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation

Abstract page for arXiv paper 2504.18453: Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Ge...

arXiv - AI · 4 min · about 1 month ago

Llms

[2502.07644] SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models

Abstract page for arXiv paper 2502.07644: SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models

arXiv - AI · 4 min · about 1 month ago

Llms

[2503.11832] Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated by Machine Unlearning

Abstract page for arXiv paper 2503.11832: Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2603.00846] Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models

Abstract page for arXiv paper 2603.00846: Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2412.03772] A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices

Abstract page for arXiv paper 2412.03772: A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices

arXiv - AI · 4 min · about 1 month ago

Llms

[2410.05669] ACPBench: Reasoning about Action, Change, and Planning

Abstract page for arXiv paper 2410.05669: ACPBench: Reasoning about Action, Change, and Planning

arXiv - AI · 4 min · about 1 month ago

Llms

[2408.05233] Electric Vehicle User Charging Behavior Analysis Integrating Psychological and Environmental Factors: A Statistical-Driven LLM based Agent Approach

Abstract page for arXiv paper 2408.05233: Electric Vehicle User Charging Behavior Analysis Integrating Psychological and Environmental Fa...

arXiv - AI · 4 min · about 1 month ago

Previous Page 165 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Claude developer hosts Christian leaders for AI summit

CoreWeave stock pops 11% on deal to power Anthropic's Claude

I Trained for the Paris Marathon Using ChatGPT

All Content

[2603.01214] Reasoning Boosts Opinion Alignment in LLMs

[2509.12282] AISSISTANT: Human-AI Collaborative Review and Perspective Research Workflows in Data Science

[2603.01213] Can AI Agents Agree?

[2509.03906] Toward Clinically Explainable AI for Medical Diagnosis: A Foundation Model with Human-Compatible Reasoning via Reinforcement Learning

[2509.01938] EigenBench: A Comparative Behavioral Measure of Value Alignment

[2508.20729] Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision

[2508.15030] Collab-REC: An LLM-based Agentic Framework for Balancing Recommendations in Tourism

[2507.16145] SpiroLLM: Finetuning Pretrained LLMs to Understand Spirogram Time Series with Clinical Validation in COPD Reporting

[2506.24119] SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

[2603.01089] CARD: Towards Conditional Design of Multi-agent Topological Structures

[2506.00530] CityLens: Evaluating Large Vision-Language Models for Urban Socioeconomic Sensing

[2505.12565] mCLM: A Modular Chemical Language Model that Generates Functional and Makeable Molecules

[2505.19653] Token-Importance Guided Direct Preference Optimization

[2504.18453] Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation

[2502.07644] SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models

[2503.11832] Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated by Machine Unlearning

[2603.00846] Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models

[2412.03772] A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices

[2410.05669] ACPBench: Reasoning about Action, Change, and Planning

[2408.05233] Electric Vehicle User Charging Behavior Analysis Integrating Psychological and Environmental Factors: A Statistical-Driven LLM based Agent Approach

Related Topics

Stay updated with AI News