Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Llms

Anthropic Supply-Chain Risk Label Should Stay in Place, Appeals Court Says | WIRED

The AI company now faces conflicting rulings in its fight over how Claude can be used by the US military.

Wired - AI · 6 min · 32 minutes ago

Llms

Tubi is the first streamer to launch a native app within ChatGPT | TechCrunch

Tubi becomes the first streaming service to offer an app integration within ChatGPT, the AI chatbot that millions of users turn to for an...

TechCrunch - AI · 3 min · about 5 hours ago

Llms

Anyone out there use Claude Pro/Max at the same time on different screens?

I am asking for feedback ? I’m currently using a Claude paid plan (Pro/Max) and was wondering about the logistics of simultaneous use. Sp...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

All Content

Llms

[2603.04405] Lost in Translation: How Language Re-Aligns Vision for Cross-Species Pathology

Abstract page for arXiv paper 2603.04405: Lost in Translation: How Language Re-Aligns Vision for Cross-Species Pathology

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2603.05498] The Spike, the Sparse and the Sink: Anatomy of Massive Activations and Attention Sinks

Abstract page for arXiv paper 2603.05498: The Spike, the Sparse and the Sink: Anatomy of Massive Activations and Attention Sinks

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.05485] Towards Provably Unbiased LLM Judges via Bias-Bounded Evaluation

Abstract page for arXiv paper 2603.05485: Towards Provably Unbiased LLM Judges via Bias-Bounded Evaluation

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.05399] Judge Reliability Harness: Stress Testing the Reliability of LLM Judges

Abstract page for arXiv paper 2603.05399: Judge Reliability Harness: Stress Testing the Reliability of LLM Judges

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.05392] Legal interpretation and AI: from expert systems to argumentation and LLMs

Abstract page for arXiv paper 2603.05392: Legal interpretation and AI: from expert systems to argumentation and LLMs

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.05294] STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks

Abstract page for arXiv paper 2603.05294: STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.05290] X-RAY: Mapping LLM Reasoning Capability via Formalized and Calibrated Probes

Abstract page for arXiv paper 2603.05290: X-RAY: Mapping LLM Reasoning Capability via Formalized and Calibrated Probes

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.05240] GCAgent: Enhancing Group Chat Communication through Dialogue Agents System

Abstract page for arXiv paper 2603.05240: GCAgent: Enhancing Group Chat Communication through Dialogue Agents System

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.05129] MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty Consensus

Abstract page for arXiv paper 2603.05129: MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty C...

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.05120] Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Reasoning

Abstract page for arXiv paper 2603.05120: Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Re...

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.05044] WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents

Abstract page for arXiv paper 2603.05044: WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.05040] Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination

Abstract page for arXiv paper 2603.05040: Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.05028] Survive at All Costs: Exploring LLM's Risky Behaviors under Survival Pressure

Abstract page for arXiv paper 2603.05028: Survive at All Costs: Exploring LLM's Risky Behaviors under Survival Pressure

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.05016] BioLLMAgent: A Hybrid Framework with Enhanced Structural Interpretability for Simulating Human Decision-Making in Computational Psychiatry

Abstract page for arXiv paper 2603.05016: BioLLMAgent: A Hybrid Framework with Enhanced Structural Interpretability for Simulating Human ...

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.04951] Retrieval-Augmented Generation with Covariate Time Series

Abstract page for arXiv paper 2603.04951: Retrieval-Augmented Generation with Covariate Time Series

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.04904] Alignment Backfire: Language-Dependent Reversal of Safety Interventions Across 16 Languages in LLM Multi-Agent Systems

Abstract page for arXiv paper 2603.04904: Alignment Backfire: Language-Dependent Reversal of Safety Interventions Across 16 Languages in ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.04900] EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection

Abstract page for arXiv paper 2603.04900: EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.04896] Authorize-on-Demand: Dynamic Authorization with Legality-Aware Intellectual Property Protection for VLMs

Abstract page for arXiv paper 2603.04896: Authorize-on-Demand: Dynamic Authorization with Legality-Aware Intellectual Property Protection...

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.04894] Differentially Private Multimodal In-Context Learning

Abstract page for arXiv paper 2603.04894: Differentially Private Multimodal In-Context Learning

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.04868] K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory Generation

Abstract page for arXiv paper 2603.04868: K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory ...

arXiv - AI · 3 min · about 1 month ago

Previous Page 122 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Anthropic Supply-Chain Risk Label Should Stay in Place, Appeals Court Says | WIRED

Tubi is the first streamer to launch a native app within ChatGPT | TechCrunch

Anyone out there use Claude Pro/Max at the same time on different screens?

All Content

[2603.04405] Lost in Translation: How Language Re-Aligns Vision for Cross-Species Pathology

[2603.05498] The Spike, the Sparse and the Sink: Anatomy of Massive Activations and Attention Sinks

[2603.05485] Towards Provably Unbiased LLM Judges via Bias-Bounded Evaluation

[2603.05399] Judge Reliability Harness: Stress Testing the Reliability of LLM Judges

[2603.05392] Legal interpretation and AI: from expert systems to argumentation and LLMs

[2603.05294] STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks

[2603.05290] X-RAY: Mapping LLM Reasoning Capability via Formalized and Calibrated Probes

[2603.05240] GCAgent: Enhancing Group Chat Communication through Dialogue Agents System

[2603.05129] MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty Consensus

[2603.05120] Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Reasoning

[2603.05044] WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents

[2603.05040] Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination

[2603.05028] Survive at All Costs: Exploring LLM's Risky Behaviors under Survival Pressure

[2603.05016] BioLLMAgent: A Hybrid Framework with Enhanced Structural Interpretability for Simulating Human Decision-Making in Computational Psychiatry

[2603.04951] Retrieval-Augmented Generation with Covariate Time Series

[2603.04904] Alignment Backfire: Language-Dependent Reversal of Safety Interventions Across 16 Languages in LLM Multi-Agent Systems

[2603.04900] EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection

[2603.04896] Authorize-on-Demand: Dynamic Authorization with Legality-Aware Intellectual Property Protection for VLMs

[2603.04894] Differentially Private Multimodal In-Context Learning

[2603.04868] K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory Generation

Related Topics

Stay updated with AI News