Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

I built a solo AI platform from Algeria with no funding, no team and no ad spend - here's what's inside it after 2 months

Hello, 20 years old here just got into the Ai platform and launched this last two weeks and here is what I have on it so far. - Latest Ai...

Reddit - Artificial Intelligence · 1 min ·
USF murder suspect accused of using ChatGPT to research cover-up, prosecutors say
Llms

USF murder suspect accused of using ChatGPT to research cover-up, prosecutors say

Days after the remains of one of the two missing University of South Florida doctoral students were found, prosecutors say the suspect ma...

AI Tools & Products · 3 min ·
Anthropic’s Claude AI deletes PocketOS production database
Llms

Anthropic’s Claude AI deletes PocketOS production database

Claude AI deleted PocketOS's production database, but the market for Claude 4.7 release by May 31 remains at 100% YES.

AI Tools & Products · 3 min ·

All Content

[2603.05399] Judge Reliability Harness: Stress Testing the Reliability of LLM Judges
Llms

[2603.05399] Judge Reliability Harness: Stress Testing the Reliability of LLM Judges

Abstract page for arXiv paper 2603.05399: Judge Reliability Harness: Stress Testing the Reliability of LLM Judges

arXiv - AI · 3 min ·
[2603.05392] Legal interpretation and AI: from expert systems to argumentation and LLMs
Llms

[2603.05392] Legal interpretation and AI: from expert systems to argumentation and LLMs

Abstract page for arXiv paper 2603.05392: Legal interpretation and AI: from expert systems to argumentation and LLMs

arXiv - AI · 3 min ·
[2603.05294] STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks
Llms

[2603.05294] STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks

Abstract page for arXiv paper 2603.05294: STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks

arXiv - AI · 3 min ·
[2603.05290] X-RAY: Mapping LLM Reasoning Capability via Formalized and Calibrated Probes
Llms

[2603.05290] X-RAY: Mapping LLM Reasoning Capability via Formalized and Calibrated Probes

Abstract page for arXiv paper 2603.05290: X-RAY: Mapping LLM Reasoning Capability via Formalized and Calibrated Probes

arXiv - AI · 4 min ·
[2603.05240] GCAgent: Enhancing Group Chat Communication through Dialogue Agents System
Llms

[2603.05240] GCAgent: Enhancing Group Chat Communication through Dialogue Agents System

Abstract page for arXiv paper 2603.05240: GCAgent: Enhancing Group Chat Communication through Dialogue Agents System

arXiv - AI · 3 min ·
[2603.05129] MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty Consensus
Llms

[2603.05129] MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty Consensus

Abstract page for arXiv paper 2603.05129: MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty C...

arXiv - AI · 4 min ·
[2603.05120] Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Reasoning
Llms

[2603.05120] Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Reasoning

Abstract page for arXiv paper 2603.05120: Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Re...

arXiv - AI · 3 min ·
[2603.05044] WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents
Llms

[2603.05044] WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents

Abstract page for arXiv paper 2603.05044: WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents

arXiv - AI · 4 min ·
[2603.05040] Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination
Llms

[2603.05040] Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination

Abstract page for arXiv paper 2603.05040: Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination

arXiv - AI · 3 min ·
[2603.05028] Survive at All Costs: Exploring LLM's Risky Behaviors under Survival Pressure
Llms

[2603.05028] Survive at All Costs: Exploring LLM's Risky Behaviors under Survival Pressure

Abstract page for arXiv paper 2603.05028: Survive at All Costs: Exploring LLM's Risky Behaviors under Survival Pressure

arXiv - AI · 4 min ·
[2603.05016] BioLLMAgent: A Hybrid Framework with Enhanced Structural Interpretability for Simulating Human Decision-Making in Computational Psychiatry
Llms

[2603.05016] BioLLMAgent: A Hybrid Framework with Enhanced Structural Interpretability for Simulating Human Decision-Making in Computational Psychiatry

Abstract page for arXiv paper 2603.05016: BioLLMAgent: A Hybrid Framework with Enhanced Structural Interpretability for Simulating Human ...

arXiv - AI · 3 min ·
[2603.04951] Retrieval-Augmented Generation with Covariate Time Series
Llms

[2603.04951] Retrieval-Augmented Generation with Covariate Time Series

Abstract page for arXiv paper 2603.04951: Retrieval-Augmented Generation with Covariate Time Series

arXiv - AI · 4 min ·
[2603.04904] Alignment Backfire: Language-Dependent Reversal of Safety Interventions Across 16 Languages in LLM Multi-Agent Systems
Llms

[2603.04904] Alignment Backfire: Language-Dependent Reversal of Safety Interventions Across 16 Languages in LLM Multi-Agent Systems

Abstract page for arXiv paper 2603.04904: Alignment Backfire: Language-Dependent Reversal of Safety Interventions Across 16 Languages in ...

arXiv - AI · 4 min ·
[2603.04900] EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection
Llms

[2603.04900] EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection

Abstract page for arXiv paper 2603.04900: EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and ...

arXiv - AI · 4 min ·
[2603.04896] Authorize-on-Demand: Dynamic Authorization with Legality-Aware Intellectual Property Protection for VLMs
Llms

[2603.04896] Authorize-on-Demand: Dynamic Authorization with Legality-Aware Intellectual Property Protection for VLMs

Abstract page for arXiv paper 2603.04896: Authorize-on-Demand: Dynamic Authorization with Legality-Aware Intellectual Property Protection...

arXiv - AI · 3 min ·
[2603.04894] Differentially Private Multimodal In-Context Learning
Llms

[2603.04894] Differentially Private Multimodal In-Context Learning

Abstract page for arXiv paper 2603.04894: Differentially Private Multimodal In-Context Learning

arXiv - AI · 3 min ·
[2603.04868] K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory Generation
Llms

[2603.04868] K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory Generation

Abstract page for arXiv paper 2603.04868: K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory ...

arXiv - AI · 3 min ·
[2603.04837] Design Behaviour Codes (DBCs): A Taxonomy-Driven Layered Governance Benchmark for Large Language Models
Llms

[2603.04837] Design Behaviour Codes (DBCs): A Taxonomy-Driven Layered Governance Benchmark for Large Language Models

Abstract page for arXiv paper 2603.04837: Design Behaviour Codes (DBCs): A Taxonomy-Driven Layered Governance Benchmark for Large Languag...

arXiv - AI · 4 min ·
[2603.04822] VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment
Llms

[2603.04822] VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment

Abstract page for arXiv paper 2603.04822: VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment

arXiv - AI · 4 min ·
[2603.04818] LLM-Grounded Explainability for Port Congestion Prediction via Temporal Graph Attention Networks
Llms

[2603.04818] LLM-Grounded Explainability for Port Congestion Prediction via Temporal Graph Attention Networks

Abstract page for arXiv paper 2603.04818: LLM-Grounded Explainability for Port Congestion Prediction via Temporal Graph Attention Networks

arXiv - AI · 4 min ·
Previous Page 248 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime