Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Converting XQuery to SQL with Local LLMs: Do I Need Fine-Tuning or a Better Approach? [P]

I am trying to convert XQuery statements into SQL queries within an enterprise context, with the constraint that the solution must rely...

Reddit - Machine Learning · 1 min · about 2 hours ago

Llms

AI: Fragility of today's Claude Cowork type AI Agent Apps. RTZ 1061

...realities like memory management, highlight a longer road to resilient AI Agents and AGI

AI Tools & Products · 11 min · about 4 hours ago

Llms

Gemini caught a $280M crypto exploit before it hit the news, then retracted it as a hallucination because I couldn't verify it - because the news hadn't dropped yet

So this happened mere hours ago and I feel like I genuinely stumbled onto something worth documenting for people interested in AI behavio...

Reddit - Artificial Intelligence · 1 min · about 13 hours ago

All Content

Llms

[2603.02262] Silent Sabotage During Fine-Tuning: Few-Shot Rationale Poisoning of Compact Medical LLMs

Abstract page for arXiv paper 2603.02262: Silent Sabotage During Fine-Tuning: Few-Shot Rationale Poisoning of Compact Medical LLMs

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.02951] CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

Abstract page for arXiv paper 2603.02951: CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.02938] Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large Language Models

Abstract page for arXiv paper 2603.02938: Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large L...

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.02913] Eliciting Numerical Predictive Distributions of LLMs Without Autoregression

Abstract page for arXiv paper 2603.02913: Eliciting Numerical Predictive Distributions of LLMs Without Autoregression

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.02840] Adapting Time Series Foundation Models through Data Mixtures

Abstract page for arXiv paper 2603.02840: Adapting Time Series Foundation Models through Data Mixtures

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.02792] From Heuristic Selection to Automated Algorithm Design: LLMs Benefit from Strong Priors

Abstract page for arXiv paper 2603.02792: From Heuristic Selection to Automated Algorithm Design: LLMs Benefit from Strong Priors

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.02675] From Shallow to Deep: Pinning Semantic Intent via Causal GRPO

Abstract page for arXiv paper 2603.02675: From Shallow to Deep: Pinning Semantic Intent via Causal GRPO

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2504.21023] Param$Δ$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost

Abstract page for arXiv paper 2504.21023: Param$Δ$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.03258] Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals

Abstract page for arXiv paper 2603.03258: Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.02635] SaFeR-ToolKit: Structured Reasoning via Virtual Tool Calling for Multimodal Safety

Abstract page for arXiv paper 2603.02635: SaFeR-ToolKit: Structured Reasoning via Virtual Tool Calling for Multimodal Safety

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.03242] Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals

Abstract page for arXiv paper 2603.03242: Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.02630] MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks

Abstract page for arXiv paper 2603.02630: MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.03233] AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework

Abstract page for arXiv paper 2603.03233: AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.03203] No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models

Abstract page for arXiv paper 2603.03203: No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Langu...

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.02604] Heterogeneous Agent Collaborative Reinforcement Learning

Abstract page for arXiv paper 2603.02604: Heterogeneous Agent Collaborative Reinforcement Learning

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.03175] Saarthi for AGI: Towards Domain-Specific General Intelligence for Formal Verification

Abstract page for arXiv paper 2603.03175: Saarthi for AGI: Towards Domain-Specific General Intelligence for Formal Verification

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.03147] Agentic AI-based Coverage Closure for Formal Verification

Abstract page for arXiv paper 2603.03147: Agentic AI-based Coverage Closure for Formal Verification

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.03080] Beyond Factual Correctness: Mitigating Preference-Inconsistent Explanations in Explainable Recommendation

Abstract page for arXiv paper 2603.03080: Beyond Factual Correctness: Mitigating Preference-Inconsistent Explanations in Explainable Reco...

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.03116] Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation

Abstract page for arXiv paper 2603.03116: Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.02510] ParEVO: Synthesizing Code for Irregular Data: High-Performance Parallelism through Agentic Evolution

Abstract page for arXiv paper 2603.02510: ParEVO: Synthesizing Code for Irregular Data: High-Performance Parallelism through Agentic Evol...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 207 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Converting XQuery to SQL with Local LLMs: Do I Need Fine-Tuning or a Better Approach? [P]

AI: Fragility of today's Claude Cowork type AI Agent Apps. RTZ 1061

Gemini caught a $280M crypto exploit before it hit the news, then retracted it as a hallucination because I couldn't verify it - because the news hadn't dropped yet

All Content

[2603.02262] Silent Sabotage During Fine-Tuning: Few-Shot Rationale Poisoning of Compact Medical LLMs

[2603.02951] CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

[2603.02938] Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large Language Models

[2603.02913] Eliciting Numerical Predictive Distributions of LLMs Without Autoregression

[2603.02840] Adapting Time Series Foundation Models through Data Mixtures

[2603.02792] From Heuristic Selection to Automated Algorithm Design: LLMs Benefit from Strong Priors

[2603.02675] From Shallow to Deep: Pinning Semantic Intent via Causal GRPO

[2504.21023] Param$Δ$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost

[2603.03258] Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals

[2603.02635] SaFeR-ToolKit: Structured Reasoning via Virtual Tool Calling for Multimodal Safety

[2603.03242] Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals

[2603.02630] MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks

[2603.03233] AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework

[2603.03203] No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models

[2603.02604] Heterogeneous Agent Collaborative Reinforcement Learning

[2603.03175] Saarthi for AGI: Towards Domain-Specific General Intelligence for Formal Verification

[2603.03147] Agentic AI-based Coverage Closure for Formal Verification

[2603.03080] Beyond Factual Correctness: Mitigating Preference-Inconsistent Explanations in Explainable Recommendation

[2603.03116] Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation

[2603.02510] ParEVO: Synthesizing Code for Irregular Data: High-Performance Parallelism through Agentic Evolution

Related Topics

Stay updated with AI News