Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

How do you use AI Agents for EDA/Data Analysis and getting it ready for ML model training? [D]

Like in manual workflow I would study the given data by using various functions like pd.info() and all column wise, remove null, outliers...

Reddit - Machine Learning · 1 min · about 3 hours ago

Llms

I built a router that automatically sends your AI tasks to the most appropriate model to handle them at low cost - 9,200 tasks in, $21 saved at $0.14 actual cost

The observation that started this: most of what people use AI for every day - summarising, drafting, classifying, extracting etc doesn't ...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

Llms

Anthropic just analyzed 1 million Claude conversations. 6% of people were asking Claude whether to quit their jobs, who to date, and if they should move countries.

They published the full research yesterday. Here's what shocked me: The breakdown of what people actually ask Claude for guidance on: Hea...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

All Content

Llms

[2603.02266] When Scaling Fails: Mitigating Audio Perception Decay of LALMs via Multi-Step Perception-Aware Reasoning

Abstract page for arXiv paper 2603.02266: When Scaling Fails: Mitigating Audio Perception Decay of LALMs via Multi-Step Perception-Aware ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.02262] Silent Sabotage During Fine-Tuning: Few-Shot Rationale Poisoning of Compact Medical LLMs

Abstract page for arXiv paper 2603.02262: Silent Sabotage During Fine-Tuning: Few-Shot Rationale Poisoning of Compact Medical LLMs

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.02951] CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

Abstract page for arXiv paper 2603.02951: CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.02938] Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large Language Models

Abstract page for arXiv paper 2603.02938: Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large L...

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.02913] Eliciting Numerical Predictive Distributions of LLMs Without Autoregression

Abstract page for arXiv paper 2603.02913: Eliciting Numerical Predictive Distributions of LLMs Without Autoregression

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.02840] Adapting Time Series Foundation Models through Data Mixtures

Abstract page for arXiv paper 2603.02840: Adapting Time Series Foundation Models through Data Mixtures

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.02792] From Heuristic Selection to Automated Algorithm Design: LLMs Benefit from Strong Priors

Abstract page for arXiv paper 2603.02792: From Heuristic Selection to Automated Algorithm Design: LLMs Benefit from Strong Priors

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.02675] From Shallow to Deep: Pinning Semantic Intent via Causal GRPO

Abstract page for arXiv paper 2603.02675: From Shallow to Deep: Pinning Semantic Intent via Causal GRPO

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2504.21023] Param$Δ$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost

Abstract page for arXiv paper 2504.21023: Param$Δ$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.03258] Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals

Abstract page for arXiv paper 2603.03258: Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.02635] SaFeR-ToolKit: Structured Reasoning via Virtual Tool Calling for Multimodal Safety

Abstract page for arXiv paper 2603.02635: SaFeR-ToolKit: Structured Reasoning via Virtual Tool Calling for Multimodal Safety

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.03242] Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals

Abstract page for arXiv paper 2603.03242: Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.02630] MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks

Abstract page for arXiv paper 2603.02630: MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.03233] AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework

Abstract page for arXiv paper 2603.03233: AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.03203] No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models

Abstract page for arXiv paper 2603.03203: No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Langu...

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.02604] Heterogeneous Agent Collaborative Reinforcement Learning

Abstract page for arXiv paper 2603.02604: Heterogeneous Agent Collaborative Reinforcement Learning

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.03175] Saarthi for AGI: Towards Domain-Specific General Intelligence for Formal Verification

Abstract page for arXiv paper 2603.03175: Saarthi for AGI: Towards Domain-Specific General Intelligence for Formal Verification

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.03147] Agentic AI-based Coverage Closure for Formal Verification

Abstract page for arXiv paper 2603.03147: Agentic AI-based Coverage Closure for Formal Verification

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.03080] Beyond Factual Correctness: Mitigating Preference-Inconsistent Explanations in Explainable Recommendation

Abstract page for arXiv paper 2603.03080: Beyond Factual Correctness: Mitigating Preference-Inconsistent Explanations in Explainable Reco...

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.03116] Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation

Abstract page for arXiv paper 2603.03116: Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation

arXiv - AI · 4 min · about 2 months ago

Previous Page 296 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

How do you use AI Agents for EDA/Data Analysis and getting it ready for ML model training? [D]

I built a router that automatically sends your AI tasks to the most appropriate model to handle them at low cost - 9,200 tasks in, $21 saved at $0.14 actual cost

Anthropic just analyzed 1 million Claude conversations. 6% of people were asking Claude whether to quit their jobs, who to date, and if they should move countries.

All Content

[2603.02266] When Scaling Fails: Mitigating Audio Perception Decay of LALMs via Multi-Step Perception-Aware Reasoning

[2603.02262] Silent Sabotage During Fine-Tuning: Few-Shot Rationale Poisoning of Compact Medical LLMs

[2603.02951] CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

[2603.02938] Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large Language Models

[2603.02913] Eliciting Numerical Predictive Distributions of LLMs Without Autoregression

[2603.02840] Adapting Time Series Foundation Models through Data Mixtures

[2603.02792] From Heuristic Selection to Automated Algorithm Design: LLMs Benefit from Strong Priors

[2603.02675] From Shallow to Deep: Pinning Semantic Intent via Causal GRPO

[2504.21023] Param$Δ$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost

[2603.03258] Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals

[2603.02635] SaFeR-ToolKit: Structured Reasoning via Virtual Tool Calling for Multimodal Safety

[2603.03242] Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals

[2603.02630] MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks

[2603.03233] AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework

[2603.03203] No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models

[2603.02604] Heterogeneous Agent Collaborative Reinforcement Learning

[2603.03175] Saarthi for AGI: Towards Domain-Specific General Intelligence for Formal Verification

[2603.03147] Agentic AI-based Coverage Closure for Formal Verification

[2603.03080] Beyond Factual Correctness: Mitigating Preference-Inconsistent Explanations in Explainable Recommendation

[2603.03116] Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation

Related Topics

Stay updated with AI News