Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[2603.16105] Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization

Abstract page for arXiv paper 2603.16105: Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization

arXiv - AI · 4 min · about 1 hour ago

Llms

[2603.09643] MM-tau-p$^2$: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings

Abstract page for arXiv paper 2603.09643: MM-tau-p$^2$: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Contro...

arXiv - AI · 4 min · about 1 hour ago

Llms

[2603.07339] Agora: Teaching the Skill of Consensus-Finding with AI Personas Grounded in Human Voice

Abstract page for arXiv paper 2603.07339: Agora: Teaching the Skill of Consensus-Finding with AI Personas Grounded in Human Voice

arXiv - AI · 4 min · about 1 hour ago

All Content

Llms

[2603.19236] L-PRISMA: An Extension of PRISMA in the Era of Generative Artificial Intelligence (GenAI)

Abstract page for arXiv paper 2603.19236: L-PRISMA: An Extension of PRISMA in the Era of Generative Artificial Intelligence (GenAI)

arXiv - AI · 3 min · 16 days ago

Llms

[2603.19247] When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models

Abstract page for arXiv paper 2603.19247: When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models

arXiv - AI · 4 min · 16 days ago

Llms

[2603.17765] Grounded Multimodal Retrieval-Augmented Drafting of Radiology Impressions Using Case-Based Similarity Search

Abstract page for arXiv paper 2603.17765: Grounded Multimodal Retrieval-Augmented Drafting of Radiology Impressions Using Case-Based Simi...

arXiv - AI · 4 min · 16 days ago

Llms

[2603.20170] Learning Dynamic Belief Graphs for Theory-of-mind Reasoning

Abstract page for arXiv paper 2603.20170: Learning Dynamic Belief Graphs for Theory-of-mind Reasoning

arXiv - AI · 3 min · 16 days ago

Llms

[2603.20101] Pitfalls in Evaluating Interpretability Agents

Abstract page for arXiv paper 2603.20101: Pitfalls in Evaluating Interpretability Agents

arXiv - AI · 4 min · 16 days ago

Llms

[2603.20046] Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs

Abstract page for arXiv paper 2603.20046: Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for ...

arXiv - AI · 4 min · 16 days ago

Llms

[2603.19896] Utility-Guided Agent Orchestration for Efficient LLM Tool Use

Abstract page for arXiv paper 2603.19896: Utility-Guided Agent Orchestration for Efficient LLM Tool Use

arXiv - AI · 3 min · 16 days ago

Llms

[2603.19715] Stepwise: Neuro-Symbolic Proof Search for Automated Systems Verification

Abstract page for arXiv paper 2603.19715: Stepwise: Neuro-Symbolic Proof Search for Automated Systems Verification

arXiv - AI · 4 min · 16 days ago

Llms

[2603.19685] A Subgoal-driven Framework for Improving Long-Horizon LLM Agents

Abstract page for arXiv paper 2603.19685: A Subgoal-driven Framework for Improving Long-Horizon LLM Agents

arXiv - Machine Learning · 4 min · 16 days ago

Llms

[2603.19639] HyEvo: Self-Evolving Hybrid Agentic Workflows for Efficient Reasoning

Abstract page for arXiv paper 2603.19639: HyEvo: Self-Evolving Hybrid Agentic Workflows for Efficient Reasoning

arXiv - AI · 3 min · 16 days ago

Llms

[2603.19584] PowerLens: Taming LLM Agents for Safe and Personalized Mobile Power Management

Abstract page for arXiv paper 2603.19584: PowerLens: Taming LLM Agents for Safe and Personalized Mobile Power Management

arXiv - AI · 4 min · 16 days ago

Llms

[2603.19515] ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models

Abstract page for arXiv paper 2603.19515: ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models

arXiv - AI · 3 min · 16 days ago

Llms

[2603.19514] Learning to Disprove: Formal Counterexample Generation with Large Language Models

Abstract page for arXiv paper 2603.19514: Learning to Disprove: Formal Counterexample Generation with Large Language Models

arXiv - AI · 3 min · 16 days ago

Llms

[2603.19500] Teaching an Agent to Sketch One Part at a Time

Abstract page for arXiv paper 2603.19500: Teaching an Agent to Sketch One Part at a Time

arXiv - Machine Learning · 3 min · 16 days ago

Llms

Over a dozen chatbot harm & suicide cases in California against OpenAI / ChatGPT have been consolidated into one big litigation

submitted by /u/Apprehensive_Sky1950 [link] [comments]

Reddit - Artificial Intelligence · 1 min · 16 days ago

Llms

[ML Engineer] 3 YOE, Focus on ML, LLM/NLP- Not getting any interview calls. Seeking Resume Review & Referrals.

submitted by /u/whatadrag79 [link] [comments]

Reddit - ML Jobs · 1 min · 16 days ago

Llms

Claude Just Opened the Strait

the definitive tick-tock

AI Tools & Products · 6 min · 16 days ago

Llms

I ran 10 head-to-head prompt format battles — the structured one won 8/10 on specificity

I tested 10 common prompt engineering techniques against a structured JSON format across identical tasks (marketing plans, code debugging...

Reddit - Artificial Intelligence · 1 min · 16 days ago

Llms

LLM failure modes map surprisingly well onto ADHD cognitive science. Six parallels from independent research.

I have ADHD and I've been pair programming with LLMs for a while now. At some point I realized the way they fail felt weirdly familiar. C...

Reddit - Artificial Intelligence · 1 min · 16 days ago

Llms

AI Fiesta review from Dhruv Rathee academy

Hi, I am a new AI user. I want to use AI for daily life optimization, getting better at table tennis and fitness, to use in architecture ...

Reddit - Artificial Intelligence · 1 min · 17 days ago

Previous Page 113 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

[2603.16105] Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization

[2603.09643] MM-tau-p$^2$: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings

[2603.07339] Agora: Teaching the Skill of Consensus-Finding with AI Personas Grounded in Human Voice

All Content

[2603.19236] L-PRISMA: An Extension of PRISMA in the Era of Generative Artificial Intelligence (GenAI)

[2603.19247] When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models

[2603.17765] Grounded Multimodal Retrieval-Augmented Drafting of Radiology Impressions Using Case-Based Similarity Search

[2603.20170] Learning Dynamic Belief Graphs for Theory-of-mind Reasoning

[2603.20101] Pitfalls in Evaluating Interpretability Agents

[2603.20046] Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs

[2603.19896] Utility-Guided Agent Orchestration for Efficient LLM Tool Use

[2603.19715] Stepwise: Neuro-Symbolic Proof Search for Automated Systems Verification

[2603.19685] A Subgoal-driven Framework for Improving Long-Horizon LLM Agents

[2603.19639] HyEvo: Self-Evolving Hybrid Agentic Workflows for Efficient Reasoning

[2603.19584] PowerLens: Taming LLM Agents for Safe and Personalized Mobile Power Management

[2603.19515] ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models

[2603.19514] Learning to Disprove: Formal Counterexample Generation with Large Language Models

[2603.19500] Teaching an Agent to Sketch One Part at a Time

Over a dozen chatbot harm & suicide cases in California against OpenAI / ChatGPT have been consolidated into one big litigation

[ML Engineer] 3 YOE, Focus on ML, LLM/NLP- Not getting any interview calls. Seeking Resume Review & Referrals.

Claude Just Opened the Strait

I ran 10 head-to-head prompt format battles — the structured one won 8/10 on specificity

LLM failure modes map surprisingly well onto ADHD cognitive science. Six parallels from independent research.

AI Fiesta review from Dhruv Rathee academy

Related Topics

Stay updated with AI News