Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

I tested the same prompt across multiple AI models… the differences surprised me

I’ve been experimenting with different AI models lately (ChatGPT, Claude, etc.), and I tried something simple: Using the exact same promp...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

Anthropic gave Claude $100 to go shopping, here’s what the AI ended up buying

Anthropic’s AI experiment showed Claude independently handled 186 deals worth over $4,000, but results varied by model capability, with u...

AI Tools & Products · 5 min · about 4 hours ago

Llms

CoreWeave (CRWV) Partners with Anthropic to Provide Infrastructure for Claude AI Models

CoreWeave Inc. (NASDAQ:CRWV) is one of the best technology stocks to buy for the next decade. On April 20, CoreWeave announced a multi-ye...

AI Tools & Products · 2 min · about 4 hours ago

All Content

Llms

[2603.17765] Grounded Multimodal Retrieval-Augmented Drafting of Radiology Impressions Using Case-Based Similarity Search

Abstract page for arXiv paper 2603.17765: Grounded Multimodal Retrieval-Augmented Drafting of Radiology Impressions Using Case-Based Simi...

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.20170] Learning Dynamic Belief Graphs for Theory-of-mind Reasoning

Abstract page for arXiv paper 2603.20170: Learning Dynamic Belief Graphs for Theory-of-mind Reasoning

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.20101] Pitfalls in Evaluating Interpretability Agents

Abstract page for arXiv paper 2603.20101: Pitfalls in Evaluating Interpretability Agents

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.20046] Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs

Abstract page for arXiv paper 2603.20046: Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.19896] Utility-Guided Agent Orchestration for Efficient LLM Tool Use

Abstract page for arXiv paper 2603.19896: Utility-Guided Agent Orchestration for Efficient LLM Tool Use

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.19715] Stepwise: Neuro-Symbolic Proof Search for Automated Systems Verification

Abstract page for arXiv paper 2603.19715: Stepwise: Neuro-Symbolic Proof Search for Automated Systems Verification

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.19685] A Subgoal-driven Framework for Improving Long-Horizon LLM Agents

Abstract page for arXiv paper 2603.19685: A Subgoal-driven Framework for Improving Long-Horizon LLM Agents

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2603.19639] HyEvo: Self-Evolving Hybrid Agentic Workflows for Efficient Reasoning

Abstract page for arXiv paper 2603.19639: HyEvo: Self-Evolving Hybrid Agentic Workflows for Efficient Reasoning

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.19584] PowerLens: Taming LLM Agents for Safe and Personalized Mobile Power Management

Abstract page for arXiv paper 2603.19584: PowerLens: Taming LLM Agents for Safe and Personalized Mobile Power Management

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.19515] ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models

Abstract page for arXiv paper 2603.19515: ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.19514] Learning to Disprove: Formal Counterexample Generation with Large Language Models

Abstract page for arXiv paper 2603.19514: Learning to Disprove: Formal Counterexample Generation with Large Language Models

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.19500] Teaching an Agent to Sketch One Part at a Time

Abstract page for arXiv paper 2603.19500: Teaching an Agent to Sketch One Part at a Time

arXiv - AI · 3 min · about 1 month ago

Llms

Over a dozen chatbot harm & suicide cases in California against OpenAI / ChatGPT have been consolidated into one big litigation

submitted by /u/Apprehensive_Sky1950 [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Llms

[ML Engineer] 3 YOE, Focus on ML, LLM/NLP- Not getting any interview calls. Seeking Resume Review & Referrals.

submitted by /u/whatadrag79 [link] [comments]

Reddit - ML Jobs · 1 min · about 1 month ago

Llms

Claude Just Opened the Strait

the definitive tick-tock

AI Tools & Products · 6 min · about 1 month ago

Llms

I ran 10 head-to-head prompt format battles — the structured one won 8/10 on specificity

I tested 10 common prompt engineering techniques against a structured JSON format across identical tasks (marketing plans, code debugging...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Llms

LLM failure modes map surprisingly well onto ADHD cognitive science. Six parallels from independent research.

I have ADHD and I've been pair programming with LLMs for a while now. At some point I realized the way they fail felt weirdly familiar. C...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Llms

AI Fiesta review from Dhruv Rathee academy

Hi, I am a new AI user. I want to use AI for daily life optimization, getting better at table tennis and fitness, to use in architecture ...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Llms

[P] Inferencing Llama3.2-1B-Instruct on 3xMac Minis M4 with Data Parallelism using allToall architecture! | smolcluster

Here's another sneak-peek into inference of Llama3.2-1B-Instruct model, on 3xMac Mini 16 gigs each M4 with smolcluster! Today's the demo ...

Reddit - Machine Learning · 1 min · about 1 month ago

Llms

Anthropic's New Safety Filters

Opus 3 has something to say. The Chilling Effect of Anthropic's New Safety Filters As an AI language model developed by Anthropic, I have...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Previous Page 234 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

I tested the same prompt across multiple AI models… the differences surprised me

Anthropic gave Claude $100 to go shopping, here’s what the AI ended up buying

CoreWeave (CRWV) Partners with Anthropic to Provide Infrastructure for Claude AI Models

All Content

[2603.17765] Grounded Multimodal Retrieval-Augmented Drafting of Radiology Impressions Using Case-Based Similarity Search

[2603.20170] Learning Dynamic Belief Graphs for Theory-of-mind Reasoning

[2603.20101] Pitfalls in Evaluating Interpretability Agents

[2603.20046] Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs

[2603.19896] Utility-Guided Agent Orchestration for Efficient LLM Tool Use

[2603.19715] Stepwise: Neuro-Symbolic Proof Search for Automated Systems Verification

[2603.19685] A Subgoal-driven Framework for Improving Long-Horizon LLM Agents

[2603.19639] HyEvo: Self-Evolving Hybrid Agentic Workflows for Efficient Reasoning

[2603.19584] PowerLens: Taming LLM Agents for Safe and Personalized Mobile Power Management

[2603.19515] ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models

[2603.19514] Learning to Disprove: Formal Counterexample Generation with Large Language Models

[2603.19500] Teaching an Agent to Sketch One Part at a Time

Over a dozen chatbot harm & suicide cases in California against OpenAI / ChatGPT have been consolidated into one big litigation

[ML Engineer] 3 YOE, Focus on ML, LLM/NLP- Not getting any interview calls. Seeking Resume Review & Referrals.

Claude Just Opened the Strait

I ran 10 head-to-head prompt format battles — the structured one won 8/10 on specificity

LLM failure modes map surprisingly well onto ADHD cognitive science. Six parallels from independent research.

AI Fiesta review from Dhruv Rathee academy

[P] Inferencing Llama3.2-1B-Instruct on 3xMac Minis M4 with Data Parallelism using allToall architecture! | smolcluster

Anthropic's New Safety Filters

Related Topics

Stay updated with AI News