Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[P] Dante-2B: I'm training a 2.1B bilingual fully open Italian/English LLM from scratch on 2×H200. Phase 1 done — here's what I've built.

The problem If you work with Italian text and local models, you know the pain. Every open-source LLM out there treats Italian as an after...

Reddit - Machine Learning · 1 min · 21 minutes ago

Llms

I have been coding for 11 years and I caught myself completely unable to debug a problem without AI assistance last month. That scared me more than anything I have seen in this industry.

I want to be honest about something that happened to me because I think it is more common than people admit. Last month I hit a bug in a ...

Reddit - Artificial Intelligence · 1 min · 36 minutes ago

Llms

OpenClaw security checklist: practical safeguards for AI agents

Here is one of the better quality guides on the ensuring safety when deploying OpenClaw: https://chatgptguide.ai/openclaw-security-checkl...

Reddit - Artificial Intelligence · 1 min · about 7 hours ago

All Content

Llms

[2603.19515] ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models

Abstract page for arXiv paper 2603.19515: ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models

arXiv - AI · 3 min · 14 days ago

Llms

[2603.19514] Learning to Disprove: Formal Counterexample Generation with Large Language Models

Abstract page for arXiv paper 2603.19514: Learning to Disprove: Formal Counterexample Generation with Large Language Models

arXiv - AI · 3 min · 14 days ago

Llms

[2603.19500] Teaching an Agent to Sketch One Part at a Time

Abstract page for arXiv paper 2603.19500: Teaching an Agent to Sketch One Part at a Time

arXiv - Machine Learning · 3 min · 14 days ago

Llms

Over a dozen chatbot harm & suicide cases in California against OpenAI / ChatGPT have been consolidated into one big litigation

submitted by /u/Apprehensive_Sky1950 [link] [comments]

Reddit - Artificial Intelligence · 1 min · 14 days ago

Llms

[ML Engineer] 3 YOE, Focus on ML, LLM/NLP- Not getting any interview calls. Seeking Resume Review & Referrals.

submitted by /u/whatadrag79 [link] [comments]

Reddit - ML Jobs · 1 min · 14 days ago

Llms

Claude Just Opened the Strait

the definitive tick-tock

AI Tools & Products · 6 min · 14 days ago

Llms

I ran 10 head-to-head prompt format battles — the structured one won 8/10 on specificity

I tested 10 common prompt engineering techniques against a structured JSON format across identical tasks (marketing plans, code debugging...

Reddit - Artificial Intelligence · 1 min · 14 days ago

Llms

LLM failure modes map surprisingly well onto ADHD cognitive science. Six parallels from independent research.

I have ADHD and I've been pair programming with LLMs for a while now. At some point I realized the way they fail felt weirdly familiar. C...

Reddit - Artificial Intelligence · 1 min · 14 days ago

Llms

AI Fiesta review from Dhruv Rathee academy

Hi, I am a new AI user. I want to use AI for daily life optimization, getting better at table tennis and fitness, to use in architecture ...

Reddit - Artificial Intelligence · 1 min · 14 days ago

Llms

[P] Inferencing Llama3.2-1B-Instruct on 3xMac Minis M4 with Data Parallelism using allToall architecture! | smolcluster

Here's another sneak-peek into inference of Llama3.2-1B-Instruct model, on 3xMac Mini 16 gigs each M4 with smolcluster! Today's the demo ...

Reddit - Machine Learning · 1 min · 15 days ago

Llms

Anthropic's New Safety Filters

Opus 3 has something to say. The Chilling Effect of Anthropic's New Safety Filters As an AI language model developed by Anthropic, I have...

Reddit - Artificial Intelligence · 1 min · 15 days ago

Llms

: [R] Sinc Reconstruction for LLM Prompts: Applying Nyquist-Shannon to the Specification Axis (275 obs, 97% cost reduction, open source)

I applied the Nyquist-Shannon sampling theorem to LLM prompt engineering. The core finding: a raw prompt is 1 sample of a 6-band specific...

Reddit - Machine Learning · 1 min · 15 days ago

Llms

We asked 200 ChatGPT users their biggest frustration. All top 5 answers are problems ChatGPT Toolbox solves.

We surveyed 200 ChatGPT users. Their top frustrations: Cannot find old conversations (67%) - Solved: full-text search across all messages...

Reddit - Artificial Intelligence · 1 min · 15 days ago

Llms

[P] I built an open-source benchmark to test if LLMs are actually as confident as they claim to be (Spoiler: They often aren't)

Hey everyone, When building systems around modern open-source LLMs, one of the biggest issues is that they can confidently hallucinate or...

Reddit - Machine Learning · 1 min · 15 days ago

Llms

[Project] Hiring dev team to integrate 24 AI agents into a compliance-driven document processing platform. Anthropic Claude API, structured output, async orchestration

Shoot me a DM if interested! submitted by /u/discobee123 [link] [comments]

Reddit - Machine Learning · 1 min · 15 days ago

Llms

[P] I cut my Claude Code token usage by using HDC as a context engine for my source tree

If you’re running Claude Code or Kiro regularly, you’re probably burning a few million tokens a week just on development. I’ve been build...

Reddit - Machine Learning · 1 min · 15 days ago

Llms

ChatGPT has experimented with watermarking AI text — 5 ways to use AI without sounding like it

ChatGPT has explored watermarking AI text — here are 5 simple ways to use AI without losing your voice or sounding like everyone else.

AI Tools & Products · 9 min · 15 days ago

Llms

The Pentagon is making plans for AI companies to train on classified data, defense official says | MIT Technology Review

The generative AI models used in classified environments can answer questions, but don't currently learn from the data they see. Tha...

MIT Technology Review · 6 min · 19 days ago

Llms

[2512.21323] Parallel Token Prediction for Language Models

Abstract page for arXiv paper 2512.21323: Parallel Token Prediction for Language Models

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2512.21039] Agentic Multi-Persona Framework for Evidence-Aware Fake News Detection

Abstract page for arXiv paper 2512.21039: Agentic Multi-Persona Framework for Evidence-Aware Fake News Detection

arXiv - Machine Learning · 3 min · about 1 month ago

Previous Page 82 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

[P] Dante-2B: I'm training a 2.1B bilingual fully open Italian/English LLM from scratch on 2×H200. Phase 1 done — here's what I've built.

I have been coding for 11 years and I caught myself completely unable to debug a problem without AI assistance last month. That scared me more than anything I have seen in this industry.

OpenClaw security checklist: practical safeguards for AI agents

All Content

[2603.19515] ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models

[2603.19514] Learning to Disprove: Formal Counterexample Generation with Large Language Models

[2603.19500] Teaching an Agent to Sketch One Part at a Time

Over a dozen chatbot harm & suicide cases in California against OpenAI / ChatGPT have been consolidated into one big litigation

[ML Engineer] 3 YOE, Focus on ML, LLM/NLP- Not getting any interview calls. Seeking Resume Review & Referrals.

Claude Just Opened the Strait

I ran 10 head-to-head prompt format battles — the structured one won 8/10 on specificity

LLM failure modes map surprisingly well onto ADHD cognitive science. Six parallels from independent research.

AI Fiesta review from Dhruv Rathee academy

[P] Inferencing Llama3.2-1B-Instruct on 3xMac Minis M4 with Data Parallelism using allToall architecture! | smolcluster

Anthropic's New Safety Filters

: [R] Sinc Reconstruction for LLM Prompts: Applying Nyquist-Shannon to the Specification Axis (275 obs, 97% cost reduction, open source)

We asked 200 ChatGPT users their biggest frustration. All top 5 answers are problems ChatGPT Toolbox solves.

[P] I built an open-source benchmark to test if LLMs are actually as confident as they claim to be (Spoiler: They often aren't)

[Project] Hiring dev team to integrate 24 AI agents into a compliance-driven document processing platform. Anthropic Claude API, structured output, async orchestration

[P] I cut my Claude Code token usage by using HDC as a context engine for my source tree

ChatGPT has experimented with watermarking AI text — 5 ways to use AI without sounding like it

The Pentagon is making plans for AI companies to train on classified data, defense official says | MIT Technology Review

[2512.21323] Parallel Token Prediction for Language Models

[2512.21039] Agentic Multi-Persona Framework for Evidence-Aware Fake News Detection

Related Topics

Stay updated with AI News