Llms Machine Learning Nlp Ai Infrastructure

The loss curve said tie. The judges said otherwise. Seeking replication for an early LLM training result [R]

Reddit - Machine Learning April 28, 2026 1 min read

About this article

TL;DR - I've written two novel functions that shape the training signal for LLMs. Early tests show people prefer responses from models trained with my functions by ~59.9%, but I'm just one guy with one GPU. Hoping someone with more resources can prove me right or wrong. The functions: Per-token gain: Each token's loss gets scaled by how surprising it is. Confident-correct tokens coast, surprising ones get amplified, and the average comes out unchanged so total gradient budget is preserved. Pe...

You've been blocked by network security.To continue, log in to your Reddit account or use your developer tokenIf you think you've been blocked by mistake, file a ticket below and we'll look into it.Log in File a ticket

Originally published on April 28, 2026. Curated by AI News.

Read Original Article

Llms

Built a multiplayer map where you can see everyone's Claude Code activity as creatures battling it out

Hello r/artificial I built this specifically for Claude Code users - every prompt you run feeds a digital pet called a Prompt Creature. T...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

Karpathy dropped a 200-line GPT, so I used the math to turn pandas DataFrames into searchable context windows and open sourced it (and automated my stats pipeline). [P]

TL;DR: I got tired of manually running Shapiro-Wilk tests and copy-pasting p-values at 2 AM. I built an open-source, async Python pipelin...

Reddit - Machine Learning · 1 min · about 4 hours ago

Llms

I built a solo AI platform from Algeria with no funding, no team and no ad spend - here's what's inside it after 2 months

Hello, 20 years old here just got into the Ai platform and launched this last two weeks and here is what I have on it so far. - Latest Ai...

Reddit - Artificial Intelligence · 1 min · about 8 hours ago

Llms

USF murder suspect accused of using ChatGPT to research cover-up, prosecutors say

Days after the remains of one of the two missing University of South Florida doctoral students were found, prosecutors say the suspect ma...

The loss curve said tie. The judges said otherwise. Seeking replication for an early LLM training result [R]

About this article

Related Articles

Built a multiplayer map where you can see everyone's Claude Code activity as creatures battling it out

Karpathy dropped a 200-line GPT, so I used the math to turn pandas DataFrames into searchable context windows and open sourced it (and automated my stats pipeline). [P]

I built a solo AI platform from Algeria with no funding, no team and no ad spend - here's what's inside it after 2 months

USF murder suspect accused of using ChatGPT to research cover-up, prosecutors say

No comments

Stay updated with AI News