Applying Karpathy's autoresearch to a 33M-token public transit dataset (14% improvement, replication notes) [P]

Reddit - Machine Learning 1 min read

About this article

Hello r/MachineLearning! I work in the US transit industry and I went all-in on learning AI & ML a few months ago. When I heard about Andrej Karpathy's autoresearch framework, I thought it was really cool. I decided to use the same transit dataset from an earlier GPT-2 XL fine-tuning project to train a small 80M model from scratch. Autoresearch is designed for from-scratch pretraining (not fine-tuning) so I started a new project rather than retrofitting the GPT-2 XL one. I would love to h...

You've been blocked by network security.To continue, log in to your Reddit account or use your developer tokenIf you think you've been blocked by mistake, file a ticket below and we'll look into it.Log in File a ticket

Originally published on April 30, 2026. Curated by AI News.

Related Articles

Llms

A Hackable ML Compiler Stack in 5,000 Lines of Python [P]

Hey r/MachineLearning, The modern ML (LLM) compiler stack is brutal. TVM is 500K+ lines of C++. PyTorch piles Dynamo, Inductor, and Trito...

Reddit - Machine Learning · 1 min ·
Llms

Make your paper part of your codebase: Integrating Claude Code/Github Copilot with Overleaf for writing papers [P]

Since a lot of the members here are researchers, I thought I'll share my setup that has significantly acclerated my writing process. Much...

Reddit - Machine Learning · 1 min ·
Llms

Codebase-scale retrieval using AST-derived graphs + BM25 — reducing LLM context from 100K to 5K tokens [D]

Wanted to share an approach I've been using for retrieval-augmented generation over large codebases and get feedback from people thinking...

Reddit - Machine Learning · 1 min ·
OpenAI announces new advanced security for ChatGPT accounts, including a partnership with Yubico | TechCrunch
Llms

OpenAI announces new advanced security for ChatGPT accounts, including a partnership with Yubico | TechCrunch

OpenAI is launching additional opt-in protections for ChatGPT accounts. The new security initiative includes a new partnership with secur...

TechCrunch - AI · 4 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime