Mythos just obliterated SWE-bench with a 93.9% score. The era of the solo mega-corp is actually here.
About this article
The new SWE-bench numbers for Mythos just dropped, and the gap between it and the current best is terrifying. SWE-bench Verified: Mythos: 93.9% Opus 4.6: 80.8% SWE-bench Pro: Mythos: 77.8% Opus 4.6: 53.4% That Pro score is a nearly 25% jump in autonomous coding. Factor in the rumors around Project Glasswing giving it deep architectural understanding, and the barrier between a prompt and a fully deployed product is basically gone. Imagine what you will be able to build when Mythos drop...
You've been blocked by network security.To continue, log in to your Reddit account or use your developer tokenIf you think you've been blocked by mistake, file a ticket below and we'll look into it.Log in File a ticket