I finally sat down and did the math on my Cloud LLM bills… and I’m moving almost everything to a 4090. [R]
About this article
I used to be all-in on cloud APIs. For any side project, I’d just grab an OpenAI or Anthropic key and not think twice. It was convenient. No worrying about VRAM, super fast responses, and I could spin something up in minutes. But that “pay-as-you-go” comfort slowly turned into real pain. Last month one of my small RAG tools that I built for a few friends racked up $120 in API costs. Then an experimental agent I left running in a loop hit $450. That was the moment I opened a spreadsheet and re...
You've been blocked by network security.To continue, log in to your Reddit account or use your developer tokenIf you think you've been blocked by mistake, file a ticket below and we'll look into it.Log in File a ticket