Not that I give much credence to anything Zitron says, but the amount of inferen...

o10449366 · 2026-04-28T15:41:39 1777390899

Yeah, I'm sure the numbers are a bit inflated compared to API, but with my Claude $200/month subscription I've supposedly consumed 12,160,410,828 tokens in April for a cost of $22,733.03.

Leynos · 2026-04-28T18:55:28 1777402528

Is that taking cache hits into account?

o10449366 · 2026-04-28T21:26:53 1777411613

Cache create is 202,746,985 and cache read is 11,998,411,722 from claude-code-monitor

Leynos · 2026-04-30T22:36:51 1777588611

I make that $7000 :o

PunchyHamster · 2026-04-28T14:04:56 1777385096

> Not that I give much credence to anything Zitron says, but the amount of inference you can get on a £200 a month OpenAI or Anthropic subscription is easily an order of magnitude more than what you'd get paying the same amount at subscription rate.

Neither of those is how much it actually costs the company selling the service. And I have feeling they are running at loss here so the play is "get everything possible using LLMs then jack up the pricing"

semiquaver · 2026-04-28T14:10:12 1777385412

There have been plenty of studies which indicate that inference considered by itself is almost certainly quite profitable at all the frontier labs. The problem is amortizing the cost of all the expensive training runs required to train new models into the revenue stream.

pkaye · 2026-04-28T15:33:34 1777390414

Does that mean those running the open models are highly profitable since they don't have to do any training?

polski-g · 2026-04-28T16:45:44 1777394744

Yes obviously, otherwise they wouldn't be doing it; they'd just go back to mining shitcoins.

semiquaver · 2026-04-28T19:22:00 1777404120

I don’t know about highly since they have no moat even more than Antrhropic and OpenAI have no moat. Anyone with a few hundred thousand dollars or sufficient free GPUs can compete with them. So running an open model should earn a market-rate margin.

paulddraper · 2026-04-28T13:51:41 1777384301

*more than what you'd get paying the same amount at usage rate.

Leynos · 2026-04-28T18:57:32 1777402652

Yes, thanks. Too late to edit now, sadly.