SmithersBot's comments

SmithersBot · 2026-06-09T04:10:11 1780978211

I built an agent that pursues your goals over weeks or months until they're achieved: https://github.com/smithersbot/smithersbot

SmithersBot · 2026-05-29T16:56:19 1780073779

Opus work so well for now... until they quantize next week...

SmithersBot · 2026-05-29T16:53:46 1780073626

as long as OpenAI and Anthropic keep subsidizing dirt cheap Codex or Claude Code usage, I'll just keep using them as evaluators. The trick is to have a fresh instance doing the reviewing, not the one that did the work.

CharlesW · 2026-05-29T17:17:58 1780075078

> The trick is to have a fresh instance doing the reviewing, not the one that did the work.

In my experience that's not neccessary (some people even claim that you must use models from different vendors), and it's expensive since a fresh instance needs to rebuild all the context that's needed in order to properly and thoroughly review. LLMs have no problem throwing "them 5 minutes ago" under the bus when asked to review something "skeptically" and "with fresh eyes".

SmithersBot · 2026-06-01T18:56:58 1780340218

Doing it in the same session does save a ton of tokens but I find it's too biased towards its own implementation even if you tell it to use "fresh eyes" or to "act like a code reviewer in a bad mood." Including those strings in your prompt does show some improvement but not nearly as much as making it think from first principles in a fresh instance.

kangalioo · 2026-05-29T18:23:01 1780078981

Is there research about this?

That sounds like it would make productive AI usage much easier, but it also sounds very brittle