Only if the LLM knows the inputs connected to particular outputs, pre-digital er...

simianwords · 2026-03-09T09:30:00 1773048600

challenge: provide a single example where the LLM can only provide the output and not the steps? (in text scenario)

latexr · 2026-03-09T10:28:20 1773052100

An LLM can always output steps, but it doesn’t mean they are true, they are great at making up bullshit.

When the “how many ‘r’ in ‘strawberry’” question was all the rage, you could definitely get LLMs to explain the steps of counting, too. It was still wrong.

simianwords · 2026-03-09T10:30:42 1773052242

can you provide a single example now with gpt 5.4 thinking that makes up things in steps? lets try to reproduce it.

latexr · 2026-03-09T13:37:53 1773063473

I’m pretty sure you can think of one yourself, I’m not going to play this game. Now it’s 5.4 Thinking, before that it was 5.3, before that 5.2, 5.1, 5, before that it was 4… At every stage there’s someone saying “oh, the previous model doesn’t matter, the current one is where it’s at”. And when it’s shown the current model can’t do something, there’s always some other excuse. It’s a profoundly bad faith argument, the very definition of moving the goal posts.

I do have a number of examples to give you, but I no longer share those online so they aren’t caught and gamed. Now I share them strictly in person.

NuclearPM · 2026-03-09T20:42:30 1773088950

Caught and gamed? What do you mean?

bigstrat2003 · 2026-03-10T00:08:15 1773101295

He means that if the problem becomes known, the AI companies will hack in a workaround rather than solving the problem by making the model more intelligent. Given that they have been caught cheating in that way in the past, I can't blame the GP for not sharing his tests.

simianwords · 2026-03-09T13:41:19 1773063679

Ok so no example.