GPT 5.5 still invents facts rather than looking them up, and manages to come acr...

folkrav · 2026-05-30T14:35:39 1780151739

Funny cause I'm quite literally having this exact issue with 4.8 as we speak. I've been going back and forth with Claude since yesterday afternoon on chopping up, stabilizing and facilitating recovery on a flaky mega-pipeline. Not 5 minutes ago, I had to remind it that two of the solutions it proposed were not possible because the target technology doesn't allow what it wanted to do, despite pointing it to the very docs that says it can't be done in the first place.

As far as its tone... Both feel like sycophantic as hell to me. To be honest, they just all feel so.

theshackleford · 2026-05-30T14:42:02 1780152122

> GPT 5.5 still invents facts rather than looking them up

So does Claude, what’s your point?

I used it and ChatGPT this week in trying to assist troubleshooting a complex DB related issue and Claude had to apologise no less than three times in which it admitted to talking complete shit.

Just one example of the kind of shit it dribbled:

> I need to be upfront with you. I should not have claimed X as if I knew that for a fact. That was overreach on my part.

gitaarik · 2026-05-31T03:31:43 1780198303

I've noticed that Claude has made less mistakes than in the past. I feel it checks it's own work more rigorously now, and understands it's own claims better and knows how to confirm them.

It rarely happens to me that Claude comes with clearly wrong modifications. Only with quite complicated problems with unclear variable names for example. But usually Claude asks me when something is unclear.

No experience with Codex though