Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I tried it. Maybe I got unlucky, but Gemini performed poorly in my testing. I gave it a task that I was working on with VSCode and GPT codex 5.1. Gemini3 repeatedly failed to finish the task and started to go down a rabbit hole on an unrelated task.

The browser extension is really cool and it provides a needed tool for the agent to use. It used the extension to show the page that it updated in the task document (the task doc is great too). However it showed me a page and did it was done, when it was clearly not done and not what I asked for.

I was expecting weaker tooling and a better model. I got good tooling and a not very good model.

Maybe 3.1 will deliver?



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: