it failed on xdg-open queries and Chrome CDP questions... couldn't even keep the OS consistent in the response, even when prompted. I only see more evidence that these LLMs have trouble with accuracy and correctness
It also happily auto-completes in the search input for non-development tasks, and it's not like developers don't ask questions outside of programming