Here's someone else testing models on a daily logic puzzle (Clues by Sam): https...

thanhhaimai · 2025-12-11T20:30:17 1765485017

This link doesn't have Gemini 3 performance on it. Do you have an updated link with the new models?

dezgeg · 2025-12-12T06:30:49 1765521049

I've also tried Gemini 3 for Clues by Sam and it can do really well, have not seen it make a single mistake even for Hard and Tricky ones. Haven't run it on too many puzzles though.

crapple8430 · 2025-12-11T20:39:37 1765485577

GPT 5 Pro is a good 10x more expensive so it's an apples to oranges comparison.