> To hit 87 percent on the original ARC-AGI test, o3 spent roughly 14 minutes per puzzle and, by my calculations, may have required hundreds of thousands of dollars in computing and electricity
> the bot came up with more than 1,000 possible answers per grid before selecting a final submission.
Let's all hope two things:
1) AGI is not near
2) Nobody ever allows it to have physical presence
2b) should that happen, expect the one idiot ask it to make "as many paperclips as possible" :)
I would be willing to bed we’ll see something rambling natural intelligence created in this century, but I think this brute force approach shows that we’re just not there yet.
> the bot came up with more than 1,000 possible answers per grid before selecting a final submission.
Yeah, AGI is right around the corner… /s