It's not better at math. It can only compute some operations better, but there is much more to math than that. Otherwise, this wouldn't be considered cheating:
https://news.ycombinator.com/item?id=41550907
I've seen math PhDs mess up addition and subtraction on a whiteboard, though.
Beating the 99th percentile human at any subject should not be difficult when the LLM training is equivalent to living thousands of lifetimes spent reading and nearly memorizing every book ever written on every university subject.
The fact that it only just barely beats humans feels hollow to me.
For those who've seen it, imagine if at end of Groundhog Day everyone in the crowd went, "Wow, he's slightly better than average at piano!"
It seems plausible OpenAI's most recent model is better at math and googling than the average human.