I'll accept Meta's frontier AI demise if they're in their current position a year from now. People killed Google prematurely too (remember Bard?), because we severely underestimate the catch-up power bought with ungodly piles of cash.
It's insane numbers like that that give me some concern for a bubble. Not because AI hits some dead end, but due to a plateau that shifts from aggressive investment to passive-but-steady improvement.
Maverick and Scout were not great, even with post-training in my experience, and then several Chinese models at multiple sizes made them kind of irrelevant (dots, Qwen, MiniMax)
If anything this helps Meta: another model to inspect/learn from/tweak etc. generally helps anyone making models
Part of the secret sauce since O1 has been accesss the real reasoning traces, not the summaries.
If you even glance at the model card you'll see this was trained on the same CoT RL pipeline as O3, and it shows in using the model: this is the most coherent and structured CoT of any open model so far.
Having full access to a model trained on that pipeline is valuable to anyone doing post-training, even if it's just to observe, but especially if you use it as cold start data for your own training.
There are definitely some shills all over HN now... But even aside from that, the sheer novelty aspect (+less robotic ethical alignment) of it is enough for many
Can't her husband just stand in front of her, behind the laptop webcam? There's often ridiculously simple real-world workarounds these complex device security process
There is no one else supposed to be in the same room as the test taker - for obvious reasons that they should receive no help on the test in any way.
Some households -- especially if you have small children, or live in a small house without the luxury of separate rooms, noisy neighborhood, etc. -- may pose a challenge.
But outside those scenarios the candidate should know to dedicate one room to themselves for the duration of the test and preferably keep it locked from inside and inform others in the house not to disturb them for those 2 or 3 hours.
I am surprised to see this simple requirement -- that there should be no other person in the video frame (which will be audited for it, both manually and through automatic processing) -- is considered draconian? How? Are test takers expecting to take tests in rooms where anyone else can casually walk in, move around, etc?
To be sure there are other quirks like no bathroom breaks, no glancing away from scree, no mouthing the words as you read, no covering your face or sometimes no resting your chin on your hand as you think etc that all can become very tedious and stressful sure.
They don't actually disqualify you (most of these places like https://www.proctoru.com or similar) just report to the exam administrator what they saw/noticed.
Yep, in medical school that's one of the first things we learn. In theory it is also best to measure in both arms (as well as one leg if you suspect a certain diagnosis)
Incredible that a drug discovered >100 years ago can still be used to price gouge Americans... Thankfully now there seems to be some semblance of rationality?
Executive branch had to threaten Big Pharma to make this happen. FTC is similarly pursing pharma companies who improperly submitted patents to the FDA’s orange book to extract more profits.
It's not the same drug. You can get the same insulin produced 100 years ago (well you can't because it was extracted from animal pancreases) for cheap, but guess what? Nobody wants it.