No they’re not. Maybe you mean to say they don’t tell the whole story or have their limitations, which has always been the case.
>>my fairly basic python benchmark
I suspect your definition of “basic” may not be consensus. Gpt-5 thinking is a strong model for basic coding and it’d be interesting to see a simple python task it reliably fails at.
they are not meaningless, but when you work a lot with LLMs and know them VERY well, then a few varied, complex prompts tell you all you need to know about things like EQ, sycophancy, and creative writing.
I like to compare them using chathub using the same prompts
Gemini still calls me "the architect" in half of the prompts. It's very cringe.
I wonder if under the covers it uses your word choices to infer your Myers-Briggs personality type and you are INTJ so it calls you "The Architect"?? Crazy thought but conceivable...
It’s very different to get a “vibe check” for a model than to get an actual robust idea of how it works and what it can or can’t do.
This exact thing is why people strongly claimed that GPT-5 Thinking was strictly worse than o3 on release, only for people to change their minds later when they’ve had more time to use it and learn its strengths and weaknesses. It takes time for people to really get to grips with a new model, not just a few prompt comparisons where luck and prompt selection will play a big role.
I get that one can perhaps have an intuition about these things, but doesn't this seem like a somewhat flawed attitude to have all things considered? That is, saying something to the effect of "well I know its not too sycophantic, no measurement needed, I have some special prompts of my own and it passed with flying colors!" just sounds a little suspect on first pass, even if its not like totally unbelievable I guess.
For some people having a child fundamentally changes their definition of love and their belief in how it’s possible to relate to other human beings.
This is mostly disconnected from true/false values. The facts haven’t really changed. Yet it can be so powerful and rewarding few would trade it regardless of the risks/pain/hardships that can come with it.
Experiences can be profound and change perspectives in a way that’s so rewarding and wholistic it’s really impossible to describe.
Comparing such experiences to math doesn’t really work, apples and oranges.
On the networking side. M4 max does have thunderbolt 5, 80gbps advertised.
Would ip over TB not allow for significantly faster interconnects when clustering Macs?
The DGX Spark, Ascent GX10, and related machines have no relation to NVIDIA Grace Blackwell GB200. The chip they are based on is called GB10, and is architecturally very different from NVIDIA's datacenter solutions, in addition to being vastly smaller and less powerful. They don't have anything resembling the Grace CPU NVIDIA used in Grace Hopper and Grace Blackwell datacenter products. The CPU portion of GB10 is a Mediatek phone chip's CPU complex that metastasized, not NVIDIA's datacenter CPU cut down.
Mediatek is in the picture because NVIDIA outsourced everything in GB10 but the GPU to Mediatek. GB10 is two chiplets, and the larger one is from Mediatek. Yes, Mediatek uses off the shelf ARM CPU core IP, but they still had to make a lot of decisions about how to implement it: how many cores, what cluster and cache arrangements, none of which resemble NVIDIA's Grace CPU.
Thanks for the clarification. I was surprised to learn it is not a single chip; thought they did something akin to Apple Silicon integrating some ARM cores on their main chip. Kind of disappointing: they basically asked MediaTek to build a CPU with an NV-Link I/O.
The big picture is probably that GB10 is destined to show up in laptops, but NVIDIA couldn't be bothered to do all the boring work of building the rest of the SoC and Mediatek was the cheapest and easiest partner available. It'll eventually be followed by an Intel SoC with NVIDIA providing the GPU chiplet, but in the meantime the Mediatek CPU solution is good enough.
From NVIDIA's perspective, they need an answer to the growing segment of SoCs with decent sized GPUs and unified memory; their existing solutions at the far end of a PCIe link with a small pool of their own memory just can't work for some important use cases, and providing GPU chiplets to be integrated into other SoCs is how they avoid losing ground in these markets without the expense of building their own full consumer hardware platform and going to war with all of Apple, Intel, AMD, Qualcomm.
Having a solid product that solves a problem well can be orthogonal to how well a codebase lends itself to readability, learning curve, and efficiently ramping up new developers on a project.
Just because you succeed at one says nothing about other practical and important metrics.
The proper way to read it is to understand the problem and its pros and cons.
Without going long in the speculation, the situation likely was: there's only one guy who really can deliver this because of his knowledge, cv and experience and we need it.
And at that point your choice is having a solution or not.
Thanks! It should work on mobile, you can hide the right panel using the button at the bottom of the screen. That said, performance on mobile is pretty limited because of hardware limitations.
tldr: Seeing what happens internally in an LLM lets you reconstruct the original prompt exactly.
Maybe not surprising if you logged all internal activity, but it can be done from only a single snapshot of hidden activations from the standard forward pass.
Merely disagreeing with a guy like Rovelli on physics feels like hubris. :)
But agree with you in this case. Animals perceive the flow of time because we have memory and prediction abilities. This gives us a psychological arrow that aligns with the thermodynamic arrow.
Concerning the concept of time, it is clear that the generalized second law is related in some manner to the question of the "direction" of time. It is sometimes asserted that the second law explains the distinction between past and future, or that the future may be defined as the direction of increasing entropy. This assertion says that the second law is more fundamental than the distinction between past and future.
It seems to the author that the above assertion is wrong. It was seen in Section 5.2 that the generalized second law is derived from the distinction between past and future; hence the distinction between past and future is more fundamental than the second law.
The following statement seems to be the most fundamental physical assertion which can be made regarding past and future: we can classify all instants t into two categories; the first category contains those instants about which experimental data is (or could be) known, and the second category contains those instants about which no experimental data is known. The first category is conventionally called the "past" and the second is called the "future". The instants may be labeled with real numbers running from -∞ to +∞, in such a way that the "past" instants constitute a set of the form (-∞ < t < t0), and the "future" instants constitute the set (t0 < t < +∞).
The choice of the positive direction as the future is purely a convention.
According to Sections 5.2 and 5.4, irreversibility and the generalized second law are derivable from the existence of the above two categories of in-stants: an "information-gathering category" (the past), and a "predictive category" (the future). The existence of these two categories seems to be a fundamental feature of nature, not explainable in terms of the second law or in terms of any other physical law.
Well, I am puzzled that Rovelli is not aware about the great debate in modern philosophy about time series A versus time series B that was started in 1908 by a paper on unreality of time by John McTaggart. A consequence of that paper is that perception of the time flow cannot be reduced to a physical process like entropy increase etc. So far nobody was able to disprove that.
And to further clarify, observer independence is referring to the frame itself not that humans imagine the world into being in the solipsistic sense many believe.
I say this because just a few days ago on this forum someone was asserting that without humans the earth would not exist, that human observation instantiates the earth and the earth did not exist before human consciousness.
What do you mean? That according to relational quantum mechanics absolute reality doesn't exist? But absence of absolute reality doesn't imply absence of objective reality. And it's according to relational quantum mechanics, not according to mathematical formalism of QM.
Hadn’t heard that. How many AAA vfx studios have left Maya for Blender?
reply