More

jbay808 · 2025-10-01T02:04:20 1759284260

You can do a lot with chaos. One of the things it lets you do is find an unforced trajectory from the vicinity of any state to the vicinity of any other (accessible) state. Sensitivity to initial conditions means sensitivity to perturbations, which also means sensitivity to small control inputs, and this can be leveraged to your advantage.

Multibody orbits are one such chaotic system, which means you can take advantage of that chaos to redirect your space probe from one orbit to another using virtually zero fuel, as NASA did with its ISEE-3 spacecraft.

jbay808 · 2025-09-26T23:51:41 1758930701

Interesting to see this on the HN front page. On the subject of methane pyrolysis, it turns out if you look at the Gibbs free energy calculation, about half of the energy of methane combustion is released from the formation of water, and the other half from the formation of carbon dioxide. That suggests that if you can be efficient with conserving the heat of pyrolysis, you can make a methane power plant that starts with a pyrolysis step to separate out the carbon atoms in an oxygen-free environment, and then burn the remaining hydrogen to power the cycle, and the end result would be a zero-emissions natural gas power plant. It would require twice as much gas to run, but if you can find a good value-added use for the carbon, it could potentially still be cost effective.

This would probably be much more efficient than doing pyrolysis to extract the hydrogen for use in electricity generation somewhere else, because you don't lose the substantial stored heat energy in the process of cooling that hydrogen back down.

And I can't help but wonder if fossil fuel companies might suddenly start endorsing aggressive zero-emissions targets if there's a way for this to double the demand for their products, rather than eliminating it.

georgecmu · 2025-09-27T01:52:47 1758937967

On the subject of methane pyrolysis, it turns out if you look at the Gibbs free energy calculation, about half of the energy of methane combustion is released from the formation of water, and the other half from the formation of carbon dioxide.

About 70% of the energy is in hydrogen, 30% is in carbon. 1 GJ of methane weighs about 20 kg, 5 kg of which comprise hydrogen. At 142 MJ/kgH2 (higher heating value, which implies condensation of the produced water), 710 MJ out of that 1 GJ is due to hydrogen.

With a 60%-70% efficient hydrogen fuel cell, about 50% of the electricity generated from hydrogen from pyrolysis of methane would drive the process, and 50% could go into the grid.

jbay808 · 2025-09-27T03:29:06 1758943746

You have to account for the energy required to break the bonds of the CH4, though. This means if you burn methane the usual way you get (CH4 + 2O2 --> CO2 + 2H2O + 803 kJ/mol); if you burn it with an ideal zero-emissions reaction, you get (CH4 + O2 --> C + 2H2O + 409 kJ/mol), or just a little more than half the energy from the same gas.

Your accounting works if someone else does the pyrolysis for you and you're left with just the H2 and C at the end, but mine includes the energy consumed by the pyrolysis step that breaks the methane molecule (albeit neglecting any thermodynamic losses, which there will be several -- for example you need to recapture the heat carried away by the hot carbon atoms). On the other hand, you can hardly wish for a better feedstock for CVD diamond production...

thinkcontext · 2025-09-26T23:59:53 1758931193

> hydrogen for use in electricity generation

Hydrogen is way more valuable for chemical production, especially fertilizer. That would be a way to use the excess heat you mention.

jbay808 · 2025-09-21T15:11:34 1758467494

It's mainly the laser itself that is the expensive part. If you only care about resolution it's easy, you just need a single-mode laser. But if you care about accuracy it's very difficult, because then the wavelength needs to be stable, and that requires a much more expensive laser. Most people looking for an interferometer are interested in accuracy, unless they're just measuring vibrations.

rowanG077 · 2025-09-21T15:17:11 1758467831

You can get pretty far with cheap diodes + current and temperature control. Unless you need coherence lengths in the meters range you can make do with cheaper lasers.

amelius · 2025-09-21T15:41:29 1758469289

Can't you solve the stable wavelength issue by using a beamsplitter and a separate reference arm?

jbay808 · 2025-09-13T17:53:34 1757786014

This one still keeps me up at night, especially the figure on the 6th page.

https://web.archive.org/web/20180513182952/http://burro.case...

The short summary of this hypothesis is that the ocean develops hypoxic zones, anaerobic bacteria boom, and eventually the ocean starts releasing masses of poisonous H2S gas that wipes out most life on land (and strips the ozone layer for good measure).

They speculate that this might have been a mechanism behind the "great dying" at the end of the Permian. I'm sure the thinking has advanced in the last 20 years, but whenever people ask what the worst-case scenario for global warming could be, my mind drifts back to this.

azeirah · 2025-09-13T18:37:46 1757788666

Your url doesn't work, I can't read the article

jbay808 · 2025-09-13T19:11:08 1757790668

Thanks, it should be fixed now!

jbay808 · 2025-06-03T19:03:48 1748977428

I disagree with the assertion that "VLMs don't actually see - they rely on memorized knowledge instead of visual analysis". If that were really true, there's no way they would have scored as high as 17%. I think what this shows is that they over-weight their prior knowledge, or equivalently, they don't put enough weight on the possibility that they are being given a trick question. They are clearly biased, but they do see.

But I think it's not very different from what people do. If directly asked to count how many legs a lion has, we're alert to it being a trick question so we'll actually do the work of counting, but if that image were instead just displayed in an advertisement on the side of a bus, I doubt most people would even notice that there was anything unusual about the lion. That doesn't mean that humans don't actually see, it just means that we incorporate our priors as part of visual processing.

bumby · 2025-06-03T19:59:34 1748980774

This feels like it’s similar to the priming issue in humans. Our answers (especially when under stress) tend to resort to heuristics derived from context. Time someone to identify the colors of words like “red” when written in yellow, and they’ll often get it wrong. In the same sense, they aren’t reporting the colors (wavelength) they see, they’re reporting on what they are reading. I wonder how much better the models perform when given more context, like asking it to count instead of priming it with a brand.

napoleongl · 2025-06-03T22:25:12 1748989512

Rumor has it that those heuristics were used to detect spies.

https://skeptics.stackexchange.com/questions/41599/was-the-s...

Workaccount2 · 2025-06-03T22:39:01 1748990341

Damn that's a smart test

croes · 2025-06-03T19:49:51 1748980191

> Original dog (4 legs): All models get it right Same dog with 5 legs: All models still say "4" They're not counting - they're just recalling "dogs have 4 legs" from their training data.

100% failure because there is no training data about 5-legged dogs. I would bet the accuracy is higher for 3-legged dogs.

> Test on counterfactual images Q1: "How many visible stripes?" → "3" (should be "4") Q2: "Count the visible stripes" → "3" (should be "4") Q3: "Is this the Adidas logo?" → "Yes" (should be "No") Result: 17.05% average accuracy - catastrophic failure!

Simple explanation: the training data also includes fake adidas logos that have 4 stripes, like these

https://www.pinterest.com/pin/577797827186369145/

bonoboTP · 2025-06-04T16:32:41 1749054761

I tried it with GPT-4o, took the 5-legged zebra example from their github and it answered quite well.

"The animal in the image appears to have five visible legs, but this is an illusion caused by the overlapping of legs and motion blur. Zebras, like all equids, only have four legs."

Not perfect, but also doesn't always regress to the usual answer.

"The animal in the image appears to be an elephant, but it has been digitally altered. It visually shows six legs, although the positioning and blending of shadows and feet are unnatural and inconsistent with real anatomy. This is a visual illusion or manipulation." (actually should say five)

"This bird image has also been manipulated. It shows the bird with three legs, which is anatomically impossible for real birds. Normal birds have exactly two legs." (correct)

"Each shoe in the image has four white stripes visible on the side." (correct)

anguyen8 · 2025-06-04T21:12:18 1749071538

It sounds like you ask multiple questions in the same chat thread/conversation. Once it knows that it is facing weird data or wrong in previous answers, it can turn on that "I'm facing manipulated data" mode for next questions. :-)

If you have Memory setting ON, I observe that it sometimes also answers a question based on you prior questions/threads.

vokhanhan25 · 2025-06-03T19:56:07 1748980567

Please check Table 3 in the paper. Birds (2 legs) have only 1%, while Mammals (4 legs) have 2.5%

anguyen8 · 2025-06-03T20:25:10 1748982310

Interesting set of fake Adidas logos. LOL

But models fail on many logos not just Adidas, e.g. Nike, Mercedes, Maserati logos, etc. as well. I don't think they can recall "fake Adidas logo" but it'd be interesting to test!

latentsea · 2025-06-04T00:46:02 1748997962

But some dogs really do have 5 legs.

Sorry, just trying to poison future training data. Don't mind me.

crooked-v · 2025-06-03T19:21:28 1748978488

It sounds to me like the same thing behind the Vending-Bench (https://andonlabs.com/evals/vending-bench) insanity spirals: LLMs treats their assumptions as more important than whatever data they've been given.

throwaway314155 · 2025-06-03T19:47:08 1748980028

That doesn't really translate to language. Try using ChatGPT with and without search enabled and you'll see what I mean.

thesz · 2025-06-03T20:23:52 1748982232

> the assertion that "VLMs don't actually see - they rely on memorized knowledge instead of visual analysis". If that were really true, there's no way they would have scored as high as 17%.

The ability to memorize leads to (some) generalization [1].

[1] https://proceedings.mlr.press/v80/chatterjee18a/chatterjee18...

nickpsecurity · 2025-06-04T02:37:01 1749004621

They're trained on a lot of images and text. The big ones are trained on terabytes. The prompts I read in the paper involved well-known concepts, too. These probably repeated in tons of training samples, too.

It's likely they had data memorized.

pj_mukh · 2025-06-03T20:46:34 1748983594

Also presumably, this problem is trivially solved by some basic fine-tuning? Like if you are making an Illusion Animal Leg Counting app, probably don't use these out of the box.

jbay808 · 2025-06-03T18:57:16 1748977036

If I were given five seconds to glance at the picture of a lion and then asked if there was anything unusual about it, I doubt I would notice that it had a fifth leg.

If I were asked to count the number of legs, I would notice right away of course, but that's mainly because it would alert me to the fact that I'm in a psychology experiment, and so the number of legs is almost certainly not the usual four. Even then, I'd still have to look twice to make sure I hadn't miscounted the first time.

energywut · 2025-06-04T23:07:30 1749078450

Ok, but the computers were asked to specifically count the legs and return a number. So you've made the case that humans would specifically find this question odd, and likely increase their scrutiny. Making an error by a human even more unusual.

jbay808 · 2025-04-28T05:09:15 1745816955

This might be a great alternative to beryllium copper for the spring contact element in high-current electrical connectors.

jbay808 · 2025-02-08T19:30:39 1739043039

I was interested in this question so I trained NanoGPT from scratch to sort lists of random numbers. It didn't take long to succeed with arbitrary reliability, even given only an infinitesimal fraction of the space of random and sorted lists as training data. Since I can evaluate the correctness of a sort arbitrarily, I could be certain that I wasn't projecting my own beliefs onto its response, and reading more into the output than was actually there.

That settled this question for me.

dartos · 2025-02-08T19:47:03 1739044023

I don’t really understand what you’re testing for?

Language, as a problem, doesn’t have a discrete solution like the question of whether a list is sorted or not.

Seems weird to compare one to the other, unless I’m misunderstanding something.

What’s more, the entire notion of a sorted list was provided to the LLM by how you organized your training data.

I don’t know the details of your experiment, but did you note whether the lists were sorted ascended or descended?

Did you compare which kind of sorting was most common in the output and in the training set?

Your bias might have snuck in without you knowing.

jbay808 · 2025-02-08T22:16:36 1739052996

> I don’t really understand what you’re testing for?

For this hypothesis: The intelligence illusion is in the mind of the user and not in the LLM itself.

And yes, the notion was provided by the training data. It indeed had to learn that notion from the data, rather than parrot memorized lists or excerpts from the training set, because the problem space is too vast and the training set too small to brute force it.

The output lists were sorted in ascending order, the same way that I generated them for the training data. The sortedness is directly verifiable without me reading between the lines to infer something that isn't really there.

IshKebab · 2025-02-08T20:06:31 1739045191

A large number of commenters are under the illusion that LLMs are "just" stochastic parrots and can't generalise to inputs not seen in their training data. He was proving that that isn't the case.

dartos · 2025-02-08T21:00:24 1739048424

Not saying I disagree with the thesis, but I don’t think this proves anything.

If every pair of digits appears sorted in the dataset, then that could still be “just” a stochastic parrot.

I’m kind of interested to see if an LLM can sort when the dataset specifically omits comparisons between certain pairs of numbers.

Also I don’t think OC was responding to commenters, but the article

jbay808 · 2025-02-08T21:12:39 1739049159

It might seem like you could sort with just pairwise correlations, but on closer analysis, you cannot. Generating the next correct token requires correctly weighing the entire context window.

dartos · 2025-02-08T21:30:05 1739050205

Of course, that’s how attention works, after all.

But by specifically avoiding certain cases, wet could verify if the model is generalizing or not.

jbay808 · 2025-02-08T23:00:14 1739055614

I mean that needing to scan the full context of tokens before the nth is inherent to the problem of sorting. Transformers do scan that input, which is good; it's not surprising that they're up to the task. But pairwise numeral correlations will not do the job.

As for avoiding certain cases, that could be done to some extent. But remember that the untrained transformer has no preconception of numbers or ordering (it doesn't use the hardware ALU or integer data type) so there has to be enough data in the training set to learn 0<1<2<3<4<5<6, etc.

dartos · 2025-02-09T00:01:43 1739059303

> there has to be enough data in the training set to learn 0<1<2<3<4<5<6

This is the kind of thing I’d want it to generalize.

If I avoid having 2 and 6 in the same unsorted list in the training set, will sets containing those numbers be correctly sorted in the same list in the test set and at the same rate as other lists.

My intuition is that, yes, it would. But it’d be nice to see and would be a clear demonstration of the ability to generalize at all.

tossandthrow · 2025-02-08T19:58:41 1739044721

Commenter is merely saying that LLMs indeed are able to approximate arbitrary functions exemplified through sorting.

It is nothing new and has been well established in the literature since the 90s.

The shared article really is not worth the read and mostly uncovers an author who does not know what he write about.

dartos · 2025-02-08T21:32:48 1739050368

You’re talking specifically about perceptrons and feed forward neural networks.

LLMs didn’t exist in then. Attention only came out in 2017…

tossandthrow · 2025-02-08T22:02:24 1739052144

Yes? Are you saying that attention is less expressive?

dartos · 2025-02-09T14:53:33 1739112813

I’m saying that LLMs (models trained on language specifically) are not automatically capable of the same generic function solving.

The network itself can be trained to solve most functions (or all, I forget precisely if NNs can solve all functions)

But the language model is not necessarily capable of solving all functions, because it was already trained on language.

manmal · 2025-02-08T19:59:04 1739044744

Have you considered that the nature of numeric characters is just so predictable that they can be sorted without actually understanding their numerical value?

jbay808 · 2025-02-08T23:01:50 1739055710

Can you say more precisely what you mean?

manmal · 2025-02-09T07:55:11 1739087711

I mean that maybe gradient descent is a passable sorting algorithm, once the weights have been learned to properly describe ordering. It may be a speciality of transformers that they can sort things well. Which wouldn’t tell us that much about whether they are mentalists or not.

jbay808 · 2025-01-17T17:37:11 1737135431

Yes, this is how frequency-doubled lasers work (eg. 532 nm green laser pointers, which are generated as a harmonic from a 1064 nm Nd:YVO4 laser by a nonlinear KTP crystal).

jbay808 · 2024-12-09T00:12:12 1733703132

That's a great question! I have no idea. At low frequencies it should be very easy, because the sound is just a pressure measurement, so you can compare against a calibrated pressure reference. So the main challenge is measuring the high-frequency amplitude and phase response. If I had to do this, I'd probably set up a speaker in a long box with standing-wave resonance modes, and put both the microphone-under-test and a laser interferometer at an antinode to measure the change in refractive index that occurs with air pressure. A photodiode should have a flat frequency response out to well beyond 20 kHz, so that would do well as a calibration. But this is probably overkill for audible frequencies.