More

fabiospampinato · 2025-07-20T19:56:06 1753041366

Now we need to know from the people that experienced the laser how different this hallucination feels compared to that. Very cool stuff!

fabiospampinato · 2025-04-04T13:09:06 1743772146

Interesting. Development of the most powerful technology humans could ever hope to build, happening within our lifetimes, "boring".

greybox · 2025-04-04T13:11:57 1743772317

> the most powerful technology humans could ever hope to build

If you're correct, that's a rather depressing thought

jjulius · 2025-04-04T13:15:46 1743772546

Two seemingly opposing ideas can, in fact, be true at the same time.

fabiospampinato · 2024-12-06T18:12:40 1733508760

To be fair if they released detailed instructions and datasets on how to rebuild llama (considering that there's some randomness in the process) you still probably wouldn't be able to build it, like who has the resources? And if you had the resources you probably _still_ probably wouldn't _want_ to rebuild it yourself, it seems awfully expensive when you could instead spend those resources elsewhere.

Fair point about the license, people have different definitions for what "open source" means.

dartos · 2024-12-06T22:30:34 1733524234

> people have different definitions for what "open source" means.

They shouldn’t. It’s just market confusion.

There is an explicit widely accepted definition.

Also like llama (the file you download from huggingface) isn’t even a program. It’s a binary weights file. No source to be opened, even.

It’s just freeware.

https://opensource.org/osd

do_not_redeem · 2024-12-06T18:24:01 1733509441

That's true for most people for ordinary software too. How many people actually build Linux or Chromium from source? Building Chromium takes more RAM and HD space than most people even have. Yet the world gets immense value from the few who do. I wouldn't want to live in a world where WebKit and Chromium were closed source. You can run a Chromium fork without having to build it yourself. And compute costs will come down over time.

comex · 2024-12-06T19:15:51 1733512551

> Building Chromium takes more RAM and HD space than most people even have.

According to [1], it takes 16GB of RAM and ~180GB of disk space. Most people have that much. It does take several hours without a many-core machine though.

Building Linux takes much less.

[1] https://chromium.googlesource.com/chromium/src.git/+/master/...

do_not_redeem · 2024-12-06T19:45:59 1733514359

I would bet overall most people have those 4GB RAM, 32GB eMMC laptops from walmart, etc. If you limit things to developers/gamers/enthusiasts, you'd probably be right.

oblio · 2024-12-07T07:35:49 1733556949

Those laptops kind of died up to a point, whoever can use an smartphone or tablet used those instead.

fabiospampinato · 2024-12-06T18:50:49 1733511049

Linux and Chromium seem at the edge of the current scale of "ordinary" open-source software. I think perhaps one should also take into account how much money would be needed to be able to build the thing in reasonable time.

Building Chromium sounds awful, but I'm not sure I'd really need to buy another computer for that. If I did I'm sure I wouldn't need to spend billions on it, most probably not even millions.

For LLaMa I definitely don't have the computer to build it, I definitely don't have the money to buy the computer, even if I won the lottery tomorrow I'm pretty sure I wouldn't have enough money to buy the hardware, even if I had enough money to buy the hardware I'm still not sure I could actually buy it in reasonable time, nvidia may be backlogged for a while, even if I already had all the hardware I probably wouldn't want to retrain llama, and even if I wanted to retrain it the process is probably going to take weeks if not months at best.

Like I think it's one of those things where the difference in magnitude creates a difference in kind, one can't quite meaningfully compare LLaMa with the Calculator app that Ubuntu ships with.

dartos · 2024-12-06T22:32:14 1733524334

The practicality of building it yourself has nothing to do with an organization affording you that ability.

Also like, gentoo people compile everything

dartos · 2024-12-06T22:36:19 1733524579

> To be fair if they released detailed instructions and datasets on how to rebuild llama

Where?

Books3 was famously one of the datasets used to train llama and it’s very illegal to put that together nowadays.

I believe the guy who wrote the script to build it got arrested

copperx · 2024-12-06T18:32:29 1733509949

Perhaps an individual couldn't. But an organization or a state could.

fabiospampinato · 2024-11-15T11:20:29 1731669629

It's probably worth to play around with different prompts and different board positions.

For context this [1] is the board position the model is being prompted on.

There may be more than one weird thing about this experiment, for example giving instructions to the non-instruction tuned variants may be counter productive.

More importantly let's say you just give the model the truncated PGN, does this look like a position where white is a grandmaster level player? I don't think so. Even if the model understood chess really well it's going to try to predict the most probable move given the position at hand, if the model thinks that white is a bad player, and the model is good at understanding chess, it's going to predict bad moves as the more likely ones because that would better predict what is most likely to happen here.

[1]: https://i.imgur.com/qRxalgH.png

fabiospampinato · 2024-11-15T14:01:16 1731679276

Apparently I can find some matches for games that start like that between very strong players [1], so my hypothesis that the model may just be predicting bad moves on purpose seems wobbly, although having stockfish at the lowest level play as the supposedly very strong opponent may still be throwing the model off somewhat. In the charts the first few moves the model makes seem decent, if I'm interpreting these charts right, and after a few of those things seem to start going wrong.

Either way it's worth repeating the experiment imo, tweaking some of these variables (prompt guidance, stockfish strength, starting position, the name of the supposed players, etc.).

[1]: https://www.365chess.com/search_result.php?search=1&p=1&m=8&...

sjducb · 2024-11-17T14:26:55 1731853615

Interesting thought the LLM isn’t trying to win, it’s trying to produce data like the input data. It’s quite rare for a very strong player to play a very weak one. If you feed it lots of weak moves it’ll best replicate the training data by following with weak moves.

NiloCK · 2024-11-15T19:35:46 1731699346

The experiment started from the first move of a game, and played each game fully. The position you linked was just an example of the format used to feed the game state to the model for each move.

What would "winning" or "losing" even mean if all of this was against a single move?

Closi · 2024-11-15T12:06:50 1731672410

Agree with this. A few prompt variants:

* What if you allow the model to do Chain of Thought (explicitly disallowed in this experiment)

* What if you explain the board position at each step to the model in the prompt, so it doesn't have to calculate/estimate it internally.

int_19h · 2024-11-15T19:13:34 1731698014

They also tested GPT-o1, which is always CoT. Yet it is still worse.

spott · 2024-11-15T15:39:59 1731685199

He was playing full games, not single moves.

fabiospampinato · 2024-11-02T22:22:00 1730586120

> Aren’t they leaving more size/speed improvements on the table by supporting it?

Only tiny ones, I don't remember the details now, IE11 ended up providing almost all the same APIs.

fabiospampinato · 2024-11-02T22:20:36 1730586036

"modern websites" means IE11+ for cash, it's a fairly old library.

Cannabat · 2024-11-04T11:37:07 1730720227

I remember using cash about 10 years ago. Was it under a different user back then? Ken wheeler maybe?

Thanks for your continued work on it!

fabiospampinato · 2024-11-04T12:02:54 1730721774

Yes exactly, at some point I asked to maintain it and kinda redid it. Now I kinda consider it "done", as in "maybe some more work would be put into it, but by end large I don't think it's going to change in the future".

fabiospampinato · 2024-10-21T23:45:17 1729554317

Civet has so many quality of life improvements! It's good that it exists sort of as a playground for ideas that could maybe in the future be adopted by JS itself, kinda like how it went with CoffeeScript.

fabiospampinato · on Sept 18, 2024

Closing hospitals would cut deaths in hospitals by 100%.

Like I'm not sure what this measure means, it's not like 26% of people that would die in the hospital would be made immortal or something.

fabiospampinato · on Sept 4, 2024

Yeah this seems more like observables than signals. Which despite what you may hear on the interwebs are two different things.

fabiospampinato · on Sept 4, 2024

This seems a bit different from the kind of signals frameworks have, where dependencies are tracked automatically (no dependency array), and you can sort of chain stuff automatically, so for example you can have an effect that depends on 3 memos that depend on 4 signals or whatever else, and you never experience non-fresh values.

If you want to look a bit deeper into this I had written another sort of toy implementation that much more closely resembles what the frameworks are actually doing: https://github.com/fabiospampinato/flimsy