LLMs don't work the way you think. In order to be useful, a model would have to ...

baijum · 2025-06-07T10:40:44 1749292844

That's a very fair and critical point. You're right that we can't change the fundamental, probabilistic nature of LLMs themselves.

But that makes me wonder if the goal should be reframed. Instead of trying to eliminate errors, what if we could change their nature?

The interesting hypothesis to explore, then, is whether a language's grammar can be designed to make an LLM's probabilistic errors fail loudly as obvious syntactic errors, rather than failing silently as subtle, hard-to-spot semantic bugs.

For instance, if a language demands extreme explicitness and has no default behaviors, an LLM's failure to generate the required explicit token becomes a simple compile-time error, not a runtime surprise.

So while we can't "fix" the LLM's core, maybe we can design a grammar that acts as a much safer "harness" for its output.

dtagames · 2025-06-07T13:47:41 1749304061

I would say we have this language already, too. It's machine code or its cousin, assembler. Processor instructions (machine code) that all software reduces down to are very explicit and have no default values.

The problem is that people don't like writing assembler, which is how we got Fortran in the first place.

The fundamental issue, then, is with the human language side of things, not the programming language side. The LLM is useful because it understands regular English, like "What is the difference between 'let' and 'const' in JS?," which is not something that can be expressed in a programming language.

To get the useful feature we want, natural language understanding, we have to accept the unreliable and predictive nature of the entire technique.

FloatArtifact · 2025-06-09T01:55:26 1749434126

What I've always been confused on, why can't we train LLMs to code without ever seeing source code?

If it understands human language enough, it should be able to understand the logic laid out in the documentation mapped to symbols to construct code.

dtagames · 2025-06-09T15:39:03 1749483543

We have this already. You can ask Cursor to go read the doc on syntax it may not have ever seen and write something that conforms. I used this recently to support a new feature in Lit which I'd never seen before and I doubt is in the training set much, if at all.

You can also describe your own app's syntax, architecture, function signatures, etc. in markdown files or just in chat and Cursor will write code that conforms to your desired syntax, which definitely doesn't exist in the training set.

baijum · 2025-06-17T13:53:48 1750168428

This project could be one option for new languages: https://genlm.org/genlm-control/

FloatArtifact · 2025-06-09T19:58:27 1749499107

Yes, but that's not how they're primarily trained.