But for it to be 100% trustworthy, *you'd* have to express correctness criteria ...

geysersam · on July 26, 2024

My intuition is that a regular LLM is better att coming up with a correct task description from a fuzzy description than it is at actually solving tasks.

humansareok1 · on July 26, 2024

There's a lot of automated proof checkers out there. Presumably you would just run any solution from an AI through those.

lanstin · on July 25, 2024

And most applied maths doesn't seem to worry about proofs much. They have techniques that either work pretty well or blow up.

optimalsolver · on July 26, 2024

Bridge collapses are a form of proof validation.