aidan_mclau's comments

aidan_mclau · on June 14, 2024

Hey! Essay author here.

>The cool thing about using modern LLMs as an eval/policy model is that their RLHF propagates throughout the search.

>Moreover, if search techniques work on the token level (likely), their thoughts are perfectly interpretable.

I suspect a search world is substantially more alignment-friendly than a large model world. Let me know your thoughts!

Tepix · on June 15, 2024

Your webpage is broken for me. The page appears briefly, then there's a french error message telling me that an error occured and i can retry.

Mobile Safari, phone set to french.

abid786 · on June 15, 2024

I'm in the same situation (mobile Safari, French phone) but if you use Chrome it works

Tepix · on June 17, 2024

It fixed itself (?)