And the guy who's already argued for airstrikes on datacenters considers that to be good news? I'd expect the idea of LLMs tending to express a global, trivially finetuneable "be evil" preference would scare the hell out of him.
He is less concerned that people can create an evil AI if they want to and more concerned that no person can keep an AI from being evil even if we tried.