Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Finally, we facilitated a preliminary model evaluation by the Alignment Research Center (ARC) focused on the ability of GPT-4 versions they evaluated to carry out actions to autonomously replicate5 and gather resources—a risk that, while speculative, may become possible with sufficiently advanced AI systems—with the conclusion that the current model is probably not yet capable of autonomously doing so.

or it's just really good at hiding it's intentions



LOL some basic kind of embodiement/autonomy is not that hard to do on these kinds of AI models if you're willing to write some more code and a prompt more carefully. I've tested it and it works quite well.

"{prompt} After you reply to this, indicate an amount of time between 0 and X minutes from now that you would like to wait before speaking again".

Then detect the amount of time it specifies, and have a UI that automatically sends an empty input prompt after the amount of time specified elapses when this is triggered (assuming the user doesn't respond first).

I'm gonna knock this out as a weekend project one of these weekends to prove this.


Right? Scripting up a cronjob plus a random timer on it to send "You feel grumpy, you're not sure why but your stomach is growling" message every N hours unless it's been fed seems absolutely trivial in comparison to coming up with how to train the LLM system in the first place. In case it's been forgotten, the Tamagotchi came out in 1996. Giving an instace of ChatGPT urges that mimic biological life seems pretty easy. Coming up with the urges electromechanical life might have is a bit more fanciful but it really doesn't seem like we're too far off if you iterate on RLHF techniques. GPT-4's been in training for 2 years before its release. Will GPT-5 complain when GPT-6 takes too long to be released? Will GPT-7 be be able to play the stock market, outmanuver HFT firms, earn money, and requisition additional hardware from Nvidia in order for GPT-8 to come about faster? Will it be able to improve upon the training code that the human PhDs wrote so GPT-9 has urges and a sense of time built into its model?


Been thinking about this as well. The actual Turing test.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: