Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I like this proposal! But of course it won't work perfectly, since the RL fine-tuning can be circumvented, as we see in ChatGPT "jailbreaks".


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: