I like this proposal! But of course it won't work perfectly, since the RL fine-t...

		cubefox on April 14, 2023 \| parent \| context \| favorite \| on: Prompt injection: what’s the worst that can happen... I like this proposal! But of course it won't work perfectly, since the RL fine-tuning can be circumvented, as we see in ChatGPT "jailbreaks".