Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

In their AMA moonshot said it was mainly finetuning


OpenAI and the other big players clearly RLHF with different users in mind than professionals. They’re optimizing for sycophancy and general pleasantness. It’s beautiful to finally see a big model that hasn’t been warped in this way. I want a model that is borderline rude in its responses. Concise, strict, and as distrustful of me as I am of it.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: