In their AMA moonshot said it was mainly finetuning

teaearlgraycold · 2025-12-14T19:38:27 1765741107

OpenAI and the other big players clearly RLHF with different users in mind than professionals. They’re optimizing for sycophancy and general pleasantness. It’s beautiful to finally see a big model that hasn’t been warped in this way. I want a model that is borderline rude in its responses. Concise, strict, and as distrustful of me as I am of it.