Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Depends on the training? If there was eg RLHF then those connections are stronger and more likely; that's a difference (but not a category difference).


Yes, but I thought we're talking about category difference.

Proper RLHF surely boosts "predicted next token until it couldn't" to feel more like "actually recalled".




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: