Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

People should be training model sizes that fit-and-fill consumer GPUs, ie:

2x 24G - for dual GPU ~ 28B model

1x 24G ~ 14B model

etc.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: