Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I read that as it runs in data centers (H100 GPUs) or high-end desktops/laptops (Strix Halo?).


I'm running it with ROG Flow Z13 128GB Strix Halo and getting 50 tok/s for 20B model and 12 tok/s for 120B model. I'd say it's pretty usable.


Excellent! I have a Framework Desktop with 128GB on preorder—really looking forward to getting it.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: