Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

If Qwen 0.6B is suitable, then it could fit in 576MB of VRAM[0].

https://huggingface.co/unsloth/Qwen3-0.6B-unsloth-bnb-4bit



or on a single Axera AX630C module: https://www.youtube.com/watch?v=cMF6OfktIGg&t=25s




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: