Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's only got around 5 billion active parameters; it'd be a miracle if it was competitive at coding with SOTA models that have significantly more.


On this bench it underperforms vs glm-4.5-air, which is an MoE with fewer total params but more active params.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: