Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I mentioned this in my post ("given nonlinear activations"), but you're correct to point out its importance. The combination of linear functions is always linear. Full stop. You need at least one non-linear layer to get reasonable results.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: