Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

What's their trick for its voice recognition on something so small if something like openai's whisper requires torch, which is almost a gig in size?


context, context, and context!! Then deep learning tricks and efficient implementation.

[1] https://picovoice.ai/blog/end-to-end-intent-inference-from-s...


There are Whisper TFLite ports with model 40Mb size and tflite itself is about 3Mb. So nowhere near a gig. https://github.com/usefulsensors/openai-whisper


Thank you!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: