Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
CrayEye now supports local/FOSS models – multimodal vision multitool (github.com/alexdredmon)
1 point by anais9 on May 20, 2024 | hide | past | favorite | 1 comment


I (or more accurately A.I.) made the FOSS mobile app https://www.crayeye.com to make it easier to experiment with multimodal vision prompts augmented by device data (e.g. location, date/time).

While this tool still uses GPT-4v / GPT-4o as its default, it now supports configuring custom engines (via OpenAPI spec) which can point to any API/model - this has been tested using Llava (and Bakllava) running locally via Ollama.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: