Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Would you pay for a tool that executes LLM code to verify it?
2 points by ZOdex 2 days ago | hide | past | favorite | discuss
m building a wrapper that queries GPT-4, Claude, and Gemini, then executes their code in a sandbox to catch hallucinations.

Is the latency (30s) worth the certainty? Or do you prefer speed?

I'm running manual tests for people today if anyone wants to try it.





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: