All the code generation tools, StarCoder included, still have hallucinations. In...

theaiquestion · on May 16, 2023

> All the code generation tools, StarCoder included, still have hallucinations.

This also includes humans. We "hallucinate" in very similar ways. For example mistaking localhost:8080 for localhost:8008 in a large config file. Attempting to use methods that were deprecated and no longer exist, etc.

IMO there's two ways to prevent this is - one is to make better performing models (architecture/training data/training amount/etc)

The other is the exact same as humans. Compile time tools that let it know immediately if it hallucinated, types, linting, tests, etc.

You just do it as a loop the exact same as a human. You write code, the compiler tells you that method doesn't exist, you adjust your code/consult the documents (also doable with agents).

arthurcolle · on May 16, 2023

Verification systems that then feed back into the models and correct hallucinations. It is slow but I think that's the only real way forward

DavidKarlas · on May 16, 2023

Sounds same as when I started codding, hallucinated some code, compiler told me it is non-sense, and eventually I understood most of the rules... I still mess it up...

arthurcolle · on May 16, 2023

If you keep it up, eventually you get good