Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

A big part of the problem with these scraping operations is how poorly implemented they are. They can get a lot cheaper gains by simply cleaning up how they operate, to not redundantly fetch the same documents hundreds of times, and so on.

Regardless of how they solve the challenges, creating an incentive to be efficient is a victory in itself. GPUs aren't cheap either, especially not if you're renting them via a browser farm.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: