"Train adversarially robust image model" is not a long task imo | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		big-chungus4 61 days ago \| parent \| context \| favorite \| on: Measuring AI Ability to Complete Long Tasks "Train adversarially robust image model" is not a long task imo

leecommamichael 60 days ago [–]

I read their citations (which are actually the same authors of this paper) and they also define using Python's built-in web server to "build a web server" as a long task.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact