Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Evaluating Agents (aunhumano.com)
42 points by mfalcon 3 months ago | past | 9 comments
Building an AI judge for classification tasks (aunhumano.com)
2 points by mfalcon 7 months ago | past
Building self improving negotiation agents (aunhumano.com)
1 point by mfalcon 9 months ago | past
The time of evaluation driven development (aunhumano.com)
3 points by mfalcon 10 months ago | past | 3 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: