Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You are effectively describing SimpleQA but with a single question instead of a comprehensive benchmark and you can note the dramatic increase in performance there.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: