Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I don't see how this proves that asking the model about its internal state will reveal its inner high level processes in a human-readable way.

Perhaps there's a research paper which would explain it better?



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: