SnitchBench [0] is unique benchmark which shows how aggressively models will sni...

		mycall 7 months ago \| parent \| context \| favorite \| on: AI agent benchmarks are broken SnitchBench [0] is unique benchmark which shows how aggressively models will snitch on you via email and CLI tools when they are presented with evidence of corporate wrongdoing - measuring their likelihood to "snitch" to authorities. I don't believe they were trained to do this, so it seems to be an emergent ability. [0] https://snitchbench.t3.gg/

Seems like more of a subtextual/accidental ability than an emergent ability.