We are clocking in around 50% success rate in this benchmark.
[1] https://github.com/xeol-io/swe-bump-bench