one week in: opus 4.7 vs 4.6 - worse one shot rate, double the retries
I spent some time few days back comparing Opus 4.6 and 4.7 using my own usage data - just to see how they actually behave side by side. [https://github.com/getagentseal/codeburn](https://github.com/getagentseal/codeburn) it’s still pretty early for 4.7, but a few things surprised me. In my sessions, 4.7 gets things right on the first try less often than 4.6. One-shot rate sits around 74.5% vs 83.8%, and I’m seeing roughly double the retries per edit (0.46 vs 0.22). It also produces a lot more output per call - about 800 tokens vs 372 on 4.6 - which makes it noticeably more expensive. ...