tobycrisford 🔸 comments on GiveWell’s AI red-teaming limitations aren’t a model problem — they’re an architecture problem

tobycrisford 🔸 31 Mar 2026 19:14 UTC
1 point
0 ∶ 0
It’s really cool that you’ve done this and released the code!
Am I understanding right that the givewell baseline you’re trying to beat used GPT, while your approach uses Claude? How can you be sure that the improvements aren’t due to the model choice, rather than the architecture?
- Tsondo 8 Apr 2026 17:48 UTC
  1 point
  0 ∶ 0
  Parent
  If you read my blog post, I go into detail about why this is not a model issue. It’s about how you frame the question much more than what the model contains. For this purpose any decent model would have had the same result. The main benefit that Claude gives is direct in terminal code writing and execution.