Ben_West🔸 comments on Reflections on Anthropic and EA

Ben_West🔸 11 May 2026 0:29 UTC
12 points
3 ∶ 0
People seem surprised and bewildered when AI folks defect away from AI safety towards capabilities. People trust that as AI companies grow, those gaining power and money from shares will not be adversely influenced by that power and money.
fwiw I don’t actually know many examples of this, and the ones I hear cited often seem uncompelling to me. E.g.:
- Greg Brockman doesn’t seem like a true believer in OpenAI’s nonprofit mission who got corrupted but rather someone who went into it wanting to make a profit
- Mechanize’s founders don’t seem like EAs who got corrupted by AI money but rather EAs with unusual moral and empirical views which result in them thinking that the best course of action is the exact opposite of what most EAs think
(Counterexamples appreciated, though!)
- Marcus Abramovitch 🔸 11 May 2026 8:00 UTC
  9 points
  1 ∶ 0
  Parent
  I think he would include a lot of people who work at Anthropic, for example, on pre-training, some of whom went through MATS or something.
  - Ben_West🔸 11 May 2026 18:26 UTC
    6 points
    1 ∶ 0
    Parent
    Thanks! I only know a handful of people in this category, but for what it’s worth, it again feels like people who were predisposed to thinking that working on pretraining would be okay rather than them being “corrupted.”
    E.g., I recently talked to someone who told me that their main takeaway from a safety fellowship was realizing that they didn’t fit in because they actually weren’t worried about existential risk in the same way that the other attendees were.
- calebp 11 May 2026 23:17 UTC
  6 points
  1 ∶ 0
  Parent
  Hmm, I think if smart EA/Rat types get “corrupted” in general, they’ll present as thoughtful people with reasons that are hard to dismiss quickly when questioned by EAs. I get the vague sense that your evidence bar for “corruption” is going to be too high to be useful in most worlds where there’s a lot of corruption.
  
  (that’s not to say that EAs/Rats/etc. who join labs/start wildly profitable companies speeding up AI progress have been “corrupted”—I just think if they were, it would present pretty similarly to how it has done and it’s hard to get lots of easy to share evidence)