𝕮𝖎𝖓𝖊𝖗𝖆 comments on Are “Bad People” Really Unwelcome in EA?

𝕮𝖎𝖓𝖊𝖗𝖆 9 Aug 2022 16:24 UTC
4 points
0 ∶ 0
Huh. If I had a bright idea for AI Safety, I’d share it and expect to get status/credit for doing so.

The idea of hiding any bright alignment research ideas I came up with didn’t occur to me.

I’m under the impression that because of common sense morals (i.e. I wouldn’t deliberately sabotage to get the chance to play hero), selfishly motivated EAs like me don’t behave particularly different in common scenarios.

There are scenarios where my selfishness will be highlighted, but they’re very, very narrow states and unlikely to materialise in the real world (highly contrived and only in thought experiment land). In the real world, I don’t expect it to be relevant. Ditto for concerns about superrational behaviour. The kind of superrational coordination that’s possible for purely motivated EAs but isn’t possible with me is behaviour I don’t expect to actually manifest in the real world.
- Max Clarke 10 Aug 2022 3:01 UTC
  2 points
  0 ∶ 0
  Parent
  Yeah the example above with choosing to not get promoted or not recieve funding is a more realistic scenario.
  
  I agree these situations are somewhat rare in practice.
  
  Re. AI Safety, my point was that these situations are especially rare there (among people who agree it’s a problem, which is about states of knowledge anyway, not about goals)
  
  Thanks for this post, I think it’s a good discussion.