PCO Moore comments on Preventing an AI-related catastrophe—Problem profile

PCO Moore 29 Aug 2022 21:27 UTC
2 points
0 ∶ 0
Pardon me if this is an obvious reference around here, but what is the source for the “much higher than 50%” risk? My prior is that such percentages are too high to be taken seriously as a rational prediction, but precisely for that reason I’d be interested in challenging and updating.
- Thomas Kwa🔹 29 Aug 2022 21:57 UTC
  11 points
  0 ∶ 0
  Parent
  Last I heard Nate Soares (at MIRI) has an all-things-considered probability around 80%, and Evan Hubinger recently gave ~80% too. Nate’s reasoning is here, and he would probably also endorse this list of challenges.
  I think you don’t really have to have any crazy beliefs to have probabilities above 50%, just
  - higher confidence in the core arguments being correct, such that you think there are concrete problems that probably need to be solved to avoid AI takeover
  - a prior that is not overwhelmingly low, despite some previous mechanisms for catastrophe like overpopulation and nuclear war being avoidable. The world is allowed to kill you.
  - observation that not much progress has been made on the problem so far, and belief that this will not massively speed up as we get closer to AGI
  Believing there are multiple independent core problems we don’t have traction on, or that some problems are likely to take serial time or multiple attempts that we don’t have, can drive this probability higher.
  - Yonatan Cale 29 Aug 2022 22:07 UTC
    5 points
    0 ∶ 0
    Parent
    Adding Nate Soares’s “AGI ruin scenarios are likely [...]”
- Erich_Grunewald 🔸 29 Aug 2022 21:57 UTC
  2 points
  0 ∶ 0
  Parent
  See e.g. Yudkowsky’s AGI Ruin: A List of Lethalities. I think at this point Yudkowsky is far from alone in giving it >50% probability, though I expect that view is far less common in academia and among machine learning (capabilities) researchers.