Jeroen Willems🔸 answers If your AGI x-risk estimates are low, what scenarios make up the bulk of your expectations for an OK outcome?

Jeroen Willems🔸 21 Apr 2023 16:00 UTC
12 points
3 ∶ 4
My estimates aren’t low, I think there’s very roughly about 40% chance we’ll die because of AI this century. But here are some reasons why it isn’t higher:
1. Creating a species vastly more intelligent than yourself seems highly unusual, nothing like it has happened before, so there need to be very good arguments for why it’s possible.
2. Having a species completely kill all other species is also very unusual, so there need to be very good arguments for why that would happen.
3. Perhaps AGI won’t be utility maximizing, LLMs don’t seem to be very maximizing. If it has a good model of the world maybe it’ll understand what we want and just give us that.
4. Perhaps we’ll convince the world to slow down AI capabilities research and solve alignment in time.
There are good counterarguments to these which is why my p(doom) is so high, but they still add to my uncertainty.
- Bary Levy 21 Apr 2023 22:13 UTC
  1 point
  0 ∶ 1
  Parent
  In response to point 2 - if you see human civilization continuing to develop indefinitely without regard for other species, wouldn’t other species be all extinct, except for maybe a select few?
  - Jeroen Willems🔸 22 Apr 2023 16:50 UTC
    3 points
    0 ∶ 0
    Parent
    “without regard for other species” is doing a lot of work
  - Sam Battis 27 Apr 2023 1:06 UTC
    1 point
    0 ∶ 0
    Parent
    Other species are instrumentally very useful to humans, providing ecosystem functions, food, and sources of material (including genetic material).
    On the AI side, it seems possible that a powerful misaligned AGI would find ecosystems and/or biological materials valuable, or that it would be cheaper to use humans for some tasks than machines. I think these factors would raise the odds that some humans (or human-adjacent engineered beings) survive in worlds dominated by such an AGI.
  - JoshuaBlake 23 Apr 2023 14:29 UTC
    1 point
    0 ∶ 0
    Parent
    This seems pretty unlikely to me