MichaelA🔸 comments on Some thoughts on Toby Ord’s existential risk estimates

MichaelA🔸Apr 17, 2020, 3:57 AM
2 points
0 ∶ 0
As for what I meant about making the exact same argument in the past: I was just saying that we’ve discovered various risks that don’t have common mini-versions, which at one point were unknown and then became known. Your argument basically rules out discovering such things ever again. Had we listened to your argument before learning about AI, for example, we would have concluded that AI was impossible, or that somehow AIs which have the means and motive to kill 10% of people are more likely than AIs which pose existential threats.
Hmm. I’m not sure I’m understanding you correctly. But I’ll respond to what I think you’re saying.
Firstly, the risk of natural pandemics, which the Spanish Flu was a strong example of, did have “common mini-versions”. In fact, Wikipedia says the Black Death was the “most fatal pandemic recorded in human history”. So I really don’t think we’d have ruled out the Spanish Flu happening by using the sort of argument I’m discussing (which I’m not sure I’d call “my argument”) - we’d have seen it as unlikely in any given year, and that would’ve been correct. But I imagine we’d have given it approximately the correct ex ante probability.
Secondly, even nuclear weapons, which I assume is what your reference to 1944 is about, seem like they could fit neatly in this sort of argument. It’s a new weapon, but weapons and wars existed for a long time. And the first nukes really couldn’t have killed everyone. And then we gradually had more nukes, more test explosions, more Cold War events, as we got closer and closer to it being possible for 100% of people to die from it. And we haven’t had 100% die. So it again seems like we wouldn’t have ruled out what ended up happening.
Likewise, AI can arguably be seen as a continuation of past technological, intellectual, scientific, etc. progress in various ways. Of course, various trends might change in shape, speed up, etc. But so far they do seem to have mostly done so somewhat gradually, such that none of the developments would’ve been “ruled out” by expecting the future to looking roughly similar to the past or the past+extrapolation. (I’m not an expert on this, but I think this is roughly the conclusion AI Impacts is arriving at based on their research.)
Perhaps a key point is that we indeed shouldn’t say “The future will be exactly like the past.” But instead “The future seems likely to typically be fairly well modelled as a rough extrapolation of some macro trends. But there’ll be black swans sometimes. And we can’t totally rule out totally surprising things, especially if we do very new things.”
This is essentially me trying to lay out a certain way of looking at things. It’s not necessarily the one I strongly adopt. (I actually hadn’t thought about this viewpoint much before, so I’ve found trying to lay it out/defend it here interesting.)
In fact, as I said, I (at least sort-of) disagreed with Tobias’ original comment, and I’m very concerned about existential risks. And I think a key point is that new technologies and actions can change the distributions we’re drawing from, in ways that we don’t understand. I’m just saying it still seems quite plausible to me (and probably likely, though not guaranteed) that we’d see a 5-25%-style catastrophe from a particular type of risk before a 100% catastrophe from it. And I think history seems consistent with that, and that that idea probably would’ve done fairly well in the past.
(Also, as I noted in my other comment, I haven’t yet seen very strong evidence or arguments against the idea that “somehow AIs which have the means and motive to kill 10% of people are more likely than AIs which pose existential threats”—or more specifically, that AI might result in that, whether or not it has the “motive” to do that. It seems to me the jury is still out. So I don’t think I’d use the fact an argument reaches that conclusion as a point against that argument.)
- kokotajlod Apr 17, 2020, 3:53 PM
  1 point
  0 ∶ 0
  Parent
  Likewise, AI can arguably be seen as a continuation of past technological, intellectual, scientific, etc. progress in various ways. Of course, various trends might change in shape, speed up, etc. But so far they do seem to have mostly done so somewhat gradually, such that none of the developments would’ve been “ruled out” by expecting the future to looking roughly similar to the past or the past+extrapolation. (I’m not an expert on this, but I think this is roughly the conclusion AI Impacts is arriving at based on their research.)
  I agree with all this and don’t think it significantly undermines anything I said.
  I think the community has indeed developed more diverse views over the years, but I still think the original take (as seen in Bostrom’s Superintelligence) is the closest to the truth. The fact that the community has gotten more diverse can be easily explained as the result of it growing a lot bigger and having a lot more time to think. (Having a lot more time to think means more scenarios can be considered, more distinctions made, etc. More time for disagreements to arise and more time for those disagreements to seem like big deals when really they are fairly minor; the important things are mostly agreed on but not discussed anymore.) Or maybe you are right and this is evidence that Bostrom is wrong. Idk. But currently I think it is weak evidence, given the above.
- kokotajlod Apr 17, 2020, 3:40 PM
  1 point
  0 ∶ 0
  Parent
  Yeah in retrospect I really shouldn’t have picked nukes and natural pandemics as my two examples. Natural pandemics do have common mini-versions, and nukes, well, the jury is still out on that one. (I think it could go either way. I think that nukes maybe can kill everyone, because the people who survive the initial blasts might die from various other causes, e.g. civilizational collapse or nuclear winter. But insofar as we think that isn’t plausible, then yeah killing 10% is way more likely than killing 100%. (I’m assuming we count killing 99% as killing 10% here?) )
  I think AI, climate change tail risks, physics risks, grey goo, etc. would be better examples for me to talk about.
  - MichaelA🔸Apr 18, 2020, 2:17 AM
    3 points
    0 ∶ 0
    Parent
    With nukes, I do share the view that they could plausibly kill everyone. If there’s a nuclear war, followed by nuclear winter, and everyone dies during that winter, rather than most people dying and then the rest succumbing 10 years later from something else or never recovering, I’d consider that nuclear war causing 100% deaths.
    My point was instead that that really couldn’t have happened in 1945. So there was one nuke, and a couple explosions, and gradually more nukes and test explosions, etc., before there was a present risk of 100% of people dying from this source. So we did see something like “mini-versions”—Hiroshima and Nagasaki, test explosions, Cuban Missile Crisis—before we saw 100% (which indeed, we still haven’t and hopefully won’t).
    With climate change, we’re already seeing mini-versions. I do think it’s plausible that there could be a relatively sudden jump due to amplifying feedback loops. But “relatively sudden” might mean over months or years or something like that. And it wouldn’t be a total bolt from the blue in any case—the damage is already accruing and increasing, and likely would do in the lead up to such tail risks.
    AI, physics risks, and nanotech are all plausible cases where there’d be a sudden jump. And I’m very concerned about AI and somewhat about nanotech. But note that we don’t actually have clear evidence that those things could cause such sudden jumps. I obviously don’t think we should wait for such evidence, because if it came we’d be dead. But it just seems worth remembering that before using “Hypothesis X predicts no sudden jump in destruction from Y” as an argument against hypothesis X.
    Also, as I mentioned in my other comment, I’m now thinking maybe the best way to look at that is specific arguments in the case of AI, physics risks, and nanotech updating us away from the generally useful prior that we’ll see small things before extreme versions of the same things.