Jeremy Gillen comments on Summary: High risk, low reward: A challenge to the astronomical value of existential risk mitigation

Jeremy Gillen 13 Sep 2023 10:56 UTC
11 points
4 ∶ 3
However, it seems very unlikely that any action we could take today could reduce the risk to an extremely low level for millions of years
This seems false, since the construction of AGI is probably an event we can influence. Having aligned AGI should reduce other x-risks permanently.
- titotal 13 Sep 2023 23:12 UTC
  14 points
  4 ∶ 3
  Parent
  This seems false, since the construction of AGI is probably an event we can influence. Having aligned AGI should reduce other x-risks permanently.
  It could reduce other x-risks, but the hypothesis that it would lower all x-risks to almost zero for the rest of time seems like wishful thinking.
  One of the interesting calculations from the paper: if the value of 1 century is v, and the current risk of extinction every century is 20%, and you invent an AGI that permanently lowers this by half to 10% for the rest of time… you would only increase the expected value in the world from 4*v to 9*v. Definitely a good result, but pretty far from the the astronomical result you might expect.
  - Wei Dai 16 Sep 2023 2:26 UTC
    24 points
    4 ∶ 2
    Parent
    What is a plausible source of x-risk that is 10% per century for the rest of time? It seems pretty likely to me that not long after reaching technological maturity, future civilization would reduce x-risk per century to a much lower level, because you could build a surveillance/defense system against all known x-risks, and not have to worry about new technology coming along and surprising you.
    
    It seems that to get a constant 10% per century risk, you’d need some kind of existential threat for which there is no defense (maybe vacuum collapse), or for which the defense is so costly that that the public goods problem prevents it from being built (e.g., no single star system can afford it on their own). But the likelihood of such a threat existing in our universe doesn’t seem that high to me (maybe 20%?) which I think upper bounds the long term x-risk.
    
    Curious how your model differs from this.
    - sphor 16 Sep 2023 7:31 UTC
      8 points
      0 ∶ 0
      Parent
      What does technological maturity mean?
      - Pablo 16 Sep 2023 16:02 UTC
        7 points
        0 ∶ 0
        Parent
        “the attainment of capabilities affording a level of economic productivity and control over nature close to the maximum that could feasibly be achieved.” (Nick Bostrom (2013) ‘Existential risk prevention as global priority’, Global Policy, vol. 4, no. 1, p. 19.)
- Gerald Monroe 13 Sep 2023 17:38 UTC
  1 point
  1 ∶ 1
  Parent
  It would depend on if the risk from AGI is a one time risk that goes away when humans figure out alignment or an ongoing effort.
  
  Alignment may be impossible. As a SWE working in AI I don’t know of any plausible method for the kind of alignment discussed here.
  
  Risk mitigation is possible, in that we can stack together serial steps that must fail for the AGI to escape and do meaningful damage, as well as countermeasures (pre constructed weapons and detection institutions) ready to respond when this happens.
  
  But the risk remains nonzero and recurring . Each century there is always this risk that the AGIs escape human control and the risk remains as long as “humans” are significantly stupider and less rational than AGI. I don’t know if “humans augment themselves so much to compete they are not remotely humanlike” counts as the extinction of humanity or not.