Thanks for the question! I cross-posted it here; Nate Soares replies:
For sure. Itâs tricky to wipe out humanity entirely without optimizing for that in particularânuclear war, climate change, and extremely bad natural pandemics look to me like theyâre at most global catastrophes, rather than existential threats. It might in fact be easier to wipe out humanity by enginering a pandemic thatâs specifically optimized for this task (than it is to develop AGI), but we donât see vast resources flowing into humanity-killing-virus projects, the way that we see vast resources flowing into AGI projects. By my accounting, most other x-risks look like wild tail risks (what if thereâs a large, competent, state-funded successfully-secretive death-cult???), whereas the AI x-risk is what happens by default, on the mainline (humanity is storming ahead towards AGI as fast as they can, pouring billions of dollars into it per year, and by default what happens when they succeed is that they accidentally unleash an optimizer that optimizes for our extinction, as a convergent instrumental subgoal of whatever rando thing itâs optimizing).
I responded hereâcross-posting here for convenience:
Hi, Iâm the user who asked this question. Thank you for responding!
I see your point about how an AGI would intentionally destroy humanity versus engineered bugs that only wipe us out âby accidentâ, but thatâs conditional on the AGI having âdestroy humanityâ as a subgoal. Most likely, a typical AGI will have some mundane, neutral-to-benevolent goal like âmaximize profit by running this steel factory and selling steelâ. Maybe the AGI can achieve that by taking over an iron mine somewhere, or taking over a country (or the world) and enslaving its citizens, or even wiping out humanity. In general, my guess is that the AGI will try to do the least costly/ârisky thing needed to achieve its goal (maximizing profit), and (setting aside that if all of humanity were extinct, the AGI would have no one to sell steel to) wiping out humanity is the most expensive of these options and the AGI would likely get itself destroyed while trying to do that. So I think that âenslave a large portion of humanity and export cheap steel at a hefty profitâ is a subgoal that this AGI would likely have, but destroying humanity is not.
It depends on the use caseâa misaligned AGI in charge of the U.S. Armed Forces could end up starting a nuclear warâbut given how careful the U.S. government has been about avoiding nuclear war, I think theyâd insist on an AGI being very aligned with their interests before putting it in charge of something so high stakes.
Also, I suspect that some militaries (like North Koreaâs) might be developing bioweapons and spending 1 to 100% as much on it annually as OpenAI and DeepMind spend on AGI; we just donât know about it.
Based on your AGI-bioweapon analogy, I suspect that AGI is a greater hazard than bioweapons, but not by quite as much as your argument implies. While few well-resourced actors are interested in using bioweapons, a whoâs who of corporations, states, and NGOs will be interested in using AGI. And AGIs can adopt dangerous subgoals for a wide range of goals (especially resource extraction), whereas bioweapons can basically only kill large groups of people.
Toby Ordâs definition of an existential catastrophe is âanything that destroys humanityâs longterm potential.â The worry is that misaligned AGI which vastly exceeds humanityâs power would be basically in control of what happens with humans, just as humans are, currently, basically in control of what happens with chimpanzees. It doesnât need to kill all of us in order for this to be a very, very bad outcome.
E.g. the enslavement by the steel-loving AGI you describe sounds like an existential catastrophe, if that AGI is sufficiently superhuman. You describe a âlarge portion of humanityâ enslaved in this scenario, implying a small portion remain free â but I donât think this would happen. Humans with meaningful freedom are a threat to the steel-loverâs goals (e.g. they could build a rival AGI) so it would be instrumentally important to remove that freedom.
Do you believe that AGI poses a greater existential risk than other proposed x-risk hazards, such as engineered pandemics? Why or why not?
Thanks for the question! I cross-posted it here; Nate Soares replies:
I responded hereâcross-posting here for convenience:
Toby Ordâs definition of an existential catastrophe is âanything that destroys humanityâs longterm potential.â The worry is that misaligned AGI which vastly exceeds humanityâs power would be basically in control of what happens with humans, just as humans are, currently, basically in control of what happens with chimpanzees. It doesnât need to kill all of us in order for this to be a very, very bad outcome.
E.g. the enslavement by the steel-loving AGI you describe sounds like an existential catastrophe, if that AGI is sufficiently superhuman. You describe a âlarge portion of humanityâ enslaved in this scenario, implying a small portion remain free â but I donât think this would happen. Humans with meaningful freedom are a threat to the steel-loverâs goals (e.g. they could build a rival AGI) so it would be instrumentally important to remove that freedom.