hardware progress has slowed down considerably on various measures
I don’t think this matters, as per the next point about there already being enough compute for doom [Edit: I’ve relegated the “nowhere near close to the physical limits to computation” sentence to a footnote and added Magnus’ reference on slowdown to it].
That actors can afford to create this next generation of AIs does not imply that those AIs will in turn lead to a hard takeoff in capabilities. From my perspective at least, that seems like an unargued assumption here.
I think the burden of proof here needs to shift to those willing to gamble on the safety of 100x larger systems. All I’m really saying here is that the risk is way too high for comfort (given the jumps in capabilities we’ve seen so far going from GPT-3->GPT3.5->GPT-4).
[Meta: would appreciate separate points being made in separate comments]. Will look into your links re data and respond later.
from a perspective concerned with the reduction of s-risks, one could argue that talking politely to, and working with, leading AI companies is in fact the most responsible thing to do, and that taking a less cooperative stance is unduly risky and irresponsible.
I’m not sure what you are saying here? Do you think there is a risk of AI companies deliberately causing s-risks (e.g. releasing a basilisk) if we don’t play nice!? They may be crazy in a sense of being reckless with the fate of billions of people’s lives, but I don’t think they are that crazy (in a sense of being sadistically malicious and spiteful toward their opponents)!
I’m not sure what you are saying here? Do you think there is a risk of AI companies deliberately causing s-risks (e.g. releasing a basilisk) if we don’t play nice!?
No, I didn’t mean anything like that (although such crazy unlikely risks might also be marginally better reduced through cooperation with these actors). I was simply suggesting that cooperation could be a more effective way to reduce risks of worst-case outcomes that might occur in the absence of cooperative work to prevent them, i.e. work of the directional kind gestured at in my other comment (e.g. because ensuring the inclusion of certain measures to avoid worst-case outcomes has higher EV than does work to slow down AI). Again, I’m not saying that this is definitely the case, but it could well be. It’s fairly unclear, in my view.
Ok. I don’t put much weight on s-risks being a likely outcome. Far more likely seems to be just that the solar system (and beyond) will be arranged in some (to us) arbitrary way, and all carbon-based life will be lost as collateral damage.
Although I guess if you are looking a bit nearer term, then s-risk from misuse could be quite high. But I don’t think any of the major players (OpenAI, Deepmind, Anthropic) are even really working on trying to prevent misuse at all as part of their strategy (their core AI Alignment work is on aligning the AIs, rather than the humans using them!) So actually, this is just another reason to shut it all down.
I suspect that a different framing might be more realistic and more apt from our perspective. In terms of helpful actions we can take, I more see the choice before us as one between trying to slow down development vs. trying to steer future development in better (or less bad) directions conditional on the current pace of development continuing (of course, one could dedicate resources to both, but one would still need to prioritize between them). Both of those choices (as well as graded allocations between them) seem to come with a lot of risks, and they both strike me as gambles with potentially serious downsides. I don’t think there’s really a “safe” choice here.
All I’m really saying here is that the risk is way too high for comfort
I’d agree with that, but that seems different from saying that a fast software-driven takeoff is the most likely scenario, or that trying to slow down development is the most important or effective thing to do (e.g. compared to the alternative option mentioned above).
both strike me as gambles with potentially serious downsides.
What are the downsides from slowing down? Things like not curing diseases and ageing? Eliminating wild animal suffering? I address that here: “it’s a rather depressing thought. We may be far closer to the Dune universe than the Culture one (the worry driving a future Butlerian Jihad will be the advancement of AGI algorithms to the point of individual laptops and phones being able to end the world). For those who may worry about the loss of the “glorious transhumanist future”, and in particular, radical life extension and cryonic reanimation (I’m in favour of these things), I think there is some consolation in thinking that if a really strong taboo emerges around AGI, to the point of stopping all algorithm advancement, we can still achieve these ends using standard supercomputers, bioinformatics and human scientists. I hope so.”
To be clear, I’ll also say that it’s far too late to only steer future development better. For that, Alignment needs to be 10 years ahead of where it is now!
a fast software-driven takeoff is the most likely scenario
I don’t think you need to believe this to want to be slamming on the brakes now. As mentioned in the OP, is the prospect of mere imminent global catastrophe not enough?
I’d again prefer to frame the issue as “what are the downsides from spending marginal resources on efforts to slow down?” I think the main downside, from this marginal perspective, is opportunity costs in terms of other efforts to reduce future risks, e.g. trying to implement “fail-safe measures”/”separation from hyperexistential risk” in case a slowdown is insufficiently likely to be successful. There are various ideas that one could try to implement.
In other words, a serious downside of betting chiefly on efforts to slow down over these alternative options could be that these s-risks/hyperexistential risks would end up being significantly greater in counterfactual terms (again, not saying this is clearly the case, but, FWIW, I doubt that efforts to slow down are among the most effective ways to reduce risks like these).
a fast software-driven takeoff is the most likely scenario
I don’t think you need to believe this to want to be slamming on the breaks on now.
Didn’t mean to say that that’s a necessary condition for wanting to slow down. But again, I still think it’s highly unclear whether efforts that push for slower progress are more beneficial than alternative efforts.
I think it’s a very hard sell to try and get people to sacrifice themselves (and the whole world) for the sake of preventing “fates worse than death”. At that point most people would probably just be pretty nihilistic. It also feels like it’s not far off basically just giving up hope: the future is, at best, non-existence for sentient life; but we should still focus our efforts on avoiding hell. Nope. We should be doing all we can now to avoid having to face such a predicament! Global moratorium on AGI, now.
I think it’s a very hard sell to try and get people to sacrifice themselves (and the whole world) for the sake of preventing “fates worse than death”.
I’m not talking about people sacrificing themselves or the whole world. Even if we were to adopt a purely survivalist perspective, I think it’s still far from obvious that trying to slow things down is more effective than is focusing on other aims. After all, the space of alternative aims that one could focus on is vast, and trying to slow things down comes with non-trivial risks of its own (e.g. risks of backlash from tech-accelerationists). Again, I’m not saying it’s clear; I’m saying that it seems to me unclear either way.
We should be doing all we can now to avoid having to face such a predicament!
But, as I see it, what’s at issue is what the best way is to avoid such a predicament/how to best navigate given our current all-too risky predicament.
FWIW, I think that a lot of the discussion around this issue appears strongly fear-driven, to such an extent that it seems to get in the way of sober and helpful analysis. This is, to be sure, extremely understandable. But I also suspect that it is not the optimal way to figure out how to best achieve our aims, nor an effective way to persuade readers on this forum. Likewise, I suspect that rallying calls along the lines of “Global moratorium on AGI, now” might generally be received less well than would, say, a deeper analysis of the reasons for and against attempts to institute that policy.
I feel like I’m one of the main characters in the film Don’t Look Up here.
the space of alternative aims that one could focus on is vast
Please can you name 10? The way I see it is—either alignment is solved in time with business as usual[1], or we Pause to allow time for alignment to be solved (or establish it’s impossibility). It is not a complicated situation. No need to be worrying about “fates worse than death” at this juncture.
I don’t think this matters, as per the next point about there already being enough compute for doom [Edit: I’ve relegated the “nowhere near close to the physical limits to computation” sentence to a footnote and added Magnus’ reference on slowdown to it].
I think the burden of proof here needs to shift to those willing to gamble on the safety of 100x larger systems. All I’m really saying here is that the risk is way too high for comfort (given the jumps in capabilities we’ve seen so far going from GPT-3->GPT3.5->GPT-4).
[Meta: would appreciate separate points being made in separate comments].
Will look into your links re data and respond later.
I’m not sure what you are saying here? Do you think there is a risk of AI companies deliberately causing s-risks (e.g. releasing a basilisk) if we don’t play nice!? They may be crazy in a sense of being reckless with the fate of billions of people’s lives, but I don’t think they are that crazy (in a sense of being sadistically malicious and spiteful toward their opponents)!
No, I didn’t mean anything like that (although such crazy unlikely risks might also be marginally better reduced through cooperation with these actors). I was simply suggesting that cooperation could be a more effective way to reduce risks of worst-case outcomes that might occur in the absence of cooperative work to prevent them, i.e. work of the directional kind gestured at in my other comment (e.g. because ensuring the inclusion of certain measures to avoid worst-case outcomes has higher EV than does work to slow down AI). Again, I’m not saying that this is definitely the case, but it could well be. It’s fairly unclear, in my view.
Ok. I don’t put much weight on s-risks being a likely outcome. Far more likely seems to be just that the solar system (and beyond) will be arranged in some (to us) arbitrary way, and all carbon-based life will be lost as collateral damage.
Although I guess if you are looking a bit nearer term, then s-risk from misuse could be quite high. But I don’t think any of the major players (OpenAI, Deepmind, Anthropic) are even really working on trying to prevent misuse at all as part of their strategy (their core AI Alignment work is on aligning the AIs, rather than the humans using them!) So actually, this is just another reason to shut it all down.
Thanks for your reply, Greg :)
That is what I did not find adequately justified or argued for in the post.
I suspect that a different framing might be more realistic and more apt from our perspective. In terms of helpful actions we can take, I more see the choice before us as one between trying to slow down development vs. trying to steer future development in better (or less bad) directions conditional on the current pace of development continuing (of course, one could dedicate resources to both, but one would still need to prioritize between them). Both of those choices (as well as graded allocations between them) seem to come with a lot of risks, and they both strike me as gambles with potentially serious downsides. I don’t think there’s really a “safe” choice here.
I’d agree with that, but that seems different from saying that a fast software-driven takeoff is the most likely scenario, or that trying to slow down development is the most important or effective thing to do (e.g. compared to the alternative option mentioned above).
What are the downsides from slowing down? Things like not curing diseases and ageing? Eliminating wild animal suffering? I address that here: “it’s a rather depressing thought. We may be far closer to the Dune universe than the Culture one (the worry driving a future Butlerian Jihad will be the advancement of AGI algorithms to the point of individual laptops and phones being able to end the world). For those who may worry about the loss of the “glorious transhumanist future”, and in particular, radical life extension and cryonic reanimation (I’m in favour of these things), I think there is some consolation in thinking that if a really strong taboo emerges around AGI, to the point of stopping all algorithm advancement, we can still achieve these ends using standard supercomputers, bioinformatics and human scientists. I hope so.”
To be clear, I’ll also say that it’s far too late to only steer future development better. For that, Alignment needs to be 10 years ahead of where it is now!
I don’t think you need to believe this to want to be slamming on the brakes now. As mentioned in the OP, is the prospect of mere imminent global catastrophe not enough?
I’d again prefer to frame the issue as “what are the downsides from spending marginal resources on efforts to slow down?” I think the main downside, from this marginal perspective, is opportunity costs in terms of other efforts to reduce future risks, e.g. trying to implement “fail-safe measures”/”separation from hyperexistential risk” in case a slowdown is insufficiently likely to be successful. There are various ideas that one could try to implement.
In other words, a serious downside of betting chiefly on efforts to slow down over these alternative options could be that these s-risks/hyperexistential risks would end up being significantly greater in counterfactual terms (again, not saying this is clearly the case, but, FWIW, I doubt that efforts to slow down are among the most effective ways to reduce risks like these).
Didn’t mean to say that that’s a necessary condition for wanting to slow down. But again, I still think it’s highly unclear whether efforts that push for slower progress are more beneficial than alternative efforts.
I think it’s a very hard sell to try and get people to sacrifice themselves (and the whole world) for the sake of preventing “fates worse than death”. At that point most people would probably just be pretty nihilistic. It also feels like it’s not far off basically just giving up hope: the future is, at best, non-existence for sentient life; but we should still focus our efforts on avoiding hell. Nope. We should be doing all we can now to avoid having to face such a predicament! Global moratorium on AGI, now.
I’m not talking about people sacrificing themselves or the whole world. Even if we were to adopt a purely survivalist perspective, I think it’s still far from obvious that trying to slow things down is more effective than is focusing on other aims. After all, the space of alternative aims that one could focus on is vast, and trying to slow things down comes with non-trivial risks of its own (e.g. risks of backlash from tech-accelerationists). Again, I’m not saying it’s clear; I’m saying that it seems to me unclear either way.
But, as I see it, what’s at issue is what the best way is to avoid such a predicament/how to best navigate given our current all-too risky predicament.
FWIW, I think that a lot of the discussion around this issue appears strongly fear-driven, to such an extent that it seems to get in the way of sober and helpful analysis. This is, to be sure, extremely understandable. But I also suspect that it is not the optimal way to figure out how to best achieve our aims, nor an effective way to persuade readers on this forum. Likewise, I suspect that rallying calls along the lines of “Global moratorium on AGI, now” might generally be received less well than would, say, a deeper analysis of the reasons for and against attempts to institute that policy.
I feel like I’m one of the main characters in the film Don’t Look Up here.
Please can you name 10? The way I see it is—either alignment is solved in time with business as usual[1], or we Pause to allow time for alignment to be solved (or establish it’s impossibility). It is not a complicated situation. No need to be worrying about “fates worse than death” at this juncture.
seems highly unlikely, but please say if you think there are promising solutions here