JWS 🔸 comments on JWS’s Quick takes

JWS 🔸Oct 4, 2023, 8:13 PM
48 points
9 ∶ 1
I want to register that my perspective on medium-term^[1] AI existential risk (shortened to AIXR from now on) has changed quite a lot this year. Currently, I’d describe it as moving from ‘Deep Uncertainty’ to ‘risk is low in absolute terms, but high enough to be concerned about’. I guess atm I’d think that my estimates are moving closer toward the Superforecasters in the recent XPT report (though I’d still say I’m still Deeply Uncertain on this issue, to the extent that I don’t think the probability calculus is that meaningful to apply)
Some points around this change:
- I’m not sure it’s meaningful to cleanly distinguish AIXR from other anthropogenic x-risks, especially since negative consequences of AI may plausibly increase other x-risks (e.g. from Nuclear War, biosecurity, Climate Change etc.)
- I think in practice, the most likely risks from AI would come from deployment of powerful systems that have catastrophic consequences that are then rolled back. I’m think of Bing ‘Sydney’ here as the canonical empirical case.^[2] I just don’t believe we’re going to get no warning shots.
- Similary, most negative projections of AI don’t take into account negative social reaction and systematic human response to these events. You either assume we’ll get no warning shots and then get exterminated (‘sharp left turn’) or think that humanity is doomed to not co-operate (‘Moloch’). I think that the evidence suggests instead that societies and governments will react against increasing AI capability if they view it negatively, rather than simply stand by and watch it happen, which is what many AIXR seem to assume or at least imply to me.
- I think the AIXR community underestimated the importance of AI Governance and engaging with politics. Instead of politicians/the public ‘not getting it’, they infact seem to have ‘got it’. The strategy of only letting people vetted to be smart enough think about AIXR seems to have been flawed, the same kind of thinking that led to thinking a ‘pivotal act’ strategy was viable.
- In fact, I’ve been pleasantly surprised by how welcoming both politicians and the public have been to the Overton Window being opened. Turns out the median voter in a liberal democracy doesn’t like ‘let large co-orporations create powerful models with little understanding of their consequences without significant oversight’. I think people’s expectations of positive co-ordination on AI issues should have gone up this year, and conversely it should make your AIXR estimate go down unless you think relative alignment progress has declined even more.
- While there have been an awful lot of terrible arguments against AIXR raised this year, some have been new and valuable to me. Some examples:
  - As titotal has written, I expect ‘early AGI’ will be a ‘buggy mess’. More specifically, I doubt it could one-shot solve ‘take over the world’ unless you assume god-like capability by assumption, instead of ‘more capable than humans in some important ways, but not invincible’.
  - Progress via ‘stack moar layers lol’ will plausibly slow down, or at least run into some severe issues.^[3] The current hype-wave seems to be just following this as far as it will go with compute and data, rather than exploring alternative architectures that could be much more efficient.
  - I’m still unimpressed by how Frontier systems perform on tests such as The ARC Challenge which test hypothesis creativity and testing in a few-shot scenario (and on hidden data) without ‘ground truth’, as opposed to training on trillions-upon-trillions of training examples with masks and true labels.
  - Related to the above, I view creativity and explanation as critical to science and the progress of science, so I’m not sure what ‘automation of science’ would actually look like. It makes me very sceptical of claims like Davidson’s with larger numbers like ’1000x capability’ (edit, it’s actually 1000x in a year! Is that a one-time increase or perpetual explosive growth? It just seems way too strong a claim).^[4]
  - Progress in AI Safety has empirically been correlated, at least weakly, with increases in capability. I didn’t agree with everything in Nora’s recent post, but I think she’s right about assessing a fully thoeretical approach to progress (such as MIRI’s strategy, as far as I can tell).
- In summary, it doesn’t seem like the field of AIXR comes out particularly strongly to me on the I-T-N Framework relative to other existential and catastrophic risks. It may be ‘more important’, but seems similarly intractable in the style that these global problems generally are, and definitely not as neglected any more (in terms of funding, where does it stand relative to nuclear or bio for example? would be interesting to know).
- However, I still think x-risk and catastrophic risk from AI this century is unjustifiably high. I just don’t think it’s plausible to have levels of pDoom ~>90% under current evidence unless you have private insight.
- I think the main existential/catastrophic issues around AI in the foreseeable future revolve around political institutions and great power conflict, rather than humans being wiped out agentically (either deceptively by a malicious power-seeker or unintentionally by an idiot-savant)
Anyway, these thoughts aren’t fully worked out, I’m still exploring what I think on this issue, but just wanted to register where my current thoughts are at in case it helps others in the community.
1. ^
  Say on a ~50 year time scale, or to the end of the century
2. ^
  Clarification: I’m not saying ‘Sydney’ had catastrophic consequences, but that a future system could be released in a similar way due to internal corporate pressures, and that system could then act negatively in the real world in a way its developers did not expect.
3. ^
  Btw, xuan’s twitter is one of the best I know of to get insights into the state of AI. Sometimes I agree, sometimes I don’t, but her takes are always legit
4. ^
  What does it even mean to say 1,000x capability, especially in terms of science? Is there a number here that Davidson is tracking? When would he view it as falsified?
What links here?
- Guy Raveh Oct 5, 2023, 2:24 AM
  8 points
  3 ∶ 0
  Parent
  This seems like a very sensible and down-to-earth analysis to me, and I’m a bit sad I can’t seem to bookmark it.
  - JWS 🔸Oct 5, 2023, 7:32 AM
    6 points
    0 ∶ 0
    Parent
    Thanks :) I might do an actual post at the end of the year? In the meantime I just wanted to get my ideas out there as I find it incredibly difficult to actually finish any of the many Forum drafts I have 😭
    - David Mathers🔸Oct 5, 2023, 12:01 PM
      6 points
      2 ∶ 0
      Parent
      Do the post :)
  - NickLaing Oct 5, 2023, 10:02 AM
    4 points
    2 ∶ 0
    Parent
    I agree this feels plenty enough to be a post for me, but we all have different thresholds I guess!
- Chris Leong Oct 5, 2023, 1:20 PM
  4 points
  2 ∶ 0
  Parent
  “AI may plausibly increase other x-risks (e.g. from Nuclear War, biosecurity, Climate Change etc.)”
  
  I’m extremely surprised to see climate change listed here. Could you explain?
  - JWS 🔸Oct 5, 2023, 11:17 PM
    2 points
    0 ∶ 0
    Parent
    Honestly I just wrote a list of potential x-risks to make a similar reference class. It wasn’t mean to be a specific claim, just examples for the quick take!
    I guess climate change might be less of an existential risk in an of itself (per Halstead), but there might be interplays between them that increase their combined risk (I think Ord talks about this in the precipice). I’m also sympathetic to Luke Kemp’s view that we should really just care about overall x-risk, regardless of cause area, as extinction by any means would be as bad for humanities potential.^[1]
    I think it’s plausible to consider x-risk from AI higher than Climate Change over the rest of this century, but my position at the moment is that this would be more like 5% v 1% or 1% v 0.01% than 90% v 0.001%, but as I said I’m not sure trying to put precise probability estimates is that useful.
    Definitely accept the general point that it’d be good to be more specific with this language in a front-page post though.
    ^
    Though not necessarily present, some extinctions may well be a lot worse than others there
    - Chris Leong Oct 5, 2023, 11:39 PM
      4 points
      0 ∶ 0
      Parent
      My point is that even though AI emits some amount of carbon gases, I’m struggling to find a scenario where it’s a major issue for global warming as AI can help provide solutions here as well.
      
      (Oh, my point wasn’t that climate change couldn’t be an x-risk, though it has been disputed, more that I don’t see the pathway for AI to exacerbate climate change).
      - David Johnston Oct 7, 2023, 9:48 PM
        1 point
        0 ∶ 0
        Parent
        I would take the proposal to be AI->growth->climate change or other negative growth side effects
- Mo Putera Oct 6, 2023, 4:37 AM
  1 point
  0 ∶ 0
  Parent
  It makes me very sceptical of claims like Davidson’s with larger numbers like ’1000x capability’ (edit, it’s actually 1000x in a year! Is that a one-time increase or perpetual explosive growth? It just seems way too strong a claim).
  I was wondering why he said that, since I’ve read his report before and that didn’t come up at all. I suppose a few scattered recollections I have are
  - Tom would probably suggest you play around with the takeoffspeeds playground to gain a better intuition (I couldn’t find anything 1,000x-in-a-year-related at all though)
  - Capabilities takeoff speed ≠ impact takeoff speed (Tom: “overall I expect impact takeoff speed to be slower than capabilities takeoff, with the important exception that AI’s impact might mostly happen pretty suddenly after we have superhuman AI”)