evhub comments on Timelines are short, p(doom) is high: a global stop to frontier AI development until x-safety consensus is our only reasonable hope

evhub Oct 13, 2023, 11:37 PM
4 points
1 ∶ 1

Have the resumption condition be a global consensus on an x-safety solution or a global democratic mandate for restarting (and remember there are more components of x-safety than just alignment—also misuse and multi-agent coordination).

This seems basically unachievable and even if it was achievable it doesn’t even seem like the right thing to do—I don’t actually trust the global median voter to judge whether additional scaling is safe or not. I’d much rather have rigorous technical standards than nebulous democratic standards.

I think it’s pushing it a bit at this stage to say that they, as companies, are primarily concerned with reducing x-risk.

That’s why we should be pushing them to have good RSPs! I just think you should be pushing on the RSP angle rather than the pause angle.
What links here?
- Greg_Colbourn's comment on Timelines are short, p(doom) is high: a global stop to frontier AI development until x-safety consensus is our only reasonable hope by Greg_Colbourn (Oct 14, 2023, 9:40 AM; 2 points)
- Greg_Colbourn Oct 14, 2023, 9:40 AM
  2 points
  1 ∶ 1
  Parent
  I’d much rather have rigorous technical standards then nebulous democratic standards.
  Fair. And where I say “global consensus on an x-safety”, I mean expert opinion (as I say in the OP). I expect the public to remain generally a lot more conservative than the technical experts though, in terms of risk they are willing to tolerate.
  I just think you should be pushing on the RSP angle rather than the pause angle.
  The RSP angle is part of the corporate “big AI” “business as usual” agenda. To those of us playing the outside game it seems very close to safetywashing.
  - evhub Oct 14, 2023, 5:41 PM
    1 point
    0 ∶ 1
    Parent
    
    The RSP angle is part of the corporate “big AI” “business as usual” agenda. To those of us playing the outside game it seems very close to safetywashing.
    
    I’ve written up more about why I think this is not true here.
    - Greg_Colbourn Oct 15, 2023, 12:01 PM
      2 points
      0 ∶ 0
      Parent
      Thanks. I’m not convinced.