Greg_Colbourn comments on Timelines are short, p(doom) is high: a global stop to frontier AI development until x-safety consensus is our only reasonable hope

Greg_Colbourn 13 Oct 2023 8:48 UTC
0 points
2 ∶ 2
I mention Responsible Scaling!
Responsible Scaling policies are deeply flawed; it’s basically an oxymoron when AGI is so close. The danger is already apparent enough (see above) to stop scaling now. The same applies to conditional pauses, trying to predict when we will get dangerous AI, or notice “near-dangerous” AI.
EDIT to add: I’m interested in a response from evhub (or anyone else) to the points raised against Responsible Scaling (see links for more details).
- evhub 13 Oct 2023 18:15 UTC
  10 points
  3 ∶ 0
  Parent
  I guess I’m not really sure what your objection is to Responsible Scaling Policies? I see that there’s a bunch of links, but I don’t really see a consistent position being staked out by the various sources you’ve linked to. Do you want to describe what your objection is?
  
  I guess the closest there is “the danger is already apparent enough” which, while true, doesn’t really seem like an objection. I agree that the danger is apparent, but I don’t think that advocating for a pause is a very good way to address that danger.
  - Greg_Colbourn 13 Oct 2023 21:10 UTC
    4 points
    0 ∶ 0
    Parent
    The consistent position is that further scaling is reckless at this stage; it can’t be done in a “responsible” way, unless you think subjecting the world to a 10-25% risk of extinction is a responsible thing to be doing!
    
    What is a better way of addressing the danger? Waiting for it to get more intense and more apparent by scaling further!? Waiting until a disaster actually happens? Actually pausing, or stopping (and setting an example), rather than just advocating for a pause?
    - evhub 13 Oct 2023 22:01 UTC
      7 points
      1 ∶ 1
      Parent
      Perhaps the crux is related to how dangerous you think current models are? I’m quite confident that we have at least a couple additional orders of magnitude of scaling before the world ends, so I’m not too worried about stopping training of current models, or even next-generation models. But I do start to get worried with next-next-generation models.
      
      So, in my view, the key is to make sure that we have a well-enforced Responsible Scaling Policy (RSP) regime that is capable of preventing scaling unless hard safety metrics are met (I favor understanding-based evals for this) before the next two scaling generations. That means we need to get good RSPs into law with solid enforcement behind them and—at least in very short timeline worlds—that needs to happen in the next few years. By far the best way to make that happen, in my opinion, is to pressure labs to put out good RSPs now that governments can build on.
      - Greg_Colbourn 13 Oct 2023 22:44 UTC
        0 points
        1 ∶ 4
        Parent
        I don’t think the current models are dangerous, but perhaps they could be if used for long enough on improving AI. A couple of orders of magnitude (or a couple of generations) is only a couple of years! This is soon enough to be pushing as hard as we can for a pause right now!
        
        Why try and take it right down to the wire with RSPs? Why over-complicate things? The stakes couldn’t be bigger (extinction). It’s super reckless to not just be saying “It seems quite likely we’re getting to world-ending models in 2-5 years. Let’s not keep going any longer. Let’s just stop now.” The tradeoff [edit: for Anthropic] for a few tens of $Bs of extra profit really doesn’t seem worth it!
        evhub 13 Oct 2023 23:00 UTC
        17 points
        2 ∶ 0
        Parent
        
        This is soon enough to be pushing as hard as we can for a pause right now!
        
        I mean, yes, obviously we should be doing everything we can right now. I just think that a RSP-gated pause is the right way to do a pause. I’m not even sure what it would mean to do a pause without an RSP-like resumption condition.
        
        Why try and take it right down to the wire with RSPs?
        
        Because it’s more likely to succeed. RSPs provides very clear and legible risk-based criteria that are much more plausibly things that you could actually get a government to agree to.
        
        The tradeoff for a few tens of $Bs of extra profit really doesn’t seem worth it!
        
        This seems extremely disingenuous and bad faith. That’s obviously not the tradeoff and it confuses me why you would even claim that. Surely you know that I am not Sam Altman or Dario Amodei or whatever.
        
        The actual tradeoff is the probability of success. If I thought e.g. just advocating for a six month pause right now was more effective at reducing existential risk, I would do it.
        Greg_Colbourn 13 Oct 2023 23:30 UTC
        6 points
        1 ∶ 1
        Parent
        I’m not even sure what it would mean to do a pause without an RSP-like resumption condition.
        Have the resumption condition be a global consensus on an x-safety solution or a global democratic mandate for restarting (and remember there are more components of x-safety than just alignment—also misuse and multi-agent coordination).
        much more plausibly things that you could actually get a government to agree to.
        I think if governments actually properly appreciated the risks, they could agree to an unconditional pause.
        This seems extremely disingenuous and bad faith. That’s obviously not the tradeoff and it confuses me why you would even claim that. Surely you know that I am not Sam Altman or Dario Amodei or whatever.
        Sorry. I’m looking at it at the company level. Please don’t take my critiques as being directed at you personally. What is in it for Anthropic and OpenAI and DeepMind to keep going with scaling? Money and power, right? I think it’s pushing it a bit at this stage to say that they, as companies, are primarily concerned with reducing x-risk. If they were they would’ve stopped scaling already. Forget the (suicide) race. Set an example to everyone and just stop!
        evhub 13 Oct 2023 23:37 UTC
        4 points
        1 ∶ 1
        Parent
        
        Have the resumption condition be a global consensus on an x-safety solution or a global democratic mandate for restarting (and remember there are more components of x-safety than just alignment—also misuse and multi-agent coordination).
        
        This seems basically unachievable and even if it was achievable it doesn’t even seem like the right thing to do—I don’t actually trust the global median voter to judge whether additional scaling is safe or not. I’d much rather have rigorous technical standards than nebulous democratic standards.
        
        I think it’s pushing it a bit at this stage to say that they, as companies, are primarily concerned with reducing x-risk.
        
        That’s why we should be pushing them to have good RSPs! I just think you should be pushing on the RSP angle rather than the pause angle.
        What links here?
        Greg_Colbourn's comment on Timelines are short, p(doom) is high: a global stop to frontier AI development until x-safety consensus is our only reasonable hope by Greg_Colbourn (14 Oct 2023 9:40 UTC; 2 points)
        Greg_Colbourn 14 Oct 2023 9:40 UTC
        2 points
        1 ∶ 1
        Parent
        I’d much rather have rigorous technical standards then nebulous democratic standards.
        Fair. And where I say “global consensus on an x-safety”, I mean expert opinion (as I say in the OP). I expect the public to remain generally a lot more conservative than the technical experts though, in terms of risk they are willing to tolerate.
        I just think you should be pushing on the RSP angle rather than the pause angle.
        The RSP angle is part of the corporate “big AI” “business as usual” agenda. To those of us playing the outside game it seems very close to safetywashing.
        evhub 14 Oct 2023 17:41 UTC
        1 point
        0 ∶ 1
        Parent
        
        The RSP angle is part of the corporate “big AI” “business as usual” agenda. To those of us playing the outside game it seems very close to safetywashing.
        
        I’ve written up more about why I think this is not true here.
        Greg_Colbourn 15 Oct 2023 12:01 UTC
        2 points
        0 ∶ 0
        Parent
        Thanks. I’m not convinced.
- Greg_Colbourn 13 Oct 2023 12:58 UTC
  0 points
  2 ∶ 1
  Parent
  Why are people downvoting my reply without comment, and upvoting evhub’s comment? It’s the most upvoted comment, even though he clearly didn’t even ctrl-F for “Responsible Scaling” / notice that I’d addressed it in the OP!