Ryan Greenblatt comments on What is the current most representative EA AI x-risk argument?

Ryan Greenblatt 24 Dec 2023 4:34 UTC
1 point
0 ∶ 0
Huh, no I almost entirely agree with this post as I noted in my prior comment. I cited this much earlier: “More generally, I think I basically endorse the views here (which discusses the questions of when you should cede power etc.).”

I do think unaligned ai would be morally valuable (I said in an earlier comment unaligned ai which take over might capture 10-30% of the value. That’s a lot of value.)

I don’t think I’m perfectly happy with unaligned AI. I’d prefer we try to align AIs, just as Paul Christiano says too.

I think we’ve probably been talking past each other. I thought the whole argument here was “how much value do we lose if (presumably misaligned) AI takes over” and you were arguing for “not much, caring about this seems like overly fixating on humanity” and I was arguing “(presumably misaligned) ais which take over probably results in substantially less value”. This now seems incorrect and we perhaps only have minor quantitative disagreements?

I think it probably would have helped if you were more quantitative here. Exactly how much of the value?
- Matthew_Barnett 24 Dec 2023 6:35 UTC
  3 points
  0 ∶ 0
  Parent
  I thought the whole argument here was “how much value do we lose if (presumably misaligned) AI takes over”
  
  I think the key question here is: compared to what? My position is that we lose a lot of potential value both from delaying AI and from having unaligned AI, but it’s not a crazy high reduction in either case. In other words they’re pretty comparable in terms of lost value.
  
  Ranking the options in rough order (taking up your offer to be quantitative):
  - Aligned AIs built tomorrow: 100% of the value from my perspective
  - Aligned AIs built in 100 years: 50% of the value
  - Unaligned AIs built tomorrow: 15% of the value
  - Unaligned AIs built in 100 years: 25% of the value
  Note that I haven’t thought about these exact numbers much.
  - frib 24 Dec 2023 7:15 UTC
    1 point
    1 ∶ 0
    Parent
    
    Aligned AIs built in 100 years: 50% of the value
    
    What drives this huge drop? Naive utility would be very close to 100%. (Do you mean “aligned ais built in 100y if humanity still exists by that point”, which includes extinction risk before 2123?)
    - Matthew_Barnett 24 Dec 2023 9:57 UTC
      4 points
      0 ∶ 0
      Parent
      I attempted to explain the basic intuitions behind my judgement in this thread. Unfortunately it seems I did a poor job. For the full explanation you’ll have to wait until I write a post, if I ever get around to doing that.
      
      The simple, short, and imprecise explanation is: I don’t really value humanity as a species as much as I value the people who currently exist, (something like) our current communities and relationships, our present values, and the existence of sentient and sapient life living positive experiences. Much of this will go away after 100 years.