Ben Millwood🔸 comments on Matthew_Barnett’s Quick takes

Ben Millwood🔸 5 Feb 2024 12:23 UTC
7 points
1 ∶ 0

Sure, it’s easy to dismiss the value of unaligned AIs if you compare against some idealistic baseline; but I’m asking you to compare against a realistic baseline, i.e. actual human nature.

I haven’t read your entire post about this, but I understand you believe that if we created aligned AI, it would get essentially “current” human values, rather than e.g. some improved / more enlightened iteration of human values. If instead you believed the latter, that would set a significantly higher bar for unaligned AI, right?
- Matthew_Barnett 5 Feb 2024 23:54 UTC
  7 points
  1 ∶ 0
  Parent
  
  If instead you believed the latter, that would set a significantly higher bar for unaligned AI, right?
  
  That’s right, if I thought human values would improve greatly in the face of enormous wealth and advanced technology, I’d definitely be open to seeing humans as special and extra valuable from a total utilitarian perspective. Note that many routes through which values could improve in the future could apply to unaligned AIs too. So, for example, I’d need to believe that humans would be more likely to reflect, and be more likely to do the right type of reflection, relative to the unaligned baseline. In other words it’s not sufficient to argue that humans would reflect a little bit; that wouldn’t really persuade me at all.