Jobst Heitzig (EMPO project) comments on Nobody’s on the ball on AGI alignment

Jobst Heitzig (EMPO project) 30 Mar 2023 18:58 UTC
2 points
0 ∶ 0
The challenge isn’t figuring out some complicated, nuanced utility function that “represents human values”; the challenge is getting AIs to do what it says on the tin—to reliably do whatever a human operator tells them to do.
IMO, this implies we need to design AI systems so that they satisfice rather than maximize: perform a requested task at a requested performance level but no better than that and with a requested probability but no more likely than that.
- Eevee🔹 10 Apr 2023 19:26 UTC
  2 points
  0 ∶ 0
  Parent
  Like an SLA (service level agreement)!
  - Jobst Heitzig (EMPO project) 10 Apr 2023 20:55 UTC
    3 points
    0 ∶ 0
    Parent
    Not exactly. A typical SLA only contains a lower bound, that would still allow for maximization. The program for a satisficer in the sense I meant it would states that the AL system really aims to do no better than requested. So, for example, quantilizers would not qualify since they might still (by chance) choose that action which maximizes return.