plex comments on AI alignment with humans… but with which humans?

plex 9 Sep 2022 2:38 UTC
13 points
3 ∶ 0
The closest thing that comes to mind is Critch’s work on multi-user alignment, e.g. What Multipolar Failure Looks Like, and Robust Agent-Agnostic Processes (RAAPs).
- blonergan 9 Sep 2022 2:43 UTC
  7 points
  0 ∶ 0
  Parent
  Here are a couple of other links that come to mind:
  https://arxiv.org/abs/2008.02275
  https://www.brookings.edu/research/aligned-with-whom-direct-and-social-goals-for-ai-systems/