Will Aldred comments on Cooperative AI: Three things that confused me as a beginner (and my current understanding)

Will Aldred 16 Apr 2024 23:10 UTC
2 points
0 ∶ 0
Thanks, I found this post helpful, especially the diagram.
What (if any) is the overlap of cooperative AI […] and AI safety?
One thing I’ve thought about a little is the possiblility of there being a tension wherein making AIs more cooperative in certain ways might raise the chance that advanced collusion between AIs breaks an alignment scheme that would otherwise work.^[1]
1. ^
  I’ve not written anything up on this and likely never will; I figure here is as good a place as any to leave a quick comment pointing to the potential problem, appreciating that it’s but a small piece in the overall landscape and probably not the problem of highest priority.
- C Tilli 26 Apr 2024 12:29 UTC
  3 points
  0 ∶ 0
  Parent
  Thanks—yes I agree, and study of collusion is often included into the scope of cooperative AI (e.g. methods for detecting and preventing collusion between AI models is among the priority areas of our current grant call at Cooperative AI Foundation).
  - Will Aldred 26 Apr 2024 21:28 UTC
    3 points
    0 ∶ 0
    Parent
    Oh, interesting, thanks for the link—I didn’t realize this was already an area of research. (I brought up my collusion idea with a couple of CLR researchers before and it seemed new to them, which I guess made me think that the idea wasn’t already being discussed.)