PeterMcCluskey comments on Community Polls on Alignment Controversies

PeterMcCluskey 18 Jun 2026 18:24 UTC
1 point
0 ∶ 0
Partially aligned transformative AIs are likely to be stable under reflection
Work on corrigibility has provided a decent outline of how to do this. My response is heavily dependent on weak guesses as to how diligent AI companies will be at incorporating the best ideas.