JackM comments on Nobody’s on the ball on AGI alignment

JackM 17 Apr 2023 23:20 UTC
2 points
0 ∶ 0
Thanks. I watched Robert Miles’ video which was very helpful. Especially the part where he explains why an AI might want to act in accordance with its base objective in a training environment only to then pursue its mesa objective in the real world.
I’m quite uncertain at this point, but I have a vague feeling that Russell’s second principle (The machine is initially uncertain about what those preferences are) is very important here. It is a vague feeling though...