Thanks. I watched Robert Miles’ video which was very helpful. Especially the part where he explains why an AI might want to act in accordance with its base objective in a training environment only to then pursue its mesa objective in the real world.
I’m quite uncertain at this point, but I have a vague feeling that Russell’s second principle (The machine is initially uncertain about what those preferences are) is very important here. It is a vague feeling though...
Thanks. I watched Robert Miles’ video which was very helpful. Especially the part where he explains why an AI might want to act in accordance with its base objective in a training environment only to then pursue its mesa objective in the real world.
I’m quite uncertain at this point, but I have a vague feeling that Russell’s second principle (The machine is initially uncertain about what those preferences are) is very important here. It is a vague feeling though...