oh54321 comments on All AGI Safety questions welcome (especially basic ones) [~monthly thread]

oh54321 6 Nov 2022 21:53 UTC
1 point
0 ∶ 0
Thanks! I think most of this made sense to me. I’m a bit fuzzy on the fourth bullet. Also, I’m still confused why a model would even develop an alternative goal to maximizing its reward function, even if it’s theoretically able to pursue one.