Yes, my example and the paperclip one both seem like a classic case of outer misalignment /β reward misspecification.
Yes, my example and the paperclip one both seem like a classic case of outer misalignment /β reward misspecification.