Hi, regarding this part:
And those short-term preferences can themselves backfire, because the humans will stick around in protected bubbles, and they can be attacked.
I’m not 100% sure I understand; could you elaborate a little? Is the idea that the human overseer’s values could value punishing some out-group or something else?
Hi, just saw this thread. I’m curious what type of mechanisms could lead to a net-negative world in your opinion?