MichaelDickens comments on A shallow review of what transformative AI means for animal welfare

MichaelDickens 9 Jul 2025 19:06 UTC
8 points
3 ∶ 0
Not OP but I would say that if we end up with an ASI that can misunderstand values in that kind of way, then it will almost certainly wipe out humanity anyway.

That is the same category of mistake as “please maximize the profit of this paperclip factory” getting interpreted as “convert all available matter into paperclip machines”.
- Alistair Stewart 9 Jul 2025 19:25 UTC
  1 point
  0 ∶ 0
  Parent
  Yes, my example and the paperclip one both seem like a classic case of outer misalignment / reward misspecification.