Chris Leong comments on On Artificial Wisdom

Chris Leong 5 Aug 2025 15:55 UTC
5 points
0 ∶ 0
I’m skeptical of your analysis of scenario 3, as I generally buy the orthogonality thesis, leading me to believe that it’s possible to be both wise and evil.

At the same time, emergent misalignment seems to suggest that it might be reasonable to expect that an AI that has been nudged to become wise will also be nudged somewhat towards being moral.
- Jordan Arel 6 Aug 2025 20:26 UTC
  3 points
  0 ∶ 0
  Parent
  Interesting! I think I didn’t fully distinguish between two possibilities:
  1. AW just has an understanding of wisdom
  2. AW whose values are aligned to wisdom, or at least aligned to pursuing and acting on wisdom
  I think both types of AW are worth pursuing, but the second may be even more valuable, and I think this is the type I had in mind at least in scenario 3.