MichaelDickens comments on Why “Solving Alignment” Is Likely a Category Mistake

MichaelDickens 7 May 2025 15:01 UTC
8 points
2 ∶ 0
I think this is an important point that’s worth saying.

For what it’s worth, I am not super pessimistic that “solving alignment” is something that can be solved in principle. But I’m quite concerned that the safety-minded AI companies seem to completely ignore the philosophical problems with AI alignment. They all operate under the assumption that alignment is purely an ML problem and they can solve it by basically doing ML research. Which I expect is false (credence: 70%).

Wei Dai has written some good stuff about the problem of “philosophical competence”. See here for a collection of his writings on the topic.
What links here?
- Will Aldred's comment on The Soul of EA is in Trouble by Mjreard (9 May 2025 10:03 UTC; 13 points)
- Nate Sharpe 7 May 2025 18:52 UTC
  3 points
  0 ∶ 0
  Parent
  Thanks Michael, I hadn’t seen Wei’s comments before and that was quite a fruitful rabbit hole 😅