I think the hardness of alignment/safety is hard to compare with steam engine / Apollo / P vs NP, because most of the difficulty would be if it has a very different inherent structure in that it doesn’t allow for iterative solutions. I think that’s what you’re getting at here too.
I think the hardness of alignment/safety is hard to compare with steam engine / Apollo / P vs NP, because most of the difficulty would be if it has a very different inherent structure in that it doesn’t allow for iterative solutions. I think that’s what you’re getting at here too.
See also:
Worlds where iterative design fails
Creating worlds where iterative alignment succeeds