I’d agree that A, B, and C seem hard; that B is harder than A, and that B is harder than C.
Where we disagree is that I suspect that C is harder than A, for basic game-theoretic reasons I mentioned in the original post.
I’m also not confident that C is a whole lot easier than B—I’m not sure that alignment with individual humans will actually give us all that much help in doing alignment with complicated groups of humans.
But, I need to think further about this, and do some more readings!
Hi harfe, thanks for this helpful clarification.
I’d agree that A, B, and C seem hard; that B is harder than A, and that B is harder than C.
Where we disagree is that I suspect that C is harder than A, for basic game-theoretic reasons I mentioned in the original post.
I’m also not confident that C is a whole lot easier than B—I’m not sure that alignment with individual humans will actually give us all that much help in doing alignment with complicated groups of humans.
But, I need to think further about this, and do some more readings!