Thanks for thoughtfully engaging with this topic! I’ve spent a lot of time exploring arguments that alignment is hard, and am also unconvinced. I’m particularly skeptical about deceptive alignment, which is closely related to your point b. I’m clearly not the right person to explain why people think the problem is hard, but I think it’s good to share alternative perspectives.
If you’re interested in more skeptical arguments, there’s a forum tag and a lesswrong tag. I particularly like Quintin Pope’s posts on the topic.
Thanks for thoughtfully engaging with this topic! I’ve spent a lot of time exploring arguments that alignment is hard, and am also unconvinced. I’m particularly skeptical about deceptive alignment, which is closely related to your point b. I’m clearly not the right person to explain why people think the problem is hard, but I think it’s good to share alternative perspectives.
If you’re interested in more skeptical arguments, there’s a forum tag and a lesswrong tag. I particularly like Quintin Pope’s posts on the topic.