If you haven’t already, I’d recommend reading Richard Ngo’s AGI Safety From First Principles, which I think is an unusually rigorous treatment of the issue.
I had it bookmarked, but not looked at it yet. Thanks for the recommendation!
Also check out the AGI Safety Fundamentals Alignment Curriculum and corresponding Google doc. The Intro to ML Safety material might also be of interest.
If you haven’t already, I’d recommend reading Richard Ngo’s AGI Safety From First Principles, which I think is an unusually rigorous treatment of the issue.
I had it bookmarked, but not looked at it yet. Thanks for the recommendation!
Also check out the AGI Safety Fundamentals Alignment Curriculum and corresponding Google doc. The Intro to ML Safety material might also be of interest.