There are lots of calls for individuals with views around AI risk to engage with each other and understand the reasoning behind fundamental disagreements.
I strongly share this view, and have therefore quite appreciated this project by AI Impacts. A lot of resources are going into this field, which I’m broadly very supportive of, but it does seem worth gaining a clearer understanding of precisely why, and precisely which approaches should get which portions of those resources.
One other post that I personally found really useful for understanding the various views, the underlying assumptions, and their interconnections was Clarifying some key hypotheses in AI alignment (coauthored by Rohin). I’ve also collected here ~30 works I found that strongly relate to this goal (I plan to update that collection over time, and have now added this post to it as well).
And I’m currently working on a post with a very similar objective, but for longtermist/x-risk strategies more broadly. Hopefully that’ll be out soon.
Thanks for this post!
I strongly share this view, and have therefore quite appreciated this project by AI Impacts. A lot of resources are going into this field, which I’m broadly very supportive of, but it does seem worth gaining a clearer understanding of precisely why, and precisely which approaches should get which portions of those resources.
One other post that I personally found really useful for understanding the various views, the underlying assumptions, and their interconnections was Clarifying some key hypotheses in AI alignment (coauthored by Rohin). I’ve also collected here ~30 works I found that strongly relate to this goal (I plan to update that collection over time, and have now added this post to it as well).
And I’m currently working on a post with a very similar objective, but for longtermist/x-risk strategies more broadly. Hopefully that’ll be out soon.