(3) many techniques that are supposed to improve capabilities also improve alignment.
Take OpenAI’s Superalignment approach. It involves “building a roughly human-level automated alignment researcher” then “using vast amounts of compute to scale efforts, and iteratively align superintelligence”.
AI capabilities is central to the alignment approach because us humans are way too limited to achieve alignment ourselves.
I think you missed out:
Take OpenAI’s Superalignment approach. It involves “building a roughly human-level automated alignment researcher” then “using vast amounts of compute to scale efforts, and iteratively align superintelligence”.
AI capabilities is central to the alignment approach because us humans are way too limited to achieve alignment ourselves.