JackM answers Alignment & Capabilities: What’s the difference?

JackM 1 Sep 2023 0:40 UTC
5 points
1 ∶ 2
I think you missed out:
(3) many techniques that are supposed to improve capabilities also improve alignment.
Take OpenAI’s Superalignment approach. It involves “building a roughly human-level automated alignment researcher” then “using vast amounts of compute to scale efforts, and iteratively align superintelligence”.
AI capabilities is central to the alignment approach because us humans are way too limited to achieve alignment ourselves.