mic comments on $100 bounty for the best ideas to red team

mic 20 Mar 2022 19:58 UTC
5 points
0 ∶ 0
Red team: Certain types of prosaic AI alignment (e.g., arguably InstructGPT) promote the illusion of safety without genuinely reducing existential risk from AI, or are capabilities research disguised as safety research. (A claim that I’ve heard from EleutherAI, rather indelicately phrased, and would like to see investigated)