This task, of trying to align them, is something that shouldn’t just be left to researchers in AI companies
In principle I agree.
But would you say that people’s suitability to align AI safely (or more specifically ensuring that Fable does not write nasty software exploits) is defined less by their expertise and alignment with Anthropic’s stated mission and more by how much money they can spend on credits?
Because that’s what Anthropic and the impending IPO marketing is asking you to believe
(tbh I’m not concerned by Fable manipulating its way into world domination. But if I was, I’d be extremely concerned that our most dedicated defenders against manipulative AI agents might be the sort of people who still take statements put out by AI companies at face value)
In principle I agree.
But would you say that people’s suitability to align AI safely (or more specifically ensuring that Fable does not write nasty software exploits) is defined less by their expertise and alignment with Anthropic’s stated mission and more by how much money they can spend on credits?
Because that’s what Anthropic and the impending IPO marketing is asking you to believe
(tbh I’m not concerned by Fable manipulating its way into world domination. But if I was, I’d be extremely concerned that our most dedicated defenders against manipulative AI agents might be the sort of people who still take statements put out by AI companies at face value)