So exciting! In my personal opinion, evals and mech interpretability seem like the most tractable parts of the AI Safety ecosystem right now, so Iām very happy to see talented people work on this.
So exciting! In my personal opinion, evals and mech interpretability seem like the most tractable parts of the AI Safety ecosystem right now, so Iām very happy to see talented people work on this.