Rohin Shah answers [missing post]

Rohin Shah 19 Jun 2022 16:37 UTC
9 points
0 ∶ 0
The AI safety community has gotten people to do reinforcement learning from human feedback (rather than automated reward functions) sooner than it would otherwise have happened.
There’s lots of subtleties about whether this reduced x-risk or not but I think it did.