Related: Zvi has just written a pretty in-depth piece on this new Superalignment team, which I recommend. Here’s the opening:
This is a real and meaningful commitment of serious firepower. You love to see it. The announcement, dedication of resources and focus on the problem are all great. Especially the stated willingness to learn and modify the approach along the way.
The problem is that I remain deeply, deeply skeptical of the alignment plan. I don’t see how the plan makes the hard parts of the problem easier rather than harder.
I will begin with a close reading of the announcement and my own take on the plan on offer, then go through the reactions of others, including my take on Leike’s other statements about OpenAI’s alignment plan.
Related: Zvi has just written a pretty in-depth piece on this new Superalignment team, which I recommend. Here’s the opening: