I agree “having people on the inside” seems useful. At the same time, it’s hard for me to imagine what an “aligned” researcher could have done at the Manhattan Project to lower nuclear risk. That’s not meant as a total dismissal, it’s just not very clear to me.
> Safety-conscious researchers and engineers have done an incredible work setting up safety teams in OpenAI and DeepMind.
I don’t know much about what successes here have looked like, I agree this is a relevant and important case study.
> I think ostracizing them would be a huge error. My other comments better reflect my current feelings here.
I agree “having people on the inside” seems useful. At the same time, it’s hard for me to imagine what an “aligned” researcher could have done at the Manhattan Project to lower nuclear risk. That’s not meant as a total dismissal, it’s just not very clear to me.
> Safety-conscious researchers and engineers have done an incredible work setting up safety teams in OpenAI and DeepMind.
I don’t know much about what successes here have looked like, I agree this is a relevant and important case study.
> I think ostracizing them would be a huge error.
My other comments better reflect my current feelings here.