Greg_Colbourn ⏸️ comments on The possibility of an indefinite AI pause

Greg_Colbourn ⏸️ 23 Sep 2023 7:17 UTC
4 points
1 ∶ 0
I think that as systems get more capable, we will see a large increase in our alignment efforts and monitoring of AI systems, even without any further intervention from longtermists.
Maybe so. But I can’t really see mechanistic interpretability being solved to a sufficient degree to detect a situationally aware AI playing the training game, in time to avert doom. Not without a long pause first at least!