oh54321 comments on All AGI Safety questions welcome (especially basic ones) [~monthly thread]

oh54321 9 Nov 2022 16:37 UTC
1 point
0 ∶ 0
Ok, cool, that’s helpful to know. Is your intuition that these examples will definitely occur and we just haven’t seen them yet (due to model size or something like this)? If so, why?
- Greg_Colbourn ⏸️ 9 Nov 2022 17:04 UTC
  2 points
  0 ∶ 0
  Parent
  My intuition is that they will occur, hopefully before it’s too late (but it’s possible that due to incentives for deception etc we may not see it before it’s too late). More here: Evaluating LM power-seeking .