Note: I’m not an expert on this stuff.
My understanding from the political science literature is that ideas around “punctuated equilibrium” and “critical junctures” are a somewhat well-supported theory about policy-making. The rough model looks something like this (from Wikipedia):
Antecedent Conditions ––> Cleavage or Shock ––> Critical Juncture
––> Aftermath ––> Legacy
My impression is that “warning shots” fit this framework somewhat well (though it’s probably better to talk in terms of “shocks” or “windows of opportunities”; see this comment). We already saw this in the case of ChatGPT, which spurred a flurry of policy-making activity (even though not much ended up sticking). (NB: I don’t think the Turing test example or alignment faking paper are shocks in this sense.)
Optimally taking advantage of these windows still requires lots of groundwork. I am unsure what this groundwork ideally looks like.
Thank you! Fixed.