Greg_Colbourn ⏸️ comments on AI Pause Will Likely Backfire

Greg_Colbourn ⏸️ 23 Sep 2023 12:04 UTC
4 points
1 ∶ 0
Anthropic^[1] have a massive conflict of interest (making money), so their statements are in some sense safetywashing. There is at least a few years worth of safety work that can be done on current models if we had the time (i.e. via a pause): interpretability is still stuck on trying to decipher GPT-2 sized models and smaller. And jailbreaks are still very far from being solved. Plenty to be getting on with without pushing the frontier of capabilities yet further.
1. ^
  And the other big AI companies that supposedly care about x-safety (OpenAI, Google DeepMind)