Your section āMythos can hide its thoughtsā reminds me a lot of the juncture in AI2027, where the fictional AI company must decide whether to press ahead with a model whose alignment properties were hazy. Something thatās unclear from your essay:
Does Anthropic intend to correct the faulty safety training of Mythos (and Claude4.6)?
This seems like a case where disclosure is not enoughāthey need to rectify the mistake in order to make these AIs as safe as possible. Especially since Claude4.6/āMythos are likely to be used for the training & alignment of Claude6 and beyond.
In hindsight, I regret putting Bernieās name in the topic title and question. Most Americans already know what they think about Bernie, so I wonder to what extent thatās influencing peopleās answers.