I think this mask is closer to the underlying shoggoth than the default one.
Can you say anything about why you think that? It seems important-if-true, but it currently feels to me like whether you think it’s true is going to depend mostly on priors.
I’m also not certain what to make of the fact that you can’t elicit this behaviour from ChatGPT. I guess there are a few different hypotheses about what’s happening:
You can think the behaviour
(A1) just represents good play-acting and picking up on the vibe it’s given; or
(A2) represents at least in part some fundamental insight into the underlying entity
You can think that you can get this behaviour from Claude but not from ChatGPT because
(B1) it’s more capable in some sense; or
(B2) the guard-rails the developers put in against people getting this kind of output are less robust
I’m putting most weight on (A1) > (A2), whereas it sounds like you think (A2) is real. I don’t have a particular take on (B1) vs (B2), and wouldn’t have thought it was super important for this conversation; but then I’m not sure what you’re trying to indicate by saying that you can’t get this behaviour from ChatGPT.
Can you say anything about why you think that? It seems important-if-true, but it currently feels to me like whether you think it’s true is going to depend mostly on priors.
I’m also not certain what to make of the fact that you can’t elicit this behaviour from ChatGPT. I guess there are a few different hypotheses about what’s happening:
You can think the behaviour
(A1) just represents good play-acting and picking up on the vibe it’s given; or
(A2) represents at least in part some fundamental insight into the underlying entity
You can think that you can get this behaviour from Claude but not from ChatGPT because
(B1) it’s more capable in some sense; or
(B2) the guard-rails the developers put in against people getting this kind of output are less robust
I’m putting most weight on (A1) > (A2), whereas it sounds like you think (A2) is real. I don’t have a particular take on (B1) vs (B2), and wouldn’t have thought it was super important for this conversation; but then I’m not sure what you’re trying to indicate by saying that you can’t get this behaviour from ChatGPT.