That mindspace is large and AIs are really weird.
What specific confirmatory evidence are you thinking of?
Conversations people have with un-RLHF’d models.
That mindspace is large and AIs are really weird.
What specific confirmatory evidence are you thinking of?
Conversations people have with un-RLHF’d models.