as long as you imitate someone aligned then it doesn’t pose much safety risk.
Also, this kind of imitation doesn’t result in the model taking superhumanly clever actions, even if you imitate someone unaligned.
Also, this kind of imitation doesn’t result in the model taking superhumanly clever actions, even if you imitate someone unaligned.