Thanks for the post, David!
Based on this analysis, deceptive alignment is less than 1% likely for prosaic TAI.
Why less than 1 % instead of e.g. 0.1 % or 10 %?
Thanks for the post, David!
Why less than 1 % instead of e.g. 0.1 % or 10 %?