Vasco Grilo🔸 comments on Deceptive Alignment is <1% Likely by Default

Vasco Grilo🔸 4 Apr 2024 14:21 UTC
4 points
0 ∶ 0
Thanks for the post, David!
Based on this analysis, deceptive alignment is less than 1% likely for prosaic TAI.
Why less than 1 % instead of e.g. 0.1 % or 10 %?