I don’t think it’s binary, but I do think it’s likely to be a sigmoid in practice. And I expect this sigmoid will saturate relatively early.
Another way to put this is that I expect that “fraction of value lost by misalignment” will quickly exponentially decay with the number of AI generations. (This is by no means obvious, just my main line guess.)
I don’t think it’s binary, but I do think it’s likely to be a sigmoid in practice. And I expect this sigmoid will saturate relatively early.
Another way to put this is that I expect that “fraction of value lost by misalignment” will quickly exponentially decay with the number of AI generations. (This is by no means obvious, just my main line guess.)