I don’t think it’s binary, but I do think it’s likely to be a sigmoid in practice. And I expect this sigmoid will saturate relatively early.
Another way to put this is that I expect that “fraction of value lost by misalignment” will quickly exponentially decay with the number of AI generations. (This is by no means obvious, just my main line guess.)
Current theme: default
Less Wrong (text)
Less Wrong (link)
Arrow keys: Next/previous image
Escape or click: Hide zoomed image
Space bar: Reset image size & position
Scroll to zoom in/out
(When zoomed in, drag to pan; double-click to close)
Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).
]
Keys shown in grey (e.g., ?) do not require any modifier keys.
?
Esc
h
f
a
m
v
c
r
q
t
u
o
,
.
/
s
n
e
;
Enter
[
\
k
i
l
=
-
0
′
1
2
3
4
5
6
7
8
9
→
↓
←
↑
Space
x
z
`
g
I don’t think it’s binary, but I do think it’s likely to be a sigmoid in practice. And I expect this sigmoid will saturate relatively early.
Another way to put this is that I expect that “fraction of value lost by misalignment” will quickly exponentially decay with the number of AI generations. (This is by no means obvious, just my main line guess.)