Elityre comments on My current thoughts on MIRI’s “highly reliable agent design” work

Elityre 13 Nov 2018 19:08 UTC
5 points
0 ∶ 0
(Eli’s personal notes, mostly for his own understanding. Feel free to respond if you want.)
1. It seems pretty likely that early advanced AI systems won’t be understandable in terms of HRAD’s formalisms, in which case HRAD won’t be useful as a description of how these systems should reason and make decisions.
My current guess is that the finalized HRAD formalisms would be general enough that they will provide meaningful insight into early advanced AI systems (even supposing that the development of those early systems is not influenced by HRAD ideas), in much the same way that Pearlean causality and Bayes nets gives (a little) insight into what neural nets are doing.