David Mathers🔸 comments on Counting arguments provide no evidence for AI doom

David Mathers🔸Feb 28, 2024, 1:35 PM
3 points
1 ∶ 0
Only glanced at one or two sections but the “goal realism is anti-Darwinian” section seems possibly irrelevant to the argument to me. When you first introduce “goal realism” it seems like it is a view that goals are actual internal things somehow “written down” in the brain/neural net/other physical mind, so that you could modify the bit of the system where the goal is written down and get different behaviour, rather than there really being nothing that is the representation of the AIs goals, because “goals” are just behavioral dispositions. But the view your criticizing in the “goal realism is anti-Darwinian” section is the view that there is always a precise fact of the matter about what exactly is being represented at a particular point in time, rather than several different equally good candidates for what is represented. But I can think of representations are physically real vehicles-say, that some combination of neuron firings is the representation of flys/black dots that causes frogs to snap at them-without thinking it is completely determinate what-flies or black dots-is represented by those neuron firings. Determinacy of what a representation represents is not guaranteed just by the fact that a representation exists. ~

EDIT: Also, is Olah-style interpretability working presuming “representation realism”? Does it provide evidence for it? Evidence for realism about goals specifically? If not, why not?
Reply
- [ ]
  [deleted]