The “meta-problem of consciousness” is “What is the exact chain of events in the brain that leads people to self-report that they’re conscious?”. The idea is (1) This is not a philosophy question, it’s a mundane neuroscience / CogSci question, yet (2) Answering this question would certainly be a big step towards understanding consciousness itself, and moreover (3) This kind of algorithm-level analysis seems to me to be essential for drawing conclusions about the consciousness of different algorithms, like those of animal brains and AIs.
(For example, a complete accounting of the chain of events that leads me to self-report “I am wearing a wristwatch” involves, among other things, a description of the fact that I am in fact wearing a wristwatch, and of what a wristwatch is. By the same token, a complete accounting of the chain of events that leads me to self-report “I am conscious” ought to involve the fact that I am conscious, and what consciousness is, if indeed consciousness is anything at all. Unless you believe in p-zombies I guess, and likewise believe that your own personal experience of being conscious has no causal connection whatsoever to the words that you say when you talk about your conscious experience, which seems rather ludicrous to me, although to be fair there are reasonable people who believe that.)
My impression is that the meta-problem of consciousness is rather neglected in neuroscience / CogSci, although I think Graziano is heading in the right direction. For example, Dehaene has a whole book about consciousness, and nowhere in that book will you see a sentence that ends ”...and then the brain emits motor commands to speak the words ‘I just don’t get it, why does being human feel like anything at all?’.” or anything remotely like that. I don’t see anything like that from QRI either, although someone can correct me if I missed it. (Graziano does have sentences like that.)
Ditto with the “meta-problem of suffering”, incidentally. (Is that even a term? You know what I mean.) It’s not obvious, but when I wrote this post I was mainly trying to work towards a theory of the meta-problem of suffering, as a path to understand what suffering is and how to tell whether future AIs will be suffering. I think that particular post was wrong in some details, but hopefully you can see the kind of thing I’m talking about. Conveniently, there’s a lot of overlap between solving the meta-problem of suffering and understanding brain motivational systems more generally, which I think may be directly relevant and important for AI Alignment.
If ‘downward causation’ isn’t real, then how are our qualia causing us to act? I suggest that we should look for solutions which describe why we have the sensory illusion of qualia having causal power, without actually adding another causal entity to the universe.
I believe this is much more feasible than it seems if we carefully examine the exact sense in which language is ‘about’ qualia. Instead of a direct representational interpretation, I offer we should instead think of language’s ‘aboutness’ as a function of systematic correlations between two things related to qualia: the brain’s logical state (i.e., connectome-level neural activity), particularly those logical states relevant to its self-model, and the brain’s microphysical state (i.e., what the quarks which constitute the brain are doing).
In short, our brain has evolved to be able to fairly accurately report its internal computational states (since it was adaptive to be able to coordinate such states with others), and these computational states are highly correlated with the microphysical states of the substrate the brain’s computations run on (the actual source of qualia). However, these computational states and microphysical states are not identical. Thus, we would need to be open to the possibility that certain interventions could cause a change in a system’s physical substrate (which generates its qualia) without causing a change in its computational level (which generates its qualia reports). We’ve evolved toward having our qualia, and our reports about our qualia, being synchronized – but in contexts where there hasn’t been an adaptive pressure to accurately report our qualia, we shouldn’t expect these to be synchronized ‘for free’.
The details of precisely how our reports of qualia, and our ground-truth qualia, might diverge will greatly depend on what the actual physical substrate of consciousness is.48 What is clear from this, however, is that transplanting the brain to a new substrate – e.g., emulating a human brain as software, on a traditional Von Neumann architecture computer – would likely produce qualia very different from the original, even if the high-level behavioral dynamics which generate its qualia reports were faithfully replicated. Copying qualia reports will likely not copy qualia.
I realize this notion that we could (at least in theory) be mistaken about what qualia we report & remember having is difficult to swallow. I would just say that although it may seem far-fetched, I think it’s a necessary implication of all theories of qualia that don’t resort to anti-scientific mysticism or significantly contradict what we know of physical laws.
Back to the question: why do we have the illusion that qualia have causal power?
In short, I’d argue that the brain is a complex, chaotic, coalition-based dynamic system with well defined attractors and a high level of criticality (low activation energy needed to switch between attractors) that has an internal model of self-as-agent, yet can’t predict itself. And I think any conscious system with these dynamics will have the quale of free will, and have the phenomenological illusion that its qualia have causal power.
And although it would be perfectly feasible for there to exist conscious systems which don’t have the quale of free will, it’s plausible that this quale will be relatively common across most evolved organisms. (Brembs 2011) argues that the sort of dynamical unpredictability which leads to the illusion of free will tends to be adaptive, both as a search strategy for hidden resources and as a game-theoretic advantage against predators, prey, and conspecifics: “[p]redictability can never be an evolutionarily stable strategy.”
OK, if I understand correctly, the report suggests that qualia may diverge from qualia reports—like, some intervention could change the former without the latter. This just seems really weird to me. Like, how could we possibly know that?
Let’s say I put on a helmet with a button, and when you press the button, my qualia radically change, but my qualia reports stay the same. Alice points to me and says “his qualia were synchronized with his qualia reports, but pressing the button messed that up”. Then Bob points to me and says “his qualia were out-of-sync with his qualia reports, but when you pressed the button, you fixed it”. How can we tell who’s right?
And meanwhile here I am, wearing this helmet, looking at both of them, and saying “Umm, hey Alice & Bob, I’m standing right here, and I’m telling you, I swear, I feel exactly the same. This helmet does nothing whatsoever to my qualia. Trust me! I promise!” And of course Alice & Bob give me a look like I’m a complete moron, and they yell at me in synchrony ”...You mean, ‘does nothing whatsoever to my qualia reports’!!”
How can we decide who’s right? Me, Alice, or Bob? Isn’t it fundamentally impossible?? If every human’s qualia reports are wildly out of sync with their qualia, and always have been for all of history, how could we tell?
Sorry if I’m misunderstanding or if this is in the report somewhere.
There are some examples of situations/interventions where I’m reasonably confident that the intervention changes qualia reports more than it changes qualia.
The first that jumps to mind is meditation: in the relatively small number of studies I’ve seen, meditation dramatically changes how people think they perceive time (time feels slower, a minute feels longer, etc), but without noticeable effects on things like reaction speed, cognitive processing of various tasks, etc.
This to me is moderate evidence that the subjective experience of the subjective experience of time has changed, but not (or at least not as much) the actual subjective experience of time.
Anecdotally, I hear similar reports for recreational drug use (time feels slower but reaction speed doesn’t go up...if anything it goes down).
This is relevant to altruists because (under many consequentialist ethical theories) extending subjective experience of time for pleasurable experiences seems like a clear win, but the case for extending the subjective experience of the subjective experience of time is much weaker.
Interesting… I guess I would have assumed that, if someone says their subjective experience of time has changed, then their time-related qualia has changed, kinda by definition. If meanwhile their reaction time hasn’t changed, well, that’s interesting but I’m not sure I care… (I’m not really sure of the definitions here.)
Let me put it a different way. Suppose we simulate Bob’s experiences on a computer. From a utilitarian lens, if you can run Bob on a computational substrate that goes 100x faster, there’s a strong theoretical case that FastBob is 100x as valuable per minute run (or 100x as disvaluable if Bob’s suffering). But if you trick simulatedBob to thinking that he’s 100x faster (or if you otherwise distort the output channel so the channel lies to you about the speed), then it seems to be a much harder case to argue that FakeFastBob is indeed 100x faster/more valuable.
If someone declares that it feels like time is passing slower for them (now that they’re enlightened or whatever), I would accept that as a sincere description of some aspect of their experience. And insofar as qualia exist, I would say that their qualia have changed somehow. But it wouldn’t even occur to me to conclude that this person’s time is now more valuable per second in a utilitarian calculus, in proportion to how much they say their time slowed down, or that the change in their qualia is exactly literally time-stretching.
I treat descriptions of subjective experience as a kind of perception, in the same category as someone describing what they’re seeing or hearing. If someone sincerely tells me they saw a UFO last night, well that’s their lived experience and I respect that, but no they didn’t. By the same token, if someone says their experience of time has slowed down, I would accept that something in their consciously-accessible brain has changed, and the way they perceive that change is as they describe, but it wouldn’t even cross my mind that the actual change in their brain is similar to that description.
As for inter-person utilitarian calculus and utility monsters, beats me, everything about that is confusing to me, and way above my pay grade :-P
Right, I guess the higher-level thing I’m getting at is that while introspective access is arguably the best tool that we have to access subjective experience in ourselves right now, and stated experiences is arguably the best tool for us to see it in others (well, at least humans), we shouldn’t confuse stated experiences as identical to subjective experience.
To go with the perception/UFO example, if someone (who believes themself to be truthful) reports seeing an UFO and it later turns out that they “saw” an UFO because their friend pulled a prank on them, or because this was an optical illusion, then I feel relatively comfortable in saying that they actually had the subjective experience of seeing an UFO. So while external reality did not actually have an UFO, this was an accurate qualia report.
In contrast, if their memory later undergoes falsification, and they misremembered seeing a bird (which at the time they believed it was a bird) as seeing an UFO, then they only had the subjective experience of rememberingseeing an UFO, not the actual subjective experience of seeing an UFO.
Some other examples:
1. If I were to undergo surgery, I would pay more money for a painkiller that numbs my present experience of pain than I would pay for a painkiller that removes my memory of pain (and associated trauma etc), though I would pay nonzero dollars for the later. This is because my memory of pain is an experience of an experience, not identical with the original experience itself.
2. Many children with congenital anosmia (being born without a sense of smell) act as if they have a sense of smell until tested. While I think it’s reasonable to say that they have some smell-adjacent qualia/subjective experiences, I’d be surprised if they hallucinated qualia identical to the experiences of people with a sense of smell, and I would be inaccurate to say that their subjective experiences of smell is the same as people who have the objective ability to smell.
I think you’re emphasizing how qualia reports are not always exactly corresponding to qualia and can’t always be taken at face value, and I’m emphasizing that it’s incoherent to say that qualia exist but there’s absolutely no causal connection whatsoever going from an experienced qualia to a sincere qualia report. Both of those can be true!
The first is like saying “if someone says “I see a rock”, we shouldn’t immediately conclude that there was a rock in this person’s field-of-view. It’s a hypothesis we should consider, but not proven.” That’s totally true.
The second is like disputing the claim: “If you describe the complete chain of events leading to someone reporting “I see a rock”, nowhere in that chain of events is there ever an actual rock (with photons bouncing off it), not for anyone ever—oh and there are in fact rocks in the world, and when people talk about rocks they’re describing them correctly, it’s just that they came to have knowledge of rocks through some path that had nothing to do with the existence of actual rocks.” That’s what I would disagree with.
So if you have a complete and correct description of the chain of events that leads someone to say they have qualia, and nowhere in that description is anything that looks just like our intuitive notion of qualia, I think the correct conclusion is “there is nothing in the world that looks just like our intuitive notion of qualia”, not “there’s a thing in the world that’s just like our intuitive notion of qualia, but it’s causally disconnected from our talking about it”.
(I do in fact think “there’s nothing in the world that looks just like our intuitive notion of qualia”. I think this is an area where our perceptions are not neutrally and accurately conveying what’s going on; more like our perception of an optical illusion than our perception of a rock.)
Hi, sorry for the very delayed reply. I think one thing I didn’t mention in the chain of comments above is that I think it’s more plausible that there are interventions that change qualia reports without much changing (morally important) qualia than the reverse: changing important qualia without changing qualia reports. And I gave examples of changing qualia reports without (much) changing qualia, whereas the linked report talks more about changing qualia without substantively changing qualia reports.
I can conceive of examples where qualia interventions change qualia but not qualia reports (eg painkillers for extreme pain that humans naturally forget/round down), but they seem more like edge cases than the examples I gave.
I agree that there are both interventions that change qualia reports without much changing (morally important) qualia and interventions that change qualia without much changing qualia reports, and that we should keep both these possibilities in mind when evaluating interventions.
The “meta-problem of consciousness” is “What is the exact chain of events in the brain that leads people to self-report that they’re conscious?”. The idea is (1) This is not a philosophy question, it’s a mundane neuroscience / CogSci question, yet (2) Answering this question would certainly be a big step towards understanding consciousness itself, and moreover (3) This kind of algorithm-level analysis seems to me to be essential for drawing conclusions about the consciousness of different algorithms, like those of animal brains and AIs.
(For example, a complete accounting of the chain of events that leads me to self-report “I am wearing a wristwatch” involves, among other things, a description of the fact that I am in fact wearing a wristwatch, and of what a wristwatch is. By the same token, a complete accounting of the chain of events that leads me to self-report “I am conscious” ought to involve the fact that I am conscious, and what consciousness is, if indeed consciousness is anything at all. Unless you believe in p-zombies I guess, and likewise believe that your own personal experience of being conscious has no causal connection whatsoever to the words that you say when you talk about your conscious experience, which seems rather ludicrous to me, although to be fair there are reasonable people who believe that.)
My impression is that the meta-problem of consciousness is rather neglected in neuroscience / CogSci, although I think Graziano is heading in the right direction. For example, Dehaene has a whole book about consciousness, and nowhere in that book will you see a sentence that ends ”...and then the brain emits motor commands to speak the words ‘I just don’t get it, why does being human feel like anything at all?’.” or anything remotely like that. I don’t see anything like that from QRI either, although someone can correct me if I missed it. (Graziano does have sentences like that.)
Ditto with the “meta-problem of suffering”, incidentally. (Is that even a term? You know what I mean.) It’s not obvious, but when I wrote this post I was mainly trying to work towards a theory of the meta-problem of suffering, as a path to understand what suffering is and how to tell whether future AIs will be suffering. I think that particular post was wrong in some details, but hopefully you can see the kind of thing I’m talking about. Conveniently, there’s a lot of overlap between solving the meta-problem of suffering and understanding brain motivational systems more generally, which I think may be directly relevant and important for AI Alignment.
wrt QRI’s take on the causal importance of consciousness—yes, it is one of the core problems that are being addressed.
Perhaps see: Breaking Down the Problem of Consciousness, and Raising the Table Stakes for Successful Theories of Consciousness.
wrt the meta-problem, see: Qualia Formalism in the Water Supply: Reflections on The Science of Consciousness 2018
I did not know about the meta-problem of consciousness before. I will have to think about this, thank you!
In Principia Qualia (p. 65-66), Mike Johnson posits:
What is happening when we talk about our qualia?
If ‘downward causation’ isn’t real, then how are our qualia causing us to act? I suggest that we should look for solutions which describe why we have the sensory illusion of qualia having causal power, without actually adding another causal entity to the universe.
I believe this is much more feasible than it seems if we carefully examine the exact sense in which language is ‘about’ qualia. Instead of a direct representational interpretation, I offer we should instead think of language’s ‘aboutness’ as a function of systematic correlations between two things related to qualia: the brain’s logical state (i.e., connectome-level neural activity), particularly those logical states relevant to its self-model, and the brain’s microphysical state (i.e., what the quarks which constitute the brain are doing).
In short, our brain has evolved to be able to fairly accurately report its internal computational states (since it was adaptive to be able to coordinate such states with others), and these computational states are highly correlated with the microphysical states of the substrate the brain’s computations run on (the actual source of qualia). However, these computational states and microphysical states are not identical. Thus, we would need to be open to the possibility that certain interventions could cause a change in a system’s physical substrate (which generates its qualia) without causing a change in its computational level (which generates its qualia reports). We’ve evolved toward having our qualia, and our reports about our qualia, being synchronized – but in contexts where there hasn’t been an adaptive pressure to accurately report our qualia, we shouldn’t expect these to be synchronized ‘for free’.
The details of precisely how our reports of qualia, and our ground-truth qualia, might diverge will greatly depend on what the actual physical substrate of consciousness is.48 What is clear from this, however, is that transplanting the brain to a new substrate – e.g., emulating a human brain as software, on a traditional Von Neumann architecture computer – would likely produce qualia very different from the original, even if the high-level behavioral dynamics which generate its qualia reports were faithfully replicated. Copying qualia reports will likely not copy qualia.
I realize this notion that we could (at least in theory) be mistaken about what qualia we report & remember having is difficult to swallow. I would just say that although it may seem far-fetched, I think it’s a necessary implication of all theories of qualia that don’t resort to anti-scientific mysticism or significantly contradict what we know of physical laws.
Back to the question: why do we have the illusion that qualia have causal power?
In short, I’d argue that the brain is a complex, chaotic, coalition-based dynamic system with well defined attractors and a high level of criticality (low activation energy needed to switch between attractors) that has an internal model of self-as-agent, yet can’t predict itself. And I think any conscious system with these dynamics will have the quale of free will, and have the phenomenological illusion that its qualia have causal power.
And although it would be perfectly feasible for there to exist conscious systems which don’t have the quale of free will, it’s plausible that this quale will be relatively common across most evolved organisms. (Brembs 2011) argues that the sort of dynamical unpredictability which leads to the illusion of free will tends to be adaptive, both as a search strategy for hidden resources and as a game-theoretic advantage against predators, prey, and conspecifics: “[p]redictability can never be an evolutionarily stable strategy.”
OK, if I understand correctly, the report suggests that qualia may diverge from qualia reports—like, some intervention could change the former without the latter. This just seems really weird to me. Like, how could we possibly know that?
Let’s say I put on a helmet with a button, and when you press the button, my qualia radically change, but my qualia reports stay the same. Alice points to me and says “his qualia were synchronized with his qualia reports, but pressing the button messed that up”. Then Bob points to me and says “his qualia were out-of-sync with his qualia reports, but when you pressed the button, you fixed it”. How can we tell who’s right?
And meanwhile here I am, wearing this helmet, looking at both of them, and saying “Umm, hey Alice & Bob, I’m standing right here, and I’m telling you, I swear, I feel exactly the same. This helmet does nothing whatsoever to my qualia. Trust me! I promise!” And of course Alice & Bob give me a look like I’m a complete moron, and they yell at me in synchrony ”...You mean, ‘does nothing whatsoever to my qualia reports’!!”
How can we decide who’s right? Me, Alice, or Bob? Isn’t it fundamentally impossible?? If every human’s qualia reports are wildly out of sync with their qualia, and always have been for all of history, how could we tell?
Sorry if I’m misunderstanding or if this is in the report somewhere.
(I have not read the report in question)
There are some examples of situations/interventions where I’m reasonably confident that the intervention changes qualia reports more than it changes qualia.
The first that jumps to mind is meditation: in the relatively small number of studies I’ve seen, meditation dramatically changes how people think they perceive time (time feels slower, a minute feels longer, etc), but without noticeable effects on things like reaction speed, cognitive processing of various tasks, etc.
This to me is moderate evidence that the subjective experience of the subjective experience of time has changed, but not (or at least not as much) the actual subjective experience of time.
Anecdotally, I hear similar reports for recreational drug use (time feels slower but reaction speed doesn’t go up...if anything it goes down).
This is relevant to altruists because (under many consequentialist ethical theories) extending subjective experience of time for pleasurable experiences seems like a clear win, but the case for extending the subjective experience of the subjective experience of time is much weaker.
Interesting… I guess I would have assumed that, if someone says their subjective experience of time has changed, then their time-related qualia has changed, kinda by definition. If meanwhile their reaction time hasn’t changed, well, that’s interesting but I’m not sure I care… (I’m not really sure of the definitions here.)
Let me put it a different way. Suppose we simulate Bob’s experiences on a computer. From a utilitarian lens, if you can run Bob on a computational substrate that goes 100x faster, there’s a strong theoretical case that FastBob is 100x as valuable per minute run (or 100x as disvaluable if Bob’s suffering). But if you trick simulatedBob to thinking that he’s 100x faster (or if you otherwise distort the output channel so the channel lies to you about the speed), then it seems to be a much harder case to argue that FakeFastBob is indeed 100x faster/more valuable.
Oh, I think I see.
If someone declares that it feels like time is passing slower for them (now that they’re enlightened or whatever), I would accept that as a sincere description of some aspect of their experience. And insofar as qualia exist, I would say that their qualia have changed somehow. But it wouldn’t even occur to me to conclude that this person’s time is now more valuable per second in a utilitarian calculus, in proportion to how much they say their time slowed down, or that the change in their qualia is exactly literally time-stretching.
I treat descriptions of subjective experience as a kind of perception, in the same category as someone describing what they’re seeing or hearing. If someone sincerely tells me they saw a UFO last night, well that’s their lived experience and I respect that, but no they didn’t. By the same token, if someone says their experience of time has slowed down, I would accept that something in their consciously-accessible brain has changed, and the way they perceive that change is as they describe, but it wouldn’t even cross my mind that the actual change in their brain is similar to that description.
As for inter-person utilitarian calculus and utility monsters, beats me, everything about that is confusing to me, and way above my pay grade :-P
Right, I guess the higher-level thing I’m getting at is that while introspective access is arguably the best tool that we have to access subjective experience in ourselves right now, and stated experiences is arguably the best tool for us to see it in others (well, at least humans), we shouldn’t confuse stated experiences as identical to subjective experience.
To go with the perception/UFO example, if someone (who believes themself to be truthful) reports seeing an UFO and it later turns out that they “saw” an UFO because their friend pulled a prank on them, or because this was an optical illusion, then I feel relatively comfortable in saying that they actually had the subjective experience of seeing an UFO. So while external reality did not actually have an UFO, this was an accurate qualia report.
In contrast, if their memory later undergoes falsification, and they misremembered seeing a bird (which at the time they believed it was a bird) as seeing an UFO, then they only had the subjective experience of remembering seeing an UFO, not the actual subjective experience of seeing an UFO.
Some other examples:
1. If I were to undergo surgery, I would pay more money for a painkiller that numbs my present experience of pain than I would pay for a painkiller that removes my memory of pain (and associated trauma etc), though I would pay nonzero dollars for the later. This is because my memory of pain is an experience of an experience, not identical with the original experience itself.
2. Many children with congenital anosmia (being born without a sense of smell) act as if they have a sense of smell until tested. While I think it’s reasonable to say that they have some smell-adjacent qualia/subjective experiences, I’d be surprised if they hallucinated qualia identical to the experiences of people with a sense of smell, and I would be inaccurate to say that their subjective experiences of smell is the same as people who have the objective ability to smell.
Thanks!
I think you’re emphasizing how qualia reports are not always exactly corresponding to qualia and can’t always be taken at face value, and I’m emphasizing that it’s incoherent to say that qualia exist but there’s absolutely no causal connection whatsoever going from an experienced qualia to a sincere qualia report. Both of those can be true!
The first is like saying “if someone says “I see a rock”, we shouldn’t immediately conclude that there was a rock in this person’s field-of-view. It’s a hypothesis we should consider, but not proven.” That’s totally true.
The second is like disputing the claim: “If you describe the complete chain of events leading to someone reporting “I see a rock”, nowhere in that chain of events is there ever an actual rock (with photons bouncing off it), not for anyone ever—oh and there are in fact rocks in the world, and when people talk about rocks they’re describing them correctly, it’s just that they came to have knowledge of rocks through some path that had nothing to do with the existence of actual rocks.” That’s what I would disagree with.
So if you have a complete and correct description of the chain of events that leads someone to say they have qualia, and nowhere in that description is anything that looks just like our intuitive notion of qualia, I think the correct conclusion is “there is nothing in the world that looks just like our intuitive notion of qualia”, not “there’s a thing in the world that’s just like our intuitive notion of qualia, but it’s causally disconnected from our talking about it”.
(I do in fact think “there’s nothing in the world that looks just like our intuitive notion of qualia”. I think this is an area where our perceptions are not neutrally and accurately conveying what’s going on; more like our perception of an optical illusion than our perception of a rock.)
Hi, sorry for the very delayed reply. I think one thing I didn’t mention in the chain of comments above is that I think it’s more plausible that there are interventions that change qualia reports without much changing (morally important) qualia than the reverse: changing important qualia without changing qualia reports. And I gave examples of changing qualia reports without (much) changing qualia, whereas the linked report talks more about changing qualia without substantively changing qualia reports.
I can conceive of examples where qualia interventions change qualia but not qualia reports (eg painkillers for extreme pain that humans naturally forget/round down), but they seem more like edge cases than the examples I gave.
I agree that there are both interventions that change qualia reports without much changing (morally important) qualia and interventions that change qualia without much changing qualia reports, and that we should keep both these possibilities in mind when evaluating interventions.