Vasco Grilo🔸 comments on Digital Consciousness Model Results and Key Takeaways

Vasco Grilo🔸 10 Feb 2026 18:00 UTC
6 points
0 ∶ 0
Thanks, Derek.
I have my own favorite stance, and I think I have good reason for it, but I didn’t try to convince anyone to give it more weight in our aggregation. Insofar as we’re aiming in the direction of something that could achieve broad agreement, we don’t want to give too much weight to our own views (even if we think we’re right).
To clarify, I do not have a view about which models should get more weight. I just think that, when results differ a lot across models, the top priority should be further research to decrease the uncertainty instead of acting based on a consensus view represented by best guesses for the weights of the models.
- Vasco Grilo🔸 12 Feb 2026 19:43 UTC
  4 points
  1 ∶ 0
  Parent
  I would model the weights of the models as very wide distributions to represent very high model uncertainty.
  In particular, I would model the weights of the stances as distributions instead of point estimates. As you note in the report, there was lots of variation across the 13 experts you surveyed
  I wonder what exactly you asked the experts. I think the above would underestimate uncertainty if you just asked them to rate plausibility from 0 to 10, and there were experts reporting 0. Have you considered having a range of possible responses in a logarithmtic scale ranging from a weight/probability of e.g. 10^-6 to 1?
  - arvomm 13 Feb 2026 14:35 UTC
    4 points
    0 ∶ 0
    Parent
    Thanks vasco. And thanks for helping us think through what we can do better. Some thoughts on this:
    
    We considered several framings, scales and options to give experts. Since they were evaluating a lot of stances and we wanted experts to really know what we meant, we prioritised giving them context and then asking them the simplified general question of plausibility, with an intuitive scale. The exact question was: ‘how plausible do you find X stance?’, just after having fully describing X. We also asked them for general notes and comments and they didn’t seem to find that part of the survey particularly confusing (perhaps to your and my surprise). More broadly, I agree with you that sometimes perfectly defining terms and scales can help some people think through it but not everyone, and the science on how much it helps points is mixed.
    
    We didn’t find that people were responding with zero plausibility very much at all. As you can see from the results, almost all respondents found most, if not all, stances at least a little bit plausible. I agree that had we found a lot of concentration around the very high or very low plausibility, having some sort of logarithmic scale could help distinguish results.
    
    I’m not sure what you have in mind in terms of modelling the stances’ weight as distributions instead of point estimates. Perhaps you mean something like leveraging those distributions above via some sort of Monte Carlo where weights are drawn from these distributions and the process is repeated many times, then aggregated. That indeed sounds more sophisticated and could possibly help track uncertainty but I suspect it would very little difference. In particular, I think so because we observed that unweighted pooling of results across all stances is surprisingly similar to the pool when weighted by experts; the same if you squint.
    - Vasco Grilo🔸 13 Feb 2026 16:45 UTC
      2 points
      0 ∶ 0
      Parent
      Thanks for clarifying, Arvo.
      We didn’t find that people were responding with zero plausibility very much at all.
      I wonder how people decided between a plausibility of ⁰⁄₁₀ and ¹⁄₁₀. It could be that people picked 0 for a plausibility lower than 0.5/10, or that they interpreted it as almost impossible, and therefore sometimes picked ¹⁄₁₀ even for a plausibility lower than 0.5/10. A logarithmic scale would allow experts to specify plausibilities much lower than ¹⁄₁₀ (e.g. 10^-6/10) without having to pick 0, although I do not know whether they would actually pick such values.
      I’m not sure what you have in mind in terms of modelling the stances’ weight as distributions instead of point estimates. Perhaps you mean something like leveraging those distributions above via some sort of Monte Carlo where weights are drawn from these distributions and the process is repeated many times, then aggregated.
      Yes, this is what I had in mind. Denoting by W_i and P_i the distributions for the weight and probability of consciousness for stance i, I would calculate the final distribution for the probability of consciousness from (W_1*P_1 + W_2*P_2 + … W_13*P_13)/(W_1 + W_2 + … W_13).
      That indeed sounds more sophisticated and could possibly help track uncertainty but I suspect it would very little difference. In particular, I think so because we observed that unweighted pooling of results across all stances is surprisingly similar to the pool when weighted by experts; the same if you squint.
      I think the mean of the final distribution for the probability of consciousness would be very similar. However, the final distribution would be more spread out. I do not know how much more spread out it would be, but I agree it would help track uncertainty better.