kaarel comments on ‘Dissolving’ AI Risk – Parameter Uncertainty in AI Future Forecasting

kaarel Oct 19, 2022, 9:36 AM
4 points
2 ∶ 0
If we actually take these to be the probabilities that we live in various kinds of worlds, then it’s just a law of conditional probability that the overall probability is the arithmetic mean of the individual probabilities, not the geometric mean, I believe.

I could imagine ways to philosophically justify taking the geometric mean here anyway, e.g. by arguing that our synthetic samples are drawn from a large community of synthetic experts that is an accurate extrapolation of actual experts, and that it’s a good idea to take the geometric mean of forecasts. [EDIT 1: I’m guessing this should be done with odds instead of probabilities though, and that this would bump the answer upward.] [EDIT 2: In fact, it was done with odds already.] But the former seems implausible given that the products of the numbers given by individual experts tend to be larger (in particular, larger mean) than the ones found for the synthetic community – this suggests that experts are giving correlated answers to the different questions. Perhaps we should think that some sort of idealized experts would answer these subquestions independently, but with the same distribution as the empirical one in this sample? It’s not clear to me that this is the case. In any case, if there is good reason to take the geometric mean here, I think the analysis could greatly benefit from presenting a clear justification of this, as the answer depends on this up to close to an order of magnitude.

I read your footnote 9 on this question regarding the geometric mean vs the arithmetic mean, and found it confusing. If the picture given in the first paragraph of my comment is indeed what you have in mind, then shouldn’t the Brier-score-maximizing prediction still be the arithmetic mean (as that is the all-things-considered probability)? I don’t see how the geometric mean would come into play.
- kaarel Oct 19, 2022, 9:46 AM
  3 points
  0 ∶ 0
  Parent
  [EDIT 1: The following is wrong re the 1.6% number, because that was the geometric mean of odds, not the geometric mean of probabilities as I assumed here.]
  
  By the way, as the number of samples you take goes to infinity, I think the geometric mean of the sampled probabilities converges (in probability) to a limit which has a simple form in terms of the data. (After taking the log, this should just be a consequence of the law of large numbers.) Namely, it converges to the geometric mean of all the products of numbers from individual predictions! So instead of getting 1.6% from the sampling, I think you could have multiplied 42 numbers you calculated to find the number this 1.6% would converge to in the limit as the number of samples goes to infinity. I.e., what I have in mind are the numbers that were averaged to get the 18.7% number below. [EDIT 2: That’s not quite true, because the 18.7% was not the average of the products, but instead the product of the averages.]
  
  Could you compute this number? Or feel free to let me know if I’m missing something. I’m also happy to elaborate further on the argument for convergence I have in mind.
  - Froolow Oct 19, 2022, 10:59 AM
    3 points
    0 ∶ 0
    Parent
    I’m not completely sure I understand your request. The screenshot below is the Excel file with the survey results in. Column U is the product of columns N to S. You’d like the geometric mean of odds of column U? This is 0.023, which is approximately 2.3%. This isn’t quite the same as the estimate in my model, I think because there is some missing survey data which isn’t carried over into the model
    - kaarel Oct 19, 2022, 7:04 PM
      2 points
      0 ∶ 0
      Parent
      Thanks! That’s indeed the quantity I was interested in, modulo me incorrectly thinking that you computed the geometric mean of probabilities and not odds.
      
      Given that you used odds when computing the geometric mean, I retract my earlier claim that there is such a simple closed-form limit as the number of samples goes to infinity. Thanks for the clarification!
    - kaarel Oct 20, 2022, 1:41 AM
      1 point
      0 ∶ 0
      Parent
      Here is another claim along similar lines: in the limit as the number of samples goes to infinity, I think the arithmetic mean of your sampled probabilities (currently reported as 9.65%) should converge (in probability) to the product of the arithmetic means of the probabilities respondents gave for each subquestion. So at least for finding this probability, I think one need not have done any sampling.
      
      If you’d like to test this claim, you could recompute the numbers in the first column below with the arithmetic mean of the probabilities replacing the geometric mean of the odds, and find what the 18.7% product becomes.
      - Froolow Oct 20, 2022, 9:27 AM
        3 points
        0 ∶ 0
        Parent
        Hope I’ve understood you right! I’ve taken the arithmetic mean of all columns and then computed the product of those arithmetic means. I end up with 9.74%. Again, I think this is slightly different from my model’s estimate of the value because the survey has some missing data which doesn’t occur in the synthetic distribution of the model
        kaarel Oct 20, 2022, 9:39 AM
        1 point
        0 ∶ 0
        Parent
        Thanks, this is great!
- Froolow Oct 19, 2022, 10:49 AM
  2 points
  0 ∶ 0
  Parent
  Just a small clarification point I didn’t make clear enough in the essay—my geometric mean is always the geometric mean of odds, converted back into probability because it makes it easier to interpret for most readers. So 1.6% is genuinely the geometric mean of odds, but I take your point that the geometric mean of odds might not be the best summary statistic to use in the first place. To reiterate though, my main argument is that point estimates are misleading in this case regardless of which point estimate you use, and distributions of ex post risk are important to consider.
  I’m really sorry, I don’t know what I meant by the reference to Briar Scores either—I’ll change that footnote until I can figure out what I was trying to say.
  - kaarel Oct 20, 2022, 2:20 AM
    1 point
    0 ∶ 0
    Parent
    I can buy that it is sometimes useful to think about x-risk in terms of a partition of the worlds we could be in, the probability of each part in the partition, and the probability of x-risk in each part. For this to be useful in decision-making, I think we’d want the partition to sort of “carve reality at its joints” in a way that’s relevant to the decisions we’d like to make. I’m generally unconvinced that the partition given here achieves this.
    
    My best attempt at trying to grok the partition here is that worlds are grouped according to something like the “intrinsic difficulty” of alignment, with the remaining uncertainty being over our actions to tackle alignment. But I don’t see a good reason to think that the calculation methodology used in the post would give us such a partition. Perhaps there is another natural way to interpret the partition given, but I don’t see it.
    
    For a more concrete argument against this distribution of probabilities capturing something useful, let’s consider the following two respondents. The first respondent is certain about the “intrinsic difficulty” of alignment, thinking we just have a probability of 50% of surviving. Maybe this first respondent is certain that our survival is determined by an actual coinflip happening in 2040, or whatever. The other respondent thinks there is a 50% chance we are in a world in which alignment is super easy, in which we have a 99% chance of survival, and a 50% chance we are in a world in which alignment is super hard, in which we have a 1% chance of survival. Both respondents will answer 50% when we ask them what their p(doom) is, but they clearly have very different views about the probability distribution on the “intrinsic difficulty” of alignment.
    
    Now, insofar as the above makes sense, it’s probably accurate to say that most respondents’ views on most of the surveyed questions are a lot like respondent 2, with a lot of uncertainty about the “intrinsic difficulty” involved, or whatever the relevant parameter is that the analysis hopes to partition according to. However, the methodology used would give the same results if the people we surveyed were all like respondent 1 and if the people we surveyed were all like respondent 2. (In fact, my vague intuition is that the best attempt to philosophically ground the methodology would assume that everyone is like respondent 1.) This seems strange, because as far as I can intuitively capture what the distribution over probabilities is hoping to achieve, it seems that it should be very different in the two cases. Namely, if everyone is like respondent 1, the distribution should be much more concentrated on certain kinds of worlds than if everyone is like respondent 2.
    
    Note that the question about the usefulness of the partition is distinct from whether one can partition the worlds into groups with the given conditional probabilities of x-risk. If I think a coin lands heads in 50% of the worlds, the math lets me partition all the possible worlds into 50% where the coin has a 0% probability of landing heads, and 50% where the coin has a 100% probability of landing heads. Alternatively, the math also lets me partition all possible worlds into 50% where the coin has 50% probability of landing heads, and 50% where the coin has 50% probability of landing heads. What I’m doubting is that either distribution would be helpful here, and that the distribution given in the post is helpful for understanding x-risk.
    What links here?
    What are your cruxes for imprecise probabilities / decision rules? by Anthony DiGiovanni (LessWrong; Jul 31, 2024, 3:42 PM; 36 points)