Yarrow Bouchard 🔸 comments on Introducing LEAP: The Longitudinal Expert AI Panel

Yarrow Bouchard 🔸 21 Nov 2025 1:10 UTC
3 points
0 ∶ 0
I think there is a very realistic chance one of the results of this survey has been quite significantly misreported. Specifically, the responses to the question about the slow/moderate/rapid progress scenarios.

Error #1, which I raised here, was that the probabilities were reported without qualification, when what should have been reported was the probability that the scenario would be the one that best matches reality. To their immense credit, the Forecasting Research Institute said they would correct this in a future version of the report. I thank them greatly for that.

Error #2, which I’m not 100% sure yet is in fact an error, so let’s call it Possible Error #2, is that these don’t seem to be probabilities at all. (I originally raised this possible error here.)

Respondents are asked to predict, in December 2030, “what percent of LEAP panelists will choose” each scenario (not with any probability). This implies that if they think there’s, say, a 51% chance that 30% of LEAP panelists will choose the slow scenario, they should respond to the question by saying 30% will choose the slow scenario. If they think there’s a 99% chance that 30% of LEAP panelists will choose the slow scenario, they should also respond by saying 30% will choose the slow scenario. In either case, the number in their answer is exactly the same, despite a 48-point difference in the probability they assign to this outcome. The report says that 30% is the probability respondents assign to the slow scenario, but it’s not clear that the respondents’ probability is 30%.

The Forecasting Research Institute only asks for the predicted “vote share” for each scenario and not the estimated probabilities behind those vote share predictions. It doesn’t seem possible to derive the respondents’ probability estimates from the vote share predictions alone. By analogy, if FiveThirtyEight’s 2020 election forecast predicts that Joe Biden will win a 55% share of the national vote, this doesn’t tell you what probability the model assigns to Biden winning the election (whether it’s, say, 70%, 80%, or 90%). The model’s probability is certainly not 55%. To know the model’s probability or guess at it, you would need information other than just the predicted vote share.

So, it seems plausible to me — although not yet certain — that this claim about what the respondents said the probability of each scenario is (with or without the “best matching” qualifier) is incorrect because the respondents were not asked about probabilities in the first place. If there is a way to derive probabilities from what the respondents were asked, I don’t know what it is.

[Edited on Nov. 21, 2025 at 1:05 PM Eastern to add: titotal apparently agrees.]
- Yarrow Bouchard 🔸 2 Dec 2025 17:49 UTC
  6 points
  0 ∶ 0
  Parent
  Update: the Forecasting Research Institute has changed the language in the report in response to this critique! (It seems like titotal played an important role in this. Thank you, titotal.)
  
  On page 32, the report now gives the survey results in the same intersubjective resolution/metaprediction wording the question was asked in, rather than as an unqualified probability:
  
  By 2030, the average expert thinks that 23% of LEAP panelists will say the state of AI most closely mirrors an (“rapid”) AI progress scenario that matches some of these claims.
  
  This is awesome! I’m very happy to see this. Thanks to the Forecasting Research Institute for making this change.
  
  I see also this EA Forum post has been updated in the same way, so thanks for that as well. Great to see.
  What links here?
  - Yarrow Bouchard 🔸's comment on Yarrow’s Quick takes by Yarrow Bouchard 🔸 (3 Dec 2025 5:17 UTC; 18 points)
  - Yarrow Bouchard 🔸 7 Dec 2025 1:23 UTC
    4 points
    0 ∶ 0
    Parent
    Update #2: titotal has published a full breakdown of the error involving the intersubjective resolution/metaprediction framing of the survey question. It’s a great post that explains the error very well. Many thanks to titotal for taking the time to write the post and for talking to the Forecasting Research Institute about this. Thanks again to the Forecasting Research Institute for revising the report and this post.