Froolow

Karma: 886

[Question] What is the state of the art in EA cost-effectiveness modelling?

Froolow4 Jun 2022 12:08 UTC

26 points

10 comments1 min readEA link

A critical review of GiveWell’s 2022 cost-effectiveness model

Froolow25 Aug 2022 15:35 UTC

227 points

20 comments35 min readEA link

Froolow 29 Aug 2022 9:08 UTC
1 point
0 ∶ 0
in reply to: Erich_Grunewald’s comment on: A critical review of GiveWell’s 2022 cost-effectiveness model
Ah sorry, I think I might have confused the issue a bit with my footnote. I think I’ve managed to conflate two issues in your mind.
The first is exactly as you say; any intervention worth doing has some effects which are easy to model and some which are difficult (maybe impossible) to model. What GiveWell has done is completely reasonable here; modelling what it can and then making assumptions about how important the other things, like track record, are in comparison to the main cost-effectiveness results.
The second issue is the more subtle one that I was driving at. Imagine you are going to buy a new car, and your friend (who knows about cars) says that modern cars are 10x more fuel efficient than the car you currently drive. Speaking very roughly, there are two strategies you could pick from to choose your next car:
- Completely ignore your friend, and pick the car that has the best MPG regardless of any other feature. This would be a good strategy if literally all you care about is fuel efficiency, but a bad strategy otherwise (because it is unlikely the most fuel efficient car is also the most comfortable to drive—especially if fuel efficiency and comfort are sort-of tradeoffs)
- Treat your friend as having offered a useful rule of thumb, and so have an idea in your head about what ‘good’ fuel efficiency looks like. This is a good strategy if cars aren’t really directly comparable along a straightforward scale—a Ford F-150 isn’t ‘better’ or ‘worse’ than a Prius, it is just a different kind of thing.
Both GiveWell (implicitly) and me in my fertility days (explicitly) argue that QALYs are like cars—you can end up in a situation where you can generate different kinds of QALYs and your best bet is to compare them with a rule of thumb like GiveWell’s 10x multiplier. However I don’t think GiveWell is correct in making this assumption about charities—there is in fact a single measure like MPG which we want to ruthlessly optimise, and therefore we do actually want to it the F-150 and Prius directly against each other.
However my point in the essay is that GiveWell don’t actually have to choose—they can build their model as if they are in the first world and directly compare charities together, and then make their final decision as though they are in the second world and different charities will offer different profiles of benefit on top of their cost-effectiveness. This is pretty much the commonsense way of choosing a car too—you would look at MPG and directly compare cars in this way, but you might then consider other factors. It would be weird to lump all cars together in your head as ‘better than 10x my previous efficiency’ or ‘worse than 10x my previous efficiency’.

Froolow 29 Aug 2022 9:20 UTC
32 points
0 ∶ 0
in reply to: JackM’s comment on: A critical review of GiveWell’s 2022 cost-effectiveness model
One of the key takeaways in the body of the text which perhaps I should have brought out more in the summary is that the GiveWell model is basically as reliable as highly professionalised bodies like pharma companies have figured out how to make a cost-effectiveness model. A small number of minor errors are unexceptional for a model of this complexity, even models that we submit to pharma regulators that have had several million dollars of development behind them.
I would say that while the errors are uninteresting and unexceptional, the unusual model design decisions are worth commenting on. The GiveWell team are admirably transparent with their model, and anybody who wants to review it can have access to almost everything at the click of a button (some assumptions are gated to GiveWell staff, but these aren’t central). Given this, it is remarkable the EA community didn’t manage to surface anyone who knew enough about models to flag to GiveWell that there were optimisations in design to be made—the essay above is not really arcane modelling lore but rather something anyone with a few years’ experience in pharma HEOR could have told you. Is this because there are too few quant actors in the EA space? Is it because they don’t think their contributions would be valued so don’t speak up? Is it because criticism of GiveWell makes you unemployable in EA spaces so is heavily incentivised against? Etc etc. That is to say—I think asking why GiveWell missed the improvements is missing the important point, which is that everyone missed these improvements so there’s probably changes that can be made to expert knowledge synthesis in EA right across the board.
Just to add that I think outreach efforts like the Red Team contest are a really good way of doing this—I wouldn’t have heard about the EA Forums had it not been for the plug Scott Alexander gave the contest on Astral Codex Ten (which I read mostly for the stuff on prediction markets).

Froolow 29 Aug 2022 9:25 UTC
7 points
0 ∶ 0
in reply to: david_reinstein’s comment on: A critical review of GiveWell’s 2022 cost-effectiveness model
Please do use the Refactor however you would like, although please also add enough of a disclaimer to keep the context from this essay that the Refactor is an academic exercise which tries to stay roughly true to GiveWell’s design philosophy and assumptions while improving architecture and calculation conventions. I don’t think as a standalone piece of modelling it is the cutting edge of what EA could aspire to, whereas Nolan / McGuire really do seem to be operating at that level.
But yes, really happy to contribute if I can be of any help—drop me a message!

Froolow 29 Aug 2022 9:28 UTC
4 points
0 ∶ 0
in reply to: MHR’s comment on: A critical review of GiveWell’s 2022 cost-effectiveness model
This looks like you are probably right—I’m really sorry I haven’t had time to go through both sets of calculations in detail but they do seem to be getting at the same sort of adjustment so that’s an excellent spot. I suppose regardless, if it isn’t obvious to a user that the same adjustment is being made to two different numbers then this is reason enough to make it more explicit!

Methods for improving uncertainty analysis in EA cost-effectiveness models

Froolow29 Aug 2022 18:07 UTC

127 points

8 comments36 min readEA link

Froolow 30 Aug 2022 20:24 UTC
9 points
0 ∶ 0
in reply to: trait-feign’s comment on: Methods for improving uncertainty analysis in EA cost-effectiveness models
Thank you for the kind words—and it is always nice to get follow-up questions!
Further reading
In terms of recommended further reading, almost all UK-based Health Economists swear by ‘the Briggs book’. This contains step-by-step instructions for doing almost everything I describe above, as well as more detail around motivation and assumptions.
If you don’t want to shell out for a textbook, an excellent exploration of uncertainty is Claxton et al 2015 where the authors demonstrated that the value of additional information on the uncertainty of streptokinase following heart attack was so small as to be negligible, which implies that a major shift in health policy could have been undertaken five years earlier and in the absence of several massive expensive trials. Claxton is one of the co-authors of the Briggs book, so knows his stuff inside out.
In terms of EA specific follow-ups, I have always really loved Kwakkel & Pruyt 2013 for their use of uncertainty analysis in a framework that EAs would recognise as longtermist. Their first example is on mineral scarcity in the medium-term future, and they go through a process very similar to that which is done for x-risk type calculations, but with what I regard as a significantly higher degree of rigour and transparency. If someone asked me to model out AI alignment scenarios I would follow this paper almost to the letter, although I would warn anyone casually clicking through that this is pretty hardcore stuff that you can’t just knock together in Excel (see their Fig 1, for example).
I note you also ask for the most speculative use of uncertainty analysis, for which I have a rather interesting answer. I remember once reading a paper on the use of Monte Carlo modelling of parameter uncertainty to resolve the Fermi Paradox (that is, why has no alien intelligence contacted us if the universe is so vast). The paper really entertained me, but I completely forgot the reference until I tracked the paper down to link it for you now—it is Sandberg, Drexler & Ord 2018, and the ‘Ord’ in the third author position is Toby Ord, who I suspect is better known to forum members as one of the founders of EA—what a lovely coincidence!
Model covariance
You are right to raise covariance in Monte Carlo simulations as a clear issue with the way I have presented the topic, but you’ll be pleased to know that this is basically a solved problem in Health Economics which I just skimmed over in the interests of time. The ‘textbook’ method of solving the problem is to use a ‘Cholesky Decomposition’ on the covariance matrix and sample from that. In recent years I’ve also started experimenting with microsimulating the underlying process which generates the correlated results, with some mixed success (but it is cool when it works!).
Risk adjustment
Your comments on risk adjustment are completely correct—amongst many of the problems my approach causes it takes unlikely outcomes (ie high standard deviation away from average) and implicitly turns them into outcomes which are proportionally even more unlikely, sometimes to the point of requiring completely impossible inputs to generate those outputs. I hope I caveated the weakness of the method appropriately, because it isn’t a good model of how humans approach risk (more of a proof of concept)
There is a fairly novel method just breaking into the Health Economics literature called a CERAC, which uses the process you outline of treating a model as a portfolio with an expected return and downside risk of those returns being penalised accordingly. I suspect something like this is the best way to handle risk adjustment in a model without an explicit model of risk-preference specified across all possible outcomes. Unfortunately to use the technique as described you need a cost-effectiveness threshold, which doesn’t exist in EA (and will never exist in EA as a matter of first-principles). As I mentioned, I work in an exclusively expected utility context so I’m not familiar enough with the technique to be confident of adapting it to EA, although if someone with a better maths background than me wanted to give it a shot I suspect that would be a pretty valuable extension of the general principle I outline.

The discount rate effectively determines whether long- or near-termism is the best use of philanthropic resources

Froolow6 Sep 2022 14:52 UTC

14 points

8 comments4 min readEA link

Froolow 6 Sep 2022 18:20 UTC
1 point
0 ∶ 0
in reply to: Sharmake’s comment on: The discount rate effectively determines whether long- or near-termism is the best use of philanthropic resources
Hyperbolic discounting is a ‘time inconsistent’ form of discounting where delays early on are penalised more than delays later on. This results in a ‘fat tail’ where it takes a long time for a hyperbolic function to get near zero. Over a long enough time period, an exponential function (for example growth in happiness driven by population growth) will always be more extreme than a hyperbolic function (for example discount rate in this scenario).
So actually the title should perhaps be reworded; the shape of the discount function matters just as much (if not more so than) the parameterisation of that function. A hyperbolic discount function will always result in longtermism dominating neartermism.
Having said that, I don’t think anyone believes hyperbolic discount rates are anything other than a function of time preference, and the consensus amongst EAs seems to be that time preference should be factored out of philanthropic analysis.

Froolow 6 Sep 2022 18:22 UTC
12 points
0 ∶ 0
on: The discount rate effectively determines whether long- or near-termism is the best use of philanthropic resources
ADDENDUM: There is an excellent post here about why the discount rate is probably not zero based on an analysis of existential risk rates. However, I think the post assumes a lot of background familiarity with the theory of discount rates, and doesn’t – for example – explicitly identify that existential risk is probably not the biggest reason to discount the future (even absent time-preference). Would an effortpost which is a deep dive into discount rates as understood by economists be helpful?

[Question] Questions on databases of AI Risk estimates

Froolow2 Oct 2022 9:12 UTC

24 points

12 comments2 min readEA link

[Question] AI Risk Microdynamics Survey

Froolow9 Oct 2022 20:00 UTC

7 points

1 comment1 min readEA link

‘Dissolving’ AI Risk – Parameter Uncertainty in AI Future Forecasting

Froolow18 Oct 2022 22:54 UTC

109 points

63 comments39 min readEA link

Froolow 19 Oct 2022 8:19 UTC
4 points
1 ∶ 0
in reply to: Linch’s comment on: ‘Dissolving’ AI Risk – Parameter Uncertainty in AI Future Forecasting
This comment is exactly right, although it seems I came across stronger on the point about geometric mean of odds than I intended to. I wanted to say basically exactly what you did in this comment—there are relatively sound reasons to treat geometric mean of odds as the default in this case, but that there was a reasonable argument for simple means too. For example see footnotes 7 and 9 where I make this point. What I wanted to get across was that the argument about simple means vs geometric mean of odds was likely not the most productive argument to be having—point estimates always (necessarily) summarise the underlying distribution of data, and it is dangerous to merely use summary statistics when the distribution itself contains interesting and actionable information
Just for clarity—I use geometric mean of odds, which I then convert back into probability as an additional step (because people are more familiar with probability than odds). If I said anywhere that I took the geometric mean of probabilities then this is a typo and I will correct it!

Froolow 19 Oct 2022 8:29 UTC
3 points
0 ∶ 0
in reply to: Ross Rheingans-Yoo’s comment on: ‘Dissolving’ AI Risk – Parameter Uncertainty in AI Future Forecasting
I think there are good reasons for preferring geometric mean of odds to simple mean when presenting data of this type, but not good enough that I’d take to the barricades over them. Linch (below) links to the same post I do in giving my reasons to believe this. Overall, however, this is an essay about distributions rather than point estimates so if your main objection is to the summary statistic I used then I think we agree on the material points, but have a disagreement about how the work should be presented.
On the point about betting odds, I note that the contest announcement also states “Applicants need not agree with or use our same conception of probability”. I think the way in which I actually disagree with the Future Fund is more radical than simple means vs geometric mean of odds—I think they ought to stop putting so much emphasis on summary statistics altogether.

Froolow 19 Oct 2022 8:47 UTC
3 points
0 ∶ 0
in reply to: harfe’s comment on: ‘Dissolving’ AI Risk – Parameter Uncertainty in AI Future Forecasting
I’m not an AI Risk expert, so any answer I gave to 1 would just be polluting. Let’s say my probabilities are A and B for a two-parameter Carlsmith Model, and those parameters could be 3% or 33% as per your example. So a simple mean of this situation is A = (3% + 33%)/2 = 18% and B is the same, so simple mean is ~3%. The geometric mean is more like 1%.
The most important point I wanted to get across is that the distribution of probabilities can be important in some contexts. If something important happens to our response at a 1% risk then it is useful to know that we will observe less than 1% risk in ³⁄₄ of all possible worlds (ie worlds when A or B are at 3%). In the essay I argue that since strategies for living in a low-risk world are likely to be different from strategies for living in a high-risk world (and both sets of strategies are likely to be different from optimal strategy if we live in a simple-mean medium-risk world), distribution is what matters.
If we agree about that (which I’m not certain we do—I think possibly you are arguing that you can and should always reduce probabilities-of-probabilities to just probabilities?), then I don’t really have a strong position on your other point about geometric mean of odds vs simple mean. The most actionable summary statistic depends on the context. While I think geometric mean of odds is probably the correct summary statistic for this application, I accept that there’s an argument to be had on the point.

Froolow 19 Oct 2022 9:46 UTC
34 points
2 ∶ 0
in reply to: Thomas Kwa’s comment on: ‘Dissolving’ AI Risk – Parameter Uncertainty in AI Future Forecasting
This is unquestionably the strongest argument against the SDO method as it applies to AI Risk, and therefore the biggest limitation of the essay. There is really good chance that many of the parameters in the Carlsmith Model are correlated in real life (since basically everything is correlated with everything else by some mechanism), so the important question is whether they are independent enough that what I’ve got here is still plausible. I offer some thoughts on the issue in Section 5.1.
To the best of my knowledge, there is no work making a very strong theoretical claim that any particular element of the Carlsmith Model will be strongly correlated with any other element. I have seen people suggest mechanisms with the implicit claim that if AI is more revolutionary than we expect then there will be correlation between our incentive to deploy it, our desire to expose it to high-impact inputs and our inability to stop it once it tries to disempower us—but I’m pretty confident the validity check in Section 4.3.3 demonstrates that correlation between some parameters doesn’t fundamentally alter conclusions about distributions, although would alter the exact point estimates which were reached.
Practically, I don’t think there is strong evidence that people’s parameters are correlated across estimates to a degree that will significantly alter results. Below is the correlation matrix for the Full Survey estimates with p<0.05 highlighted in green. Obviously I’m once again leaning on the argument that a survey of AI Risk is the same thing as the actual AI Risk, which I think is another weakness of the essay.
This doesn’t spark any major concerns for me—there is more correlation than would be expected by chance, but it seems to be mostly contained within the ‘Alignment turns out to be easy’ step, and as discussed above the mechanism still functions if one or two steps are removed because they are indistinguishable from preceding steps. The fact that there is more positive than negative correlation step is some evidence of the ‘general factor of optimism’ which you describe (because the ‘optimistic’ view is that we won’t deploy AI until we know it is safe, so we’d expect negative correlation on this factor in the table). Overall I think my assumption of independence is reasonable in the sense that the results are likely to be robust to the sorts of correlations I have empirically observed and theoretically seen accounted for, however I do agree with you that if there is a critical flaw in the essay it is likely to be found here.
I don’t quite follow your logic where you conclude that if estimates are correlated then simple mean is preferred—my exploration of the problem suggests that if estimates are correlated to a degree significant enough to affect my overall conclusion then you stop being able to use conventional statistics at all and have to do something fancy like microsimulation. Anecdata—in the specific example you give my intuition is that 0.4% really is a better summary of our knowledge, since otherwise we round off Aida’s position to ‘approximately 1%’ which is several orders of magnitude incorrect. Although as I say above, in the situation you describe above both summary estimates are misleading in different ways and we should look at the distribution—which is the key point I was trying to make in the essay.
What links here?
- Steven Byrnes's comment on ‘Dissolving’ AI Risk – Parameter Uncertainty in AI Future Forecasting by Froolow (19 Oct 2022 19:48 UTC; 2 points)

Froolow 19 Oct 2022 10:00 UTC
3 points
0 ∶ 0
in reply to: Linch’s comment on: ‘Dissolving’ AI Risk – Parameter Uncertainty in AI Future Forecasting
I think that’s a fair criticism. For all I know, the FF are not at all uncertain about their estimates (or at least not uncertain over order-of-magnitude) and so the SDO mechanism doesn’t come into play. I still think there is value in explicitly and systematically considering uncertainty, even if you end up concluding it doesn’t really matter for your specific beliefs -if only because you can’t be totally confident it doesn’t matter until you have actually done the maths.
I’ve updated the text to replace ‘geometric mean’ with ‘geometric mean of odds’ everywhere it occurs. Thanks so much for the close reading and spotting the error.

Froolow 19 Oct 2022 10:14 UTC
17 points
0 ∶ 0
in reply to: Dan_Keys’s comment on: ‘Dissolving’ AI Risk – Parameter Uncertainty in AI Future Forecasting
I had not thought to do that, and it seems quite sensible (I agree with your point about prima facie worry about low outliers). The results are below.
To my eye, the general mechanism I wanted to defend about is preserved (there is an asymmetric probability of finding yourself in a low-risk world), but the probability of finding yourself in an ultra-low-risk world has significantly lowered, with that probability mass roughly redistributing itself around the geometric mean (which itself has gone up to 7%-ish)
In some sense this isn’t totally surprising—removing the lowest 10% of estimates means that order-of-magnitude uncertainty is only preserved for one of the six parameters in the equation (Containment), so the SDO mechanism doesn’t really apply. I don’t have the subject-specific knowledge to conclude is de-extremising the data in this way is reasonable (do we actually have better-than-order-of-magnitude knowledge about all of these parameters except Containment?), but the analysis you suggest is an important limitation of my results which I had totally overlooked, so thank you for the suggestion.

Froolow

[Question] What is the state of the art in EA cost-effec­tive­ness mod­el­ling?

A crit­i­cal re­view of GiveWell’s 2022 cost-effec­tive­ness model

Meth­ods for im­prov­ing un­cer­tainty anal­y­sis in EA cost-effec­tive­ness models

Further reading

Model covariance

Risk adjustment

The dis­count rate effec­tively de­ter­mines whether long- or near-ter­mism is the best use of philan­thropic resources

[Question] Ques­tions on databases of AI Risk estimates

[Question] AI Risk Micro­dy­nam­ics Survey

‘Dis­solv­ing’ AI Risk – Pa­ram­e­ter Uncer­tainty in AI Fu­ture Forecasting