Froolow

Karma: 886

A critical review of GiveWell’s 2022 cost-effectiveness model

Froolow25 Aug 2022 15:35 UTC

227 points

20 comments35 min readEA link

Methods for improving uncertainty analysis in EA cost-effectiveness models

Froolow29 Aug 2022 18:07 UTC

127 points

8 comments36 min readEA link

‘Dissolving’ AI Risk – Parameter Uncertainty in AI Future Forecasting

Froolow18 Oct 2022 22:54 UTC

109 points

63 comments39 min readEA link

Froolow 14 Jun 2023 11:37 UTC
52 points
10 ∶ 1
in reply to: Arturo Macias’s comment on: The Common Sense View on Meat Implies Some Pretty Radical Conclusions
I researched this fairly extensively a few years ago, and it is a true (but maybe misleading, depending on context) claim.
The usual source for this claim is the Sentience Institute, although if you go to a huge amount of effort to check government records by hand you get basically the same number so I’m not worried that the source is somewhat biased. They get the 99% number by using USDA data on the size of farms, and then defining any farm over a certain size as a ‘factory’ farm. This makes sense to me, and is how I’d approach the definitional problem unless I was shown extremely compelling evidence of a farm processing eg 5000 pigs a year using traditional ‘mom and pop’ techniques.
The reason the claim might be misleading is that it is using ‘meat’ as a shorthand for ‘meat animals’ rather than eg ‘carcass weight’. Because the vast majority of farmed animals are chickens, and chickens are overwhelmingly factory farmed when farmed, the result of the Sentience Institute methodology is that it appears the overwhelming percentage of farmed animals are factory farmed. In fact, by carcass weight it is ‘only’ about 90% of meat which is factory farmed.
This could in theory drop a bit lower if you say that the process for factory farming cows is not all that morally relevant for the ²⁄₃ of their life they spend in pastures and hence were very exacting with your definitions (ie maybe for the sake of argument we would say something like “85% of meat-by-weight is farmed in a way that would be extremely distressing for the animal” rather than “99% of meat is factory farmed”), but it is hard to get very much lower than this because pigs and chickens are almost exclusively raised in cramped factory conditions and also make up a great deal of the meat we eat.

“The Race to the End of Humanity” – Structural Uncertainty Analysis in AI Risk Models

Froolow19 May 2023 12:03 UTC

48 points

4 comments21 min readEA link

Froolow 19 Oct 2022 9:46 UTC
34 points
2 ∶ 0
in reply to: Thomas Kwa’s comment on: ‘Dissolving’ AI Risk – Parameter Uncertainty in AI Future Forecasting
This is unquestionably the strongest argument against the SDO method as it applies to AI Risk, and therefore the biggest limitation of the essay. There is really good chance that many of the parameters in the Carlsmith Model are correlated in real life (since basically everything is correlated with everything else by some mechanism), so the important question is whether they are independent enough that what I’ve got here is still plausible. I offer some thoughts on the issue in Section 5.1.
To the best of my knowledge, there is no work making a very strong theoretical claim that any particular element of the Carlsmith Model will be strongly correlated with any other element. I have seen people suggest mechanisms with the implicit claim that if AI is more revolutionary than we expect then there will be correlation between our incentive to deploy it, our desire to expose it to high-impact inputs and our inability to stop it once it tries to disempower us—but I’m pretty confident the validity check in Section 4.3.3 demonstrates that correlation between some parameters doesn’t fundamentally alter conclusions about distributions, although would alter the exact point estimates which were reached.
Practically, I don’t think there is strong evidence that people’s parameters are correlated across estimates to a degree that will significantly alter results. Below is the correlation matrix for the Full Survey estimates with p<0.05 highlighted in green. Obviously I’m once again leaning on the argument that a survey of AI Risk is the same thing as the actual AI Risk, which I think is another weakness of the essay.
This doesn’t spark any major concerns for me—there is more correlation than would be expected by chance, but it seems to be mostly contained within the ‘Alignment turns out to be easy’ step, and as discussed above the mechanism still functions if one or two steps are removed because they are indistinguishable from preceding steps. The fact that there is more positive than negative correlation step is some evidence of the ‘general factor of optimism’ which you describe (because the ‘optimistic’ view is that we won’t deploy AI until we know it is safe, so we’d expect negative correlation on this factor in the table). Overall I think my assumption of independence is reasonable in the sense that the results are likely to be robust to the sorts of correlations I have empirically observed and theoretically seen accounted for, however I do agree with you that if there is a critical flaw in the essay it is likely to be found here.
I don’t quite follow your logic where you conclude that if estimates are correlated then simple mean is preferred—my exploration of the problem suggests that if estimates are correlated to a degree significant enough to affect my overall conclusion then you stop being able to use conventional statistics at all and have to do something fancy like microsimulation. Anecdata—in the specific example you give my intuition is that 0.4% really is a better summary of our knowledge, since otherwise we round off Aida’s position to ‘approximately 1%’ which is several orders of magnitude incorrect. Although as I say above, in the situation you describe above both summary estimates are misleading in different ways and we should look at the distribution—which is the key point I was trying to make in the essay.
What links here?
- Steven Byrnes's comment on ‘Dissolving’ AI Risk – Parameter Uncertainty in AI Future Forecasting by Froolow (19 Oct 2022 19:48 UTC; 2 points)

Froolow 29 Aug 2022 9:20 UTC
32 points
0 ∶ 0
in reply to: JackM’s comment on: A critical review of GiveWell’s 2022 cost-effectiveness model
One of the key takeaways in the body of the text which perhaps I should have brought out more in the summary is that the GiveWell model is basically as reliable as highly professionalised bodies like pharma companies have figured out how to make a cost-effectiveness model. A small number of minor errors are unexceptional for a model of this complexity, even models that we submit to pharma regulators that have had several million dollars of development behind them.
I would say that while the errors are uninteresting and unexceptional, the unusual model design decisions are worth commenting on. The GiveWell team are admirably transparent with their model, and anybody who wants to review it can have access to almost everything at the click of a button (some assumptions are gated to GiveWell staff, but these aren’t central). Given this, it is remarkable the EA community didn’t manage to surface anyone who knew enough about models to flag to GiveWell that there were optimisations in design to be made—the essay above is not really arcane modelling lore but rather something anyone with a few years’ experience in pharma HEOR could have told you. Is this because there are too few quant actors in the EA space? Is it because they don’t think their contributions would be valued so don’t speak up? Is it because criticism of GiveWell makes you unemployable in EA spaces so is heavily incentivised against? Etc etc. That is to say—I think asking why GiveWell missed the improvements is missing the important point, which is that everyone missed these improvements so there’s probably changes that can be made to expert knowledge synthesis in EA right across the board.
Just to add that I think outreach efforts like the Red Team contest are a really good way of doing this—I wouldn’t have heard about the EA Forums had it not been for the plug Scott Alexander gave the contest on Astral Codex Ten (which I read mostly for the stuff on prediction markets).

[Question] What is the state of the art in EA cost-effectiveness modelling?

Froolow4 Jun 2022 12:08 UTC

26 points

10 comments1 min readEA link

[Question] Questions on databases of AI Risk estimates

Froolow2 Oct 2022 9:12 UTC

24 points

12 comments2 min readEA link

Froolow 5 Jun 2023 9:55 UTC
24 points
3 ∶ 0
in reply to: Elizabeth’s comment on: Change my mind: Veganism entails trade-offs, and health is one of the axes
Hi Elizabeth, I’m the co-author of the piece linked above. You’re absolutely right we chose to focus on the omnivore-vs-vegetarian comparison, for a variety of different reasons. However, AHS-2 does have some comparisons between omnivores to vegans. From the abstract: “the adjusted hazard ratio (HR) for all-cause mortality in all vegetarians combined vs non-vegetarians was 0.88 (95% CI, 0.80–0.97). The adjusted HR for all-cause mortality in vegans was 0.85 (95% CI, 0.73–1.01)”. So depending on how strict you are being with statistical significance there’s somewhere between a small signal and no signal that veganism is better with respect to all-cause mortality than omnivorism.
I think AHS is the best data we’ve got on this topic, but I’d be cautious about over-interpreting it. In my mind the biggest criticism is that Adventists are generally more healthy than the typical American (they do a lot more exercise, avoid alcohol and tobacco etc), which leads to extremely pernicious selection bias. For example, it could be that a vegan diet is much healthier than an omnivorous diet if you are the kind of person who spends a lot of time worrying about your health generally, but the risk of getting the wrong nutrients is so high with a vegan diet that it is harmful to people who are not otherwise concerned about their health. So I’m not confident the very slight improvement in overall mortality from switching from a vegetarian diet to a vegan diet can be judged to be a real effect from AHS alone.
On the other hand, I think I would be confident enough in the AHS data to say that it shows that veg*nism does not entail a tradeoff on the ‘years of life lived’ axis. The most conservative reading of the data possible would be that a veg*n diet has no effect on years of life lived, and I think it is probably more reasonable to read the AHS study as likely underestimating the benefits a veg*n diet would give the average person. Obviously ‘years of life lived’ is not the same thing as ‘health’ so I’m not saying this is a knock-down argument against your main point—just wanted to contextualise how we were using the data in the linked piece.
What links here?

Froolow 6 Feb 2023 19:48 UTC
23 points
2 ∶ 0
on: Unjournal’s 1st eval is up: Resilient foods paper (Denkenberger et al) & AMA ~48 hours
I’m one of the evaluators involved in the project (Alex Bates). I wanted to mention that it was an absolute pleasure to work with Unjournal, and a qualitative step above any other journal I’ve ever reviewed for. I’d definitely encourage people to get involved if they are on the fence about it!

October 2022 AI Risk Community Survey Results

Froolow24 May 2023 10:37 UTC

19 points

0 comments7 min readEA link

Froolow 19 Oct 2022 10:14 UTC
17 points
0 ∶ 0
in reply to: Dan_Keys’s comment on: ‘Dissolving’ AI Risk – Parameter Uncertainty in AI Future Forecasting
I had not thought to do that, and it seems quite sensible (I agree with your point about prima facie worry about low outliers). The results are below.
To my eye, the general mechanism I wanted to defend about is preserved (there is an asymmetric probability of finding yourself in a low-risk world), but the probability of finding yourself in an ultra-low-risk world has significantly lowered, with that probability mass roughly redistributing itself around the geometric mean (which itself has gone up to 7%-ish)
In some sense this isn’t totally surprising—removing the lowest 10% of estimates means that order-of-magnitude uncertainty is only preserved for one of the six parameters in the equation (Containment), so the SDO mechanism doesn’t really apply. I don’t have the subject-specific knowledge to conclude is de-extremising the data in this way is reasonable (do we actually have better-than-order-of-magnitude knowledge about all of these parameters except Containment?), but the analysis you suggest is an important limitation of my results which I had totally overlooked, so thank you for the suggestion.

Froolow 20 Oct 2022 10:13 UTC
15 points
6 ∶ 0
in reply to: aogara’s comment on: ‘Dissolving’ AI Risk – Parameter Uncertainty in AI Future Forecasting
Both of the links you suggest are strong philosophical arguments for ‘disjunctive’ risk, but are not actually model schema (although Soares does imply he has such a schema and just hasn’t published it yet). The fact that I only use Carlsmith to model risk is a fair reflection of the state of the literature.
(As an aside, this seems really weird to me—there is almost no community pressure to have people explicitly draw out their model schema in powerpoint or on a piece of paper or something. This seems like a fundamental first step in communicating about AI Risk, but only Carlsmith has really done it to an actionable level. Am I missing something here? Are community norms in AI Risk very different to community norms in health economics, which is where I usually do my modelling?)

The discount rate effectively determines whether long- or near-termism is the best use of philanthropic resources

Froolow6 Sep 2022 14:52 UTC

14 points

8 comments4 min readEA link

Froolow 14 Jun 2023 17:44 UTC
12 points
6 ∶ 0
in reply to: Larks’s comment on: The Common Sense View on Meat Implies Some Pretty Radical Conclusions
To be clear the Sentience Institute itself is beyond reproach, describing their approach as being “We estimate that 99% of US farmed animals are living in factory farms at present”, which is totally unambiguous.

I’m quite sympathetic to the idea of moral arguments treating the basic unit of ‘meat’ as being the animal—that seems to be the morally relevant unit

Froolow 6 Sep 2022 18:22 UTC
12 points
0 ∶ 0
on: The discount rate effectively determines whether long- or near-termism is the best use of philanthropic resources
ADDENDUM: There is an excellent post here about why the discount rate is probably not zero based on an analysis of existential risk rates. However, I think the post assumes a lot of background familiarity with the theory of discount rates, and doesn’t – for example – explicitly identify that existential risk is probably not the biggest reason to discount the future (even absent time-preference). Would an effortpost which is a deep dive into discount rates as understood by economists be helpful?

Froolow 20 Oct 2022 9:55 UTC
9 points
4 ∶ 0
in reply to: harfe’s comment on: ‘Dissolving’ AI Risk – Parameter Uncertainty in AI Future Forecasting
I don’t know if a rough analogy might help, but imagine you just bought a house . The realtor warns you that some houses in this neighbourhood have faulty wiring, and your house might randomly set on fire during the 5 years or so you plan to live in it (that is, there is a 10% or whatever chance per year the house sets on fire). There are certain precautions you might take, like investing in a fire blanket and making sure your emergency exits are always clear, but principally buying very good home insurance, at a very high premium.
Imagine then you meet a builder in a bar and he says, “Oh yes, Smith was a terrible electrician and any house Smith built has faulty wiring, giving it a 50% chance of fire each year. If Smith didn’t do your wiring then it is no more risky than any other house, maybe 1% per year”. You don’t actually live in a house with a 10% risk, you live in a house with a 1% or 50% risk. Each of those houses necessitates a different strategy—in a low risk house you can basically take no action, and save money on the premium insurance. In the high risk house you want to basically sell immediately (or replace the wiring completely). One important thing you would want to do straight away is discover if Smith or Jones built your house, which is irrelevant information in the first situation before you met the builder in the bar, where you implicitly have perfect certainty. You might reason inductively—“I saw a fire this year, so it is highly likely I live in a home that Smith built, so I am going to sell at a loss to avoid the fire which will inevitably happen next year” (compared to the first situation where you would just reason you were unlucky)
I totally agree with your final paragraph—to actually do anything with the information there is an asymmetrically distributed ex post AI Risk requires a totally different model. This is not an essay about what to actually do about AI Risk. However hopefully this comment gives perhaps a sketch picture of what might be accomplished when such a model is designed and deployed.

Froolow 30 Aug 2022 20:24 UTC
9 points
0 ∶ 0
in reply to: trait-feign’s comment on: Methods for improving uncertainty analysis in EA cost-effectiveness models
Thank you for the kind words—and it is always nice to get follow-up questions!
Further reading
In terms of recommended further reading, almost all UK-based Health Economists swear by ‘the Briggs book’. This contains step-by-step instructions for doing almost everything I describe above, as well as more detail around motivation and assumptions.
If you don’t want to shell out for a textbook, an excellent exploration of uncertainty is Claxton et al 2015 where the authors demonstrated that the value of additional information on the uncertainty of streptokinase following heart attack was so small as to be negligible, which implies that a major shift in health policy could have been undertaken five years earlier and in the absence of several massive expensive trials. Claxton is one of the co-authors of the Briggs book, so knows his stuff inside out.
In terms of EA specific follow-ups, I have always really loved Kwakkel & Pruyt 2013 for their use of uncertainty analysis in a framework that EAs would recognise as longtermist. Their first example is on mineral scarcity in the medium-term future, and they go through a process very similar to that which is done for x-risk type calculations, but with what I regard as a significantly higher degree of rigour and transparency. If someone asked me to model out AI alignment scenarios I would follow this paper almost to the letter, although I would warn anyone casually clicking through that this is pretty hardcore stuff that you can’t just knock together in Excel (see their Fig 1, for example).
I note you also ask for the most speculative use of uncertainty analysis, for which I have a rather interesting answer. I remember once reading a paper on the use of Monte Carlo modelling of parameter uncertainty to resolve the Fermi Paradox (that is, why has no alien intelligence contacted us if the universe is so vast). The paper really entertained me, but I completely forgot the reference until I tracked the paper down to link it for you now—it is Sandberg, Drexler & Ord 2018, and the ‘Ord’ in the third author position is Toby Ord, who I suspect is better known to forum members as one of the founders of EA—what a lovely coincidence!
Model covariance
You are right to raise covariance in Monte Carlo simulations as a clear issue with the way I have presented the topic, but you’ll be pleased to know that this is basically a solved problem in Health Economics which I just skimmed over in the interests of time. The ‘textbook’ method of solving the problem is to use a ‘Cholesky Decomposition’ on the covariance matrix and sample from that. In recent years I’ve also started experimenting with microsimulating the underlying process which generates the correlated results, with some mixed success (but it is cool when it works!).
Risk adjustment
Your comments on risk adjustment are completely correct—amongst many of the problems my approach causes it takes unlikely outcomes (ie high standard deviation away from average) and implicitly turns them into outcomes which are proportionally even more unlikely, sometimes to the point of requiring completely impossible inputs to generate those outputs. I hope I caveated the weakness of the method appropriately, because it isn’t a good model of how humans approach risk (more of a proof of concept)
There is a fairly novel method just breaking into the Health Economics literature called a CERAC, which uses the process you outline of treating a model as a portfolio with an expected return and downside risk of those returns being penalised accordingly. I suspect something like this is the best way to handle risk adjustment in a model without an explicit model of risk-preference specified across all possible outcomes. Unfortunately to use the technique as described you need a cost-effectiveness threshold, which doesn’t exist in EA (and will never exist in EA as a matter of first-principles). As I mentioned, I work in an exclusively expected utility context so I’m not familiar enough with the technique to be confident of adapting it to EA, although if someone with a better maths background than me wanted to give it a shot I suspect that would be a pretty valuable extension of the general principle I outline.

[Question] AI Risk Microdynamics Survey

Froolow9 Oct 2022 20:00 UTC

7 points

1 comment1 min readEA link

Froolow

Further reading

Model covariance

Risk adjustment