Prioritising animal welfare over global health and development?
Summary
Corporate campaigns for chicken welfare increase wellbeing way more cost-effectively than the best global health and development (GHD) interventions.
In addition, the effects on farmed animals of such interventions can at least influence prioritisation within GHD, and those on wild animals might determine whether they are beneficial or harmful.
I encourage Charity Entrepreneurship (CE), Founders Pledge (FP), GiveWell (GW), Open Philanthropy (OP) and Rethink Priorities (RP) to:
Increase their support of animal welfare interventions relative to those of GHD (at the margin).
Account for effects on animals in the cost-effectiveness analyses of GHD interventions.
Corporate campaigns for chicken welfare increase nearterm wellbeing way more cost-effectively than GiveWellâs top charities
Corporate campaigns for chicken welfare are considered one of the most effective animal welfare interventions. A key supporter of these is The Humane League (THL), which is one of the 3 top charities of Animal Charity Evaluators.
I calculated the cost-effectiveness of corporate campaigns for broiler welfare in human-years per dollar from the product between:
Chicken-years affected per dollar, which I set to 15 as estimated here by Saulius Simcikas. Note Saulius estimates broiler and cage-free campaigns affect 41 chicken-years per dollar, 2.73 (= 41â15) times as much as the broiler campaigns on which I am relying.
Improvement in welfare as a fraction of the median welfare range when broilers go from a conventional to a reformed scenario[1], assuming:
The time broilers experience each level of pain defined here (search for âdefinitionsâ) in a conventional and reformed scenario is given by these data (search for âpain-tracksâ) from the Welfare Footprint Project (WFP).
The welfare range is symmetric around the neutral point[2], and excruciating pain corresponds to the worst possible experience.
Excruciating pain is 1 k times as bad as disabling pain[3].
Disabling pain is 100 times as bad as hurtful pain.
Hurtful pain is 10 times as bad as annoying pain.
The lifespan of broilers is:
For both broilers and a random human, 8 h each day is spent sleeping, i.e. 1â3 (= 8â24) of the time.
For broilers, the welfare from positive experiences per time awake is symmetric of that of hurtful pain[4].
Median welfare range of chickens, which I set to RPâs median estimate of 0.332.
Reciprocal of the intensity of the mean human experience, which I obtained supposing humans:
Sleep 8 h each day, and have a neutral experience during that time.
Being awake is as good as hurtful pain is bad. This means being awake with hurtful pain is neutral, thus accounting for positive experiences.
I computed the cost-effectiveness in the same metric for the lowest cost to save a life among GWâs top charities from the ratio between:
Life expectancy at birth in Africa in 2021, which was 61.7 years according to these data from OWID.
Lowest cost to save a life of 3.5 k$ (from Helen Keller International), as stated by GW here.
The results are in the tables below. The data and calculations are here (see tab âCost-effectivenessâ).
Intensity of the mean experience as a fraction of the median welfare range | ||
---|---|---|
Broiler in a conventional scenario | Broiler in a reformed scenario | Human |
-2.58*10^-5 | -4.76*10^-6 | 3.33*10^-6 |
Broiler in a conventional scenario relative to a human | Broiler in a reformed scenario relative to a human | Broiler in a conventional scenario relative to a reformed scenario |
-7.75 | -1.42 | 5.45 |
Intensity of the mean experience as a fraction of that of the mean human experience | |
---|---|
Broiler in a conventional scenario | Broiler in a reformed scenario |
-2.57 | -0.472 |
Improvement in chicken welfare per time when broilers go from a conventional to a reformed scenario as a fraction of... | |
---|---|
The median welfare range of chickens | The intensity of the mean human experience |
2.11*10^-5 | 2.10 |
Cost-effectiveness (human-years per dollar) | |
---|---|
Corporate campaigns for broiler welfare | 31.5 |
Lowest cost to save a life among GWâs top charities | 0.0176 |
Corporate campaigns for broiler welfare relative to lowest cost to save a life among GWâs top charities | 1.79 k |
According to my results, corporate campaigns for broiler welfare are 1.79 k times as effective as the lowest cost to save a life among GWâs top charities. I am not surprised. Here I got a ratio 6.48 (= 11.6/â1.79) times as high, essentially because I used a moral weight 7.26 (= 2.41/â0.332) times as high as RPâs median welfare range (which I used above). This was not available at the time, but I trust it much more than my previous estimate, so I think the lower ratio of 1.79 k is more accurate.
To get a ratio of 1:
Everything else equal, the median welfare range of chickens (relative to humans) would have to be 1.85*10^-4 (= 0.332/â(1.79*10^3)), which is 92.5 % (= 1.85/â2.00) the one I guessed here for nematodes. I do not see this being possible.
Assuming broiler welfare is worth zero outside hedonism, this would have to be given a weight of 0.0559 % (= 1/â(1.79*10^3)). This is very much against what Bob Fischer says here. âEven if hedonic goods and bads (i.e., pleasures and pains) arenât all of welfare, theyâre a lot of it. So, probably, the choice of a theory of welfare will only have a modest (less than 10x [i.e. at least 10 % weight for hedonism]) impact on the differences we estimate between humansâ and nonhumansâ welfare rangesâ.
So the takeaway to me is that corporate campaigns for chicken welfare increase nearterm wellbeing robustly more cost-effectively than GWâs top charities, which are plausibly among the best GHD interventions.
Effects of global health and development interventions on animals are neglected and unclear
GHD interventions decrease mortality or increase economic growth. These tend to increase the consumption of farmed animals (see meat-eater problem), or impact net forest area, thus changing the number of animals. To illustrate, I show in the next sections the effects on animals of GWâs top charities may at least influence which countries they should target, or even determine whether they are beneficial or harmful. Nonetheless, these considerations have not been researched by GW.
Farmed animals
The table below contains the relative reduction in the cost-effectiveness of saving lives due to increased consumption of poultry caused by saving lives in each of the countries targeted by GWâs top charities analysed here. I have focussed on poultry because I think there is especially good data from WFP on the conditions of chickens. I got the estimates from the product between:
Absolute value of the intensity of the mean experience of broilers in a reformed scenario as a fraction of the median welfare range of chickens relative to the intensity of the mean human experience, which I estimated to be â1.42.
Median welfare range of chickens, which I set to RPâs median estimate of 0.332.
Production of poultry per capita in 2019 in each country as a fraction of the global one to the power of 1.5.
I computed the fraction from these and these data from Our World in Data (OWID).
1.5 instead of 1 such that each doubling of poultry consumption per capita makes the conditions of farmed chickens 1.41 (= 2^0.5) times as bad. This is a very rough approximation, as I expect the lives of farmed chickens to be positive for low poultry consumption per capita, and eventually become negative as it increases, which will arguably happen. From these data from OWID, the population of chickens in Africa increased 2.98 % (= (1.81/â1.20)^(1/â(2014 â 2000)) â 1) per year between 2000 and 2014.
The data and calculations are here (see tab âPoultryâ).
Country | Consumption of poultry per capita in 2020 as a fraction of the global one (%) | Relative reduction in the cost-effectiveness of saving lives due to poultry (%) |
---|---|---|
Mean of the countries below | 13.1 | 2.67 |
Burkina Faso | 12.4 | 2.06 |
Cameroon | 18.9 | 3.89 |
Chad | 2.34 | 0.169 |
Cote dâIvoire | 16.0 | 3.01 |
Democratic Republic of Congo | 0.659 | 0.0253 |
Guinea | 5.51 | 0.612 |
Kenya | 7.83 | 1.03 |
Mali | 16.0 | 3.03 |
Mozambique | 22.4 | 4.99 |
Niger | 4.86 | 0.506 |
Nigeria | 6.72 | 0.82 |
South Sudan | 30.6 | 7.99 |
Togo | 30.3 | 7.86 |
Uganda | 9.27 | 1.33 |
World | 100 | 47.2 |
These results suggest accounting for poultry does not matter much for GHD interventions. Among the countries targeted by GWâs top charities, the relative reduction in the cost-effectiveness of saving lives ranges from 0.0253 % for the Democratic Republic of Congo to 7.99 % for South Sudan.
Nevertheless, I believe the results above underestimate the reduction in cost-effectiveness, because:
I have not accounted for other farmed animals. From my estimates here, the negative utility of farmed chickens is only 30.6 % (= 1.42/â4.64) of that of all farmed animals globally. This suggests accounting for all farmed animals would lead to a reduction in cost-effectiveness for the mean country of 8.72 % (= 0.0267/â0.306), which is not negligible. So accounting for the effects of GHD interventions on farmed animals may lead to targeting different countries.
I have used the current consumption of poultry per capita, but this, as well as that of other farmed animals, will tend to increase with economic growth. I estimated the badness of the experiences of all farmed animals alive is 4.64 times the goodness of the experiences of all humans alive, which suggests saving a random human life results in a nearterm increase in suffering.
On the other hand, greater economic growth may be associated with moral circle expansion, and lead to technological innovations that can increase the welfare of farmed animals, or make alternatives more convenient, cheaper and tastier[5]. An additional major uncertainty is the welfare range of chickens. I have used RPâs median estimate, but the 5th and 95th percentile are 0.602 % (= 0.002/â0.332) and 2.61 (= 0.869/â0.332) times as large. Furthermore, as Julian Jamison noted, assuming disabling pain is 10 (instead of 100) times as bad as hurtful pain leads to broilers in a conventional scenario having positive lives[6].
Overall, I am quite uncertain about the magnitude of the effect on farmed animals, but think it may well lead to at least different prioritisation within GHD interventions. So I believe it should be integrated in cost-effectiveness analyses of GHD interventions. This will involve further research, for instance, on forecasting how prevalent will factory-farming become in low-income countries.
Wild animals
The table below contains the absolute value of the relative variation in the cost-effectiveness of saving lives due to changes in the population of wild terrestrial arthropods caused by increased deforestation. I do not know whether the variation corresponds to an increase or decrease, as I am quite uncertain about whether wild arthropods have good or bad lives (see this preprint from Heather Browning and Walter Weit). I got the estimates from the product between:
Decrease in forest area per capita in 2015, which I computed from these and these data from OWID. As a 1st approximation, I assume net change is forest area is directly proportional to population.
Decrease in density of terrestrial arthropods due to deforestation, which I estimated to be 280 M/âha following this.
Intensity of the mean experience of wild terrestrial arthropods as a fraction of that of humans, which I estimated to be 0.200 % here (see 4th column of table).
The data and calculations are here (see tab âWild terrestrial arthropodsâ).
Country | Decrease in forest area per capita in 2015 (m^2) | Decrease in the number of wild terrestrial arthropods per capita in 2015 | Absolute value of the relative variation in the cost-effectiveness of saving lives due to wild terrestrial arthropods |
---|---|---|---|
Mean of the countries below | 20.5 | 574 k | 1.15 k |
Cameroon | 24.3 | 681 k | 1.36 k |
Mali | 0 | 0 | 0 |
Mozambique | 89.1 | 2.50 M | 4.99 k |
Niger | 6.17 | 173 k | 346 |
Nigeria | 8.88 | 249 k | 497 |
Togo | 3.96 | 111 k | 222 |
Uganda | 11.0 | 308 k | 616 |
World | 6.93 | 194 k | 388 |
The results suggest the increase in human welfare from GWâs top charities saving lives is much smaller than the increase/âdecrease in that of wild terrestrial arthropods, since the absolute values of the relative variation in cost-effectiveness are much higher than 1. Nonetheless, these are quite uncertain because they are (in my model) directly proportional to the welfare range of silkworms. I have used RPâs median estimate, but the 5th and 95th percentile are 0 (= 0â0.002) and 36.5 (= 0.073/â0.002) times as large.
All in all, I can see the impact on wild animals being anything from negligible to all that matters in the nearterm. So, as for farmed animals, I think more research is needed. For example, on forecasting net change in forest area in low-income countries.
Note the impact on wild animals may also be the major driver of the overall nearterm effect of interventions which aim to improve the welfare of farmed animals[7]. For example, corporate campaigns for chicken welfare will tend to make chicken and eggs more expensive, which can lead to an increase in the consumption of beef, and therefore more deforestation, thus decreasing the population of wild terrestrial arthropods. Nevertheless, I think the positive/ânegative impact on wild animals is much larger for interventions which focus on reducing the consumption of farmed animals (like ones around abolitionism), instead of improving their living conditions.
Regarding the impact of human diet on animal welfare (of both farmed and wild animals), Michael St. Jules suggested Matheny 2005, this and these posts from Brian Tomasik, this post from Carl Shulman, and Fischer 2018.
Miscellaneous thoughts on organisations aligned with effective altruism
As far as I can tell, organisations aligned with effective altruism do not consider the effects of GHD interventions on animals. Below is some brief additional discussion, by alphabetical order of organisation.
Charity Entrepreneurship
CE seemingly has strong reasons to account for effects on animals. According to CEâs weighted animal welfare index, the âtotal welfare score (with evidence)â of:
âFF [factory-farmed] broiler chickenâ is â1.75 (= â56/â32) times that of a âhuman in a low middle-income countryâ, which is 1.01 times the value of â1.73 I got for broilers in a reformed scenario in my early cost-effectiveness analysis (see 1st table).
âWild bug[s]â is â1.31 (= â42/â32) times that of a âhuman in a low middle-income countryâ, which is 656 times the value of 0.200 % I used in my early estimation of the effects on animals.
These suggest the impacts of GHD interventions will be similar to what I estimated for farmed animals, and 3 orders of magnitude as large for wild animals.
Founders Pledge
As part of FPâs prioritisation, Stephen Clare and Aidan Goth published 3 years ago this analysis[8] comparing the cost-effectiveness of THL and Against Malaria Foundation (AMF), which is one of GWâs top charities. According to its Guesstimate model, the cost-effectiveness of THL is 852 (= 23â0.027) times that of AMF, which (considering the uncertainty involved) is pretty close to the ratio of 1.79 k I got in my early cost-effectiveness analysis.
Stephen and Aidan highlighted the moral weight of chickens relative to humans as a major uncertainty. However, this has meanwhile been narrowed down thanks to RPâs (great!) moral weight project. Maybe FP has not focussed much on animal welfare[9] due to other considerations, such as not having a fund for it (see FPâs funds).
GiveWell
GW determines the value of consumption and saving lives as a function of age based on surveys of its team, donors and beneficiaries (see here). I think it would make some sense to include questions about the importance of animals in such surveys. Nonetheless, I think it would be much better to combine RPâs median welfare ranges with empirical evidence about how further away from the neutral point (as a fraction of the median range) is the mean experience of animals. Something like what I did, but way more in-depth!
I believe it would be hard for people to come up with good estimates describing the importance of animals in surveys. As Bob Fischer commented here:
The upshot of Jasonâs post on whatâs wrong with the âholisticâ approach to moral weight assignments, my post about theories of welfare, and my post about the appropriate response to animal-friendly results is something like this: you should basically ignore your priors re: animalsâ welfare ranges as theyâre probably (a) not really about welfare ranges, (b) uncalibrated, and (c) objectionably biased.
Welfare ranges are not the sole determinant of the importance of animals, but they are a key input. So trusting our priors regarding them will imply coming up with an inaccurate assessment of how much consideration we should give to animals. Moreover, I suppose GWâs team, donors and beneficiaries would not naturally be open to the possibility of defining moral weights as a function of the country, but that arguably makes sense given consumption of animals and deforestation vary across countries (and so do effects on animals). Alas, the moral weight of saving a life can even be negative under some circumstances (although killing people is still bad!).
Additionally, for the sake of transparency, it would be good if GW described in their website how they think about effects on animals. 8 months ago, I asked GW for feedback on this post related to the meat-eater problem. I was told my message was passed to the research team, but I have not heard back.
Open Philanthropy
From OPâs global health and wellbeing cause prioritisation framework:
When it comes to other outcomes like farm animal welfare or the far future [not so far if you think existential risk in the next 100 years is around 1â6], we practice worldview diversification instead of trying to have a single unified framework for cost-effectiveness analysis.
I think diversification makes sense in general, but the details matter. There is a (somewhat remote) sense in which a fossil fuel company is practising worldview diversification if it is decreasing its own emissions while increasing extraction of fossil fuels such that it overall contributes to global warming. However, if the goal really is mitigating global warming, it makes sense to focus on the overall contribution of the company to it.
Saving lives increases the nearterm welfare of humans, but it decreases that of farmed animals, and has unclear effects on wild animals. I think effects on farmed animals are sufficiently clear to be integrated into cost-effectiveness analyses, and that we should invest more resources into understanding those on wild animals (relative to global health and wellbeing interventions, at the margin).
Related to learning more, I am glad OP has supported RPâs moral weight project. At the same time, I wonder whether it should have happened before it directed hundreds of millions of dollars towards GHD interventions. Not only because of their effects on animals, but owing to animal welfare interventions increasing wellbeing way more cost-effectively, as I showed in my early cost-effectiveness analysis. This is in agreement with OPâs post on worldview diversification[10]:
If you value chicken life-years equally to human life-years, this implies that corporate campaigns do about 10,000x as much good per dollar as top charities. If you believe that chickens do not suffer in a morally relevant way, this implies that corporate campaigns do no good.[3]
One could, of course, value chickens while valuing humans more. If one values humans 10-100x as much, this still implies that corporate campaigns are a far better use of funds (100-1,000x). If one values humans astronomically more, this still implies that top charities are a far better use of funds. It seems unlikely that the ratio would be in the precise, narrow range needed for these two uses of funds to have similar cost-effectiveness.
The value of chickens depends on how much weight one gives to hedonism, about which Alexander Berger (OPâs co-CEO) writes[11]:
We think that most plausible arguments for hedonism end up being arguments for the dominance of farm animal welfare. We seem to put a lot of weight on those arguments relative to you, and farm animal welfare is OP GHWâs biggest area of giving after GiveWell recommendations. If we updated toward more weight on hedonism, we think the correct implication would be even more work on FAW, rather than work on human mental health.
In the same comment, Alexander mentions:
We [OP] think it is a mistake to collapse worldviews in the sense that we use them to popular debates in philosophy, and we definitely donât aim to be exhaustive across worldviews that have many philosophical adherents. We see proliferation of worldviews as costly for the standard intellectual reason that they inhibit optimization, as well as carrying substantial practical costs, so we think the bar for putting money behind an additional worldview is significantly higher than you seem to think. But we havenât done a good job articulating and exploring what we do mean and how that interacts with the case for worldview diversification (which itself remains undertheorized). We appreciate the push on this and are planning to do more thinking and writing on it in the future.
If OPâs worldviews are not supposed to correspond to popular debates in philosophy, and having more is costly, should the ones of nearterm animal and human welfare be unified? I agree worldview diversification âremains undertheorizedâ.
I asked Alexander and Lewis Bollard at the end of January whether they thought this analysis about the effects of terrestrial arthropods on the cost-effectiveness of GiveWellâs top charities was any relevant, but I have not heard back.
Rethink Priorities
RPâs Worldview Investigations Team seems perfectly positioned to study how to account for the effects on animals of GHD interventions, and figure out what the greater cost-effectiveness of corporate campaigns to increase wellbeing implies.
I asked here whether RPâs GHD team was considering addressing effects on animals in their work, but I have not heard back (and was downvoted). I had also contacted RP about the post on terrestrial arthropods at the end of January, and was told my message was forwarded to the GHD team, but I have not heard back either.
Complex cluelessness should not be ignored
I do not think it is fair to ignore the effects on animals because they look like a crucial consideration. We are in a case of complex cluelessness, not one of simple cluelessness where very uncertain effects can be ignored based on evidential symmetry. Me looking now to the right might ultimately create a storm somewhere, but just as well prevent it, so we can ignore these considerations. In contrast, increasing population size will robustly lead to greater consumption of food, which has certain impacts on farmed and wild animals.
I agree that, mathematically, E(âoverall effectâ) > 0 if:
âOverall effectâ = ânearterm effect on humansâ + ânearterm effect on animalsâ + âlongterm effectâ.
E(ânearterm effect on humansâ) > 0.
E(ânearterm effect on animalsâ) = k_1 E(ânearterm effect on humansâ).
E(âlongterm effectâ) = k_2 E(ânearterm effect on humansâ).
k_1 + k_2 = 0.
That being said, setting k_1 + k_2 to 0 seems unfair under complex cluelessness. One could just as well say k_1 + k_2 = â1, in which case E(âoverall effectâ) = 0. Since I am not confident |k_1 + k_2| << 1, I am not confident either about the sign of E(âoverall effectâ), nor about whether GWâs top charities are beneficial or harmful.
Let me try to illustrate how I think about this with an example (originally commented here). Imagine the following:
Nearterm effects on humans are equal to 1 in expectation.
This estimate is very resilient, i.e. it will not change much in response to new evidence.
Other effects (on animals and in the longterm) are â1 k with 50 % likelihood, and 1 k with 50 % likelihood, so they are equal to 0 in expectation.
These estimates are not resilient, and, in response to new evidence, there is a 50 % chance the other effects will be negative in expectation, and 50 % chance they will be positive in expectation.
However, it is very unlikely that the other effects will in expectation be between â1 and 1, i.e. they will most likely dominate the expected nearterm effects.
What do you think is a better description of our situation?
The expected overall effect is 1 (= 1 + 0) in expectation. This is positive, so the intervention is robustly good.
The overall effect is â999 (= 1 â 1 k) with 50 % likelihood, and 1,001 (= 1 + 1 k) with 50 % likelihood. This means the expected value is positive. However, given the lack of resilience of the other effects, we have little idea whether it will continue to be positive, or turn out negative in response to new evidence. So we should not act as if the intervention is robustly good. Instead, it would be good to investigate the other effects further, especially because we have not even tried any hard to do that in the past.
Am I uncertain about the value of killing people too?
No, killing people is bad! Not saving lives has drastically different consequences from killing people, which is much more anti-cooperative. For what it is worth, I think I am much more against killing than the median citizen. For example, I suspect most people would be in favour of militarily supporting Ukraine even if it was known that it increased the number of people killed in the Russo-Ukrainian War, whereas I would tend to prefer whatever prevented the most war deaths.
However, for the same reasons I am not confident about whether saving lives is good or bad, I do not know whether a random person dying (without being killed) is beneficial or harmful.
I do not know whether saving lives is good longterm
One can argue saving lifes is robustly good longterm (k_2 >> k_1) based on the capability approach to human welfare, despite nearterm effects on humans plus animals being unclear. I am sympathetic to this argument, but think it is too general. There are obvious benefits of being able to live a long and healthy life, but I also worry about humans having the capability of factory-farming animals whose lives are pretty bad. Note the title of the post is âthe capability approach to human welfareâ (emphasis mine). Interestingly, I have recently listened to Martha Nussbaum on the Clearer Thinking podcast, and it looks like her book Justice for Animals: Our Collective Responsibility attempts to extend the capability approach to non-human animals.
In the same way it is better to focus on differential progress over economic growth, I would rather increase good capabilities over all capabilities, and it is unclear to me what is the net effect of increasing population at the margin. There are many indirect longterm effects. The answer may vary too, depending on factors like year, country and age.
I believe saving lives would more easily be good if there were much fewer humans, because in that case it would decrease the risk from extinction, which is good given my presumption that the expected value of the future is positive. I am open to the possibility that saving lives is a good proxy for longterm value for the current population too, but this is not obvious to me. I think it warrants empirical investigation, for example, into impacts on democracy levels. This in particular seems to be a neglected topic. From Kono 2009 (emphasis mine):
Although many people have argued that foreign aid props up dictators [and so might GHD interventions?], few have claimed that it props up democrats, and no one has systematically examined whether either assertion is empirically true. We argue, and find, that aid has both effects. Over the long run [what matters most?], sustained aid flows promote autocratic survival because autocrats can stockpile this aid for use in times of crisis. Each disbursement of aid, however, has a larger impact on democratic survival because democrats have fewer alternative resources to fall back on.
In addition, I tend to think it would be a surprising and suspicious convergence if saving lives as cost-effectively as possible was the best way to improve the longterm future. I would expect metrics more closely related to existential risk to be better. For example:
For climate change, greenhouse gas emissions (in the worst worlds[12]).
For nuclear war, number of nuclear warheads.
For catastrophic pandemics, cost of sequencing a full human genome.
For artificial intelligence (AI), global corporate investment in AI.
Additionally, it is worth keeping in mind longtermist interventions can save lives quite cost-effectively too. For example:
The cost-effectiveness of 3.95 bp/âG$ I estimated here for longtermism and catastrophic risk prevention (for method 3 with truncation) naively corresponds to saving a life for 316 $ (= 1/â(3.95*10^-4*8)), which is 11.1 (= 3500â316) times as cost-effective as the lowest cost among GWâs top charities (from Helen Keller International).
Joel Tan estimated lobbying for arsenal limitation is 5 k times as cost-effective as GWâs top charities. âThe headline cost-effectiveness will almost certainly fall if this cause area is subjected to deeper researchâ. âThat said, results are robust, insofar as the low-confidence tractability estimates can drop by three whole magnitudes and still leave the intervention to be comfortably more cost-effective than GiveWell[âs top charities]â.
Note these interventions would look even more cost-effective after accounting for their effect on the far future.
What would I like to see?
Thinking at the margin, I would say scope-sensitive ethics imply prioritising animal welfare over global health and development. I think the scale of the welfare of farmed animals and wild terrestrial arthropods is 4.64 and 253 k times as large as that of humans, so accounting for them seems crucial a priori.
So I encourage organisations, especially the ones I discussed above aligned with effective altruism, to:
Increase their support of animal welfare interventions relative to those of GHD (at the margin).
Account for effects on animals in the cost-effectiveness analyses of GHD interventions.
Acknowledgements
Thanks to Jeff Kaufman, Michael St. Jules, and Sanjay Joshi for feedback on the draft.
- ^
- ^
This assumption influences the improvement in welfare as a fraction of the median welfare range, but not the cost-effectiveness of corporate campaigns for broiler welfare in human-years per dollar. For example, if welfare could range from something as good as disabling pain is bad to excruciating pain, the welfare range would become 50.05 % (= (1 + 1 k)/â(2 k)) as large. Consequently, the improvement in welfare as a fraction of the median welfare range would become 1.998 (= 1â0.5005) times as large, but so would the intensity of the mean human experience. As a result, the cost-effectiveness in human-years per dollar would remain the same, since it is directly proportional to the improvement in welfare as a fraction of the median welfare range, and to the reciprocal of the intensity of the mean human experience.
- ^
- ^
This assumption affects the (signed) intensity of the mean experience of broilers, but not the improvement in their welfare when they go from a conventional to a reformed scenario, because the lifespan of broilers and value of them being alive is the same in both scenarios. As a consequence, the assumption does not impact the cost-effectiveness of corporate campaigns for broiler welfare.
- ^
Thanks to Sanjay Joshi for noting this point.
- ^
The intensity of the mean experience as a fraction of the median welfare range would be 8.24 %, instead of â777 %.
- ^
Thanks to Michael St. Jules for noting this point. I had thought about it, but had not written it down, possibly due to motivated reasoning.
- ^
If I recall correctly, the one which got me thinking about comparisons between animal welfare and GHD interventions!
- ^
Their only report on animal welfare was published in November 2020.
- ^
Thanks to Michael for noting these points.
- ^
Thanks to Michael for letting me know about Alexanderâs comment.
- ^
See section âClimate damage is increasing non-linearlyâ in this report from FP.
- Open Phil Should AlloÂcate Most NeartÂerÂmist FundÂing to AnÂiÂmal Welfare by 19 Nov 2023 17:00 UTC; 502 points) (
- 20 Nov 2023 1:32 UTC; 186 points) 's comment on Open Phil Should AlloÂcate Most NeartÂerÂmist FundÂing to AnÂiÂmal Welfare by (
- Do you think deÂcreasÂing the conÂsumpÂtion of anÂiÂmals is good/âbad? Think again? by 27 May 2023 8:22 UTC; 89 points) (
- EvÂiÂdence of effecÂtiveÂness and transÂparency of a few effecÂtive givÂing organisations by 1 Jul 2023 8:10 UTC; 60 points) (
- Famine deaths due to the cliÂmatic effects of nuÂclear war by 14 Oct 2023 12:05 UTC; 40 points) (
- My quick thoughts on donatÂing to EA Fundsâ Global Health and DevelÂopÂment Fund and what it should do by 15 Dec 2023 9:08 UTC; 33 points) (
- Cost-effecÂtiveÂness of School Plates by 25 May 2024 9:01 UTC; 33 points) (
- 20 Nov 2023 14:57 UTC; 28 points) 's comment on Open Phil Should AlloÂcate Most NeartÂerÂmist FundÂing to AnÂiÂmal Welfare by (
- BadÂness of eatÂing farmed anÂiÂmals in terms of smokÂing cigarettes by 22 Jul 2023 8:45 UTC; 26 points) (
- 22 Nov 2023 15:55 UTC; 20 points) 's comment on Open Phil Should AlloÂcate Most NeartÂerÂmist FundÂing to AnÂiÂmal Welfare by (
- 13 Jul 2023 14:26 UTC; 18 points) 's comment on ElecÂtric Shrimp StunÂning: a PoÂtenÂtial High-ImÂpact DonaÂtion Opportunity by (
- 28 Sep 2023 15:08 UTC; 13 points) 's comment on WeighÂing AnÂiÂmal Worth by (
- Helping anÂiÂmals or savÂing huÂman lives in high inÂcome counÂtries is arÂguably betÂter than savÂing huÂman lives in low inÂcome counÂtries? by 21 Mar 2024 9:05 UTC; 12 points) (
- Marginal value (or lack thereof) of voting by 11 Mar 2024 9:01 UTC; 7 points) (
- 22 Nov 2023 15:09 UTC; 6 points) 's comment on Open Phil Should AlloÂcate Most NeartÂerÂmist FundÂing to AnÂiÂmal Welfare by (
- 31 Oct 2023 11:24 UTC; 3 points) 's comment on InÂterÂmeÂdiÂate ReÂport on Abrupt SunÂlight ReÂducÂtion Scenarios by (
- 12 Feb 2024 7:43 UTC; 2 points) 's comment on EA FundsâA simÂple analÂyÂsis of grants by (
- 4 Jun 2024 15:59 UTC; 2 points) 's comment on Cost-effecÂtiveÂness of School Plates by (
- 6 May 2024 22:30 UTC; NIL points) 's comment on Founders Pledgeâs CliÂmate Change Fund might be more cost-effecÂtive than GiveWellâs top charÂiÂties, but it is much less cost-effecÂtive than corÂpoÂrate camÂpaigns for chicken welfare? by (
- 19 Nov 2023 8:29 UTC; 0 points) 's comment on EcoÂnomics of AnÂiÂmal Welfare: Call for Abstracts by (
Hey Vasco,
Love the post; I think it is super valuable to have these sorts of important conversations, directly thinking about cross-cause comparison. Itâs worth noting that CE does consider cross-cause effects in all the interventions we consider/ârecommend, including possible animal effects and WAS effects. Despite this, CE does not come to the same conclusion as this post; here are a couple of notes on why:
Strength of evidence discounting: CEAs are not all equal when they are based on very different strengths of evidence, and I think we weight this factor a lot heavier. Itâs quite common for the impact of any given intervention to regress fairly heavily as more research/âwork is put into it. We have found this in CEâs, GWâs and other EAsâ research. This can be seen in even more depth in the GiveWell and EA forum writings on deworming and how to deal with speculative effects that possibly have very high upsides. For example, I would expect a five-hour CEA to be constantly off (almost always in a positive direction) compared to a 50-hour CEA. A calculation made at two different levels of rigor should not be directly compared. (This does not mean shorter-form CEAs are not worth doing, but I think we have to take their cons and likely regressions a lot more seriously than this post currently does.) This discounting should be even more heavily applied to flow-through effects, as the evidence for them is way lighter than the direct effects. We tend to use something akin to the weighted quantitative modeling used here.
Marginal funding and reliability in effects: Hereâs a good example of how a CEA can regress really quickly; GiveWell typically does CEAs on marginal donations made, whereas many other CEAsâincluding the one you use from Sauliusâdo not consider marginal funding. I currently think that the marginal dollar to corporate campaigns is way less impactful when compared to the average dollar of spending pre-2018. This can affect a CEA quite drastically. Another example is the funding of numerous animal interventions through corporate campaigns, which have become the âhitâ of the animal movement. However, these campaigns often are seen as cost-effectiveness without clear before hand knowledge of the impact an additional dollar of funding would have accomplished. It is a bit like measuring CEâs cost effectiveness by looking at the top charity we incubated and assuming future charities will be equal to that. Variance is a real pain, and itâs not even clear if other corporate campaigns will be equally cost-effective to cage-free. On the other hand, top GW charities have this built in; they are not estimating the average EV of AMFâs top three historical campaigns, they are estimating the impact of marginal average future funding.
Variable animal effects dependent on intervention: You touch on this, but I think there is an important point missed. The effects on animals vary quite a lot, depending on the intervention. Interventions that primarily affect mortality in Africa, for instance, end up looking like how you describe. But morbidity-focused interventions, mental health focused interventions, and family planning interventions are all significantly less affected by this consideration. Same goes for any intervention that operates in contexts where there is lower meat consumption (such as in India). I think if you remodeled this for an organization like Fortify Health (Iron fortification in India), it would result in rather different outcomes.
If you combine these factors and look at a marginal dollar to FH vs a marginal dollar to THL (both of them with similarly rigorous CEAs and flow-through effects that are discounted based on certainty), I think the outcomes would be different enough to change your endline conclusion.
The non-epistemic difference I have is to do with ecosystem limitations, and is more specific to CE itself vs. general EA organizations. When we launch a charity, we need 1) founders 2) ideas, and 3) funding. Each of these are fairly cause area limited (and I think limiting factors are often more important than total scale). For example, if we aimed to found 10 animal charities a year (vs 10 charities across all the cause areas we currently focus on) I do not think the weakest two would be anywhere near as impactful as the top two, and only a small minority of them would get long-term funding. In fact, with animal charities making up around a third of those we have launched, I think we already run close to those limitations. This means that even if we thought that animal charities were more impactful than human ones on average, the difference would have to be pretty large for us to think that adding a 9th or 10th animal charity into the animal ecosystem would be more impactful than adding the first or second human-focused charity. I expect a version of this consideration can apply to other actors too. In general, I believe that given the current ecosystem, more than ~three-five charities founded per year within a given area would start to result in cannibalization between charities.
Thanks again for the consideration of this; I do think people should do a lot more cross-cause thinking, and I expect there are some really neglected areas that have significant intercausal impact.
Hi Joey,
Thank you so much for taking the time explain your reasons in great detail! I broadly agree with all the points you make.
Could you elaborate on how CE does this? Among the 9 CEâs health reports of 2023, I only found 3 instances of the word âanimalâ. Here (emphasis mine):
Here (emphasis mine):
Here (emphasis mine):
Only the 1st of these refers to animal welfare, and has very little detail.
Saulius commented that (emphasis mine):
So cost-effectiveness used to be higher, but Sauliusâ updated estimate of 65 years of chicken life per dollar is 4.33 (= 65â15) times as high as the one I used in my BOTEC. If the 2019-2020 average cost-effectiveness is also about 4.33 times as high as the current marginal cost-effectiveness, my BOTEC will not be too off. I did not easily find estimates for the marginal cost-effectiveness. Kieran Greig (from RP) surveyed groups working on corporate campaigns globally, and told me roughly 1 year ago that:
Are there any quantitative analyses of the marginal cost-effectiveness?
Great point! It crossed my mind, but I ended up not including it.
I agree this tends to be the case, but I am not sure how much. For example, I have the impression RPâs median welfare ranges are higher than what most people expected a priori. In general, it seems hard to know how much to adjust estimates, and I guess it would be better to invest more resources (at the margin) into decreasing our incertainty.
Further details are confidential:
- âI apologize that I canât share too much specifically as I promised organizations that those results would be confidentialâ.
Hi Vasco -
Itâs great that youâre so passionate about this, but I find it extremely surprising that youâre willing to draw such strong conclusions based on such weak evidence and ad hoc assumptions. For instance if I change your assumption that debilitating pain is 100x as bad as hurtful pain, and instead assume that it is only 10x as bad (and donât change anything else), your calculations imply that even under the conventional scenario broiler chickens have net positive lives (and hence presumably that we should be eating as many of them as possible and donating to advocacy groups that promote chicken consumption, at least given total utilitarianism).
Are you so certain that it is 100x as bad, even within an order of magnitude? If so why? I did read both of the links in your fn 2 but found them unconvincing for your claims, e.g. the first one only discusses the logarithmic nature of the scale but nothing about specific magnitudes. From the second one we learn that multiple voluntary human activities (tattoos, ânaturalâ childbirth, perhaps eating hot peppers) fall into the disabling category⊠which suggests maybe itâs not necessarily so horrific after all.
Thatâs just one assumption amongst many. And yet your âtakeaway is that corporate campaigns for chicken welfare increase nearterm wellbeing robustly [emphasis mine] more cost-effectively than GWâs top charitiesâ.
Julian
I donât think this assumption Vasco made is reasonable, and it substantially overestimates the pleasure conventional broilers are likely to experience:
There are a few issues with this:
While in disabling pain, they shouldnât be experiencing any pleasure, by WFPâs definition, so you should subtract the time spent in disabling pain.
While suffering from behavioural deprivation, they probably shouldnât experience any significant pleasure, either, so you should subtract that time, too. Theyâre suffering from behavioural deprivation because theyâre prevented from being active. They probably donât find just sitting around very pleasurable, although it could be mildly pleasurable (annoying pain intensity). (WFPâs estimates assume they donât suffer from behavioural deprivation while eating.)
It would be surprising if all of their leftover time was pleasurable of intensity similar to hurtful pain, rather than just annoying pain. Per day on average, they spend around 3.2 hours eating, 0.25 to 3 hours foraging/âexploring (in the 3rd week of life and after, but more in the first two weeks) and at most 0.25 hours dustbathing, based on WFPâs estimates. Some of this could be pleasurable of intensity similar to hurtful pain and generously all of it could be. They donât generally have enough space to play, and even if they did, theyâd probably do it during their foraging hours already accounted for. The rest of the time is presumably basically inactive, e.g. resting, which could be pleasurable, but it seems unlikely to have intensity (much) greater than annoying pain, so should probably roughly match annoying pain. This inactive time could also easily be unpleasant instead, too, because of the high stocking densities (from social stress, heat, feces, air quality).
Iâd expect that Vasco overestimated the amount of pleasure they experience at least 2 times with this assumption. We get at least around 2x too much just from point 3, assuming the inactive hours arenât very pleasurable.
Note that some pains WFP estimated overlap in time, so you donât want to double or triple subtract times spent in pain, and this makes actually calculating the time left for pleasure trickier. Even WFPâs pain estimates attributed to lameness is made up of 3 types of pains that may overlap in time for the most severe cases of lameness: the direct pain of the condition in the legs, hunger from not eating enough because itâs painful to get food, and thirst for the same reason. Also, pains are probably subadditive, but WFP treated them as additive, so may have overestimated pain this way.
The math is easier for egg-laying hens in conventional cages because the only particularly pleasurable activities they might engage in are eating, around 2-4 hours/âday. They donât get to dustbathe, forage, explore, walk around or even stretch their wings in conventional cages. Even in furnished cages, they quickly run out of litter to forage.
Thanks for the input Michaelâyour estimates seem reasonable /â defensible to me. On the other hand, it also seems reasonable /â defensible to argue that time spent just sitting around is fairly highly pleasurable for chickens (relative to their maximum): many humans prefer doing nothing to active foraging (NB Iâm being serious), and chickens (like all prey) are evolved to be wary of predators and at risk of dying at any moment. My sense is that the default welfare state for all living beings is nontrivially positive (we see this in human survey data, and it makes sense evolutionarily), so a chicken that is both alive and not at risk of being eaten or starving might be in very good shape in chicken terms. I simply donât know, which leads to...
However the broader point, which all three of us seem to agree on, is that all of these estimates are wildly uncertain and should be taken with many large grains of salt and (imo) not used to draw any firm conclusions about what should happen (except that we can agree less pain is better than more pain). Reasonable people can and do disagree about what itâs like to be a chicken in captivity.
I appreciate you pointing out these possibilities. You might indeed be right, and I think itâs a position new evidence could end up supporting. However, I donât think you or really anyone would be warranted in believing the average broiler welfare overall to be positive in expectation if they were well-informed about their conditions and the current state of evidence. Maybe we should just withhold judgement. However, using Welfare Footprint Projectâs analysis, and being, like them, careful in the attribution of welfare states and more careful the more intense, there would be more expected pain than expected pleasure.
I do think itâs plausible the default (e.g. most common) welfare state for wild red jungle fowls, i.e. the chickenâs wild progenitor and counterpart, is positive, or at least that positive is more common than negative. I might even lean somewhat towards that, but it depends on how common the threat of predators is and how long-lasting the negative effects of predator exposure are. But this and comparisons to humans (which ones?) are quite weak priors from which to conclude broilers frequently experience pleasure of intensity similar to hurtful pain just from sitting/âresting, and there are multiple reasons to be skeptical or even expect negative welfare instead. Conventionally farmed chickens are in very unnatural, monotonous and limiting environments, often have painful and limiting health conditions and face multiple chronic stressors their wild counterparts donât face. Their environments are especially not conducive to high baseline moods or much good to attend to when theyâre not active, and they also contain substantial bad.[1]
On comparisons to wild animals, as foragers, I donât imagine red jungle fowls would often be at risk of starvation in the wild, and in fact a decent share of broilers (or hours of broiler life) suffer from hunger and thirst, according to WFP: broiler breeders in particular are chronically hungry and food-deprived, and other severely lame broilers also seem to suffer significantly from hunger. I agree that the absence of predators should make a difference (although Iâm very uncertain about how much). A condition-informed survey of expert opinion found the welfare of conventional broilers below the cutoff for âacceptable welfareâ (although this doesnât imply net negative in particular) and far below the welfare for nature, which was the second highest rated after only backyard flocks. Ratings of nature had relatively high variance, with 3 of the 27 experts even putting it below the acceptability cutoff.
Also, if sitting around is pleasurable at hurtful intensity, and disabling pain is only 10x as intense as hurtful pain, things like fresh large bone breaks (e.g. leg in humans, keel in chickens), the pain of birth without painkillers or anaesthesia, panic attacks, the part of a tattoo experience where it felt âLike someone slicing into my leg with a hot, sharp live wireâ (assuming they are disabling and not excruciating) would only be about 10x as bad as just sitting around is good per minute. I personally find that counterintuitive. Maybe you donât, but itâs worth pointing out what the conjunction of views youâre defending implies.
Maybe just sitting is comfortable, but it could be uncomfortable due to poor litter quality (e.g. ammonia buildup) and contact burns/âdermatitis, leg pain or heat (although I think the most intense of these are largely already accounted for by WFP). Maybe watching other chickens is interesting, but it could be stressful, given high stocking densities and social dominance. Maybe their inactive non-highly pained moods are based on some kind of mean of their active and pained welfares, like if you have fun often and donât suffer often, youâll still be in a good mood when youâre not having fun, and if youâre in pain often, but donât have much fun, youâll be in a bad mood even when youâre not in much pain.
Thanks for commenting, Michael!
I agree there are a few issues with this. However (I have added what follows in a footnote):
If the WFP is capturing most of the painful experiences (weighted by intensity), and pleasurable experiences are negligible, then my assumption will not influence the cost-effectiveness of corporate campaigns. It can potentially change whether chickens have good or bad lives, and therefore impact whether consuming less animals is good/âbad, but I think this is pretty unclear anyway for other reasons (e.g. effects on wild animals).
If I assume all the time not classified by the WFP is neutral, I get the lives of broilers in a conventional and reformed scenario are, per unit time, 3.08 and 1.07 times as bad as human lives are good. So the lives of broilers in a conventional and reformed scenario would become worse by a factor of 1.19 (= 3.08/â2.58) and 1.86 (= 1.07/â0.574).
Great points, Julian!
I assumed disabling (not debilitating) pain is 100 times as bad as hurtful pain, but my 90 % confidence interval would be something like 10 to 1 k. As a result, I would not be too surprised if broilers in conventional scenarions had positive lives.
Because of this, I am decently open to the possibility that we should be eating more/âless factory-farmed animals (including chickens).
Supposing hurtful, disabling and excruciating pain are each as bad as annoying pain (instead of 10, 1 k and 1 M times as bad, as I guessed), the cost-effectiveness of corporate campaigns for broiler welfare would still be 40.5 times that of the lowest cost to save a human life. In other words, for corporate campaigns for broiler welfare to be as effective as GWâs top charities, one would have to assume, for example:
Hurtful pain 1 order of magnitude (OOM) less bad.
Disabling pain 3 OOMs less bad.
Excruciating pain 6 OOMs less bad.
A median welfare range of chickens 2.5 % (= 1â40.5) as high as RPâs best guess.
Combinations like this seem sufficiently unlikely for one to say âcorporate campaigns for chicken welfare increase nearterm [emphasis mine] wellbeing robustly more cost-effectively than GWâs top charitiesâ. If we include indirect longterm effects, I still guess corporate campaigns to be more effective, but not robustly so.
Thanks for the quick and constructive reply!
(and yes apologies for the typo: I meant âdisablingâ not âdebilitatingâ)
I admit Iâm still unconvinced by several of the assumptions and still believe that they require a bit more discussion /â support /â defense; e.g. in addition to the ones above, the claim that welfare is symmetric around the neutral point or (as discussed elsewhere in the comments) that their welfare range is 0.33 that of humans. Iâm also sympathetic to the comment that was somewhat skeptical regarding the expected marginal impact of best-guess future advocacy.
However I agree you may well be right that for a broad range of values, improving animal welfare (even if already positive, which I was too focused on) is more cost-effective than GW top charities, and this is an important point. I personally would find it even more informative and convincing to see some illustrative sets of parameter values along the lines of your âto get a ratio of 1â exercise (which I thought was a nice touch). What is the most plausible combination of values leading to a ratio of 1? However I realize itâs not fair to ask you to do thatâyouâve already put in a lot of useful work here.
As you acknowledge, an extremely broad set of parameters will lead to the conclusion that we should be eating more chickens rather than fewer. Of course the Humane League doesnât see it that way, and in general I would find animal-welfare advocates much more compelling if they didnât seem to always also push for veganism; imho it makes them sound ideological rather than evidence-driven. [You wonât be surprised to hear that Iâm not veganâhowever I would happily vote for more humane animal farming regulations (if there were a sufficiently high probability of being pivotal...), and Iâm open to being convinced to donate to charities that focus solely on improving conditions.]
Likewise you say you yourself are open to the outcome that we should be eating more factory-farmed animals rather than fewer, which I appreciate. [although I note that in your post you refer without caveats to the ânegative utility of farmed chickensâ] Given that as weâve seen many plausible assumptions in your model would lead to such a conclusion, would you suggest that your framework implies that anyone believing something like those values (as I do) should in fact eat chickens and actively encourage all their friends to do so? I ask this not simply to play devilâs advocate (esp since I sincerely believe in that position myself: everyone please eat more chickens!) but to continue to stress-test a bit on how seriously the model is meant to be taken with respect to any concrete conclusions.
What is the case for eating more chickens?
If farmed chickens plausibly have overall net positive lives (per the discussion above), and if youâre something like a total utilitarian, then you should want more of them to exist; hence eat more in order to at least weakly increase demand /â production.
Alternately, if you think itâs very difficult to know for sure whether chickens have net positive lives or not, and you enjoy the taste of chicken, then thatâs another case for eating more of them.
Thanks for the question, mlovic, and welcome to the EA forum! Thanks for clarifying, Julian.
This might be true, but not necessarily so. I strongly endorse expected total hedonistic utilitarianism (classical utilitarianism), but would not be confident about increasing the consumption of factory-farmed animals even if they had good lives (although my best guess is that chickens do not):
Conditional on factory-farmed animals having good lives, one should arguably guess wild animals also have good lives. Consequently, since the scale of the welfare of wild animals is much larger than that of factory-farmed animals, one should do what increases the welfare of wild animals. So, because animal foods require much more land than plant-based foods, one would still want to decrease the consumption of factory-farmed animals.
Even if one thinks wild animals have bad lives, or is mostly agnostic about it, the increase in the welfare of factory-farmed animals may be outweighted by other negative effects. For example, I think enslaving people with good lives would be bad today[1] under classical utilitarianism, as it would erode impartiality by implicitly attributing a lower welfare range than justified to the enslaved. Likewise for factory-farmed animals, although less so.
If one is not confident about factory-farmed and wild animals having good/âbad lives (I am not), and thinks this is a crucial consideration due to not giving major weight to the 2nd bullet above, one should focus on learning more about that question (e.g. Welfare Footprint Projectâs research), or improving their lives (e.g. corporate campaigns for chicken welfare).
And also in the past, but less bad, holding the quality of life of the enslaved constant.
Thanks for the constructive reply too!
I assumed the welfare range is symmetric around the neutral point, but this does not impact the cost-effectiveness of corporate campaigns in human-years per dollar. To illustrate, if I had supposed the welfare range goes from excruciating pain to something as good as hurtful pain is bad, the welfare range would become about 0.5 times as wide (in reality, a little over 0.5 times as wide). Consequently:
The improvement in chicken welfare (when broilers go from a conventional to a reformed scenario) as a fraction of the median welfare range of chickens would become 2 (= 1â0.5) times as large.
The intensity of the mean human experience as a fraction of the median welfare range of humans would become 2 times as large.
The cost-effectiveness of corporate campaigns in human-years per dollar is directly proportional to the ratio between the above, so it would not change.
I agree:
I still think this is not a major issue, but I can see there is some margin for reasonable disagreement. I might do a Monte Carlo simulation modelling everything as distributions one of these days.
FWIW, in a previous analysis, I estimated corporate campaigns can be anything between 4.36 % to 34.1 k times as effective as GWâs top charities (5th to 95th percentiles). The reason for my 95th percentile being roughly 1 M times my 5th percentile is me having used a moral weight distribution with 95th percentile about 1 M times as large as the 5th percentile. In contrast, RPâs 95th percentile is only 434 (= 0.869/â0.002) times RPâs 5th percentile, and my uncertainty about the intensity of the various types of pains is similar, which means the 5th and 95th percentile of the cost-effectiveness ratio can be guesstimated from 0.602 % and 2.61 times the median ratio. So, if I were to do a Monte Carlo simulation, I guess I would conclude corporate campaigns are something between 10 to 4 k times as effective as GWâs top charities (around 2.5 OOMs of uncertainty, as the median welfare range). Maybe this goes down to 1 to 400 times if one is quite pessimistic about marginal cost-effectiveness.
I sympathise with animal-welfare advocates pushing for veganism because the most common objections to it are pretty weak (e.g. animals do not feel pain, animals are less intelligent/âpowerful, and the lives of factory-farmed animals are roughly as good as those of free range animals).
To be honest, I had not realised it was so easy to get positive lives for chickens (I seem to remember that I played with the numbers in the Sheet, but I think I was focussing on the cost-effectiveness ratio). I have added to the post the following:
I think 100 is significantly more reasonable than 10, but thanks for noting this!
Since I am quite uncertain about whether consuming more animals is good/âbad[1], I would probably focus on:
Informing people about what is involved (most people overestimate the welfare of factory-farmed animals, and might not want to continue eating them if they have low welfare, even if it is positive, and maybe it is good to have norms against creating beings with low positive welfare, even if the total view is right).
Pushing people towards eating factory-farmed animals with better lives (e.g. broilers in reformed scenarios instead of conventional ones), as opposed to promoting veganism.
Actually, I have a draft about this. If you like, comments are welcome!
Thanks againâall very constructive /â helpful. Iâve updated some of my beliefs (partly toward the scale of this issue, as you intended, but also toward current factory farming not being as bad as I would have guessed⊠although I admit most people probably know less about conditions than I did), and I hope you have as well.
The only place I wanted to specifically respond is to your comment that you âsympathise with animal-welfare advocates pushing for veganism because the most common objections to it are pretty weakââthis doesnât make sense to me. We should only advocate for positions where the strongest objections are weak, not where the most common objections (which might be terrible ones) are weak. Again, tbh, it sounds more ideological than evidence- or logic-based.
I took a quick look at your linked doc and it looks good (to me): there is truly a lot of uncertainty about both basic direct outcomes (do conventional factory chickens have net positive or negative lives?) and indirect ones (what is the impact on wild animals and what is their welfare? how does it affect [human] economic and moral growth?). I would also add that while we can be sure that there is some positive elasticity between âone person stops eating chickensâ to âfuture chicken production is lower in expectationâ we donât currently have any idea what that number is (Iâve looked into this somewhat carefully), so thatâs another huge level of uncertainty. Anyone claiming that they know that the ârightâ answer is not to eat animals, including many EAs and animal charities, is stepping way beyond the actual state of knowledge.
Canât we make informed estimates, even if they have wide ranges? We multiply the demand shift by Ï”SÏ”SâÏ”D (based on equilibrium displacement models, or this), with long-run elasticity estimates from the literature.
(FWIW, Iâm also sympathetic to diet change being net negative in the near term, mostly because of the impacts on wild invertebrates and maybe fish. So I mostly focus on welfare.)
Iâm a professor of economics, but thanks for the link explaining elasticity :)
The answer is no, we canât just do that, since those approaches assume nontrivial changes (and/âor they assume everything is continuous, which the real world isnât). One plausible simple model of supermarket (or restaurant) purchasing behavior is that when observed demand goes above/âbelow a certain threshold relative to predicted demand, they buy more/âless of the input next cycle. From an individual point of view, the expected aggregate demand of other agents in any time period will be a Gaussian distribution (by the law of large numbers), and the threshold will be away from the mean (doesnât make sense to update every time), which implies that oneâs probability of being the marginal buyer at the threshold declines exponentially (not linearly, as it would be for macro-level shifts and as you are implicitly assuming). From the ACE link: âwe can approximate the supply and demand curves in this region by straight linesââno, you canât do that (for individual behavior) without substantive additional assumptions or a lot of legwork into how these decisions actually get made.
In any case I have no idea if thatâs the right model, because I havenât studied supermarket supply chain management. As far as I can tell (but Iâd love to see this somewhere), nobody in either the econ lit or animal welfare lit has tried to do this at the level required to make what I would consider an informed estimate; weâre not just talking about a factor of 2 or 3 here. That knowledge gap doesnât seem to stop the latter group from making very strong claims; they mostly donât even seem to understand or acknowledge the high uncertainty and strong assumptions.
This sounds like Budolfsonâs buffer model. Have you seen the response by McMullen and Halteman? They describe supply chains and management practices in the section âEfficient Responsive Supply Chains and Causal Efficacyâ.
Also, this short first-hand account for grocery stores in an older article from the EA community on the issue, quoted from a comment on a post in an EA Facebook group on the issue.
I agree that itâs probably true most people donât know the right reasons to believe that their individual purchase decisions make much difference on average, because most people know basically nothing about supply chain management.
I had seen some of this, but not the specific paper (ungated) by McMullen & Haltemanâthanks!
First of all note that the two sources you cite directly contradict one another: the first-hand anecdotal account says there is essentially no meat waste even in very small groceries, while M&H (p.12) say there is a modest constant unavoidable waste that is in fact higher in smaller /â local stores than for big outfits. Indeed M&H are internally inconsistent: they say that the market is highly competitive (although they only give a very incomplete reference for this on p.14, which I couldnât find any trace of; my googling found this source suggesting a net profit margin for farming/âagriculture of 5.7%, which is middlingâbetter than aerospace/âdefense or healthcare), but then they also state (p.23) that larger firms have up to 60% lower costs than smaller onesâso how do the latter survive if the industry is so competitive? All of these are bad signs right off the bat.
Second note that none of these sources actually do any data analysis or try to examine original data about the markets or supply chains; they are armchair papers. My whole point is that depending on which of several reasonable assumptions one makes, different conclusions will be drawn. The only way to adjudicate this is to actually figure out whatâs going on in the real world, and neither of these sources attempts to do that. Hint: neither of them gives an empirically-derived concrete estimate for individual-level elasticity.
Third (to finally answer your question!), no my hypothetical model is not the same as the way they are using the term âbufferâ (which seems to be more about maintaining a minimum level of excess in the system; mine is simply about the optimal tradeoff between stockouts vs excess/âwaste). For instance M&H say (p.25) âif there is some probability (1/ân) that any given purchase will occur on a threshold, then the threshold action will trigger a reduction in production of around n units, yielding an expected impact equal to 1âł (and from the reducing suffering page: âThe probability that any given chicken is the chicken that causes two cases instead of three to be purchased is 1/â25â). Well yesâif itâs linear then the expected effect is the same order of magnitude as the input. My model was precisely one where the probability is plausibly not linear: in any given cycle, total sales are much more likely to be near the mean than near the threshold, so every individual would correctly believe that their own actions are very unlikely to change anything, which is not inconsistent with the (obviously correct) claim that large changes in demand are roughly linear and do influence things according to whatever macro-level elasticity has been estimated for chickens.
Or my 30-second model might be wrongâIâm not claiming itâs correct. Iâm claiming that we donât know, and the fact that none of these sources seems to have even considered it (or any other ones), and donât even realize the nature of the assumptions theyâre making, and nevertheless draw such strong conclusions, is again a bad sign.
This seems fair and seems like the strongest argument here. Even M&H only say they âbriefly sketch the contours of a positive argument for consumer efficacyâ.
While I think this doesnât undermine your point that people could come to reasonable differing conclusions about this case, itâs worth pointing out the same is true about counterfactuals for basically all charity and altruistic work based on similar arguments, so this case doesnât seem categorically special. Some level of guesswork is basically always involved, although to different degrees, and levels of ârobustnessâ can differ:
GiveWell has estimates for the value of counterfactual spending by other actors, but it mostly only reflects government actors, plus the Global Fund. What about Open Phil and smaller donors? (Maybe they can be ignored based on Open Philâs own statements, and assuming smaller donors donât pay attention to these funding levels, and so donât respond.) Some of the numbers they do use are also just guesses. They go further than basically anyone else, but is it far enough? How much less cost-effective could they be?
For individuals doing altruistic work, if they didnât do it (e.g. they didnât take the job), what would others have done differently, and with what value? (âReplaceabilityâ.)
There are other effects on things we donât or canât measure. Does the charity undermine governance and do harm in the long run as a result? What about the effects on nonhuman animals, farmed and wild? What about the potential impacts much further into the future, through economic growth, climate change or space colonization? This gets into cluelessness and the McNamara fallacy.
Yes all fair, and Iâd say it goes beyond counterfactuals. Iâm not sure people fully realize how sensitive many conclusions are to all sorts of assumptions, which are often implicit in standard models. I am on record disagreeing strongly with John Halstead about the likely cost-effectiveness of advocating for economic growth, and I feel similarly about much of the longtermist agenda, so this isnât specific to animals. My personal sense is that if you can save an existing human life for a few thousand dollars (for which the evidence is very clear, although point taken that the marginal impact isnât definitively pinned downâhowever Iâd guess within a factor of two,), thatâs an extremely high bar to overcome.
Fair. I think the anecdotal account is a limiting case of M&H where the waste is very close to 0, though, so the arguments in M&H would apply to the anecdote. M&Hâs argument doesnât depend on there being modest constant unavoidable waste rather than essentially none.
This doesnât show theyâre internally inconsistent.
They probably meant the market is highly competitive in absolute terms, not among the very most competitive markets in the US. The argument they make isnât meant to depend on the relative competitiveness of the industry among industries, and it wouldnât be valid if it did.
Small farms can survive by product differentiation and competing in different submarkets. They can sell niche, specially labelled/âdescribed products, like organic, free range or locally raised, and they can charge premiums this way. They can sell in different places, like farmers markets, to small local grocers or to restaurants trying to appear more responsible/âethical, and charge more this way. Broiler farms producing fewer than 100,000 broilers/âyear only made up around 5% of the market in 2001 (Fig 2), so itâs pretty plausible and Iâd guess itâs the case that small broiler farms with much higher production costs sell differentiated products.
I wasnât gesturing toward the relative competitiveness because itâs important per se (youâre right that it isnât) but rather as a way to gauge absolute competitiveness for those who donât already know that a net profit margin of 5.7% isnât bad at all. My intuition is that people realize that both defense and healthcare firms make decent profits (as they do) and hence that this fact would help convey that farmers (whether large or small; and if your point is that they can differentiate themselves and do some monopolistic competition then youâre already on my side vs M&H) are not typically right on the edge of survival.
However I donât personally think the level of competition is crucial to anything here. M&H believe that itâs necessary for their argument (in the abstract they say their case rests on it), so I was pointing out that (a) itâs actually not that competitive; and (b) if they do think itâs truly competitive (i.e. not differentiated) then that is indeed inconsistent with their own claim on p.23, which is a bad sign for their analysis.
My main point (which you donât seem to have responded to) remains that these are all conceptual arguments making various particular assumptions rather than actually trying to estimate an individual-level impact with a combination of a concrete well-defined model and empirics.
The edge of survival is not the only relevant threshold here. Chicken farmers donât own the birds they raise and only raise them when given a contract, so itâs not entirely their choice whether or not and when they raise any chickens. From M&H:
And even if their net profit margins were 5.7% on average, many farms could still be on the edge of survival. Also from M&H:
From MacDonald, 2008:
Furthermore, the 20th percentile of household income[1] across broiler farmers was $18,782 in 2011, according to the USDA, and so close to the poverty line at the time. However, the household income for chicken farmers is relatively high recently, in 2020 (USDA).
Also, about differentiation, I donât see what the existence of some small high-cost farms selling to small niche submarkets tells you about the behaviour or competitiveness of the conventional large farms, which account for almost all of the combined market. I donât think itâs a case of monopolistic competition; these are just a few separate submarkets, like free range and organic. Maybe those selling locally are acting nearly monopolistically, with the âlocalâ label or by selling to farmers markets, but it also doesnât really matter, because theyâre selling to a tiny submarket and their supply is very limited. If a kid sets up a lemonade stand in their neighbourhood and sells lemonade above grocery store prices, you wouldnât conclude from this that an individual lemonade company can set higher prices for grocery stores (or distributors?), where almost all of the lemonade is bought, without being pushed out of the market.
The USDAâs definition:
Sorry, I could have been more explicit in my comment. I wasnât referring to the rest of the Reducing Suffering article, and I didnât mean that any of that article referred to your model. M&H refer to a model similar to yours (Budolfsonâs buffer model), but not in the section that I referred to (and from which you quote). What I meant is that both propose more plausible models of markets (more plausible based on observations of how grocery stores behave), and I was pointing to those alternative proposals.
M&H summarizes the main takeaway from Budolfsonâs buffer model:
This is an illustration of Budolfsonâs buffer model, directly from Budolfson, 2018:
Presumably there could also be a conceivable decrease in sales that would cause Richard to produce fewer T-shirts, too. Richard has a historical monthly demand range that serves essentially the same purpose as your predicted demand, with thresholds for setting alternative future procurement/âproduction decisions far enough away from the centre of the historical range, or in your case, predicted demand.
EDIT: so your last paragraph seems wrong:
Interestingâthanks for the extra info re Budolfson. I did in fact read all of M&H, and they give two interpretations of the buffer model, neither of which is related to my model, so thatâs what I was referring to. [Thatâs also what I was referring to in my final paragraph: none of the sources you cited on that side of the causal efficacy argument seems to have considered anything like my model, which remains true given my current knowledge.] In fact if Budolfson was saying something more like my model, which does seem to be the case, then thatâs an even worse sign for M&H because they must not have understood it.
The paragraph you quote from Budolfson is indeed more similar to my model, except that in my case the result follows from profit-maximizing behavior (in a competitive industry if you like!) rather than ad hoc and unusual assumptions.
Suppose that I consider a threshold (for increasing or decreasing production next cycle) right at the mean of expected sales (15,000 in the example): half the time Iâll stockout and have disappointed customers; half the time Iâll have extra stock and have to sell it on a secondary market, or give it away, or waste it. Which is worse for business? Plausibly stocking out is worse. So my threshold will be higher than the mean, reducing the probability of stocking out and increasing the prob of excess. The optimal level will be set just so that at the margin, the badness of stocking out (larger) multiplied by the prob of stocking out (smaller) will exactly offset the badness of excess times the prob of excess. Because it is above the mean, which is in fact the true best-guess state of the world (ignoring any individual consumer), and because the distribution around the mean will plausibly be Gaussian (normal), which declines exponentially from the meanânot linearly! - every individual consumer should rationally believe that their decision is less than 1/ân likely to be taking place at the threshold. QED.
Iâm not sure what you mean by M&H not understanding Budolfson. They give a brief overview of the model, but the section from M&H I referred to (âEfficient Responsive Supply Chains and Causal Efficacyâ) describes the market as they understand it, in a way thatâs not consistent with Budolfson. The implicit reply is that Budolfsonâs model does not match their observations of how the market actually works.
I think how theyâd respond to your model is:
stores do use explicit demand predictions to decide procurement,
they are constantly making new predictions,
these predictions are in fact very sensitive to recent individual purchase decisions, and actually directly so.
Suppose the store makes stocking decisions weekly. If demand is lower one week than it would have otherwise been, their predictions for the next week will be lower than they would have otherwise been. Of course, thereâs still a question of how sensitive: maybe they give little weight to their actual recent recorded purchases[1] relative to other things, like othersâ market forecasts or sales the same time in past years.[2] But M&H would contend that actually they are very sensitive to recent purchases, and I would guess thatâs the case, too, because it probably is one of the most predictive pieces of information they can use, and plausibly the most predictive. They donât provide direct estimates of the sensitivity based on empirical data and maybe they donât back these claims with strong enough evidence at all (i.e. maybe stores donât actually usually work this way), and itâs fair to point out these kinds of holes in their arguments if someone wants to use their paper to make a strong case.
Here are relevant quotes:
I would correct the one sentence to âWhen a person decides to stop purchasing chickens, the result is that their local grocery store automatically starts ordering chickens more slowly than they otherwise would have, to reflect the lower than otherwise rate of sale.â
Or, indirectly, through leftover stocks or stockouts.
Although eventually that should get picked up.
I still havenât read Budolfson, so Iâm not claiming that M&H misinterpret him. As I said, I did read their entire paper, and in the section specifically about him they describe two interpretations of âbufferâ, neither of which matches my model. So if his model is similar to mine, they got it wrong. If his model is different than mine, then they donât seem to have ever considered a model like mine. Either way a bad sign.
Everything you write about how you think they might respond to me (i.e. your three bullet points and the subsequent paragraph) is 100% consistent with my model and doesnât change any of its implications. In my model stores use predicted demand and can update it as often as they want. The point is that purchasing is in bulk (at least at some level in the supply chain); therefore there is a threshold; and the optimal threshold (every single time) will be chosen to be away from the mean prediction. This can still be extremely sensitive, and may well be. [Apologies if my brief descriptions were unclear, but please do take another look at it before responding if you donât see why all this is the case.]
To the final point, yes of course if someone decides to stop purchasing then the store [probabilistically] starts ordering fewer chickens [than otherwise]; I didnât disagree with that sentence of theirs, and it is also 100% consistent with my model. The question is the magnitude of that change and whether it is linear or not, crucial points to which they have nothing to contribute.
EDIT: I did misunderstand at this point, as you pointed out in your reply.
Ok, I think I get your model, but I donât really see why a grocery store in particular would follow it, and it seems like a generally worse way to make order decisions for one. I think itâs more plausible for earlier parts of the supply chain, where businesses may prefer to produce consistent volumes, because there are relevant thresholds (in revenue) for shutting down, downsizing, expanding and entering the market, and itâs costly to make such a decision (selling/âbuying capital, hiring/âfiring staff) only to regret it later or even flip-flop.[1] It takes work to hire someone, so hiring and firing (in either order) is costly. Capital assets lose value once you purchase or use them, so buying and selling (in either order) is costly. If changes in a businessâ production levels often require such a decision, that business has reason to try to keep production more consistent or stick with their plans to avoid accumulating such costs. But not all changes to production levels require such decisions.
(I donât mean to imply you donât understand all of the above; this is just me thinking through it, checking my understanding and showing others interested.)
I donât think a grocery store has to adjust its capital or staff to order more or less, or at least not for the vast majority of marginal changes in order size. Same for distributors/âwholesalers.
Iâm not sure about broiler farms. Theyâd sometimes just have to wait longer for a contract (or never get one again), or maybe theyâd get a smaller contract and raise fewer broilers (the market is contract-based in the US, and the farms donât own the broilers[2]), so it often just wouldnât be their decision. But on something like your model, if a farm was planning to enter the market or expand, and contracts or revenues (or market reports) come only slightly worse than expected (still above the threshold in your model, and which is far more likely than coming below the threshold), theyâd enter/âexpand anyway. For farms not planning to expand/âenter the market, maybe theyâd even take on a contract they donât expect to pay for its variable costs, just to get more favour from the companies contracting them in the future or to push out competitors. Or, just generally, the contracts would very disproportionately be above their thresholds for shutdown, as they expect them to be. Also, many individual farmers are probably subject to the sunk cost fallacy.
Then there are the integrator/âprocessor companies like Tyson that contract the farms. A small number of companies control a large shares of this part of the supply chain, and theyâve been caught price-fixing (see here and here), which undermines the efficiency (and of course competitiveness) of the market. Below their predictions, maybe theyâd want to keep giving farms contracts in order to keep them from shutting down or to keep them from switching to competitors, because itâll be harder/âslower to replace them if demand recovers, or just to hurt competitors. Or, if they were already planning to expand production, but sales come in below expectation, theyâd do it anyway for similar reasons.
Hereâs an example for a grocery store:
Suppose, to avoid stockouts (like you propose they should), as a rule, they order 7 more units than (the expected value of) their predicted sales.
Suppose they would have predicted 123 sales for the next period had you not abstained. Because you abstained, they instead predict 122. So, as a result of your abstention, they order 129 instead of 130, and you make a difference, at least at this level.
Now, maybe they need to order in specific multiples of units. Say they need to order in multiples of 10, and they order the minimum multiple of 10 thatâs at least 7 over what they predict.
In the above case, your abstention makes no difference, and they would order 130 either way, but thatâs just one case. The threshold to order 10 fewer is when the prediction modulo 10 would have been 4 and your abstention drops it below that.[3] If you look at a randomly sampled period where they need to order, thereâs not really any reason to believe that their prediction modulo 10 will be especially unlikely to be 4 compared to any of the other digits.[4]
I see papers on sunk-cost hysteresis and entry and exist decisions under uncertainty, like Baldwin, 1989, Dixit, 1989, Gschwandtner and Lambson, 2002.
Also:
For their prediction x, if x mod10=4, then they order x+16. If x mod10=3, then they order x+7.
I guess one way would be if they have sufficiently consistent purchases and choose a supplier based on the multiple to get their prediction modulo the multiple away from the threshold. I think itâs very unlikely theyâd switch suppliers just to get their predictions in a better spot with respect to multiples.
Hiâthanks again for taking more time with this, but I donât think you do understand my model. It has nothing to do with capital assets, hiring/âfiring workers, or switching suppliers. All that it requires is that some decisions are made in bulk, i.e. at a level of granularity larger than the impact of any one individual consumer. I agree this is less likely for retail stores (possibly some of them order in units of 1? wouldnât it be nice if someone actually cared enough to look into this rather than us all arguing hypothetically...), but it will clearly happen somewhere back up the supply chain, which is all that my model requires.
Your mistake is when you write âSay they need to order in multiples of 10, and they order the minimum multiple of 10 thatâs at least 7 over what they predict.â Thatâs not what my model predicts (I think itâs closer to M&Hâs first interpretation of buffers?), nor does it make economic sense, and it builds in linearity. What a profit-maximizing store will do is to balance the marginal benefit and marginal cost. Thus if they would ideally order 7 extra, but they have to order in multiples of 10 and x=4 mod10, theyâll order x+6 not x+16 (small chance of one extra stock-out vs large chance of 10 wasted items). They may not always pick the multiple-of-10 closest to 7 extra, but they will balance the expected gains and losses rather than using a minimum. From there everything that Iâm suggesting (namely the exponential decline in probability, which is the key point where this differs from all the others) follows.
And a quick reminder: Iâm not claiming that my model is the right one or the best one, however it is literally the first one that I thought of and yet no one else in this literature seems to have considered it. Hence my conclusion that theyâre making far stronger claims than are possibly warranted.
(Iâve edited this comment, but the main argument about grocery stores hasnât changed, only some small additions/âcorrections to it, and changes to the rest of my response.)
Thanks for clarifying again. Youâre right that I misunderstood. The point as I now understand is that they expect the purchases (or whatever theyâd ideally order, if they could order by individual units) to fall disproportionately in one order size and away from each threshold for lower and higher order sizes, i.e. much more towards the middle, and theyâve arranged for their order sizes to ensure this.
Iâll abandon the specific procedure I suggested for the store, and make my argument more general. For large grocery stores, I think my argument at the end of my last comment is still basically right, though, and so you should expect sensitivity, as I will elaborate further here. In particular, this would rule out your model applying to large grocery stores, even if they have to order in large multiples, assuming a fixed order frequency.
Letâs consider a grocery store. Suppose they make purchase predictions p (point estimates or probability distributions), and they have to order in multiples of K, but Iâll relax this assumption later. We can represent this with a function f from predictions to order sizes so that f(p)=Kâg(p), where g is an integer-valued function.f can be the solution to an optimization problem, like yours. Iâm ignoring any remaining stock they could carry forward for simplicity, but they could just subtract it from p and put that stock out first. Iâm also assuming a fixed order frequency, but M&H mention the possibility of âa threshold at which a delivery of meat comes a day laterâ. I think your model is a special case of this, ignoring what Iâm ignoring and with the appropriate relaxations below.
I claim the following:
Assuming the store is not horrible at optimizing, f should be nondecreasing and scale roughly linearly with p. What I mean by âroughly linearly with pâ is that for (the vast majority of possible values of) p, we can assume that f(p+K)=f(p)+K, and that values of p where f(p+1)=f(p)+K, i.e. the thresholds, are spaced roughly K apart. Even if different order sizes didnât differ in multiples of some fixed number, something similar should hold, with spacing between thresholds roughly reflecting order size differences.
A specific store might have reason to believe their predictions are on a threshold much less than 1/K of the time across order decisions, but only for one of a few pretty specific reasons:
They were able to choose K the first time to ensure this, intentionally or not, and stick with it and f regardless of how demand shifts.
The same supplier for the store offers different values of K (or the store gets the supplier to offer another value of K), and the store switches K or uses multiple values of K simultaneously in a way that avoids the thresholds. (So f defined above isnât general enough.)
They switch suppliers or products as necessary to choose K in a way that avoids the thresholds. Maybe they donât stop offering a product or stop ordering from the same supplier altogether, but optimize the order(s) for it and a close substitute (or multiple substitutes) or multiple suppliers in such a way that the thresholds are avoided for each. (So f defined above isnât general enough.)
If none of these specific reasons hold, then you shouldnât expect to be on the threshold much less than 1/K of the time,[1] and you should believe E[f(pâ1)]âE[f(p)]â1, where the expectation is taken over your probability distribution for the storeâs prediction p.
How likely are any of these reasons to hold, and what difference should they make to your expectations even if they did?
The first reason wouldnât give you far less than 1/K if the interquartile range of their predictions across orders over time isnât much smaller than K, but they prefer or have to keep offering the product anyway. This is because the thresholds are spaced roughly K apart, p will have to cross thresholds often with such a large interquartile range, and if p has to cross thresholds often, it canât very disproportionately avoid them.[2]
Most importantly, however, if K is chosen (roughly) independently of p, your probability distribution for p mod K for a given order should be (roughly) uniform over 0,..., Kâ1,[3] so p should hit the threshold with probability (roughly) 1/K. It seems to me that K is generally chosen (roughly) independently of p. In deciding between suppliers, the specific value of K seems less important than the cost per unit, shipping time, reliability and a lower value of K.[4] In some cases, especially likely for stores belonging to large store chains, there isnât a choice, e.g. Walmart stores order from Walmart-owned distributors, or chain stores will have agreements with the same supplier company across stores. Then, having chosen a supplier, a store could try to arrange for a different value of K to avoid thresholds, but I doubt theyâd actually try this, and even if they did try, suppliers seem likely to refuse without a significant increase in the cost per unit for the store, because suppliers have multiple stores to ship to and donât want to adjust K by the store.
Stores similarly probably wouldnât follow the strategies in the second and third reasons because they wouldnât be allowed to, or even if they could, other considerations like cost per unit, shipping time, reliability and stocking the same product would be more important. Also, if the order quantities vary often enough between orders based on such strategies, youâd actually be more likely to make a difference, although smaller when you do.
So, I maintain that for large stores, you should believe E[f(pâ1)]âE[f(p)]â1.
Fair. I donât think they should necessarily have considered it, though, in case observations they make would have ruled it out, but it seems like they didnât make such observations.
I donât think this is obvious either way. This seems to be a stronger claim than youâve been making elsewhere about your model. I think youâd need to show that itâs possible and worth it for those at one step of the supply chain to choose K or suppliers like in a way I ruled out for grocery stores and without making order sizes too sensitive to predictions. Or something where my model wasnât general enough, e.g. I assumed a fixed order frequency.
It could be more than 1/K, because weâve ruled out being disproportionately away from the threshold by assumption, but still allowed the possibility of disproportionately hitting it.
For realistic distributions of p across orders over time.
I would in fact expect lower numbers within 0, âŠ, Kâ1 to be slightly more likely, all else equal. Basically Benfordâs law and generalizations to different digit positions. Since these are predictions and people like round numbers, if K is even or a multiple of 5, I wouldnât be surprised if even numbers and multiples of 5 were more likely, respectively.
Except maybe if the minimum K across suppliers is only a few times less than p, closer to p or even greater, and they canât carry stock forward past the next time they would otherwise receive a new shipment.
I agree. Sorry for not being clear in my previous reply. By âI sympathise with animal-welfare advocates pushing for veganismâ, I meant that I can see from where they are coming, not that I rationally endorse veganism.
This seems to rest heavily on Rethink Prioritiesâ Welfare Estimates. While their expected value for the âwelfare rangeâ of chickens is 0.332 that of humans, their 90% confidence for that number spans 0.002 to 0.869, which is so wide that we canât make much use of it.
Seems to be a tendency in EA to try to use expected values when just admitting âI have no ideaâ is more honest and truthful.
I mean to be fair to OP (edit: I meant original poster) they make their uncertainty really clear throughout and the conditionals it entails. I donât think itâs fair to say theyâre not being honest and truthful.
Hi zchuang,
I agree OPâs writings have high reasoning transparency (certainly much more than my posts). In the very 1st bullet of their post on worldview diversification, they write:
However, after RPâs moral weight project, I do not think it is reasonable to assume (in expectation) that âchickens have essentially no moral significance compared to that of humansâ. In general, OPâs decision-making around how much should be allocated to each worldview remains unclear to me.
Sorry I meant OP as in original poster not OpenPhil. But nice response nonetheless!
Iâd suggest editing your top-level post (with brackets, like this: [the original poster, originally wrote âOPâ which was ambiguous])
Hi Henry,
Thanks for engaging!
Note that:
So, using RPâs 5th percentile welfare range instead of the median one, corporate campaigns for broiler welfare are still 10.3 (= 1.71*10^3*0.002/â0.332) times as effective. However, there is also large uncertainty in how bad are the lives of broilers and human relative to their median welfare ranges. This means the true 5th percentile will tend to be lower than the 10.3 I just calculated. I guess the uncertainty stemming from the median welfare range is similar to that from the mean experience relative to the median welfare range, so I think there is less than 10 % chance that corporate campaings for broiler welfare are less effective than the lowest cost to save a life among GWâs top charities. I suppose RP will look into building on their moral weight project.
I am also concerned about acting as if expect values are resilient, i.e. assuming they will not easily change in the future in response to new information. On the other hand, large uncertainty in the welfare range of chickens does not necessarily imply the median welfare range lacks resilience. My understanding is that RPâs research tried to integrate most of the available evidence, which means narrowing the interval of possible values may be difficult.
Hey Vasco, you make lots of good points here that are worth considering at length. These are topics weâve discussed on and off in a fairly unstructured way on the research team at FP, and Iâm afraid Iâm not sure whatâs next when it comes to tackling them. We donât currently have a researcher dedicated to animal welfare, and our recommendations in that space have historically come from partner orgs.
Just as context, the reason for this is that FP has historically separated our recommendations into three âworldviewsâ (longtermism, current generations, and animal welfare). The idea is that itâs a lot easier to shift member grantmaking across causes within a worldview (e.g. from rare diseases to malaria, for instance) than across worldviews (e.g. to get people to care much more about chickens). The upshot of this, for better or for worse, is that we end up spending a lot of time prioritizing causes within worldviews, and avoiding the question of how to prioritize across worldviews.
This is also part of the reason we donât have a dedicated animal welfare researcher â we havenât historically moved as much money within that worldview as within our others. But itâs actually not sure which way the causality flows in that case, so your post is a good nudge to think more seriously about this, as well as the ways we might be able to incorporate animal welfare considerations into our GHD calculations, worldview separations notwithstanding.
Thanks for sharing your thought, Matt!
For balance and completeness⊠Would it make sense to add something (or another piece) considering the impact of chicken welfare improvement funding on wild animal welfare?
Hi David,
Thanks for the comment. I think that would make sense (in another piece)! Somewhat relatedly, I wrote that:
I liked this post. It was thought provoking.
I just wanted to note that you are correct in highlighting the âhumanâ part in my post on the capability approach. To me, capabilities are the best way to think about human welfare but some variant of utilitarianism is the best way to think about the welfare of (most?) animals, but Iâve no good way to exchange between those and I find that unsatisfying.
Thanks, Ryan!
Interestingly, I have recently listened to Martha Nussbaum on the Clearer Thinking podcast, and it looks like her book Justice for Animals: Our Collective Responsibility attempts to extend the capability approach to non-human animals.
That makes some sense to me. She should have an easier time of this (than Sen-ish people like me) because sheâs willing to just write a list of the eg 10 most important capabilities for humans. If youâre willing to do that, then it almost seems easier to do it for animals. Iâll listen to the podcast and should read the book. Thanks for the pointer.
âAccording to CEâs weighted animal welfare indexââthe link seems brokenâI think the bit after the final â/ââ needs to be removed
Thanks! I have corrected it now. This is the link.