I have a PhD in finance and am the strategist at Affinity Impact, the impact initiative of a Taiwanese family that makes both grants and impact investments.
Wayne_Chang
Thanks so much for such a thorough and great summary of all the various considerations! This will be my go-to source now for a topic that I’ve been thinking about and wrestling with for many years.
I wanted to add a consideration that I don’t think you explicitly discussed. Most investment decisions done by philanthropists (including the optimal equity/bond split) are outsourced to someone else (financial intermediary, advisor, or board). These advisors face career risk (i.e. being fired) when making such decisions. If the advisor recommends something that deviates too far from consensus practice, they have to worry about how they can justify this decision if things go sour. If you are recommending 100% equities and the market tanks (like it did last year), it’s hard to say ‘But that’s what the theory says,’ when the reflective response by the principal is that you are a bad advisor because you don’t understand risk. Many advisors have been fired this way, and no one wants to be in that position. This means tilting toward consensus is likely the rational thing to recommend as financial advisors. There are real principal-agent issues at play, and this is something acutely felt by practitioners even if it’s less discussed among academics.
I suspect the EA community is subject to this dynamic too. It’s rarely the asset owners themselves who decide the equity mix. Asset allocation decisions are recommended by OpenPhil, Effective Giving, EA financial advisors, etc. to their principals, and it’s dangerous to recommend anything that deviates too far from practice. This is especially so when EA’s philanthropy advice is already so unconventional and is arguably the more important battle to fight. It can be impact-optimal over the long term to tilt toward asset allocation consensus when not doing so risks losing the chance to make future grant recommendations. The ability to survive as an advisor and continue to recommend over many periods can matter more than a slightly more optimal equity tilt in the short term.
Keynes comes to mind: “Worldly wisdom teaches that it is better for reputation to fail conventionally than to succeed unconventionally.”
Thanks for posting this, Jonathan! I was going to share it on the EA Forum too but just haven’t gotten around to it.
I think GIF’s impact methodology is not comparable to GiveWell’s. My (limited) understanding is that their Practical Impact approach is quite similar to USAID’s Development Innovation Ventures’ impact methodology. DIV’s approach was co-authored by Michael Kremer so it has solid academic credentials. But importantly, the method takes credit for the funded NGO’s impact over the next 10 years, without sharing that impact with subsequent funders. The idea is that the innovation would fail without their support so they can claim all future impact if the NGO survives (the total sum of counterfactual impact need not add to 100%). This is not what GiveWell does. GiveWell takes credit for the long-term impact of the beneficiaries it helps but not for the NGOs themselves. So this is comparing apples to oranges. It’s true that GiveWell Top Charities are much more likely to survive without GiveWell’s help but this leads to my next point.
GiveWell also provides innovation grants through their All Grants Fund (formerly called Incubation Grants). They’ve been funding a range of interventions that aren’t Top Charities and in many cases, are very early, with GiveWell support being critical to the NGO’s survival. According to GiveWell’s All Grants Fund page, “As of July 2022, we expect to direct about three-quarters of our grants to top charity programs and one-quarter to other programs, so there’s a high likelihood that donations to the All Grants Fund will support a top charity grant.” This suggests that in GiveWell’s own calculus, innovation grants as a whole cannot be overwhelmingly better than Top Charities. Otherwise, Top Charities wouldn’t account for the majority of the fund.
When thinking about counterfactual impact, the credit one gets for funding innovation should depend on the type of future donors the NGO ends up attracting. If these future donors would have given with low cost-effectiveness otherwise (or not at all), then you deserve much credit. But if they would have given to equally (or even more) cost-effective projects, then you deserve zero (or even negative) credit. So if GIF is funding NGOs that draw money from outside EA (whereas GiveWell isn’t), it’s plausible their innovations have more impact and thus are more ‘cost-effective’. But we are talking about leverage now, so again, I don’t think the methodologies are directly comparable.
Finally, I do think GIF should be more transparent about their impact calculations when making such a claim. It would very much benefit other donors and the broader ecosystem if they can make public their 3x calculation (just share the spreadsheet please!). Without such transparency, we should be skeptical and not take their claim too seriously. Extraordinary claims require extraordinary evidence.
Thanks for your response, Joel!
Stepping back, CEARCH’s goal is to identify cause areas that have been missed by EA. But to be successful, you need to compare apples with apples. If you’re benchmarking everything to GiveWell Top Charities, readers expect your methodology to be broadly consistent with GiveWell’s and their conservative approach (and for other cause areas, consistent with best-practice EA approaches). The cause areas that are standing out for CEARCH should be because they are actually more cost-effective, not because you’re using a more lax measuring method.
Coming back to the soda tax intervention, CEARCH’s finding that it’s 1000x GiveWell Top Charities raised a red flag for me so it seemed that you must somehow be measuring things differently. LEEP seems comparable since they also work to pass laws that limit a bad thing (lead paint), but they’re at most ~10x GiveWell Top Charities. So where’s the additional 100x coming from? I was skeptical that soda taxes would have greater scale, tractability, or neglectedness since LEEP already scores insanely high on each of these dimensions.
So I hope CEARCH can ensure cost-effectiveness comparability and if you’re picking up giant differences w/ existing EA interventions, you should be able to explain the main drivers of these differences (and it shouldn’t be because you’re using a different yardstick). Thanks!
Hi Joel, I skimmed your report really quickly (sorry) but suspect that you did not account for soda taxes being eventually passed anyway. So the modeled impact of any intervention shouldn’t be going to 2100 or beyond but out only a few years (I’d think <10 years) when soda taxes would eventually be passed without any active intervention. You are trying to measure the impact of a counterfactual donated dollar in the presence of all the forces already at play that are pushing for soda taxes (how some countries already have them). This makes for a more plausible model, and I believe is how LEEP or OpenPhil model policy intervention cost-effectiveness (I could be wrong though).
New phrasing works well!
Got it. But I think the phrasing for the number of animals that die is confusing then. Since you say “100 other human [sic] would probably die with me in that minute,” the reference is to how many animals would also do during that minute. I think what you want to say is for every human death, how many animals would die, but that’s not the current phrasing (and by that logic, the number of humans that would die per human death would be 1, not 100).
I’d suggest making everything consistent on a per-second basis as smaller numbers are more relatable. So 1 other human would die with you that second, along with 10 cows, etc.
Thanks for writing this! The very last sentence seems off. Did you mean to say every second (instead of minute)? Also, the number of farm animals that die every second should be 1⁄60 (not 1⁄120) of that in the “minute” table above.
This last sentence was quite shocking for me to read. It’s sad…but very powerful.
Minor suggestion: in your title and summary, please just write out “10 k” as 10,000. No need to abbreviate when people may be unsure that it’s actually 10,000 (given that it’s such a large difference).
I agree with Michael that concrete examples would be very helpful, even for researchers. A post should be informative and persuasive, and examples almost always help with that. In this case, examples can also make clear the underlying logic, and where the explanation can be confusing.
For example, let’s think about investing in alternative protein companies as a way to tackle animal welfare. Assume that in a future state where lots more people eat real meat (bad world state), the returns for alt-proteins in that state are low but cost-effectiveness is high. This could be because alt proteins have faced lower rates of adoption (low returns) but it’s now easier to persuade meat eaters to switch (search costs are now low since more willing-switchers can be efficiently targetted). The opposite situation is true too. In a good future state with few meat-eaters, alt protein returns are high but cost-effectiveness is low. So this scenario should put us in your table’s upper left quadrant (negative correlation btw/ World State and Cost-Effectiveness + negative correlation btw/ Return and Cost-Effectiveness).
This example illustrates how some of your quadrant descriptions may be confusing or even inappropriate:
“Underweight investment”: I agree with this one since to have a greater EV, you want investments with a positive correlation between returns and cost-effectiveness. This isn’t true for alt proteins here, so you should avoid them.
“Divest from evil to do good”: I don’t think this makes sense because alt proteins are not “evil” (but you should avoid them given the scenario).
“Mission leveraging”: I was quite confused initially because I was assuming that the comparison is to no investment at all. If so, then investing in alt proteins can lead to an ambiguous impact on volatility (depending on the relative magnitude of return changes versus cost-effectiveness changes). It could in fact be mission hedging (with an improvement in the bad state) if the low returns end up producing more total good because of the state’s high cost-effectiveness. However, I eventually realized that the comparison is to a fixed grant within the animal welfare space (although this was never made explicit in the post and may not be what most people would assume). If so, then indeed this is always mission leveraging since a positive correlation between the world state and returns does ensure lower volatility.
So as you can see, an example makes clear where table descriptions may be inappropriate and where a clearer description can be helpful. It also makes more concrete what various correlation signs mean and how to think about them.
This post (and the series it summarizes) draws on the scientific literature to assess different ways of considering and classifying animal sentience. It persuasively takes the conversation beyond an all-or-nothing view and is a significant advancement for thinking about wild animal suffering as well farm animal welfare beyond just cows, pigs, and chickens.
Thanks for the clarification, Owen! I had mis-understood ‘investment-like’ as simply having return compounding characteristics. To truly preserve optionality though, these grants would need to remain flexible (can change cause areas if necessary; so grants to a specific cause area like AI safety wouldn’t necessarily count) and liquid (can be immediately called upon; so Founder’s Pledge future pledges wouldn’t necessarily count). So yes, your example of grants that result “in more (expected) dollars held in a future year (say a decade from now) by careful thinking people who will be roughly aligned with our values” certainly qualifies, but I suspect that’s about it. Still, as long as such grants exist today, I now understand why you say that the optimal giving rate is implausibly (exactly) 0%.
Hi Owen, even if you’re confident today about identifying investment-like giving opportunities with returns that beat financial markets, investing-to-give can still be desirable. That’s because investing-to-give preserves optionality. Giving today locks in the expected impact of your grant, but waiting allows for funding of potentially higher impact opportunities in the future.
The secretary problem comes to mind (not a perfect analogy but I think the insight applies). The optimal solution is to reject the initial ~37% of all applicants and then accept the next applicant that’s better than all the ones we’ve seen. Given that EA has only been around for about a decade, you would have to think that extinction is imminent for a decade to count for ~37% of our total future. Otherwise, we should continue rejecting opportunities. This allows us to better understand the extent of impact that’s actually possible, including opportunities like movement building and global priorities research. Future ones could be even better!
I highly recommend the Founder’s Pledge report on Investing to Give. It goes through and models the various factors in the giving-now vs giving-later decision, including the ones you describe. Interestingly, the case for giving-later is strongest for longtermist priorities, driven largely by the possibility that significantly more cost-effective grants may be available in the future. This suggests that the optimal giving rate today could very well be 0%.
Have you compared your analysis to this previous EA Forum post? Are there different takeaways? Have you done anything differently and if so, why?
Here’s the math on moral/financial fungibility:
...
You’re probably better off eating cow beef and donating the $6.03/kg to the Good Food Institute
Is refraining from killing really morally fungible to killing + offsetting? Would it be morally permissible for someone to engage in murder if they agreed to offset that life by donating $5,000 to Malaria Consortium? I don’t mean to be offensive with this analogy, but if we are to take seriously the pain/suffering that factory farming inflicts on animals, we should morally regard it in a similar lens to inflicting pain/suffering on humans.
So, no, moral acts are not necessarily fungible. It is better to not eat meat in the first place than to eat meat and donate the savings to farm animal charities (even if you could save more animals). This is obvious from a rights moral framework but even consequentialists would consider financial offsetting dangerous and unpalatable. The consequences of allowing people to engage in immoral acts + offsetting would be a treacherous and ultimately inferior world.
So your calculations are not the cost of eating meat but rather, the cost of saving animals. You have not estimated the cost of chicken/cow suffering (which would require estimating utility functions and animal preferences), but rather, the cost of alleviating suffering. Your low-cost numbers don’t imply that eating meat is inconsequential, but rather, that it’s very cost-effective to help chickens and cows. GiveWell’s $5,000 per human life doesn’t make human life cheap or murder trivial, it means we have an extraordinary opportunity to help others at a very low cost to ourselves.
Thanks, Sanjay, I’m sharing a basic model I’ve written that highlights the trade-off for impact investments that seek both social impact and financial returns. This isn’t specifically about ESG but the key ideas still apply. The upshot: the investment must produce annually one percent of a same-sized grant’s social benefit for every one percent concession on its financial return. I construct impact investing’s version of the Security Market Line and quantitatively define what ‘impact alpha’ means.
This model was written a couple of years ago but since then, I actually haven’t applied it much. That’s because it’s hard to quantify impact, which is a key input that the model requires (and an input that any model will obviously require). There’s no established and easy way to monetize impact, especially given impact’s tremendous heterogeneity. Comparing the value of a year’s education versus a year’s health is hard enough. What about quantifying the counterfactual impact that a business has? Or that of the investor investing into the business? So modeling is helpful but at this stage, I think data is what we actually need most.
I agree with Michael that a 70% allocation to US stocks is way too high. US stocks’ outperformance against international developed stocks can almost entirely be explained by the increase in the US market’s valuation (which shouldn’t be assumed to continue and indeed, is more likely to reverse). See AQR’s analysis on pg 6 here. Also, what about Emerging Market stocks? This should certainly get some allocation as well, especially if you’re focused on the next 100 years. China and India will increasingly be key economic players and have capital markets that will outgrow the US in importance. In fact, 6 of the 7 largest economies in the world in 2050 are likely to be emerging economies. When it comes to investing, beware of simply extrapolating the past into the future! The US markets have done well because the US has been the dominant country in the 20th century. This is unlikely to continue during this century.
A 10% global bonds/90% global stocks portfolio is likely to be more robust and not suffer from a USD/US historical bias. Keep it simple and avoid picking bond/stock market winners.
This paper is relevant to your question.
Abstract: This article asks how sustainable investing (SI) contributes to societal goals, conducting a literature review on investor impact—that is, the change investors trigger in companies’ environmental and social impact. We distinguish three impact mechanisms: shareholder engagement, capital allocation, and indirect impacts, concluding that the impact of shareholder engagement is well supported in the literature, the impact of capital allocation only partially, and indirect impacts lack empirical support. Our results suggest that investors who seek impact should pursue shareholder engagement throughout their portfolio, allocate capital to sustainable companies whose growth is limited by external financing conditions, and screen out companies based on the absence of specific ESG practices that can be adopted at reasonable costs. For rating agencies, we outline steps to develop investor impact metrics. For policymakers, we highlight that SI helps to diffuse good business practices, but is unlikely to drive a deeper transformation without additional policy measures.
I don’t think it makes sense to compound the model distributions (e.g. from 1 year to 10 years). Doing so leads to non-intuitive results that are difficult to justify.
1) Compounded model results (e.g. 10x impact in 10 years) are highly sensitive to the arbitrarily assumed shape, range, and skewness parameters of the variable distributions. Also, these results will vary wildly from simulation to simulation depending on the sequence of random draws. This points to the model’s fragility and leads to unnecessary confusion.
2) The parameter estimates may use annualized growth rates, but they need not correspond to an annual time frame. Indeed, it is more realistic to make estimates for longer horizons because short-term noise averages out (i.e. Law of Large Numbers). In other words, it is far easier to estimate a variable’s expected mean than its underlying distribution. Estimates for the expected mean will already be highly uncertain. I don’t think it’s possible to reasonably defend distribution assumptions of the variables themselves.
The exercise is to compare giving-today vs. investing-to-give-later. The post usefully identifies key variables in this consideration. I think the most it can do is propose useful estimates of these variables’ expectations over the long run (i.e. their averages over time) and their key uncertainties (i.e. Knighting uncertainty and not quantifiable distribution parameters). If the expectations’ net sum is above 1, it makes sense to give later. If it falls below 1, it makes sense to give now. Reasonable areas of uncertainty can be further discussed and debated. Already, there will be much irreconcilable (rational) disagreement. Compounding returns using arbitrary distribution parameters won’t (and shouldn’t) reconcile any differences and likely confuses the matter.
This criticism seems unfair to me:
It seems applicable to any type of advocacy. Those who promote global health and poverty are likely biased toward foreign people. Those who promote longtermism are likely biased toward future people. Those who advocate for effective philanthropy are likely biased toward effectiveness and/or philanthropy.
There’s no effective counter-argument since, almost by definition, any engagement is possibly biased. If one responds with, “I don’t think I’m biased because I didn’t have these views to begin with,” the response can always be, “Well, you engaged in this topic and had a positive response, so surely, you must be biased somehow because most people don’t engage at all.” It seems then that only criticisms of the field are valid.
This is reminiscent of an ad hominem attack. Instead of engaging in the merits of the argument, the critique tars the person instead.
Even if the criticism is valid, what is to be done? Likely nothing as it’s unclear what the extent of the bias would be anyway. Surely, we wouldn’t want to silence discussion of the topic. So just as we support free speech regardless of people’s intentions and biases, we should support any valid arguments within the EA community. If one is unhappy with the arguments, the response should be to engage with them and make valid counterarguments, not speculate on people’s initial intuitions or motivations.