Yarrow Bouchard 🔸 comments on Biorisk researcher Gregory Lewis’s criticism of EA’s initial response to covid-19

Yarrow Bouchard 🔸 17 Dec 2025 10:10 UTC
2 points
0 ∶ 0
To clarify, are you saying that, in retrospect, the process through which people in EA did research on epidemiology, public health, and related topics looks any better to you now that it looked to you back in April 2020 when you did this interview?

I think I understand your point that it would probably be nearly impossible to score the conclusions in a way that people in EA would agree is convincing or fair — there’s tons of ambiguity and uncertainty, hence tons of wiggle room. (I hope I’m understanding that right.)

But in the April 2020 interview, you said that many of these conclusions were akin to calling a coin flip. Crudely, many interventions that experts were still debating could be seen as roughly having a 50-50 chance of being good or bad (or maybe it’s anywhere from 70-30 to 30-70, doesn’t really matter), so any conclusion that an intervention is good or bad has a roughly 50-50 chance of being right. You said a stopped clock is right twice a day, and it may turn out that Donald Trump got some things right about the pandemic, but if so, it will be through dumb luck rather than good science.

So, I’m curious: leaving aside the complicated and messy question of scoring the conclusions, do you now think the EA community’s approach to the science — particularly, the extent to which they wanted to do it themselves, as non-experts, rather than just trying to find the expert consensus on any given topic, or even seeing if any expert would talk to them about it (e.g. in 2020, you suggested some names of experts to have on the 80,000 Hours Podcast) — was any less bad than you saw it in 2020?
- Gregory Lewis🔸 17 Dec 2025 18:28 UTC
  10 points
  0 ∶ 0
  Parent
  I’d say my views now are roughly the same now as they were then. Perhaps a bit milder, although I am not sure how much of this is “The podcast was recorded at a time I was especially/?unduly annoyed at particular EA antics which coloured my remarks despite my best efforts (such as they were, and alas remain) at moderation” (the complements in the pre-amble to my rant were sincere; I saw myself as hectoring a minority), vs. “Time and lapses of memory have been a salve for my apoplexy—but if I could manage a full recounting, I would reprise my erstwhile rage”.
  
  But at least re. epistemic modesty vs. ‘EA/rationalist exceptionalism’, what ultimately decisive is overall performance: ~”Actually, we don’t need to be all that modest, because when we strike out from “expert consensus” or hallowed authorities, we tend to be proven right”. Litigating this is harder still than re. COVID specifically (even if ‘EA land’ spanked ‘credentialed expertise land’ re. COVID, its batting average across fields could still be worse, or vice versa),
  Yet if I was arguing against my own position, what happened during COVID facially looks like fertile ground to make my case. Perhaps it would collapse on fuller examination, but certainly doesn’t seem compelling evidence in favour of my preferred approach on its face.
  - Yarrow Bouchard 🔸 18 Dec 2025 0:00 UTC
    2 points
    0 ∶ 0
    Parent
    Thanks, that’s very helpful.
    I’m curious why you say that about the accuracy/performance of the conclusions of the EA community with regard to covid. Are you saying it’s just overly complicated and messy to evaluate these conclusions now, even to your own satisfaction? Or you do personally have a sense of how good/bad overall the conclusions were, you just don’t think you could convince people in EA of your sense of things?
    The comparison that comes to mind for me is how amateur investors (including those who don’t know the first thing about investing, how companies are valued, GAAP accounting, and so on) always seem to think they’re doing a great job. Part of this is they typically don’t even benchmark their performance against market indexes like the S&P 500. Or, if they do, they do it in a really biased, non-rigorous way, e.g. oh, my portfolio of 3 stocks went up a lot recently, let me compare it to the S&P 500 year-to-date now. So, they’re not even measuring their performance properly in the first place, yet they seem to believe this is a great idea and they’re doing a great job anyway.
    Studies of even professional investors find it’s rare for an investor to beat the market over a 5-year period, and even rarer for an investor who beats the market in a 5-year period to beat the market again in the next 5-year period. There actually seems to be surprisingly weak correlation between beating the market in one period to the next. Using your coin flip analogy, if every stock trade is a bet on a roughly ⁵⁰⁄₅₀ proposition, i.e., “this stock will beat the market” or “this stock won’t beat the market”, then you need a large sample size of trades to rule out the influence of chance. It’s so easy for amateurs to cherry-pick trades, prematurely declare victory (e.g. say they beat the market the moment a stock goes up a lot, rather than waiting until the end of the quarter or the end of the year), become overconfident on too small a number of trades (e.g. just bought Apple stock), or not even benchmark their performance against the market at all.
    
    Seeing these irrationalities so often and so viscerally, and even seeing how hard it is to talk people out of them even when you can show them the research and expert opinion, or explain these concepts, I’m extremely skeptical of people who just an intuitive, gut feeling that they’ve outperformed experts on making calls or predictions with a statistically significant sample size of calls, in the absence of any kind of objective accounting of their performance. It just seems too tempting, feels too good, to feel like one is winning, to take a moment of sober second thought and double-check that feeling against an objective measure (in the case of stocks, checking a market index), wonder if you can rule out luck (e.g. just buying Apple and that’s it), and wonder if you can rule out bias in your assessment of performance (e.g., checking the S&P 500 when your favourite stock has just gone up a lot).
    
    If the process was as bad as you say, as in, people who have done a few weeks of reading on the relevant science and medicine making elementary mistakes, then I’m very skeptical of the amount of psychological bias involved in people recalling and subjectively assessing their own track record, or any sense of confidence they have about that. It seems like if we don’t need people who understand science and medicine to do science and medicine properly, then a lot of our education system and scientific and medical institutions are a waste. Given that it’s just so commonsense that understanding a subject better should lead you to make better calls on that subject — overall, over the long term, statistically — we should not violate common sense on the basis of a few amateurs guessing a few coin flips better than experts, and we should especially not violate common sense when we can’t even confirm whether that actually happened.