Vasco Grilo🔸 comments on Reducing the nearterm risk of human extinction is not astronomically cost-effective?

Vasco Grilo🔸 9 Jun 2024 11:11 UTC
2 points
0 ∶ 0
Thanks for the comment, Owen!
Honestly I don’t really like “astronomically cost-effective” framings; I think they’re misleading, because they imply too much equivalence with standard cost-effectiveness analysis, whereas if they’re taken seriously then it’s probably the case that many many actions have astronomical expected impact.
Agreed^[1].
On the intuition pump about human life expectancy:
- Suppose that you in fact avert a death that would have occurred in any period, however small
  Then this does have a pretty big impact on life expectancy!
  Unless you’re deferring the death to mere moments later (like the fact that you were maybe about to die would be evidence that you were in a dangerous situation, and so maybe we care about getting to a non-dangerous situation
You kind of anticipated my reply with your last point. Once I condition on a given human dying in the next e.g. 1.80*10^-35 seconds (calculation in the post), I should expect there is an astronomically high cost of extending the life expectancy to the baseline e.g. 100 years (e.g. because it is super hard to avoid simulation shutdown or vacuum decay). With reasonable costs, I would only be able to increase it infinitesimaly (e.g. by moving a few kilometers away from the location from which the universal collapse wave comes).
- An toy example:
  Suppose that we could exogenously introduce a new risk which has a 1% chance of immediately ending the universe, otherwise nothing happens
  That definitely decreases the expected value of the future by 1% (assuming non-existence is zero)
  Therefore, eliminating that risk definitely increases the expected value of the future by 1%
- If you think the expected value of the future is astronomical, then eliminating that risk would also have astronomical value
Agreed.
- But it seems to me like a wild assumption to say that all of the probability mass goes onto such [“slightly-more valuable worlds”] worlds
I know you italicised “all”, but, just to clarify, I do not think all the probability mass would go to such worlds. Some non-negligible mass would go to astronomically valuable worlds, but I think it would be so small that the increase in the expected the value of these would be negligible. Do you have a preferred function describing the difference between the PDF of the value of the future after and before the intervention? It is unclear to me why assuming a decaying exponential, as I did in my calculations for illustration, would be wild.
- If you want to avoid the conclusion that avoiding nearterm extinction risk has astronomical value, you therefore need to take one of three branches:
  Claim that nearterm extinction risk is astronomically small
  I think you’re kind of separately making this claim, but that it’s not the main point of this post?
  Claim that the situation is importantly disanalogous from the toy example I give above
  Maybe you think this? But I don’t understand what the mechanisms of difference would be
  Claim that the future does not have astronomical expected value
  Honestly I think there’s some plausibility to this line, but you seem not to be exploring it
I sort of take branch 2. I agree eliminating a 1 % instantaneous risk of the universe collapsing would have astronomical benefits, but I would say it would be basically impossible. I think the cost/difficulty of eliminating a risk tends to infinity as the duration of the risk tends to 0^[2], in which case the cost-effectiveness of mitigating the risk also goes to 0. In addition, eliminating a risk becomes harder as its potential impact increases, such that eliminating a risk which endargers the whole universe is astronomically harder than eliminating one that only endangers humans.
As the duration and scope of the risk decreases, postulating that the relative increase in the expected value of the future matches the (absolute) reduction in risk becomes increasingly less valid.
Describing a risk as binary as in your example^[3], it is more natural to assume that the probability mass which is taken out of the risk is e.g. loguniformly distributed across all the other worlds, thus meaningully increasing the chance of astronomically valuable worlds, and resulting in risk mitigation being astronomically cost-effective. However, describing a risk as continous (in time and scope), it is more natural to suppose that the change in its duration and impact will be continuous, and this does not obviously result in risk mitigation being astronomically cost-effective.
On 1, I guess the probability of humans going extinct over the next 10 years is 10^-7, so nowhere near as small as needed to result in non-astronomically large cost-effectiveness when multiplied by the expected value of the future. However, you are right this is not the main point of the post.
On 3, I guess you find a non-astronomical expected value of the future more plausible due to your guess for the nearterm risk of human extinction being orders of magnitude higher than mine.
1. ^
  I say in the post that:
  Firstly, I do not think reducing the nearterm risk of human extinction being astronomically cost-effective implies it is astronomically more cost-effective than interventions not explicitly focussing on tail risk, like ones in global health and development and animal welfare.
2. ^
  In general, any task (e.g. risk mitigation) can be made arbitrarily costly/hard by decreasing the time in which it has to be performed. At the limit, the task becomes impossible when there is no time to complete it.
3. ^
  Either it is on or off. Either it affects the whole universe or nothing at all.
- Owen Cotton-Barratt 9 Jun 2024 12:20 UTC
  4 points
  0 ∶ 0
  Parent
  Trying to aim at the point of most confusion about why we’re seeing things differently:
  
  Why doesn’t your argument show that medical intervention during birth can have only a negligible effect on life expectancy?
  - Vasco Grilo🔸 9 Jun 2024 15:52 UTC
    2 points
    0 ∶ 0
    Parent
    For the situation to be analogous, one would have to consider an intervention decreasing the risk of a death over 1.80*10^-35 seconds (see calculations in the post), in which case it does feel intuitive to me that changing the life expectancy would be quite hard. Either the risk would occur over a much longer period, and therefore the intervention would only be mitigating a minor fraction of it, or it would be a very binary risk like simulation shutdown that would be super hardly be reduced.
    - Owen Cotton-Barratt 9 Jun 2024 16:49 UTC
      4 points
      0 ∶ 0
      Parent
      Of course it’s not intended to be a strict analogy. Rather, I take the whole shape of your argument to be a strong presumption against interventions during a short period X having an expected effect on total lifespan that is >>X.
      But I claim that this happens with birth. Birth is something like 0.0001% of the duration of the whole life, but interventions during birth can easily have impacts on life expectancy which are much greater than 0.0001%. Clearly there’s some mechanism in operation here which means these don’t need to be tightly coupled. What is it, and why doesn’t it apply to the case of extinction risk?
      Obviously the numbers involved are much more extreme in the case of human extinction, but I think the birth:life duration ratio is already big enough that we could hope to learn useful lessons from that.
      - Vasco Grilo🔸 9 Jun 2024 17:17 UTC
        2 points
        0 ∶ 0
        Parent
        Thanks for clarifying. My argument depends on the specific numbers involved. Reducing the risk of human extinction over the next year only directly affects around 10^10 lives, i.e. 10^-43 (= 10^(9 − 52)) of my estimate for the expected value of the future. I see such reduction as astronomically harder than decreasing a risk which only directly affect 10^-6 (= 0.0001 %) of the value. My main issue is that these arguments are often analysed informally, whereas I think they require looking into maths like how fast the tail of counterfactual effects decays. I have now added the following bullets to the post, which were initially not imported:
        I cannot help notice arguments for reducing the nearterm risk of human extinction being astronomically cost-effective might share some similarities with (supposedly) logical arguments for the existence of God (e.g. Thomas Aquinas’ Five Ways), although they are different in many aspects too. Their conclusions seem to mostly follow from:
        Cognitive biases. In the case of the former, the following come to mind:
        Authority bias. For example, in Existential Risk Prevention as Global Priority, Nick Bostrom interprets a reduction in (total/cumulative) existential risk as a relative increase in the expected value of the future, which is fine, but then deals with the former as being independent from the latter, which I would argue is misguided given the dependence between the value of the future and increase in its PDF. “The more technologically comprehensive estimate of 10^54 human brain-emulation subjective life-years (or 10^52 lives of ordinary length) makes the same point even more starkly. Even if we give this allegedly lower bound on the cumulative output potential of a technologically mature civilisation a mere 1 per cent chance of being correct, we find that the expected value of reducing existential risk by a mere one billionth of one billionth of one percentage point is worth a hundred billion times as much as a billion human lives”.
        Nitpick. The maths just above is not right. Nick meant 10^21 (= 10^(52 − 2 − 2*9 − 2 − 9)) times as much just above, i.e. a thousand billion billion times, not a hundred billion times (10^11).
        Binary bias. This can manifest in assuming the value of the future is not only binary, but also that interventions reducing the nearterm risk of human extinction mostly move probability mass from worlds with value close to 0 to ones which are astronomically valuable, as opposed to just slightly more valuable.
        Scope neglect. I agree the expected value of the future is astronomical, but it is easy to overlook that the increase in the probability of the astronomically valuable worlds driving that expected value can be astronomically low too, thus making the increase in the expected value of the astronomically valuable worlds negligible (see my illustration above).
        Little use of empirical evidence and detailed quantitative models to catch the above biases. In the case of the former:
        As far as I know, reductions in the nearterm risk of human extinction as well as its relationship with the relative increase in the expected value of the future are always directly guessed.
        Owen Cotton-Barratt 9 Jun 2024 17:56 UTC
        5 points
        0 ∶ 0
        Parent
        Sorry, I don’t find this is really speaking to my question?
        I totally believe that people make mistakes in thinking about this stuff for reasons along the lines of the biases you discuss. But I also think that you’re making some strong assumptions about things essentially cancelling out that I think are unjustified, and talking about mistakes that other people are making doesn’t (it seems to me) work as a justification.
        Vasco Grilo🔸 9 Jun 2024 18:25 UTC
        2 points
        0 ∶ 0
        Parent
        
        Sorry, I don’t find this is really speaking to my question?
        I do not think the difficulty of decreasing a risk is independent of the value at stake. It is harder to decrease a risk when a larger value is at stake. So, in my mind, decreasing the nearterm risk of human extinction is astronomically easier than decreasing the risk of not achieving 10^50 lives of value, such that decreasing the former by e.g. 10^-10 leads to a relative increase in the latter much smaller than 10^-10.
        I also think that you’re making some strong assumptions about things essentially cancelling out
        Could you elaborate on why you think I am making a strong assumption in terms of questioning the following?
        In light of the above, I expect what David Thorstad calls rapid diminution. I see the difference between the PDF after and before an intervention reducing the nearterm risk of human extinction as quickly decaying to 0, thus making the increase in the expected value of the astronomically valuable worlds negligible. For instance:
        If the difference between the PDF after and before the intervention decays exponentially with the value of the future v, the increase in the value density caused by the intervention will be proportional to v*e^-v^[4].
        The above rapidly goes to 0 as v increases. For a value of the future equal to my expected value of 1.40*10^52 human lives, the increase in value density will multiply a factor of 1.40*10^52*e^(-1.40*10^52) = 10^(log10(1.40)*52 - log10(e)*1.40*10^52) = 10^(-6.08*10^51), i.e. it will be basically 0.
        Do you think I am overestimating how fast the difference between the PDF after and before the intervention decays? As far as I can tell, the (posterior) counterfactual impact of interventions whose effects can be accurately measured, like ones in global health and development, decays to 0 as time goes by. I do not have a strong view on the particular shape of the difference, but exponential decay is quite typical in many contexts.
        Owen Cotton-Barratt 9 Jun 2024 22:39 UTC
        9 points
        1 ∶ 0
        Parent
        I do not think the difficulty of decreasing a risk is independent of the value at stake. It is harder to decrease a risk when a larger value is at stake.
        This makes sense as a kind of general prior to come in with. Although note:
        It’s surely observational, not causal—there’s no magic at play which means if you keep a scenario fixed except for changing the value at stake, this should impact the difficulty
        One of the plausible generating mechanisms is having a broad altruistic market which takes the best opportunities, leaving no free lunches—but for some of the cases we’re discussing it’s unclear the market could have made it efficient
        So, in my mind, decreasing the nearterm risk of human extinction is astronomically easier than decreasing the risk of not achieving 10^50 lives of value, such that decreasing the former by e.g. 10^-10 leads to a relative increase in the latter much smaller than 10^-10.
        Now it looks to me as though you’re dogmatically sticking with the prior. Having come across the (kinda striking) observation which says “if there’s a realistic chance of spreading to the stars, then premature human extinction would forgo astronomical value”, it seems like you’re saying “well that would mean that the prior was wrong, so that observation can’t be quite right”, and then reasoning from your prior to try to draw conclusions about the causal relationships there.
        Whereas I feel that the prior reasonably justifies more scepticism in cases where more lives are at stake (and indeed, I do put a bunch of probability on “averting near-term extinction doesn’t save astronomical value for some reason or another”, though the reasons tend to be ones where we never actually had a shot of an astronomically big future in the first place, and I think that that’s sort of the appropriate target for scepticism), but doesn’t give you anything strong enough to be confident about things.
        (I certainly wouldn’t be surprised if I’m somehow misunderstanding what you’re doing; I’m just responding to the picture I’m getting from what you’ve written.)
        Vasco Grilo🔸 10 Jun 2024 9:33 UTC
        2 points
        0 ∶ 0
        Parent
        Now it looks to me as though you’re dogmatically sticking with the prior.
        Are there any interventions whose estimates of (posterior) counterfactual impact do not decay to 0 in at most a few centuries? From my perspective, their absence establishes a strong prior against persistent longterm effects.
        I do put a bunch of probability on “averting near-term extinction doesn’t save astronomical value for some reason or another”, though the reasons tend to be ones where we never actually had a shot of an astronomically big future in the first place, and I think that that’s sort of the appropriate target for scepticism
        This makes a lot of sense to me too.
        Owen Cotton-Barratt 10 Jun 2024 10:40 UTC
        4 points
        0 ∶ 0
        Parent
        In general our ability to measure long term effects is kind of lousy. But if I wanted to look for interventions which don’t have that decay pattern it would be most natural to think of conservation work saving species from extinction. Once we’ve lost biodiversity, it’s essentially gone (maybe taking millions of years to build up again naturally). Conservation work can stop that. And with rises in conservation work over time it’s quite plausible that early saving species won’t just lead to them going extinct slightly later, but being preserved indefinitely.
        Vasco Grilo🔸 10 Jun 2024 10:54 UTC
        2 points
        0 ∶ 0
        Parent
        I was not clear above, but I meant (posterior) counterfactual impact under expected total hedonistic utilitarianism. Even if a species is counterfactually preserved indefinitely due to actions now, which I think would be very hard, I do not see how it would permanently increase wellbeing. In addition, I meant to ask for actual empirical evidence as opposed to hypothetical examples (e.g. of one species being saved and making an immortal conservationist happy indefinitely).
        Expand this thread
        Owen Cotton-Barratt 10 Jun 2024 11:54 UTC
        4 points
        0 ∶ 0
        Parent
        I think this is something where our ability to measure is just pretty bad, and in particular our ability to empirically detect whether the type of things that plausibly have long lasting counterfactual impacts actually do is pretty terrible.
        
        I respond to that by saying “ok I guess empirics aren’t super helpful for the big picture question let’s try to build mechanistic understanding of things grounded wherever possible in empirics, as well as priors about what types of distributions occur when various different generating mechanisms are at play”, whereas it sounds like you’re responding by saying something like “well as a prior we’ll just use the parts of the distribution we can actually measure, and assume that generalizes unless we get contradictory data”?
        Vasco Grilo🔸 10 Jun 2024 12:02 UTC
        2 points
        0 ∶ 0
        Parent
        I respond to that by saying “ok I guess empirics aren’t super helpful for the big picture question let’s try to build mechanistic understanding of things grounded wherever possible in empirics, as well as priors about what types of distributions occur when various different generating mechanisms are at play”, whereas it sounds like you’re responding by saying something like “well as a prior we’ll just use the parts of the distribution we can actually measure, and assume that generalizes unless we get contradictory data”?
        Yes, that would be my reply. Thanks for clarifying.
        Owen Cotton-Barratt 10 Jun 2024 12:38 UTC
        4 points
        0 ∶ 1
        Parent
        Yeah, so I basically think that that response feels “spiritually frequentist”, and is more likely to lead you to large errors than the approach I outlined (which feels more “spiritually Bayesian”), especially in cases like this where we’re trying to extrapolate significantly beyond the data we’ve been able to gather.