Error
Unrecognized LW server error:
Field "fmCrosspost" of type "CrosspostOutput" must have a selection of subfields. Did you mean "fmCrosspost { ... }"?
Unrecognized LW server error:
Field "fmCrosspost" of type "CrosspostOutput" must have a selection of subfields. Did you mean "fmCrosspost { ... }"?
It is not obvious that non-extinction is an attractor state. If there is some minimal background risk of extinction that we can not get below (whether due to asteroids, false vacuum decay, nuclear war, everyone becoming a negative utilitarian and stops reproducing, whatever) then it is the nature of exponential discounting that the very long-term future can quickly become essentially unimportant.
I think expansive space colonization would reduce the risk asymptotically, since it’s unlikely for all of a large number of very distant civilizations to go extinct around the same time. The more distant the civilizations, the more roughly independent their risks should be. And the more civilizations there are, the more likely at least one is around at any time.
It’s certainly not as strong an attractor state as extinction, but I still think it’s an attractor state to some extent. Certainly wild animals (especially when you consider aquatic life) have and likely will exist for a very long time unless we take extreme action to get rid of them or there’s a particularly intense catastrophe.
Also I agree with Michael on the relevance of space colonisation. Many total utilitarians can’t wait for space colonisation as it will significantly reduce x-risk. I get this thinking, but I hope we don’t bring non-human animals. As I say in the post, it seems safer to make them go extinct.
On discounting, uncertainty over future discount rates (perhaps due to uncertainty about future x-risk which may become lower than it is now) leads to a declining discount rate over time and the result that we should discount the longterm future as if we were in the safest world among those we find plausible. This is known as Weitzman discounting. From Greaves’ paper Discounting for Public Policy:
Therefore we can’t really wave away the very long-term future, assuming of course that Weitzman is correct (he may not be, see the “Weitzman-Gollier puzzle”).
I think you’re accurately reflecting what these authors say, but my independent impression is that this point is mistaken. I.e., I think the idea of a qualitative distinction between simple cluelessness and complex cluelessness doesn’t make sense.
I describe my position, and why I think the examples typically used don’t support the points the authors want them to support, here. Here I’ll just briefly suggest some reasons why we can say something about the long-term effects of choosing to conceive a child on a particular day:
One day later means the child should be expected to be born roughly one day later, and thus will be roughly one day younger at any future point. This probably very slightly slows down GDP growth, intellectual progress, population growth (via the child later having their own children), growth in carbon emissions due to what the child does themselves, maybe also cuts in carbon emission due to tech advancement or policy change or whatever, etc. Then this could be good or bad for the long-term future based on whether things like GDP growth, intellectual progress, population growth, etc. are good or bad for the long-term future, which it also seems we can say something about (see e.g. Differential Progress).
I don’t want to start a pointless industry of alternatively ‘shooting down’ & refining purported cases of simple cluelessness, but just for fun here is another reason for why our cluelessness regarding “conceiving a child on Tuesday vs. Wednesday” really is complex:
Shifting the time of conception by one day (ignoring the empirical complication pointed out by Denise below) also shifts the probability distribution of birth date by weekday, e.g. whether the baby’s birth occurs on a Tuesday or Wednesday. However, for all we know the weekday of birth has a systematic effect on birth-related health outcomes of mother or child. For instance, consider some medical complication occurring during labor with weekday-independent probability, which needs to be treated in a hospital. We might then worry that on a Wednesday healthcare workers will tend to be more overworked, and so slightly more likely to make mistakes, than on a Tuesday (because many of them will have had the weekend off and so on Wednesday they’ve been through a larger period of workdays without significant time off). On the other hand, we might think that people are reluctant to go to a hospital on a weekend such that there’ll be a “rush” on hospitals on Mondays, which takes until Wednesday to “clear”—making in fact Monday or Tuesday more stressful for healthcare workers. And so on and so on …
(This is all made up, but if I google for relevant terms I pretty quickly find studies such as Weekday of Surgery Affects Postoperative Complications and Long-Term Survival of Chinese Gastric Cancer Patients after Curative Gastrectomy or Outcomes are Worse in US Patients Undergoing Surgery on Weekends Compared With Weekdays or Influence of weekday of surgery on operative
complications. An analysis of 25.000 surgical procedures or …
I’m sure many of these studies are terrible but their existence illustrates that it might be pretty hard to justify an epistemic state that is committed to the effect of different weekdays exactly canceling out.)
((It doesn’t help if we could work out the net effect on all health outcomes at birth, say b/c we can look at empirical data from hospitals. Presumably some non-zero net effect on e.g. whether or not we increase the total human population by 1 at an earlier time would remain, and then we’re caught in the ‘standard’ complex cluelessness problem of working out whether the long-term effects of this are net positive or net negative etc.))
I’m wondering if a better definition of simple cluelessness would be something like: “While the effects don’t ‘cancel out’, we are justified in believing that their net effect will be small compared to differences in short-term effects.”
I think that that’s clearly a good sort of sentence to say. But:
I don’t think we need the simple vs complex cluelessness” idea to say that
I really don’t want us to use the term “clueless” for that! That sounds very absolute, and I think was indeed intended by Greaves to be absolute (see her saying “utterly unpredictable” here).
I don’t want us to have two terms that (a) sound like they’re meant to sharply distinct, and (b) were (if I recall correctly) indeed originally presented as sharply distinct.
(I outlined my views on this a bit more in this thread, which actually happens to have been replies to you as well.)
Why can’t we simply talk in terms of having more or less “resilient” or “justified” credences, in terms of of how large the value of information from further information-gathering or information-analysis would be, and in terms of the value of what we could’ve done with that time or those resources otherwise?
It seems like an approach that’s more clearly about quantitative differences in degree, rather than qualitative differences in kind, would be less misleading and more useful.
It’s been a year since I thought about this much, and I only read 2 of the papers and a bunch of the posts/comments (so I didn’t e.g. read Trammell’s paper as well). But from memory, I think there are at least two important ways in which standard terms and framing of simple vs complex cluelessness has caused issues:
Many people seem to have taken the cluelessness stuff as an argument that we simply can’t say anything at all about the long-term future, whereas we can say something about the near-term future, so we should focus on the near-term future.
Greaves seems to instead want to argue that we basically, at least currently, can’t say anything at all about the long-term effects of interventions like AMF, whereas we can say something about the long-term effects of a small set of interventions chosen for their long-term effects (e.g., some x-risk reduction efforts), so we should focus on the long-term future.
See e.g. here, where Greaves says the long-term effects of short-termist interventions are “utterly unpredictable”.
My independent impression is that both of these views are really problematic, and that the alternative approach used in Tarsney’s epistemic challenge paper is just obviously far better. We should just think about how predictable various effects on various timelines from various interventions are. We can’t just immediately say that we should definitely focus on neartermist interventions or that we should definitely focus on longtermist interventions; it really depends on specific questions that we actually can improve our knowledge about (through efforts like building better models or collecting more evidence about the feasibility of long-range forecasting).
Currently, this is probably the main topic in EA where it feels to me like there’s something important that’s just really obviously true and that lots of other really smart people are missing. So I should probably find time to collect my thoughts from various comments into a single post that lays out the arguments better.
When this post went up, I wrote virtually the same comment, but never sent it! Glad to see you write it up, as well as your below comments. I have the impression that in each supposed example of ‘simple cluelessness’ people just aren’t being creative enough to see the ‘complex cluelessness’ factors, as you clarify with chairs in other comment.
My original comment even included saying how Phil’s example of simple cluelessness is false, but it’s false for different reasons than you think: If you try to conceive a child a day later, this will not in expectancy impact when the child will be born. The impact is actually much stronger than that. It will affect whether you are able to conceive in this cycle at all, since eggs can only be fertilized during a very brief window of time (12-24 hours). If you are too late, no baby.
To be honest I’m not really sure how important there being a distinction between simple and complex cluelessness actually is. The most useful thing I took from Greaves was to realise there seems to be an issue of complex cluelessness in the first place—where we can’t really form precise credences in certain instances where people have traditionally felt like they can, and that these instances are often faced by EAs when they’re trying to do the most good.
Maybe we’re also complexy clueless about what day to conceive a child on, or which chair to sit on, but we don’t really have our “EA hat on” when doing these things. In other words, I’m not having a child to do the most good, I’m doing it because I want to. So I guess in these circumstances I don’t really care about my complex cluelessness. When giving to charity, I very much do care about any complex cluelessness because I’m trying to do the most good and really thinking hard about how to do so.
I’m still not sure if I would class myself as complexly clueless when deciding which chair to sit on (I think from a subjective standpoint I at least feel simply clueless), but I’m also not sure this particular debate really matters.
I’m also inclined to agree with this. I actually only very recently realized that a similar point had also been made in the literature: in this 2019 ‘discussion note’ by Lok Lam Yim, which is a reply to Greaves’s cluelessness paper:
Interesting—that’s fairly similar to the counterarguments I gave for the same case here:
Your critique of the conception example might be fair actually. I do think it’s possible to think up circumstances of genuine ‘simple cluelessness’ though where, from a subjective standpoint, we really don’t have any reasons to think one option may be better or worse than the alternative.
For example we can imagine there being two chairs in front of us and making a choice of which chair to sit on. There doesn’t seem to be any point stressing about this decision (assuming there isn’t some obvious consideration to take into account), although it is certainly possible that choosing the left chair over the right chair could be a terrible decision ex post. So I do think this decision is qualitatively different to donating to AMF.
However I think the reason why Greaves introduces the distinction between complex and simple cluelessness is to save consequentialism from Lenman’s cluelessness critique (going by hazy memory here). If a much wider class of decisions suffer from complex cluelessness than Greaves originally thought, this could prove problematic for her defence. Having said that, I do still think that something like working on AI alignment probably avoids complex cluelessness for the reasons I give in the post, so I think Greaves’ work has been useful.
So far, I feel I’ve been able to counter any proposed example, and I predict I would be able to do so for any future example (unless it’s the sort of thing that would never happen in real life, or the information given is less than one would have in real life).
Some off-the-top-of-my-head reasons we might not have perfect evidential symmetry here:
One chair might be closer, so walking to it expends less energy and/or takes less time, which has various knock-on effects
One chair will be closer to some other object in the world, making it easier for you to hear what’s going on over there and for people over there to hear you, which could have various knock-on effects
One chair might look very slightly older, and thus very slightly more likely to have splinters or whatever
I totally agree, but this is a very different claim from there being a qualitative, absolute distinction between simple and complex cluelessness.
My independent impression is that, for the purpose of evaluating longtermism and things like that, we could basically replace all discussion of simple vs complex cluelessness with the following points:
You’ll typically do a better job achieving an objective (in expectation) if you choose a plan that was highlighted in an effort to try to achieve that objective, rather than choosing a plan that was highlighted in an effort to try to achieve some other objective
This seems like commonsense, and also is in line with the “suspicious convergence” idea
Plans like “donate to AMF” were not highlighted to improve the very long-term future
Plans like “donate to reduce AI x-risk” were highlighted largely to improve the very long-term future
A nontrivial fraction of people highlighted this plan for other reasons (e.g., because they wanted to avoid extinction for their own sake or the sake of near-term generations), but a large fraction highlighted it for approximately longtermist reasons (e.g., Bostrom)
On the object-level, it also seems like existing work makes a much more reasonable case for reducing AI x-risk as a way to improve the long-term future than for AMF as a way to improve the long-term future
But then there’s also the fact that those far-future effects are harder to predict that nearer-future effects, and nearer-future effects do matter at least somewhat, so it’s not immediately obvious whether we should focus on the long-term or the short-term. This is where work like “The Epistemic Challenge to Longtermism” and “Formalising the ‘Washing Out Hypothesis’” become very useful.
Also, there are many situations where it’s not worth trying to work out which of two actions are better, due to some mixture of that being very hard to work out and the stakes not being huge
E.g., choosing which chair to sit on; deciding which day to try to conceive a child on
This is basically just a point about value of information and opportunity cost; it doesn’t require a notion of absolute evidential symmetry
(I used AMF and AI x-risk as the examples because you did; we could also state the points in a more general form.)
FWIW, I also think other work of Greaves has been very useful. And I think most people. -though not everyone—who’ve thought about the topic think the cluelessness stuff is much more useful than I think it is (I’m just reporting my independent impression here), so my all-things-considered belief is that that work has probably been more useful than seems to be true to me.
I think simple cluelessness is a subjective state. In reality one chair might be slightly older, but one can be fairly confident that it isn’t worth trying to find out (in expected value terms). So I think I can probably just modify my example to one where there doesn’t seem to be any subjectively salient factor to pull you to one chair or the other in the limited time you feel is appropriate to make the decision, which doesn’t seem too far-fetched to me (let’s say the chairs look the same at first glance, they are both in front row and either side of the aisle etc.).
I think invoking simple cluelessness in the case of choosing which chair to sit on is the only way a committed consequentialist can feel OK making a decision one way or the other—otherwise they fall prey to paralysis. Admittedly I haven’t read James Lenman closely enough to know if he does in fact invoke paralysis as a necessary consequence for consequentialists, but I think it would probably be the conclusion.
EDIT: To be honest I’m not really sure how important there being a distinction between simple and complex cluelessness actually is. The most useful thing from Greaves was to realise there seems to be an issue of complex cluelessness in the first place where we can’t really form precise credences.
For me, Greaves’ work on cluelessness just highlighted to me a problem I didn’t think was there in the first place. I do feel the force of her claim that we may no longer be able to justify certain interventions (for example giving to AMF), and I think this should hold even for shorttermists (provided they don’t discount indirect effects at a very high-rate). The decision-relevant consequence for me is trying to find interventions that don’t fall prey to problem, which might be the longtermist ones that Greaves puts forward (although I’m uncertain about this).
I haven’t read the relevant papers since last year, but I think I recall the idea being not just that we currently don’t have a sense of what the long-term effects of an action are, but also that we basically can’t gain information about that. In line with that memory of mine, Greaves writes here that the long-term effects of short-termist interventions are “utterly unpredictable”—a much stronger claim that just that we currently have no real prediction.
(And I think that that idea is very problematic, as discussed elsewhere in this thread..)
(I’m writing this while a little jetlagged, so it might be a bit incoherent or disconnected from what you were saying.)
I don’t think this is right. I think the key thing is to remember that doing more analysis (thinking, discussing, researching, whatever) is itself a choice, and itself has a certain expected value (which is related to how long it will take, how likely it is to change what other decision you make, and how much of an improvement that change might be). Sometimes that expected value justifies the opportunity cost, and sometimes it doesn’t. This can be true whether you can or can’t immediately see any difference in the expected value of the two “concrete choices” (this is a term I’m making up to exclude the choice to do further analysis).
E.g., I don’t spend time deciding which of two similar chairs to sit in, and this is the right decision for me to make from a roughly utilitarian perspective, and this is because:
It seems like that, even after quite a while spent analysing which chair I should sit in, the expected value I assign to each choice would be quite similar
There are other useful things I can do with my time
The expected value of just choosing a chair right away and then doing certain other things is higher than the expected value of first spending longer deciding which chair to sit in
(Of course, I don’t explicitly go through that whole thought process each time I implicitly make a mundane decision.)
But there are also some cases where the expected values we’d guess each of two actions would have are basically the same and yet we should engage in further analysis. This is true when the opportunity cost of the time spent on that analysis seems justified, in expectation, by the probability that that analysis would cause us to change our decision and the extent to which that change might be an improvement.
So I don’t think the concept of “simple cluelessness” is necessary, and I think it’s unhelpful in that:
It sounds absolute and unchangeable, whereas in many cases one either already has or could come to have a belief about which action would have higher expected value
It implies that there’s something special about certain cases where one has extremely little knowledge, whereas really what’s key is how much information value various actions (e.g., further thinking) would provide and what opportunity cost those actions have
I think another genuine issue for longtermism is complex cluelessness/deep uncertainty and moral uncertainty, although it’s not specific to longtermism. Even if you identify an intervention that you think has predictably large effects on the far future, you may not be able to weigh the arguments and evidence in such a way as to decide that it’s actually net positive in expectation.
It’s easy to forget the possibility that you’ll do more harm than good, or give it too little weight, and I suspect this is worse when we get into very small probabilities of making a difference, and especially Pascalian cases, since we’re especially bad at estimating such (differences in) probabilities and considering many very small (differences in) probabilities.
There seems to be some bias at some influential EA orgs against writing about the idea that the far future could be bad (or worse conditional on the survival of humanity or our descendants), which can lead us to systematically underestimating the risks of backfire in this way or related ways. There are other ways seemingly good interventions can backfire, e.g. the research we publish could be used for harm (even if they they’re actually doing good according to their own views!), or some AI safety work could be accelerating the development (and adoption) of AGI. Without good feedback loops, such biases and blindspots can persist more easily, and shortermist interventions tend to have better feedback loops than longtermist ones.
I think that the claim you make it plausible, but I don’t think the post you link to provides good evidence of it. If readers were going to read and update on that post, I’d encourage them to also read the commentary on it here. (I read the post myself and found it very unconvincing and strange.)
I think the guidelines and previous syllabi/reading lists are/were biased against downside-focused views, practically pessimistic views, and views other than total symmetric and classical utilitarianism (which are used most to defend work against extinction) in general, as discussed in the corresponding sections of the post. This is both on the normative ethics side and discussion of how the future could be bad or extinction could be good. I discussed CLR’s guidelines with Jonas Vollmer here. CLR’s guidelines are here, and the guidelines endorsed by 80,000 Hours, CEA, CFAR, MIRI, Open Phil and particular influential EAs are here. (I don’t know if these are current.)
On the normative ethics side, CLR is expected to discuss moral uncertainty and non-asymmetric views in particular to undermine asymmetric views, and while the other side is expected to discuss moral uncertainty and s-risks, they are not expected to discuss asymmetric views in particular, so this biases us away from asymmetric views, according to which the future may be bad and extinction may be good.
On discussion of how the future could be bad or extinction could be good, from CLR’s guidelines:
(emphasis mine)
So, CLR associates are discouraged from arguing that the future could be bad and extinction could be good, biasing us against theses hypotheses.
I’m not sure that the guidelines for CLR are actually bad overall, though, since I think the arguments for them are plausible, and I agree that people with pessimistic or downside-focused views should not seek to cause extinction, except possibly through civil discussion and outreach causing people to deprioritize work on preventing extinction. But the guidelines rule out ways of doing the latter, too.
I have my own (small) personal example related to normative ethics, too. The coverage of the asymmetry on this page, featured on 80,000 Hours’ Key Ideas page, is pretty bad:
The article does not cite any literature making positive cases for the asymmetry (although they discuss the repugnant conclusion as being a reason for person-affecting views). I cite some in this thread.
There are asymmetric views to which this argument does not apply, some published well before this page, e.g. this and this. Also, the conclusion may not be so bizarre if the lives are equally content/satisfied, in line with negative accounts of welfare (tranquilism/Buddhist axiology, antifrustrationism, negative utilitarianism, etc.).
Over a year ago, I criticized this for being unfair in the comments section of that page, linking to comments in my own EA Forum shortform and other literature with arguments for the asymmetry, and someone strong downvoted the comments in my shortform with a downvote strength of 7 and without any explanation. There was also already another comment criticizing the discussion of the asymmetry.
FWIW, I think that the specific things you point to in this comment do seem like some evidence in favour of your claim that some influential EA orgs have some bias against things broadly along the lines of prioritising s-risks or adopting suffering-focused ethical views. And as mentioned in my other comment, I also did already see that claim as plausible.
(I guess more specifically, I see it as likely that at least some people at EA orgs have this bias, and likely that there’s at least a little more of this bias than of an “opposite” bias, but not necessarily likely—just plausible—that there’s substantially more of that bias than of the “opposite” bias.)
Also, on reflection, I think I was wrong to say “I don’t think the post you link to provides good evidence [for your claim].” I think that the post you link to does contain some ok evidence for that claim, but also overstates the strength of this evidence, makes other over-the-top claims, and provides as evidence some things that don’t seem worth noting at all, really.
And to put my own cards on the table on some related points:
I’d personally like the longtermist community to have a bit of a marginal shift towards less conflation of “existential risk” (or the arguments for existential risk reduction) with “extinction risk”, more acknowledgement that effects on nonhumans should perhaps be a key consideration for longtermists, and more acknowledgement of s-risks as a plausible longtermist priority
But I also think we’re already moving in the right direction on these fronts, and that we’re already in a fairly ok place
From what I’ve read moral uncertainty tends to work in favour of longtermists, provided you’re happy to do something like maximising expected choice-worthiness. E.g. see here for moral uncertainty about population axiology implying we should choose options preferred by total utilitarianism (disclaimer—I’ve only read the abstract!). If Greaves and MacAskill’s claim about the robustness of longtermism to different moral views is fair, it seems longtermism should remain fairly robust in the face of moral uncertainty.
In terms of complex cluelessness in a more empirical sense, I admit I haven’t properly considered the possibility that something like “researching AI alignment” may have realistic downsides. I do however find it a tougher sell that we’re complexly clueless about working on AI alignment in the same way that we are about giving to AMF.
Fantastic post, thanks for taking the time to write it :)
Thank you, glad you liked it!
tl;dr for this comment:
I appreciate you highlighting that longtermists doesn’t necessarily entail ultimately focusing on humans.
But I think it’d be better to broaden your discussion to humans, non-human animals, and other types of non-humans that might be moral patients (e.g., artificial sentiences)
See also posts tagged Non-Humans and the Long-Term Future
And I think you imply that, when it comes to non-humans, we must necessarily focus on suffering-reduction
I think it’d be better to broaden your discussion so that it’s open to other goals regarding non-humans, such as increasing happiness
---
This was also a nitpick I had. Here’s a relevant part of the notes I wrote on their paper (which I should be posting in full soon):
It seems like the paper implicitly assumes that humans are the only moral patients
I think it makes sense for the paper to focus on humans, since it makes sense for many papers to tackle just one thorny issue at a time
But I think it would’ve been good for the paper to at least briefly acknowledge that this is just a simplifying assumption
Perhaps just in a footnote
Otherwise the paper is kind-of implying that the authors really do take it as a given that humans are the only moral patients (which I think wouldn’t actually match the authors’ views)
---
I appreciate you highlighting that longtermism doesn’t have to focus on humans. But I personally think your framing is still narrower than it should be: You could be interpreted as implying that longtermists should focus on either humans, or on the suffering of non-human animals, or on some combination of those goals.
But I think it’s also quite important to consider other possible moral patients that are neither humans nor animals, such as artificial sentiences. (Perhaps arguably some artificial sentiences would be considered by some people to be effectively humans or non-human animals, but this may not be the case, and other artificial sentiences might be more starkly different.)
And it could also be a moral priority to decrease bad things other than suffering among non-humans, such as death or a lack of freedom. (This would of course require that utilitarianism be false, or at least that we be quite uncertain about it.) And it could also be a moral priority to increase good things for non-humans (e.g., allow there to be large numbers of happy non-human beings).
I do think that this is all plausible. But I think people have sometimes jumped on this option too quickly, with too little critical consideration. See also this section of a post and this doc.
Also, I think “Non-human animals are moral patients” (emphasis added) is too strong; I’m not sure we should be practically certain that any nonhuman animals are moral patients, and I definitely don’t think we should be practically certain that all are (e.g., insects, crabs).
To be clear, I’m vegan, and broadly supportive of people focusing on animal welfare, and I think due to moral uncertainty / expected value society should pay far more attention to animals than it does. But I still think it’s quite unclear which animals are moral patients. And my impression is that people who’ve looked into this tend to feel roughly similar (see e.g. Muehlhauser’s report).
My impression is that people interested in wild animal welfare early on jumped a bit too quickly to being confident that wild animal lives are net negative, possibly because one of the pioneers of this area (Brian Tomasik) is morally suffering-focused (hence the early arguments tended to focus on suffering).
I don’t have a strong view on whether wild animal lives tend to be net negative, but it seems to me that more uncertainty is warranted.
See also the EAG talk Does suffering dominate enjoyment in the animal kingdom? | Zach Groff.
I don’t think this undermines the idea that maybe longtermists should focus on non-humans, but it suggests that maybe it’s unwise to place much much emphasis on reducing suffering and maybe even causing extinction than on other options (e.g., improving their lives or increasing their population). I think we should currently see both options as plausible priorities.
tl;dr: It’s plausible to me that the future will involve far more nonbiological sentience (e.g., whole brain emulations) than biological sentience, which might make farm animals redundant and wild animals vastly outnumbered.
You write:
I’m not sure why you think it’s highly likely that wild animals will continue to exist for a long-time, and in particular in large numbers relative to other types of beings (which you seem to imply, though you don’t state it outright)? It seems plausible to me that the future will involve something like massive expansion into space by (mostly) humans, whole-brain emulations, or artificial sentiences, without spreading wild animals to these places. (We might spread simulated wild animals without spreading biological ones, but we also might not.)
Relatedly, I think another reason farmed animals might be made redundant is that humanity may simply move from biological to digital form, such that there is no need to eat actual food of any kind. (Of course, it seems hard to say whether this’ll happen, but over a long time-scale I wouldn’t say it’s highly unlikely.)
For arguments for and against these sorts of points I’m making, see Should Longtermists Mostly Think About Animals? and the comments there.
You list as a misconception that “Longtermists have to predict the future”. I think what you intend to say is a valuable point, but that your current phrasing is a bit off. I think what you really mean is something like that it’s a misconception that “Longtermists have to make very precise predictions about the far future”.
I agree with this point, and think it’s important:
But longtermists still do need to predict the future—in particular, the case for the most popular longtermist interventions require:
predicting events like existential risks or trajectory changes in the coming years, decades, or centuries
predicting that those events would cause the world to be “locked-in” to a particular state/trend, or something like that
Predicting the future is also necessary to make the case for neartermist/shortermist interventions, but that tends to require predicting things that are closer to the present and for which we have more precedents/data to go on.
Thanks for this post! I expect this will indeed clarify some important points for people, and I’ve already recommended this post to a couple people interested in these sorts of topics.
I’d suggest editing the post to put the misconceptions in the headings in quote marks, or something like that. Otherwise on some level people’s minds might initially process the headings as claims you’re making—especially if they’re skimming or just glancing at the post.
(This suggestion is informed by my previous reading on topics like how misinformation can accidentally emerge, spread, and be sticky. A particularly relevant paper is The effects of subtle misinformation in news headlines. I haven’t checked for replications of this stuff, but the basic takeaway here seems to make intuitive sense to me as well.)
Thanks for all your comments Michael, and thanks for recommending this post to others!
I have read through your comments and there is certainly a lot of interesting stuff to think about there. I hope to respond but I might not be able to do that in the very near future.
Great suggestion thanks, I have done that.
Thank you for this Jack.
Floating an additional idea here, in the terms of another misconception that I sometimes see. Very interested in your feedback:
Possible misconception: Someone has made a thorough case for “strong longtermism”
Axiological strong longtermism (AL) is the idea that: “In a wide class of decision situations, the option that is ex ante best is contained in a fairly small subset of options whose ex ante effects on the very long-run future are best.”
The colloquial use of strong longtermism on this forum (CL) is something like “In most of the ethical choices we face today we can focus primarily on the far-future effects of our actions”.
Now there are a few reasons why this might not follow (why CL might not follow from AL):
The actions that are best in the short run are the same as the ones that are best in the long run (this is consistent with AL, see p10 of the Case for Strong Longtermism paper) in which case focusing attention on the more certain short term could be sufficient.
Making decisions solely by evaluating ex ante effects is not a useful way of making decisions or otherwise interacting with the world.
Etc
Whether or not you agree with these reasons it should at least be acknowledged that the Case for Strong Longtermism paper focuses on making a case for AL – it does not actually try to make a case for CL. This does not mean there is no way to make a case for CL but I have not seen anyone try to and I expect it would be very difficult to do, especially if aiming for philosophical-level rigour.
– –
This misconception can be used in discussions for or against longtermism. If you happen to be a super strong believer that we should focus mainly on the far future it would whisper caution and if you think that Greaves and MacAskill’s arguments are poor it would suggest being careful not to overstate their claims.
(PS. Both 1 and 2 seem likely to be true to me)
Thanks for this! I guess I agree with your overall point that the case isn’t as airtight as it could be. It’s for that reason that I’m happy that the Global Priorities Institute has put longtermism front and centre of their research agenda. I’m not sure I agree with your specific points though.
I’m sceptical of this. It would seem to me to be surprising and suspicious convergence that the actions that are best in terms of short-run effects are also the actions that are best in terms of long-run effects. We should be predisposed to thinking this is very unlikely to be the case.
Even if there are some cases where the actions that have the best short run effects are also the ones that have the best long-run effects, I think it would be important for us to justify doing them based on their long-run effects (so I disagree that colloquial longtermism would be undermined as you say). This is because, if axiological strong longtermism is true, the vast majority of the value of these actions will in fact be coming from the long-run effects. Ignoring this fact and just doing them based on their short-run effects wouldn’t seem to me to be a great idea, as if we were to come across evidence or otherwise conclude that the action isn’t in fact good from a long-run perspective, we wouldn’t be able to correct for this (and correcting for it would be very important). So I’m not convinced that AL doesn’t imply CL.
I would need to know more about your proposed alternative to comment. I would just point out (something I didn’t mention in my post), that Greaves and MacAskill also argue for “deontic strong longtermism” in their paper. I.e. that we ought to be driven by far future effects. They argue that the deontic longtermist claim follows from the axiological claim as, if axiological strong longtermism is true it is true by a large margin, and that a plausible non-consequentialist theory has to be sensitive to the axiological stakes, becoming more consequentialist in output as the axiological stakes get higher.
I hope this doesn’t come across as snarky, but “etc.” makes it sound like there is a long list of obvious problems but, to be honest, I’m not sure what these are beyond the ones I mention in my post so it would probably be helpful for you to specify these.
Hi Jack, Thank you for your thoughts. Always a pleasure to get your views on this topic.
I think that was the main point I wanted to make (the rest was mostly to serve as an example). The case is not yet made with rigour, although maybe soon. Glad you agree.
I would also expect (although cant say for sure) that if you go hang out with GPI academics and ask how certain they are about x y and z about longtermism you would perhaps find less certainty than it comes across from the outside or that you might find on this forum and that it is useful for people to realise that.
Hence thought it might be one for your list.
– –
The specific points 1. and 2. were mostly to serve as examples for the above (the “etc” was entirely in that vein, just to imply that there maybe things that a truly rigorous attempt to prove CL would throw up).
Main point made, and even roughly agreed on :-), so happy to opine a few thoughts on the truth or 1. and 2. anyway:
– –
Please assume that by short-term I mean within 100 years, not within 10 years.
A few reasons you might think this is true:
Convergence: See your section on “Longtermists won’t reduce suffering today”. Consider some of the examples in the paper, speeding up progress, preventing climate change, etc are quite possibly the best things you would do to maximise benefit over the next 100 years. AllFed justify working on extreme global risks based on expected lives saved in the short-run. (If this is suspicious convergence it goes both ways, why are many of the examples in the paper so suspiciously close to what is short-run best).
Try it: Try making the best plan you can accounting for all the souls in the next 1x10^100 years, but no longer. Great done. Now make the best plan but only take into account the next 1X10^99 years. Done? does it look any different? Now try 1x10^50 years. How different does that look? What about the best plan for 100000 years? Does that plan look different? What about 1000 years or 100 years? At what point does it look different? Based on my experience of working with governments on long-term planning my guess would be it would start to differ significantly after about 50-100 years. (Although it might well be the case that this number is higher for philanthropists rather than policy makers.)
Neglectedness: Note that the two thirds of the next century (after 33 years) is basically not featured in almost any planning today. That means most of the next 100 years is almost as neglected as the long-term future (and easier to impact).
On:
I think I agree with this (at least intuitively agree, not given it deep though). I raised 1. as it as I think it is a useful example of where the Case for Strong Longtermism paper focuses on AL rather than CL. See section 3 p9 – the authors say if short-term actions are also the best long-term actions then AL is trivially true and then move on. The point you raise here is just not raised by the authors as it is not relevant to the truth of AL.
– –
I agree that AL leads to ‘deontic strong longtermism’.
I don’t think expected value approach (which is the dominant approach used in their paper) or the other approaches they discuss fully engages with how to make complex decisions about the far future. I don’t think we disagree much here (you say more work could be done on decisions theoretic issues, and on tractability).
Unfortunately, I am running out of time and weekend to go into this in too much depth on this so I hope you don’t mind that instead of a lengthy answer here if I just link you to some reading.
I have recently been reading the following that you might find an interesting introduction to how one might go about thinking about these topics and is fairly close to my views:
https://blog.givewell.org/2014/06/10/sequence-thinking-vs-cluster-thinking/
https://www.givewell.org/modeling-extreme-model-uncertainty
– –
Always happy to hear your views. Have a great week
I just don’t really see a meaningful / important distinction between AL and CL to be honest. Let’s consider that AL is true, and also that cultivated meat happens to be the best intervention from both a shortermist and longtermist perspective.
A shortermist might say: I want cultivated meat so that people stop eating animals reducing animal suffering now
A longtermist might say: I want cultivated meat so that people stop eating animals and therefore develop moral concern for all animals. This will reduce risks of us locking in persistent animal suffering in the future
In this case, if AL is true, I think we should also be colloquial longtermists and justify cultivated meat in the way the longtermist does, as that would be the main reason cultivated meat is good. If evidence were to come out that stopping eating meat doesn’t improve moral concern for animals, cultivated meat may no longer be great from a longtermist point of view—and it would be important to reorient based on this fact. In other words, I think AL should push us to strive to be colloquial longtermists.
Otherwise, thanks for the reading, I will have a look at some point!
I think I essentially agree, and I think that these sorts of points are too often ignored. But I don’t 100% agree. In particular, I wouldn’t be massively surprised if, after a few years of relevant research, we basically concluded that there’s a systematic reason why the sort of things that are good for the short-term will tend to also be good for the long-term, and that we can basically get no better answers to what will be good for the long-term than that. (This would also be consistent with Greaves and MacAskill’s suggestion of speeding up progress as a possible longtermist priority.)
I’d bet against that, but not with massive odds. (It’d be better for me to operationalise my claim more and put a number on it, rather than making these vague statements—I’m just taking the lazy option to save time.)
And then if that was true, it could make sense to most of the time just focus on evaluating things based on short-term effects, because that’s easier to evaluate. We could have most people focusing on that proxy most of the time, while a smaller number of people continuing checking whether that seems a good proxy and whether we can come up with better ones.
I think most longtermists are already doing something that’s not massively different from that: Most of us focus most of the time on reducing existential risk, or some specific type of existential risk (e.g., extinction caused by AI), as if that’s our ultimate, terminal goal. Or we might even most of the time focus on an even more “proximate” or “merely instrumental” proxy, like “improving institutions’ ability and motivation to respond effectively to [x]”, again as if that’s a terminal goal.
(I mean this to stand in contrast to consciously focusing on “improving the long-term future as much as possible”, and continually re-deriving what proxies to focus on based on that goal. That would just be less efficient.)
Then we sometimes check in on whether the proxies we focus on are actually what’s best for the future.
I think this approach makes sense, though it’s also good to remain aware of what’s a proxy and what’s an ultimate goal, and to recognise our uncertainty about how good our proxies are. (This post seems relevant, and in any case is quite good.)
Yeah, this is also what came to mind for me when I read weeatquince’s comment. I’d add that Greaves and MacAskill also discuss some possible decision-theoretic objections, including objections to the idea that one should simply make decisions based on what seems to have the highest expected value, and argue that the case for longtermism seems robust to these objections. (I’m not saying they’re definitely right, but rather that they do seem to engage with those potential counterarguments.)
I agree that CL may or may not follow from AL depending on one’s other ethical and empirical views.
However, I’m not sure I understand if and why you think this is a problem for longtermism specifically, as opposed to effective altruism more broadly. For instance, consider the typical EA argument for donating to more rather than less effective global health charities. I think that argument essentially is that donating to a more effective charity has better ex-ante effects.
Put differently, I think many EAs donate to AMF because they believe that GiveWell has established that marginal donations to AMF have pretty good ex-ante effects compared to other donation options (at least if we only look at a certain type of effect, namely short-term effects on human beneficiaries). But I haven’t seen many people arguing on the EA Forum that, actually, it is a misconception that someone has made a thorough case for donating to AMF because maybe making decisions solely by evaluating ex-ante effects is not a useful way of interacting with the world. [1]
So you directing a parallel criticism at longtermism specifically leaves me a little confused. Perhaps I’m misunderstanding you?
(I’m setting aside your potential empirical defeater ‘1.’ since I largely agree with the discussion on it in the other responses to your comment. I.e. I think it is countered strongly, though not absolutely decisively, by the ‘beware suspicious convergence’ argument.)
[1] People have claimed that there isn’t actually a strong case for donating to AMF; but usually such arguments are based on types of effects (e.g. on nonhuman animals or on far-future outcomes) that the standard pro-AMF case allegedly doesn’t sufficiently consider rather than on claims that, actually, ex-ante effects are the wrong kind of thing to pay attention to in the first place.
tl;dr – The case for giving to GiveWell top charities is based on much more more than just expected value calculations.
The case for longtermism (CL) is not based on much more than expected value calculations, in fact many non-expected value arguments currently seem to point the other way. This has lead to a situation where there are many weak arguments against longtermsim and one very strong argument for longtermism. This is hard to evaluate.
We (longtermists) should recognise that we are new and there is still work to be done to build a good theoretical base for longtermism.
Hi Max,
Good question. Thank you for asking.
– –
The more I have read by GiveWell (and to a lesser degree by groups such as Charity Entrepreneurship and Open Philanthropy) the more it is apparent to me that the case for giving to the global poor is not based solely on expected value but is based on a very broad variety of arguments.
For example I recommend reading:
https://blog.givewell.org/2014/06/10/sequence-thinking-vs-cluster-thinking
https://blog.givewell.org/2011/08/18/why-we-cant-take-expected-value-estimates-literally-even-when-theyre-unbiased/
https://www.givewell.org/modeling-extreme-model-uncertainty
https://forum.effectivealtruism.org/posts/h6uXkwFzqqr2JdZ4e/joey-savoie-tools-for-decision-making
The rough pattern of these posts is that taking a broad variety of different decision making tools and approaches and seeing where they all converge and point too is better than just looking at expected value (or using any other single tool). That expected value calculations are not the only way to make decisions and that the arguments for giving to the global poor would be unconvincing if solely based on expected value cautions and not on historical evidence, good feedback loops, expert views, strategic considerations, etc, etc. then the authors would not be convinced.
For example in [1.] Holden describes how he was initially sceptical that:
”donations can do more good when targeting the developing-world poor rather than the developed-world poor “
but he goes onto says that:
”many (including myself) take these arguments more seriously on learning things like “people I respect mostly agree with this conclusion”; “developing-world charities’ activities are generally more robustly evidence-supported, in addition to cheaper”; “thorough, skeptical versions of ‘cost per life saved’ estimates are worse than the figures touted by charities, but still impressive”; “differences in wealth are so pronounced that “hunger” is defined completely differently for the U.S. vs. developing countries“; “aid agencies were behind undisputed major achievements such as the eradication of smallpox”; etc.”
– –
Now I am actually somewhat sceptical of some of this writing. I think much of it is a pushback against longtermism. Remember the global development EAs have had to weather the transition from “give to global health, it has the highest expected value” to “give to global health, it doesn’t have the highest expected value (longtermism has that) but is good for many other reasons”. So it is not surprising that they have gone on to express that there are many other reasons to care about global health that are not based in expected value calculations.
– –
But that possible “status quo bias” does not mean they are wrong. It is still the case that GiveWell have made a host of arguments for global health beyond expected value and that the longtermsim community has not done so. The longtermism community has not produced historical evidence or highlighted successful feedback loops or demonstrated that their reasoning is robust to a broad variety of possible worldviews or built strong expert consensus. (Although the case has been made that preventing extreme risks is robust to very many possible futures, so that at least is a good longtermist argument that is not based on expected value.)
In fact to some degree the opposite is the case. People who argue against longtermism have pointed to cases were long-term type planning historically led to totalitarianism or to the common-sense weirdness of longtermist conclusions etc. My own work into risk management suggests that especially when planning for disasters it is good to not put too much weight on expected value but to assume that something unexpected will happen.
The fact is that the longtermist community has much more weird conclusions than the global health community yet has put much less effort into justifying those conclusions.
– –
To me it looks like all this has lead to a situation where there are many weak arguments against longtermsim (CL) and one very strong argument for longtermism (AL->CL). This is problematic as it is very hard to compare one strong argument against many weak arguments and which side you fall on will depend largely on your empirical views and how you weigh up evidence. This ultimately leads to unconstructive debate.
– –
I think the longtermist view is likely roughly correct. But I think that the case for longtermism has not be made rigorously or even particularly well (certainly it does not stand up well to Holden’s “cluster thinking” ideals). I don’t see this as a criticism of the longtermist community as the community is super new and the paper arguing the case even just from the point of view of expected value is still in draft! I just think it is a misconception worth adding to the list that the community has finished making the case for longtermism – we should recognise our newness and that there is still work to be done and not pretend we have all the answers. The EA global health community has build this broad theoretical bases beyond expected value and so can we, or we can at least try.
– –
I would be curious to know the extent to which you agree with this?
Also, I think this way of mapping situation is a bit more nuanced here than in my previous comment so I want to acknowledge a subtle changing of views between by earlier comment and this one, ask that if you respond you respond to the views as set out here rather than above and of course thank you for your insightful comment that lead to my views evolving – thank you Max!
– –
– –
(PS. On the other topic you mention. [Edited: I am not yet sure of the extent to which I think] the ‘beware suspicious convergence’ counter-argument [applies] in this context. Is it suspicious that if you make a plan for 1000 years it looks very similar to if you make a plan for 10000 years? Is it suspicious that if I plan for 100000 years or 100 years what I do in the next 10 years looks the same? Is it suspicious that if I want to go from my house in the UK to Oslo the initial steps are very similar to if I want to go from my house to Australia – ie. book ticket, get bus to train station, get train to airport? Etc? [Would need to give this more thought but it is not obvious] )
Hi Sam, thank you for your thoughtful reply.
Here are some things we seem to agree on:
The cases for specific priorities or interventions that are commonly advocated based on a longtermist perspective (e.g. “work on technical AI safety”) are usually far from watertight. It could be valuable to improve them, by making them more “robust” or otherwise.
Expected-value calculations that are based on a single quantitative model have significant limitations. They can be useful as one of many inputs to a decision, but it would usually be bad to use them as one’s sole decision tool.
(I am actually a big fan of the GiveWell/Holden Karnofsky posts you link to. When I disagree with other people it often comes down to me favoring more “cluster thinking”. For instance, these days this happens a lot to me when talking to people about AI timelines, or other aspects of AI risk.)
However, I think I disagree with your characterization of the case for CL more broadly, at least for certain uses/meanings of CL.
Here is one version of CL which I believe is based on much more than just expected-value calculations within a single model: This is roughly the claim that (i) in our project of doing as much good as possible we should at the highest level be mostly guided by very long-run effects and (ii) this makes an actual difference for how we plan and prioritize at intermediate levels.
Here are I have a picture in mind that is roughly as follows:
Lowest level: Which among several available actions should I take right now?
Intermediate levels:
What are the “methods” and inputs (quantitative models, heuristics, intuitions, etc.) I should use when thinking about the lowest level?
What systems, structures, and incentives should we put in place to “optimize” which lowest-level decision situations I and other agents find ourselves in in the first place?
How do I in turn best think about which methods, systems, structures, etc. to use for answering these intermediate-level questions?
Etc.
Highest level: How should I ultimately evaluate the intermediate levels?
So the following would be one instance of part (i) of my favored CL claim: When deciding whether to use cluster thinking or sequence thinking for a decision, we should aim to choose whichever type of thinking best helps us find the option with most valuable long-run effects. For this it is not required that I make the choice between sequence thinking or cluster thinking by an expected-value calculation, or indeed any direct appeal to any long-run effects. But, ultimately, if I think that, say, cluster thinking is superior to sequence thinking for the matter at hand, then I do so because I think this will lead to the best long-run consequences.
And these would be an instances of part (ii): That often we should decide primarily based on the proxy of “what does most reduce existential risk?”; that it seems good to increase the “representation” of future generations in various political contexts; etc.
Regarding what the case for this version of CL rests on:
For part (i), I think it’s largely a matter of ethics/philosophy, plus some high-level empirical claims about the world (the future being big etc.). Overall very similar to the case for AL. I think the ethics part is less in need of “cluster thinking”, “robustness” etc. And that the empirical part is, in fact, quite “robustly” supported.
[This point made me most want to push back against your initial claim about CL:] For part (ii), I think there are several examples of proxy goals, methods, interventions, etc., that are commonly pursued by longtermists which have a somewhat robust case behind them that does not just rely on an expected value estimate based on a single quantitative model. For instance, avoiding extinction seems very important from a variety of moral perspectives as well as common sense, there are historical precedents of research and advocacy at least partly motivated by this goal (e.g. nuclear winter, asteroid detection, perhaps even significant parts of environmentalism), there is a robust case for several risks longtermists commonly worry about (including AI), etc. More broadly, conversations involving explicit expected value estimates, quantitative models, etc. are only a fraction of the longtermist conversations I’m seeing. (If anything I might think that longtermists, at least in some contexts, make too little use of these tools.) E.g. look at the frontpage of LessWrong, or their curated content. I’m certainly not among the biggest fans of LessWrong or the rationality community, but I think it would be fairly inaccurate to say that a lot of what is happening there is people making explicit expected value estimates. Ditto for longtermist content featured in the EA Newsletter, etc. etc. I struggle to think of any example I’ve seen where a longtermist has made an important decision based just on a single EV estimate.
Rereading your initial comment introducing AL and CL, I’m less sure if by CL you had in mind something similar to what I’m defending above. There certainly are other readings that seem to hinge more on explicit EV reasoning or that are just absurd, e.g. “CL = never explicitly reason about anything happening in the next 100 years”. However, I’m less interested in these versions since they to me would seem to be a poor description of how longtermists actually reason and act in practice.
Not sure this “many weak arguments” way of looking at it is quite correct either had a quick look at the arguments given against longtermism and there are not that many of them. Maybe a better point is that there are many avenues and approaches that remain unexplored.
‘This seems repugnant’ doesn’t seem like a justifiable objection to me, so not something an advocate of SLT should be obliged to take on directly.
If I said “this doctor’s theory of liver deterioration suggests that I should reduce my alcohol intake, which seems repugnant to me”, you would not feel compelled to respond that “actually, some of the things the doctor is advocating could allow you to drink more alcohol”.
(I suspect that beyond the “this seems repugnant” there is a more coherent critique—and that is the critique we should focus on.)
In response, you stated:
It might be worth re-stating this. Thinking about objective functions and constraints, either
R1. SLT implies that resources should be devoted in a way that does less to reduce current suffering (i.e., implies more current suffering than absent SLT) or
R2. SLT does not change our objective function, or it coincidentally implies an allocation that does has no differential effect on current suffering (a ‘measure zero’, i.e., coincidental result)
R3. SLT implies that resources should be devoted in a way that leads to less current suffering
R3 seems unlikely to be the case, particularly if we imagine bounds on altruistic capacity. And, if there were an approach that could use the same resources to reduce current suffering even more, it already should have been chosen in the absence of SLT.
If R2 is the case then SLT is not important for our resource decision so we can ignore it.
If R1 holds (which seems most likely to me), then following SLT does imply an increase in current suffering, and we are back to the main objection
Very balanced assessment! Nicely done :)
Thanks!