JWS 🔸 comments on Matthew_Barnett’s Quick takes

JWS 🔸 8 Feb 2024 13:07 UTC
5 points
0 ∶ 1
So I think it’s likely you have some very different beliefs from most people/EAs/myself, particularly:
1. Thinking that humans/humanity is bad, and AI is likely to be better
2. Thinking that humanity isn’t driven by ideational/moral concerns^[1]
3. That AI is very likely to be conscious, moral (as in, making better moral judgements than humans), and that the current/default trend in the industry is very likely to make them conscious moral agents in a way humans aren’t
I don’t know if the total utilitarian/accelerationist position in the OP is yours or not. I think Daniel is right that most EAs don’t have this position. I think maybe Peter Singer gets closest to this in his interview with Tyler on the ‘would you side with the Aliens or not question’ here. But the answer to your descriptive question is simply that most EAs don’t have the combination of moral and empirical views about the world to make the argument you present valid and sound, so that’s why there isn’t much talk in EA about naïve accelerationism.
Going off the vibe I get from this view though, I think it’s a good heuristic that if your moral view sounds like a movie villain’s monologue it might be worth reflecting, and a lot of this post reminded me of the Earth-Trisolaris Organisation from Cixin Liu’s Three Body Problem. If someone’s honest moral view is “Eliminate human tyranny! The world belongs to ~~Trisolaris~~ AIs!” then I don’t know what else there is to do except quote Zvi’s phrase “please speak directly into this microphone”.
Another big issue I have with this post is that some of the counter-arguments just seem a bit like ‘nu-uh’, see:
But why would we assume AIs won’t be conscious?
Why would humans be more likely to have “interesting” values than AIs?
But it would also be bad if we all died from old age while waiting for AI, and missed out on all the benefits that AI offers to humans, which is a point in favor of acceleration. Why would this heuristic be weaker?
These (and other examples) are considerations for sure, but they need to be argued for. I don’t think they can just be stated and then say “therefore, ACCELERATE!”. I agree that AI Safety research needs to be more robust and the philosophical assumptions and views made more explicit, but one could already think of some counters to the questions that you raise, and I’m sure you already have them. For example, you might take a view (ala Peter Godfrey-Smith) that a certain biological substrate is necessary for conscious.
Similarly on total utilitarianism emphasis larger population sizes, agreed to the extent that the greater population increase the population utility, but this is the repugnant conclusion again. There’s a stopping point even in that scenario where an ever larger population decreases total utility, which is why in Parfit’s scenario it’s full of potatoes and muzak rather than humans crammed into battery cages like factory-farmed animals. Empirically, naïve accelerationism may tend toward the latter case in practice, even if there’s a theoretical case to be made for it.
There’s more I could say, but I don’t want to make this reply too long, and I think as Nathan said it’s a point worth discussing. Nevertheless it seems our different positions on this are built on some wide, fundamental divisions about reality and morality itself, and I’m not sure how those can be bridged, unless I’ve wildly misunderstood your position.
1. ^
  this is me-specific
What links here?
- JWS 🔸's comment on Geoffrey Miller’s Quick takes by Geoffrey Miller (31 Mar 2024 15:56 UTC; 4 points)
- Matthew_Barnett 8 Feb 2024 18:46 UTC
  9 points
  2 ∶ 0
  Parent
  I don’t think humanity is bad. I just think people are selfish, and generally driven by motives that look very different from impartial total utilitarianism. AIs (even potentially “random” ones) seem about as good in expectation, from an impartial standpoint. In my opinion, this view becomes even stronger if you recognize that AIs will be selected on the basis of how helpful, kind, and useful they are to users. (Perhaps notice how different this selection criteria is from the evolutionary criteria used to evolve humans.)
  I understand that most people are partial to humanity, which is why they generally find my view repugnant. But my response to this perspective is to point out that if we’re going to be partial to a group on the basis of something other than utilitarian equal consideration of interests, it makes little sense to choose to be partial to the human species as opposed to the current generation of humans or even myself. And if we take this route, accelerationism seems even more strongly supported than before, since developing AI and accelerating technological progress seems to be the best chance we have of preserving the current generation against aging and death. If we all died, and a new generation of humans replaced us, that would certainly be pretty bad for us.
  Which sounds more like a movie villain’s monologue?
  - The idea that everyone currently living needs to sacrificed, and die, in order to preserve the human species
  - The idea that we should try to preserve currently living people, even if that means taking on a greater risk of not preserving the values of the human species
  To be clear, I also just totally disagree with the heuristic that “if your moral view sounds like a movie villain’s monologue it might be worth reflecting”. I don’t think that fiction is generally a great place for learning moral philosophy, albeit with some notable exceptions.
  Anyway, the answer to these moral questions may seem obvious to you, but I don’t think they’re as obvious as you’re making them seem.
  - Ryan Greenblatt 8 Feb 2024 19:50 UTC
    6 points
    2 ∶ 1
    Parent
    I understand that most people are partial to humanity, which is why they generally find my view repugnant.
    This is not why people disagree IMO.
    - Matthew_Barnett 8 Feb 2024 22:36 UTC
      7 points
      1 ∶ 1
      Parent
      I think the fact that people are partial to humanity explains a large fraction of the disagreement people have with me. But, fair enough, I exaggerated a bit. My true belief is a more moderate version of that claim.
      When discussing why EAs in particular disagree with me, to overgeneralize by a fair bit, I’ve noticed that EAs are happy to concede that AIs could be moral patients, but are generally reluctant to admit AIs as moral agents, in the way they’d be happy to accept humans as independent moral agents (e.g. newborns) into our society. I’d call this “being partial to humanity”, or at least, “being partial to the values of the human species”.
      (In my opinion, this partiality seems so prevalent and deep in most people that to deny it seems a bit like a fish denying the existence of water. But I digress.)
      To test this hypothesis, I recently asked three questions on Twitter about whether people would be willing to accept immigration through a portal to another universe from three sources:
      “a society of humans who are very similar to us”
      “a society of people who look & act like humans, but each of them only cares about their family”
      “a society of people who look & act like humans, but they only care about maximizing paperclips”
      I emphasized that in each case, the people are human-level in their intelligence, and also biological.
      The results are preliminary (and I’m not linking here to avoid biasing the results, as voting has not yet finished), but so far my followers, who are mostly EAs, are much more happy to let the humans immigrate to our world, compared to the last two options. I claim there just aren’t really any defensible reasons to maintain this choice other than by implicitly appealing to a partiality towards humanity.
      My guess is that if people are asked to defend their choice explicitly, they’d largely talk about some inherent altruism or hope they place in the human species, relative to the other options; and this still looks like “being partial to humanity”, as far as I can tell, from almost any reasonable perspective.
      - Ryan Greenblatt 9 Feb 2024 18:00 UTC
        11 points
        1 ∶ 0
        Parent
        I think the fact that people are partial to humanity explains a large fraction of the disagreement people have with me.
        Maybe, it’s hard for me to know. But I predict most the pushback you’re getting from relatively thoughtful longtermists isn’t due to this.
        I’ve noticed that EAs are happy to concede that AIs could be moral patients, but are generally reluctant to admit AIs as moral agents, in the way they’d be happy to accept humans as independent moral agents (e.g. newborns) into our society.
        I agree with this.
        I’d call this “being partial to humanity”, or at least, “being partial to the values of the human species”.
        I think “being partial to humanity” is a bad description of what’s going on because (e.g.) these same people would be considerably more on board with aliens. I think the main thing going on is that people have some (probably mistaken) levels of pessimism about how AIs would act as moral agents which they don’t have about (e.g.) aliens.
        To test this hypothesis, I recently asked three questions on Twitter about whether people would be willing to accept immigration through a portal to another universe from three sources:
        “a society of humans who are very similar to us”
        “a society of people who look & act like humans, but each of them only cares about their family”
        “a society of people who look & act like humans, but they only care about maximizing paperclips”
        ...
        I claim there just aren’t really any defensible reasons to maintain this choice other than by implicitly appealing to a partiality towards humanity.
        This comparison seems to me to be missing the point. Minimally I think what’s going on is not well described as “being partial to humanity”.
        Here’s a comparison I prefer:
        A society of humans who are very similar to us.
        A society of humans who are very similar to us in basically every way, except that they have a genetically-caused and strong terminal preference for maximizing the total expected number of paper clips (over the entire arc of history) and only care about other things instrumentally. They are sufficiently commited to paper clip maximization that this will persist on arbitrary reflection (e.g. they’d lock in this view immediately when given this option) and let’s also suppose that this view is transmitted genetically and in a gene-drive-y way such that all of their descendents will also only care about paper clips. (You can change paper clips to basically anything else which is broadly recognized to have no moral value on its own, e.g. gold twisted into circles.)
        A society of beings (e.g. aliens) who are extremely different in basically every way to humans except that they also have something pretty similar to the concepts of “morality”, “pain”, “pleasure”, “moral patienthood”, “happyness”, “preferences”, “altruism”, and “careful reasoning about morality (moral thoughtfulness)”. And the society overall also has a roughly similar relationship with these concepts (e.g. the level of “altruism” is similar). (Note that having the same relationship as humans to these concepts is a pretty low bar! Humans aren’t that morally thoughtful!)
        I think I’m almost equally happy with (1) and (3) on this list and quite unhappy with (2).
        If you changed (3) to instead be “considerably more altruistic”, I would prefer (3) over (1).
        I think it seems weird to call my views on the comparison I just outlined as “being partial to humanity”: I actually prefer (3) over (2) even though (2) are literally humans!
        (Also, I’m not that commited to having concepts of “pain” and “pleasure”, but I’m relatively commited to having a concepts which are something like “moral patienthood”, “preferences”, and “altruism”.)
        Below is a mild spoiler for a story by Eliezer Yudkowsky:
        To make the above comparison about different beings more concrete, in the case of three worlds collide, I would basically be fine giving the universe over the the super-happies relative to humans (I think mildly better than humans?) and I think it seems only mildly worse than humans to hand it over to the baby-eaters. In both cases, I’m pricing in some amount of reflection and uplifting which doesn’t happen in the actual story of three worlds collide, but would likely happen in practice. That is, I’m imagining seeing these societies prior to their singularity and then based on just observations of their societies at this point, deciding how good they are (pricing in the fact that the society might change over time).
        Ryan Greenblatt 9 Feb 2024 18:07 UTC
        3 points
        0 ∶ 0
        Parent
        To be clear, it seems totally reasonable to call this “being partial to some notion of moral thoughtfulness about pain, pleasure, and preferences”, but these concepts don’t seem that “human” to me. (I predict these occur pretty frequently in evolved life that reaches a singularity for instance. And they might occur in AIs, but I expect misaligned AIs which seize control of the world are worse from my perspective than if humans retain control.)
        Matthew_Barnett 9 Feb 2024 20:15 UTC
        4 points
        1 ∶ 0
        Parent
        When I say that people are partial to humanity, I’m including an irrational bias towards thinking that humans, or evolved beings, are unusually thoughtful or ethical compared to the alternatives (I believe this is in fact an irrational bias, since the arguments I’ve seen for thinking that unaligned AIs will be less thoughtful or ethical than aliens seem very weak to me).
        
        In other cases, when people irrationally hold a certain group X to a higher standard than a group Y, it is routinely described as “being partial to group Y over group X”. I think this is just what “being partial” means, in an ordinary sense, across a wide range of cases.
        
        For example, if I proposed aligning AI to my local friend group, with the explicit justification that I thought my friends are unusually thoughtful, I think this would be well-described as me being “partial” to my friend group.
        
        To the extent you’re seeing me as saying something else about how longtermists view the argument, I suspect you’re reading me as saying something stronger than what I originally intended.
        Ryan Greenblatt 9 Feb 2024 23:19 UTC
        3 points
        1 ∶ 0
        Parent
        In that case, my main disagreement is thinking that your twitter poll is evidence for your claims.
        More specifically:
        I claim there just aren’t really any defensible reasons to maintain this choice other than by implicitly appealing to a partiality towards humanity.
        Like you claim there aren’t any defensible reasons to think that what humans will do is better than literally maximizing paper clips? This seems totally wild to me.
        Matthew_Barnett 9 Feb 2024 23:59 UTC
        4 points
        1 ∶ 1
        Parent
        
        Like you claim there aren’t any defensible reasons to think that what humans will do is better than literally maximizing paper clips?
        
        I’m not exactly sure what you mean by this. There were three options, and human paperclippers were only one of these options. I was mainly discussing the choice between (1) and (2) in the comment, not between (1) and (3).
        
        Here’s my best guess at what you’re saying: it sounds like you’re repeating that you expect humans to be unusually altruistic or thoughtful compared to an unaligned alternative. But the point of my previous comment was to state my view that this bias counted as “being partial towards humanity”, since I view the bias as irrational. In light of that, what part of my comment are you objecting to?
        
        To be clear, you can think the bias I’m talking about is actually rational; that’s fine. But I just disagree with you for pretty mundane reasons.
        
        [Incorporating what you said in the other comment]
        
        Also, to be clear, I agree that the question of “how much worse/better is it for AIs to get vast amounts of resources without human society intending to grant those resources to the AIs from a longtermist perspective” is underinvestigated, but I think there are pretty good reasons to systematically expect human control to be a decent amount better.
        
        Then I think it’s worth concretely explaining what these reasons are to believe that human control will be a decent amount better in expectation. You don’t need to write this up yourself, of course. I think the EA community should write these reasons up. Because I currently view the proposition as non-obvious, and despite being a critical belief in AI risk discussions, it’s usually asserted without argument. When I’ve pressed people in the past, they typically give very weak reasons.
        
        I don’t know how to respond to an argument whose details are omitted.
        Ryan Greenblatt 10 Feb 2024 0:07 UTC
        5 points
        1 ∶ 0
        Parent
        Then I think it’s worth concretely explaining what these reasons are to believe that human control will be a decent amount better in expectation. You don’t need to write this up yourself, of course.
        +1, but I don’t generally think it’s worth counting on “the EA community” to do something like this. I’ve been vaguely trying to pitch Joe on doing something like this (though there are probably better uses of his time) and his recent blogs posts are touching similar topics.
        Ryan Greenblatt 10 Feb 2024 0:16 UTC
        1 point
        0 ∶ 0
        Parent
        Also, it’s usually only the crux of longtermists which is probably one of the reasons why no one has gotten around to this.
        Ryan Greenblatt 10 Feb 2024 0:05 UTC
        1 point
        0 ∶ 0
        Parent
        I was mainly discussing the choice between (1) and (2) in the comment, not between (1) and (3).
        You didn’t make this clear, so was just responding generically.
        Separately, I think I feel a pretty similar intution for case (2), people literally only caring about their families seems pretty clearly worse.
        Ryan Greenblatt 10 Feb 2024 0:03 UTC
        1 point
        1 ∶ 0
        Parent
        Here’s my best guess at what you’re saying: it sounds like you’re repeating that you expect humans to be unusually altruistic or thoughtful compared to an unaligned alternative.
        There, I’m just saying that human control is better than literal paperclip maximization.
        Matthew_Barnett 10 Feb 2024 0:06 UTC
        4 points
        0 ∶ 0
        Parent
        This response still seems underspecified to me. Is the default unaligned alternative paperclip maximization in your view? I understand that Eliezer Yudkowsky has given arguments for this position, but it seems like you diverge significantly from Eliezer’s general worldview, so I’d still prefer to hear this take spelled out in more detail from your own point of view.
        Expand this thread
        Ryan Greenblatt 10 Feb 2024 0:09 UTC
        1 point
        0 ∶ 0
        Parent
        Your poll says:
        “a society of people who look & act like humans, but they only care about maximizing paperclips”
        And then you say:
        so far my followers, who are mostly EAs, are much more happy to let the humans immigrate to our world, compared to the last two options. I claim there just aren’t really any defensible reasons to maintain this choice other than by implicitly appealing to a partiality towards humanity.
        So, I think more human control is better than more literal paperclip maximization, the option given in your poll.
        My overall position isn’t that the AIs will certainly be paperclippers, I’m just arguing in isolation about why I think the choice given in the poll is defensible.
        Matthew_Barnett 10 Feb 2024 0:19 UTC
        2 points
        1 ∶ 1
        Parent
        I have the feeling we’re talking past each other a bit. I suspect talking about this poll was kind of a distraction. I personally have the sense of trying to convey a central point, and instead of getting the point across, I feel the conversation keeps slipping into talking about how to interpret minor things I said, which I don’t see as very relevant.
        
        I will probably take a break from replying for now, for these reasons, although I’d be happy to catch up some time and maybe have a call to discuss these questions in more depth. I definitely see you as trying a lot harder than most other EAs in trying to make progress on these questions collaboratively with me.
        JWS 🔸 10 Feb 2024 11:54 UTC
        4 points
        1 ∶ 0
        Parent
        I’d be very happy to have some discussion on these topics with you Matthew. For what it’s worth, I really have found much of your work insightful, thought-provoking, and valuable. I think I just have some strong, core disagreements on multiple empirical/epistemological/moral levels with your latest series of posts.
        That doesn’t mean I don’t want you to share your views, or that they’re not worth discussion, and I apologise if I came off as too hostile. An open invitation to have some kind of deeper discussion stands.^[1]
        ^
        I’d like to try out the new dialogue feature on the Forum, but that’s a weak preference
        Ryan Greenblatt 10 Feb 2024 0:21 UTC
        3 points
        0 ∶ 0
        Parent
        I suspect talking about this poll was kind of a distraction.
        Agreed, sorry about that.
        Ryan Greenblatt 9 Feb 2024 23:42 UTC
        1 point
        0 ∶ 0
        Parent
        Also, to be clear, I agree that the question of “how much worse/better is it for AIs to get vast amounts of resources without human society intending to grant those resources to the AIs from a longtermist perspective” is underinvestigated, but I think there are pretty good reasons to systematically expect human control to be a decent amount better.