Jordan Arel comments on Existential Choices Symposium with Will MacAskill and other special guests (3-5pm GMT Monday)

Jordan Arel Mar 16, 2025, 4:23 PM
11 points
2 ∶ 0
Will MacAskill stated in a recent 80,000 hours podcast that he believes marginal work on trajectory change toward a best possible future rather than a mediocre future seems likely significantly more valuable than marginal work on extinction risk.
Could you explain what the key crucial considerations are for this claim to be true, and a basic argument for why think each of the crucial considerations resolves in favor of this claim?
Would also love to hear if others have any other crucial considerations they think weigh in one direction or the other.
- tylermjohn Mar 17, 2025, 4:17 PM
  9 points
  1 ∶ 0
  Parent
  Will is thinking about this much more actively and will give the best answer, but here are some key crucial considerations:
  - How tractable is extinction risk reduction and trajectory change work?
    As a part of that, are there ways that we can have a predictable and persistent effect on the value of the long-term future other than by reducing extinction risk?
  - How good is the future by default?
  - How good are the best attainable futures?
  These are basically Tractability and Importance from the INT framework.
  Some of the biggest disagreements in the field are over how likely we are to achieve eutopia by default (or what % of eutopia we will achieve) and what, if anything, can be done to predictably shape the far future. Populating and refining a list of answers to this last question has been a lot of the key work of the field over the past few years.
  - Jordan Arel Mar 21, 2025, 2:19 PM
    1 point
    0 ∶ 0
    Parent
    Thanks Tyler! I think this is spot on. I am nearing the end of writing a very long report on this type of work so I don’t have time at the moment to write a more detailed reply (and what I’m writing is attempting to answer these questions). One thing that really caught my eye was when you mentioned:
    Populating and refining a list of answers to this last question has been a lot of the key work of the field over the past few years.
    I am deeply interested in this field, but not actually sure what is meant by “the field.” Could you point me to what search terms to use and perhaps the primary authors or research organizations who have published work on this type of thing?”
- Greg_Colbourn ⏸️ Mar 19, 2025, 8:44 AM
  6 points
  0 ∶ 1
  Parent
  I think another crucial consideration is how likely, and near, extinction is. If it is near, with high likelihood (and I think it is down to misaligned ASI being on the horizon), then it’s unlikely there will be time for trajectory change work to bear fruit.
- Greg_Colbourn ⏸️ Mar 17, 2025, 3:43 PM
  5 points
  1 ∶ 0
  Parent
  I think Will MacAskill and Finn Morehouse’s paper rests on the crucial consideration that aligning ASI is possible (by anyone at all). They haven’t established this (EDIT: by this I mean they don’t cite to any supporting arguments for this, rather than personally coming up with the arguments themselves. But as far as I know, there aren’t any supporting arguments for the assumption, and in fact there are good arguments on the other side for why aligning ASI is fundamentally impossible).
  - Davidmanheim Mar 17, 2025, 5:04 PM
    4 points
    1 ∶ 0
    Parent
    This seems like a really critical issue, and I’d be very interested in hearing whether this is disputed by @tylermjohn / @William_MacAskill.
    - tylermjohn Mar 17, 2025, 5:12 PM
      3 points
      0 ∶ 1
      Parent
      I think there is a large minority chance that we will successfully align ASI this century, so I definitely think it is possible.
      - Davidmanheim Mar 17, 2025, 5:20 PM
        4 points
        1 ∶ 0
        Parent
        To clarify, do you think there’s a large minority change that it is possible to align an arbitrarily powerful system, or do you think there is a large minority chance that it is going to happen with the first such arbitrarily powerful system, such that we’re not locked in to a different future / killed by a misaligned singleton?
      - Greg_Colbourn ⏸️ Mar 17, 2025, 5:20 PM
        2 points
        0 ∶ 0
        Parent
        Why do you think this? What make you think that it’s possible at all?^[1] And what do you mean by “large minority”? Can you give an approximate percentage?
        ^
        Or to paraphrase Yampolskiy: what makes it possible for a less intelligent species to indefinitely control a more intelligent species (when this has never happened before)?
        Davidmanheim Mar 17, 2025, 5:32 PM
        6 points
        1 ∶ 0
        Parent
        To respond to Yampolskiy without disagreeing with the fundamental point, I think it’s definitely possible for a less intelligent species to align or even indefinitely control a boundedly and only slightly more intelligent species, especially given greater resources, speed, and/or numbers, and sufficient effort.
        
        The problem is that humans aren’t currently trying to limit the systems or trying much to monitor, much less robustly align or control them.
        Greg_Colbourn ⏸️ Mar 17, 2025, 6:10 PM
        2 points
        0 ∶ 0
        Parent
        Fair point. But AI is indeed unlikely to top out at merely “slighlty more” intelligent. And it has the potential for a massive speed/numbers advantage too.
        Davidmanheim Mar 17, 2025, 6:31 PM
        2 points
        1 ∶ 0
        Parent
        Yes, by default self-improving AI goes very poorly, but this is a plausible case where would could have aligned AGI, if not ASI.