tylermjohn comments on Existential Choices Symposium with Will MacAskill and other special guests (3-5pm GMT Monday)

tylermjohn 17 Mar 2025 17:12 UTC
3 points
0 ∶ 1
I think there is a large minority chance that we will successfully align ASI this century, so I definitely think it is possible.
- Davidmanheim 17 Mar 2025 17:20 UTC
  4 points
  1 ∶ 0
  Parent
  To clarify, do you think there’s a large minority change that it is possible to align an arbitrarily powerful system, or do you think there is a large minority chance that it is going to happen with the first such arbitrarily powerful system, such that we’re not locked in to a different future / killed by a misaligned singleton?
- Greg_Colbourn ⏸️ 17 Mar 2025 17:20 UTC
  2 points
  0 ∶ 0
  Parent
  Why do you think this? What make you think that it’s possible at all?^[1] And what do you mean by “large minority”? Can you give an approximate percentage?
  1. ^
    Or to paraphrase Yampolskiy: what makes it possible for a less intelligent species to indefinitely control a more intelligent species (when this has never happened before)?
  - Davidmanheim 17 Mar 2025 17:32 UTC
    6 points
    1 ∶ 0
    Parent
    To respond to Yampolskiy without disagreeing with the fundamental point, I think it’s definitely possible for a less intelligent species to align or even indefinitely control a boundedly and only slightly more intelligent species, especially given greater resources, speed, and/or numbers, and sufficient effort.
    
    The problem is that humans aren’t currently trying to limit the systems or trying much to monitor, much less robustly align or control them.
    - Greg_Colbourn ⏸️ 17 Mar 2025 18:10 UTC
      2 points
      0 ∶ 0
      Parent
      Fair point. But AI is indeed unlikely to top out at merely “slighlty more” intelligent. And it has the potential for a massive speed/numbers advantage too.
      - Davidmanheim 17 Mar 2025 18:31 UTC
        2 points
        1 ∶ 0
        Parent
        Yes, by default self-improving AI goes very poorly, but this is a plausible case where would could have aligned AGI, if not ASI.