Toby Tremlett🔹 comments on Discussion Thread: Existential Choices Debate Week

Toby Tremlett🔹 17 Mar 2025 9:56 UTC
5 points
2 ∶ 0
Two major reasons/ considerations:
1- I’m unconvinced of the tractability of non-extinction-risk reducing longtermist interventions.
2- Perhaps this is self-defeating—but I feel uncomfortable substantively shaping the future in ways that aren’t merely making sure it exists. Visions of the future that I would have found un-objectionable a century ago would probably seem bad to me today. In short—this consideration is basically “moral uncertainty”. I think extinction-risk reduction is, though not recommended on every moral framework, at least recommended on most. I haven’t seen other ideas for shaping the future which are as widely recommended.
- Maxime Riché 🔸 20 Mar 2025 11:59 UTC
  4 points
  0 ∶ 0
  Parent
  I am curious about (1)
  Do you think that changing the moral values/goals of the ASIs Humanity would create is not a tractable way to influence the value of the future?
  If yes, is that because we are not able to change them, or because we don’t know which moral values to input, or something else?
  In the second case, what about inputting the goal of figuring out which goals to pursue (“long reflection”)?
  - Toby Tremlett🔹 20 Mar 2025 13:51 UTC
    7 points
    0 ∶ 0
    Parent
    I think yes and for all the reasons. I’m a bit sceptical that we can change the values ASIs will have—we don’t understand present models that well, and there are good reasons not to treat how a model outputs text as representative of its goals (it could be hallucinating, it could be deceptive, it’s outputs might just not be isomorphic to a reward structure).
    
    And even if we could, I don’t know of any non-controversial value to instill in the ASI, that isn’t just included in basic attempts to control the ASI (which I’d be doing mostly for extinction related reasons).
    - Kenneth_Diao 24 Mar 2025 5:09 UTC
      1 point
      0 ∶ 0
      Parent
      I’m going to press on point 2; I think this is self-defeating as it suggests the future will just be bad, so by this line of reasoning we shouldn’t even try to reduce extinction risks.