MichaelStJules comments on MIRI 2024 Mission and Strategy Update

MichaelStJules 10 Jan 2024 23:28 UTC
4 points
1 ∶ 0
This doesn’t seem like a strong enough argument to justify a high probability of existential catastrophe (if that’s what you intended?).

At vastly superhuman capabilities (including intelligence and rationality), it should be easier to reduce existential-level mistakes to tiny levels. They would have vastly more capability for assessing and mitigating risks and for moral reflection (not that this would converge to some moral truth; I don’t think there is any).

If you think this has a low chance of success (if we could delay AGI long enough to actually do it), then alignment seems pretty hopeless to me on that view, and a temporary pause only delays the inevitable doom.

I do think we could do better (for upside-focused views) by ensuring more value pluralism and preventing particular values from dominating, e.g. by uploading and augmenting multiple minds.
- Greg_Colbourn ⏸️ 11 Jan 2024 15:37 UTC
  2 points
  0 ∶ 1
  Parent
  At vastly superhuman capabilities (including intelligence and rationality), it should be easier to reduce existential-level mistakes to tiny levels. They would have vastly more capability for assessing and mitigating risks and for moral reflection
  They are still human though, and humans are famous for making mistakes, even the most intelligent and rational of us. It’s even regarded by many as part of what being human is—being fallible. That’s not (too much of) a problem at current power differentials, but it is when we’re talking of solar-system-rearranging powers for millions of subjective years without catastrophic error...
  a temporary pause only delays the inevitable doom.
  Yes. The pause should be indefinite, or at least until global consensus to proceed, with democratic acceptance of whatever risk remains.