Zach Stein-Perlman comments on Zach Stein-Perlman’s Quick takes

Zach Stein-Perlman 1 Jul 2024 22:55 UTC
4 points
1 ∶ 0
Caveats:
1. I endorse the argument we should figure out how to use LLM-based systems without accidentally torturing them because they’re more likely to take catastrophic actions if we’re torturing them.
2. I haven’t tried to understand the argument we should try to pay AIs to [not betray us / tell on traitors / etc.] and working on AI-welfare stuff would help us offer AIs payment better; there might be something there.
3. I don’t understand the decision theory mumble mumble argument; there might be something there.
(Other than that, it seems hard to tell a story about how “AI welfare” research/interventions now could substantially improve the value of the long-term future.)
(My impression is these arguments are important to very few AI-welfare-prioritizers / most AI-welfare-prioritizers have the wrong reasons.)
What links here?
- Ryan Greenblatt's comment on Zach Stein-Perlman’s Quick takes by Zach Stein-Perlman (2 Jul 2024 1:41 UTC; 22 points)
- Ryan Greenblatt 2 Jul 2024 1:34 UTC
  5 points
  0 ∶ 0
  Parent
  My impression is these arguments are important to very few AI-welfare-prioritizers
  FWIW, these motivations seem reasonably central to me personally, though not my only motivations.
  - Zach Stein-Perlman 2 Jul 2024 1:39 UTC
    2 points
    0 ∶ 0
    Parent
    Among your friends, I agree; among EA Forum users, I disagree.
    - Ryan Greenblatt 2 Jul 2024 1:42 UTC
      1 point
      0 ∶ 0
      Parent
      Yes, I meant central to me personally, edited the comment to clarify.