I feel like the word âvaluesâ makes this sound more complex than it is, and Iâd say we instead want the agent to understand and act in line with what the human wants /â intends.
Doesnât âwants /â intendsâ makes this sound less complex than it is? To me this phrasing connotes (not to say you actually believe this) that the goal is for AIs to understand short-term human desires, without accounting for ways in which our wants contradict what we would value in the long term, or ways that individualsâ wants can conflict. Once we add caveats like âwhat we would want /â intend after sufficient rational reflection,â my sense is that âvaluesâ just captures that more intuitively. I havenât surveyed people on this, though, so this definitely isnât a confident claim on my part.
Once we add caveats like âwhat we would want /â intend after sufficient rational reflection,â my sense is that âvaluesâ just captures that more intuitively.
I in fact donât want to add in those caveats here: Iâm suggesting that we tell our AI system to do what we short-term want. (Of course, we can then âshort-term wantâ to do more rational reflection, or to be informed of true and useful things that help us make moral progress, etc.)
I agree that âvaluesâ more intuitively captures the thing with all the caveats added in.
Doesnât âwants /â intendsâ makes this sound less complex than it is? To me this phrasing connotes (not to say you actually believe this) that the goal is for AIs to understand short-term human desires, without accounting for ways in which our wants contradict what we would value in the long term, or ways that individualsâ wants can conflict. Once we add caveats like âwhat we would want /â intend after sufficient rational reflection,â my sense is that âvaluesâ just captures that more intuitively. I havenât surveyed people on this, though, so this definitely isnât a confident claim on my part.
I in fact donât want to add in those caveats here: Iâm suggesting that we tell our AI system to do what we short-term want. (Of course, we can then âshort-term wantâ to do more rational reflection, or to be informed of true and useful things that help us make moral progress, etc.)
I agree that âvaluesâ more intuitively captures the thing with all the caveats added in.