Title: Paul Christiano on how you might get consequentialist behavior from large language modelsAuthor: Paul ChristianoURL: https://forum.effectivealtruism.org/posts/dgk2eLf8DLxEG6msd/how-would-a-language-model-become-goal-directed?commentId=cbJDeSPtbyy2XNr8EWhy it’s good: I think lots of people are very wrong about how LLMs might lead to consequentialist behavior, and Paul’s comment here is my favorite attempt at answering this question. I think that this question is extremely important.
Title: Paul Christiano on how you might get consequentialist behavior from large language models
Author: Paul Christiano
URL: https://forum.effectivealtruism.org/posts/dgk2eLf8DLxEG6msd/how-would-a-language-model-become-goal-directed?commentId=cbJDeSPtbyy2XNr8E
Why it’s good: I think lots of people are very wrong about how LLMs might lead to consequentialist behavior, and Paul’s comment here is my favorite attempt at answering this question. I think that this question is extremely important.