Buck comments on What are the most underrated posts & comments of 2022, according to you?

Buck 1 Jan 2023 22:49 UTC
16 points
2 ∶ 0
Title: Paul Christiano on how you might get consequentialist behavior from large language models
Author: Paul Christiano
URL: https://forum.effectivealtruism.org/posts/dgk2eLf8DLxEG6msd/how-would-a-language-model-become-goal-directed?commentId=cbJDeSPtbyy2XNr8E
Why it’s good: I think lots of people are very wrong about how LLMs might lead to consequentialist behavior, and Paul’s comment here is my favorite attempt at answering this question. I think that this question is extremely important.