Lauro Langosco answers What are the coolest topics in AI safety, to a hopelessly pure mathematician?

Lauro Langosco 7 May 2022 11:17 UTC
2 points
0 ∶ 0
You might be interested in this great intro sequence to embedded agency. There’s also corrigibility and MIRI’s other work on agent foundations.

Also, coherence arguments and consequentialist cognition.

AI safety is a young field; for most open problems we don’t yet know of a way to crisply state them in a way that can be resolved mathematically. So if you enjoy taking messy questions and turning them into neat math you’ll probably find much to work on.

ETA: oh and of course ELK.