Why Not Try Build Safe AGI?

24 Dec 2022 10:02 UTC

Copy-pasting from my one-on-ones with AI Safety researchers:

Why mechanistic interpretability does not and cannot contribute to long-term AGI safety (from messages with a friend)

Remmelt19 Dec 2022 12:02 UTC

17 points

3 comments1 min readEA link

List #1: Why stopping the development of AGI is hard but doable

Remmelt24 Dec 2022 9:52 UTC

24 points

2 comments1 min readEA link

List #2: Why coordinating to align as humans to not develop AGI is a lot easier than, well… coordinating as humans with AGI coordinating to be aligned with humans

Remmelt24 Dec 2022 9:53 UTC

3 points

0 comments1 min readEA link

List #3: Why not to assume on prior that AGI-alignment workarounds are available

Remmelt24 Dec 2022 9:54 UTC

6 points

0 comments1 min readEA link