RSS

Joe_Carlsmith

Karma: 3,538

Senior advisor at Open Philanthropy. Doctorate in philosophy at the University of Oxford. Opinions my own.

A frame­work for think­ing about AI power-seeking

Joe_CarlsmithJul 24, 2024, 10:41 PM
44 points
11 comments16 min readEA link

Lov­ing a world you don’t trust

Joe_CarlsmithJun 18, 2024, 7:31 PM
65 points
7 comments33 min readEA link

On “first crit­i­cal tries” in AI alignment

Joe_CarlsmithJun 5, 2024, 12:19 AM
29 points
3 comments14 min readEA link

On attunement

Joe_CarlsmithMar 25, 2024, 12:47 PM
28 points
0 comments22 min readEA link

Video and tran­script of pre­sen­ta­tion on Schem­ing AIs

Joe_CarlsmithMar 22, 2024, 3:56 PM
23 points
1 comment32 min readEA link

On green

Joe_CarlsmithMar 21, 2024, 5:38 PM
61 points
3 comments31 min readEA link

On the abo­li­tion of man

Joe_CarlsmithJan 18, 2024, 6:17 PM
71 points
4 comments41 min readEA link

Be­ing nicer than Clippy

Joe_CarlsmithJan 16, 2024, 7:44 PM
26 points
3 comments27 min readEA link

An even deeper atheism

Joe_CarlsmithJan 11, 2024, 5:28 PM
26 points
2 comments15 min readEA link

Does AI risk “other” the AIs?

Joe_CarlsmithJan 9, 2024, 5:51 PM
23 points
3 comments8 min readEA link

When “yang” goes wrong

Joe_CarlsmithJan 8, 2024, 4:35 PM
57 points
1 comment13 min readEA link

Deep athe­ism and AI risk

Joe_CarlsmithJan 4, 2024, 6:58 PM
65 points
4 comments27 min readEA link

Gentle­ness and the ar­tifi­cial Other

Joe_CarlsmithJan 2, 2024, 6:21 PM
90 points
2 comments11 min readEA link

Oth­er­ness and con­trol in the age of AGI

Joe_CarlsmithJan 2, 2024, 6:15 PM
37 points
1 comment7 min readEA link

Em­piri­cal work that might shed light on schem­ing (Sec­tion 6 of “Schem­ing AIs”)

Joe_CarlsmithDec 11, 2023, 4:30 PM
7 points
1 comment19 min readEA link