RSS

Joe_Carlsmith

Karma: 3,117

Senior research analyst at Open Philanthropy. Doctorate in philosophy at the University of Oxford. Opinions my own.

What is it to solve the al­ign­ment prob­lem?

Joe_Carlsmith24 Aug 2024 21:19 UTC
32 points
1 comment1 min readEA link

Value frag­ility and AI takeover

Joe_Carlsmith5 Aug 2024 21:28 UTC
35 points
3 comments1 min readEA link

A frame­work for think­ing about AI power-seeking

Joe_Carlsmith24 Jul 2024 22:41 UTC
44 points
11 comments1 min readEA link

Lov­ing a world you don’t trust

Joe_Carlsmith18 Jun 2024 19:31 UTC
65 points
7 comments1 min readEA link

On “first crit­i­cal tries” in AI alignment

Joe_Carlsmith5 Jun 2024 0:19 UTC
29 points
3 comments1 min readEA link

On attunement

Joe_Carlsmith25 Mar 2024 12:47 UTC
27 points
0 comments1 min readEA link

Video and tran­script of pre­sen­ta­tion on Schem­ing AIs

Joe_Carlsmith22 Mar 2024 15:56 UTC
23 points
1 comment1 min readEA link

On green

Joe_Carlsmith21 Mar 2024 17:38 UTC
61 points
3 comments1 min readEA link

On the abo­li­tion of man

Joe_Carlsmith18 Jan 2024 18:17 UTC
71 points
4 comments1 min readEA link

Be­ing nicer than Clippy

Joe_Carlsmith16 Jan 2024 19:44 UTC
25 points
3 comments1 min readEA link

An even deeper atheism

Joe_Carlsmith11 Jan 2024 17:28 UTC
25 points
2 comments1 min readEA link

Does AI risk “other” the AIs?

Joe_Carlsmith9 Jan 2024 17:51 UTC
22 points
3 comments1 min readEA link

When “yang” goes wrong

Joe_Carlsmith8 Jan 2024 16:35 UTC
56 points
1 comment1 min readEA link

Deep athe­ism and AI risk

Joe_Carlsmith4 Jan 2024 18:58 UTC
64 points
4 comments1 min readEA link

Gentle­ness and the ar­tifi­cial Other

Joe_Carlsmith2 Jan 2024 18:21 UTC
89 points
2 comments1 min readEA link