RSS

Joe_Carlsmith

Karma: 3,167

Senior research analyst at Open Philanthropy. Doctorate in philosophy at the University of Oxford. Opinions my own.

In­cen­tive de­sign and ca­pa­bil­ity elicitation

Joe_Carlsmith12 Nov 2024 20:56 UTC
9 points
0 comments1 min readEA link

Op­tion control

Joe_Carlsmith4 Nov 2024 17:54 UTC
11 points
0 comments1 min readEA link

Mo­ti­va­tion control

Joe_Carlsmith30 Oct 2024 17:15 UTC
18 points
0 comments1 min readEA link

How might we solve the al­ign­ment prob­lem? (Part 1: In­tro, sum­mary, on­tol­ogy)

Joe_Carlsmith28 Oct 2024 21:57 UTC
11 points
0 comments1 min readEA link

Video and tran­script of pre­sen­ta­tion on Oth­er­ness and con­trol in the age of AGI

Joe_Carlsmith8 Oct 2024 22:30 UTC
18 points
1 comment1 min readEA link

What is it to solve the al­ign­ment prob­lem?

Joe_Carlsmith24 Aug 2024 21:19 UTC
32 points
1 comment1 min readEA link

Value frag­ility and AI takeover

Joe_Carlsmith5 Aug 2024 21:28 UTC
38 points
3 comments1 min readEA link

A frame­work for think­ing about AI power-seeking

Joe_Carlsmith24 Jul 2024 22:41 UTC
44 points
11 comments1 min readEA link

Lov­ing a world you don’t trust

Joe_Carlsmith18 Jun 2024 19:31 UTC
65 points
7 comments1 min readEA link

On “first crit­i­cal tries” in AI alignment

Joe_Carlsmith5 Jun 2024 0:19 UTC
29 points
3 comments1 min readEA link

On attunement

Joe_Carlsmith25 Mar 2024 12:47 UTC
27 points
0 comments1 min readEA link

Video and tran­script of pre­sen­ta­tion on Schem­ing AIs

Joe_Carlsmith22 Mar 2024 15:56 UTC
23 points
1 comment1 min readEA link

On green

Joe_Carlsmith21 Mar 2024 17:38 UTC
61 points
3 comments1 min readEA link

On the abo­li­tion of man

Joe_Carlsmith18 Jan 2024 18:17 UTC
71 points
4 comments1 min readEA link

Be­ing nicer than Clippy

Joe_Carlsmith16 Jan 2024 19:44 UTC
25 points
3 comments1 min readEA link

An even deeper atheism

Joe_Carlsmith11 Jan 2024 17:28 UTC
25 points
2 comments1 min readEA link