RSS

Joe_Carlsmith

Karma: 2,927

Senior research analyst at Open Philanthropy. Doctorate in philosophy at the University of Oxford. Opinions my own.

On attunement

Joe_Carlsmith25 Mar 2024 12:47 UTC
27 points
0 comments1 min readEA link

Video and tran­script of pre­sen­ta­tion on Schem­ing AIs

Joe_Carlsmith22 Mar 2024 15:56 UTC
23 points
1 comment1 min readEA link

On green

Joe_Carlsmith21 Mar 2024 17:38 UTC
61 points
3 comments1 min readEA link

On the abo­li­tion of man

Joe_Carlsmith18 Jan 2024 18:17 UTC
71 points
4 comments1 min readEA link

Be­ing nicer than Clippy

Joe_Carlsmith16 Jan 2024 19:44 UTC
25 points
3 comments1 min readEA link

An even deeper atheism

Joe_Carlsmith11 Jan 2024 17:28 UTC
25 points
2 comments1 min readEA link

Does AI risk “other” the AIs?

Joe_Carlsmith9 Jan 2024 17:51 UTC
22 points
3 comments1 min readEA link

When “yang” goes wrong

Joe_Carlsmith8 Jan 2024 16:35 UTC
56 points
1 comment1 min readEA link

Deep athe­ism and AI risk

Joe_Carlsmith4 Jan 2024 18:58 UTC
64 points
4 comments1 min readEA link

Gentle­ness and the ar­tifi­cial Other

Joe_Carlsmith2 Jan 2024 18:21 UTC
89 points
2 comments1 min readEA link

Oth­er­ness and con­trol in the age of AGI

Joe_Carlsmith2 Jan 2024 18:15 UTC
28 points
1 comment1 min readEA link

Em­piri­cal work that might shed light on schem­ing (Sec­tion 6 of “Schem­ing AIs”)

Joe_Carlsmith11 Dec 2023 16:30 UTC
6 points
1 comment1 min readEA link

Sum­ming up “Schem­ing AIs” (Sec­tion 5)

Joe_Carlsmith9 Dec 2023 15:48 UTC
9 points
1 comment1 min readEA link

Speed ar­gu­ments against schem­ing (Sec­tion 4.4-4.7 of “Schem­ing AIs”)

Joe_Carlsmith8 Dec 2023 21:10 UTC
6 points
0 comments1 min readEA link

Sim­plic­ity ar­gu­ments for schem­ing (Sec­tion 4.3 of “Schem­ing AIs”)

Joe_Carlsmith7 Dec 2023 15:05 UTC
6 points
1 comment1 min readEA link

The count­ing ar­gu­ment for schem­ing (Sec­tions 4.1 and 4.2 of “Schem­ing AIs”)

Joe_Carlsmith6 Dec 2023 19:28 UTC
9 points
1 comment1 min readEA link

Ar­gu­ments for/​against schem­ing that fo­cus on the path SGD takes (Sec­tion 3 of “Schem­ing AIs”)

Joe_Carlsmith5 Dec 2023 18:48 UTC
7 points
1 comment1 min readEA link

Non-clas­sic sto­ries about schem­ing (Sec­tion 2.3.2 of “Schem­ing AIs”)

Joe_Carlsmith4 Dec 2023 18:44 UTC
12 points
1 comment1 min readEA link

Does schem­ing lead to ad­e­quate fu­ture em­pow­er­ment? (Sec­tion 2.3.1.2 of “Schem­ing AIs”)

Joe_Carlsmith3 Dec 2023 18:32 UTC
6 points
1 comment1 min readEA link

The goal-guard­ing hy­poth­e­sis (Sec­tion 2.3.1.1 of “Schem­ing AIs”)

Joe_Carlsmith2 Dec 2023 15:20 UTC
6 points
1 comment1 min readEA link