Joe_Carlsmith

Karma: 3,540

Senior advisor at Open Philanthropy. Doctorate in philosophy at the University of Oxford. Opinions my own.

Video and transcript of talk on “Can goodness compete?”

Joe_CarlsmithJul 17, 2025, 5:59 PM

30 points

4 comments34 min readEA link

(joecarlsmith.substack.com)

Video and transcript of talk on AI welfare

Joe_CarlsmithMay 22, 2025, 4:15 PM

22 points

1 comment28 min readEA link

(joecarlsmith.substack.com)

The stakes of AI moral status

Joe_CarlsmithMay 21, 2025, 6:20 PM

54 points

9 comments14 min readEA link

(joecarlsmith.substack.com)

Video and transcript of talk on automating alignment research

Joe_CarlsmithApr 30, 2025, 5:43 PM

11 points

1 comment24 min readEA link

(joecarlsmith.com)

Can we safely automate alignment research?

Joe_CarlsmithApr 30, 2025, 5:37 PM

13 points

1 comment48 min readEA link

(joecarlsmith.com)

AI for AI safety

Joe_CarlsmithMar 14, 2025, 3:00 PM

34 points

1 comment17 min readEA link

(joecarlsmith.substack.com)

Paths and waystations in AI safety

Joe_CarlsmithMar 11, 2025, 6:52 PM

22 points

2 comments11 min readEA link

(joecarlsmith.substack.com)

When should we worry about AI power-seeking?

Joe_CarlsmithFeb 19, 2025, 7:44 PM

21 points

2 comments18 min readEA link

(joecarlsmith.substack.com)

Joe_Carlsmith Feb 19, 2025, 8:13 AM
4 points
0 ∶ 0
in reply to: Lizka’s comment on: Fake thinking and real thinking
Very glad to hear it, Lizka :) -- and thanks for letting me know.

What is it to solve the alignment problem?

Joe_CarlsmithFeb 13, 2025, 6:42 PM

25 points

1 comment19 min readEA link

(joecarlsmith.substack.com)

How do we solve the alignment problem?

Joe_CarlsmithFeb 13, 2025, 6:27 PM

28 points

1 comment6 min readEA link

(joecarlsmith.substack.com)

Fake thinking and real thinking

Joe_CarlsmithJan 28, 2025, 8:05 PM

77 points

3 comments38 min readEA link

Takes on “Alignment Faking in Large Language Models”

Joe_CarlsmithDec 18, 2024, 6:22 PM

72 points

1 comment62 min readEA link

Incentive design and capability elicitation

Joe_CarlsmithNov 12, 2024, 8:56 PM

9 points

0 comments12 min readEA link

Option control

Joe_CarlsmithNov 4, 2024, 5:54 PM

11 points

0 comments54 min readEA link

Motivation control

Joe_CarlsmithOct 30, 2024, 5:15 PM

18 points

0 comments52 min readEA link

How might we solve the alignment problem? (Part 1: Intro, summary, ontology)

Joe_CarlsmithOct 28, 2024, 9:57 PM

18 points

0 comments32 min readEA link

Video and transcript of presentation on Otherness and control in the age of AGI

Joe_CarlsmithOct 8, 2024, 10:30 PM

18 points

1 comment27 min readEA link

What is it to solve the alignment problem? (Notes)

Joe_CarlsmithAug 24, 2024, 9:19 PM

32 points

1 comment53 min readEA link

Value fragility and AI takeover

Joe_CarlsmithAug 5, 2024, 9:28 PM

39 points

3 comments30 min readEA link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer