RSS

ThomasW

Karma: 3,683

AI governance/​grantmaking. Formerly at the Center for AI Safety and Yale EA organizer.

An Overview of Catas­trophic AI Risks

Center for AI Safety15 Aug 2023 21:52 UTC
37 points
1 comment13 min readEA link
(www.safe.ai)

[MLSN #9] Ver­ify­ing large train­ing runs, se­cu­rity risks from LLM ac­cess to APIs, why nat­u­ral se­lec­tion may fa­vor AIs over humans

ThomasW11 Apr 2023 16:05 UTC
18 points
0 comments6 min readEA link
(newsletter.mlsafety.org)

ThomasW’s Quick takes

ThomasW28 Mar 2023 12:17 UTC
7 points
1 comment1 min readEA link

[MLSN #8]: Mechanis­tic in­ter­pretabil­ity, us­ing law to in­form AI al­ign­ment, scal­ing laws for proxy gaming

ThomasW20 Feb 2023 16:06 UTC
25 points
0 comments4 min readEA link
(newsletter.mlsafety.org)

What’s the deal with AI con­scious­ness?

ThomasW11 Jan 2023 16:37 UTC
33 points
0 comments1 min readEA link

“AI” is an indexical

ThomasW3 Jan 2023 22:00 UTC
23 points
2 comments1 min readEA link

Is EA an ad­vanced, plan­ning, strate­gi­cally-aware power-seek­ing mis­al­igned mesa-op­ti­mizer?

ThomasW16 Dec 2022 17:07 UTC
39 points
5 comments7 min readEA link

ML Safety Schol­ars Sum­mer 2022 Retrospective

ThomasW1 Nov 2022 3:09 UTC
56 points
2 comments21 min readEA link

“A Creepy Feel­ing”: Nixon’s De­ci­sion to Disavow Biolog­i­cal Weapons

ThomasW30 Sep 2022 15:17 UTC
48 points
3 comments11 min readEA link

Cover story on EA in Time magazine

ThomasW10 Aug 2022 15:26 UTC
94 points
19 comments1 min readEA link
(time.com)

An­nounc­ing the In­tro­duc­tion to ML Safety Course

ThomasW6 Aug 2022 2:50 UTC
136 points
4 comments7 min readEA link

$20K in Boun­ties for AI Safety Public Materials

ThomasW5 Aug 2022 2:57 UTC
45 points
11 comments6 min readEA link

Dialec­tic of Enlightenment

ThomasW15 Jun 2022 4:58 UTC
5 points
1 comment2 min readEA link
(monoskop.org)

You Don’t Need To Jus­tify Everything

ThomasW12 Jun 2022 18:36 UTC
139 points
11 comments3 min readEA link

Open Prob­lems in AI X-Risk [PAIS #5]

ThomasW10 Jun 2022 2:22 UTC
44 points
1 comment36 min readEA link

Perform Tractable Re­search While Avoid­ing Ca­pa­bil­ities Ex­ter­nal­ities [Prag­matic AI Safety #4]

ThomasW30 May 2022 20:37 UTC
33 points
1 comment26 min readEA link

Com­plex Sys­tems for AI Safety [Prag­matic AI Safety #3]

ThomasW24 May 2022 0:04 UTC
49 points
6 comments21 min readEA link

EA Hous­ing Slack

ThomasW14 May 2022 20:16 UTC
38 points
8 comments2 min readEA link

Look Out The Window

ThomasW12 May 2022 21:16 UTC
74 points
3 comments2 min readEA link

A Bird’s Eye View of the ML Field [Prag­matic AI Safety #2]

ThomasW9 May 2022 17:15 UTC
97 points
2 comments36 min readEA link