RSS

TW123

Karma: 3,698

An Overview of Catas­trophic AI Risks

Center for AI Safety15 Aug 2023 21:52 UTC
37 points
1 comment13 min readEA link
(www.safe.ai)

[MLSN #9] Ver­ify­ing large train­ing runs, se­cu­rity risks from LLM ac­cess to APIs, why nat­u­ral se­lec­tion may fa­vor AIs over humans

TW12311 Apr 2023 16:05 UTC
18 points
0 comments6 min readEA link
(newsletter.mlsafety.org)

ThomasW’s Quick takes

TW12328 Mar 2023 12:17 UTC
7 points
1 comment1 min readEA link

[MLSN #8]: Mechanis­tic in­ter­pretabil­ity, us­ing law to in­form AI al­ign­ment, scal­ing laws for proxy gaming

TW12320 Feb 2023 16:06 UTC
25 points
0 comments4 min readEA link
(newsletter.mlsafety.org)

What’s the deal with AI con­scious­ness?

TW12311 Jan 2023 16:37 UTC
33 points
0 comments1 min readEA link

“AI” is an indexical

TW1233 Jan 2023 22:00 UTC
23 points
2 comments1 min readEA link

ML Safety Schol­ars Sum­mer 2022 Retrospective

TW1231 Nov 2022 3:09 UTC
56 points
2 comments21 min readEA link

“A Creepy Feel­ing”: Nixon’s De­ci­sion to Disavow Biolog­i­cal Weapons

TW12330 Sep 2022 15:17 UTC
48 points
3 comments17 min readEA link

Cover story on EA in Time magazine

TW12310 Aug 2022 15:26 UTC
94 points
19 comments1 min readEA link
(time.com)

An­nounc­ing the In­tro­duc­tion to ML Safety Course

TW1236 Aug 2022 2:50 UTC
136 points
4 comments7 min readEA link

$20K in Boun­ties for AI Safety Public Materials

TW1235 Aug 2022 2:57 UTC
45 points
11 comments6 min readEA link

Dialec­tic of Enlightenment

TW12315 Jun 2022 4:58 UTC
5 points
1 comment2 min readEA link
(monoskop.org)

You Don’t Need To Jus­tify Everything

TW12312 Jun 2022 18:36 UTC
139 points
11 comments3 min readEA link

Open Prob­lems in AI X-Risk [PAIS #5]

TW12310 Jun 2022 2:22 UTC
44 points
1 comment36 min readEA link

Perform Tractable Re­search While Avoid­ing Ca­pa­bil­ities Ex­ter­nal­ities [Prag­matic AI Safety #4]

TW12330 May 2022 20:37 UTC
33 points
1 comment25 min readEA link

Com­plex Sys­tems for AI Safety [Prag­matic AI Safety #3]

TW12324 May 2022 0:04 UTC
49 points
6 comments21 min readEA link

EA Hous­ing Slack

TW12314 May 2022 20:16 UTC
38 points
8 comments2 min readEA link

Look Out The Window

TW12312 May 2022 21:16 UTC
77 points
3 comments2 min readEA link

A Bird’s Eye View of the ML Field [Prag­matic AI Safety #2]

TW1239 May 2022 17:15 UTC
97 points
2 comments35 min readEA link

In­tro­duc­tion to Prag­matic AI Safety [Prag­matic AI Safety #1]

TW1239 May 2022 17:02 UTC
68 points
0 comments6 min readEA link