RSS

Center for AI Safety

Karma: 992

State­ment on AI Ex­tinc­tion—Signed by AGI Labs, Top Aca­demics, and Many Other Notable Figures

Center for AI Safety30 May 2023 9:06 UTC
427 points
28 comments1 min readEA link
(www.safe.ai)

Model­ing the im­pact of AI safety field-build­ing programs

Center for AI Safety10 Jul 2023 17:22 UTC
81 points
0 comments7 min readEA link

AI Safety Newslet­ter #5: Ge­offrey Hin­ton speaks out on AI risk, the White House meets with AI labs, and Tro­jan at­tacks on lan­guage models

Center for AI Safety9 May 2023 15:26 UTC
60 points
0 comments4 min readEA link
(newsletter.safe.ai)

Cost-effec­tive­ness of stu­dent pro­grams for AI safety research

Center for AI Safety10 Jul 2023 17:23 UTC
53 points
6 comments15 min readEA link

$250K in Prizes: SafeBench Com­pe­ti­tion An­nounce­ment

Center for AI Safety3 Apr 2024 22:07 UTC
46 points
0 comments1 min readEA link

An Overview of Catas­trophic AI Risks

Center for AI Safety15 Aug 2023 21:52 UTC
37 points
1 comment13 min readEA link
(www.safe.ai)

Cost-effec­tive­ness of pro­fes­sional field-build­ing pro­grams for AI safety research

Center for AI Safety10 Jul 2023 17:26 UTC
36 points
2 comments18 min readEA link

AI Safety Newslet­ter #4: AI and Cy­ber­se­cu­rity, Per­sua­sive AIs, Weaponiza­tion, and Ge­offrey Hin­ton talks AI risks

Center for AI Safety2 May 2023 16:51 UTC
35 points
2 comments5 min readEA link
(newsletter.safe.ai)

AI Safety Newslet­ter #6: Ex­am­ples of AI safety progress, Yoshua Ben­gio pro­poses a ban on AI agents, and les­sons from nu­clear arms control

Center for AI Safety16 May 2023 15:14 UTC
32 points
1 comment6 min readEA link
(newsletter.safe.ai)

AISN #12: Policy Pro­pos­als from NTIA’s Re­quest for Com­ment and Re­con­sid­er­ing In­stru­men­tal Convergence

Center for AI Safety27 Jun 2023 15:25 UTC
30 points
3 comments7 min readEA link
(newsletter.safe.ai)

AISN #31: A New AI Policy Bill in Cal­ifor­nia Plus, Prece­dents for AI Gover­nance and The EU AI Office

Center for AI Safety21 Feb 2024 21:55 UTC
27 points
0 comments6 min readEA link
(newsletter.safe.ai)

AISN#14: OpenAI’s ‘Su­per­al­ign­ment’ team, Musk’s xAI launches, and de­vel­op­ments in mil­i­tary AI use

Center for AI Safety12 Jul 2023 16:58 UTC
26 points
0 comments4 min readEA link
(newsletter.safe.ai)

AISN #13: An in­ter­dis­ci­plinary per­spec­tive on AI proxy failures, new com­peti­tors to ChatGPT, and prompt­ing lan­guage mod­els to misbehave

Center for AI Safety5 Jul 2023 15:33 UTC
25 points
0 comments9 min readEA link
(newsletter.safe.ai)

AI Safety Newslet­ter #7: Dis­in­for­ma­tion, Gover­nance Recom­men­da­tions for AI labs, and Se­nate Hear­ings on AI

Center for AI Safety23 May 2023 21:42 UTC
23 points
0 comments6 min readEA link
(newsletter.safe.ai)

AISN #25: White House Ex­ec­u­tive Order on AI, UK AI Safety Sum­mit, and Progress on Vol­un­tary Eval­u­a­tions of AI Risks

Center for AI Safety31 Oct 2023 19:24 UTC
21 points
0 comments6 min readEA link
(newsletter.safe.ai)

AISN #33: Re­assess­ing AI and Biorisk Plus, Con­soli­da­tion in the Cor­po­rate AI Land­scape, and Na­tional In­vest­ments in AI

Center for AI Safety12 Apr 2024 16:11 UTC
19 points
0 comments9 min readEA link
(newsletter.safe.ai)

AISN #28: Cen­ter for AI Safety 2023 Year in Review

Center for AI Safety23 Dec 2023 21:31 UTC
17 points
1 comment5 min readEA link
(newsletter.safe.ai)

Catas­trophic Risks from AI #5: Rogue AIs

Center for AI Safety27 Jun 2023 22:06 UTC
16 points
1 comment1 min readEA link

AI Safety Newslet­ter #8: Rogue AIs, how to screen for AI risks, and grants for re­search on demo­cratic gov­er­nance of AI

Center for AI Safety30 May 2023 11:44 UTC
16 points
3 comments6 min readEA link
(newsletter.safe.ai)

AISN #24: Kiss­inger Urges US-China Co­op­er­a­tion on AI, China’s New AI Law, US Ex­port Con­trols, In­ter­na­tional In­sti­tu­tions, and Open Source AI

Center for AI Safety18 Oct 2023 17:03 UTC
16 points
1 comment6 min readEA link
(newsletter.safe.ai)