Center for AI Safety

Karma: 992

Statement on AI Extinction—Signed by AGI Labs, Top Academics, and Many Other Notable Figures

Center for AI Safety30 May 2023 9:06 UTC

427 points

28 comments1 min readEA link

(www.safe.ai)

Modeling the impact of AI safety field-building programs

Center for AI Safety10 Jul 2023 17:22 UTC

81 points

0 comments7 min readEA link

AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models

Center for AI Safety9 May 2023 15:26 UTC

60 points

0 comments4 min readEA link

(newsletter.safe.ai)

Cost-effectiveness of student programs for AI safety research

Center for AI Safety10 Jul 2023 17:23 UTC

53 points

6 comments15 min readEA link

$250K in Prizes: SafeBench Competition Announcement

Center for AI Safety3 Apr 2024 22:07 UTC

46 points

0 comments1 min readEA link

An Overview of Catastrophic AI Risks

Center for AI Safety15 Aug 2023 21:52 UTC

37 points

1 comment13 min readEA link

(www.safe.ai)

Cost-effectiveness of professional field-building programs for AI safety research

Center for AI Safety10 Jul 2023 17:26 UTC

36 points

2 comments18 min readEA link

AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risks

Center for AI Safety2 May 2023 16:51 UTC

35 points

2 comments5 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #6: Examples of AI safety progress, Yoshua Bengio proposes a ban on AI agents, and lessons from nuclear arms control

Center for AI Safety16 May 2023 15:14 UTC

32 points

1 comment6 min readEA link

(newsletter.safe.ai)

AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering Instrumental Convergence

Center for AI Safety27 Jun 2023 15:25 UTC

30 points

3 comments7 min readEA link

(newsletter.safe.ai)

AISN #31: A New AI Policy Bill in California Plus, Precedents for AI Governance and The EU AI Office

Center for AI Safety21 Feb 2024 21:55 UTC

27 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN#14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments in military AI use

Center for AI Safety12 Jul 2023 16:58 UTC

26 points

0 comments4 min readEA link

(newsletter.safe.ai)

AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehave

Center for AI Safety5 Jul 2023 15:33 UTC

25 points

0 comments9 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #7: Disinformation, Governance Recommendations for AI labs, and Senate Hearings on AI

Center for AI Safety23 May 2023 21:42 UTC

23 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI Risks

Center for AI Safety31 Oct 2023 19:24 UTC

21 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #33: Reassessing AI and Biorisk Plus, Consolidation in the Corporate AI Landscape, and National Investments in AI

Center for AI Safety12 Apr 2024 16:11 UTC

19 points

0 comments9 min readEA link

(newsletter.safe.ai)

AISN #28: Center for AI Safety 2023 Year in Review

Center for AI Safety23 Dec 2023 21:31 UTC

17 points

1 comment5 min readEA link

(newsletter.safe.ai)

Catastrophic Risks from AI #5: Rogue AIs

Center for AI Safety27 Jun 2023 22:06 UTC

16 points

1 comment1 min readEA link

AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for research on democratic governance of AI

Center for AI Safety30 May 2023 11:44 UTC

16 points

3 comments6 min readEA link

(newsletter.safe.ai)

AISN #24: Kissinger Urges US-China Cooperation on AI, China’s New AI Law, US Export Controls, International Institutions, and Open Source AI

Center for AI Safety18 Oct 2023 17:03 UTC

16 points

1 comment6 min readEA link

(newsletter.safe.ai)