Center for AI Safety

Karma: 1,008

AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risks

Center for AI Safety2 May 2023 16:51 UTC

35 points

2 comments5 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models

Center for AI Safety9 May 2023 15:26 UTC

60 points

0 comments4 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #6: Examples of AI safety progress, Yoshua Bengio proposes a ban on AI agents, and lessons from nuclear arms control

Center for AI Safety16 May 2023 15:14 UTC

32 points

1 comment6 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #7: Disinformation, Governance Recommendations for AI labs, and Senate Hearings on AI

Center for AI Safety23 May 2023 21:42 UTC

23 points

0 comments6 min readEA link

(newsletter.safe.ai)

Statement on AI Extinction—Signed by AGI Labs, Top Academics, and Many Other Notable Figures

Center for AI Safety30 May 2023 9:06 UTC

427 points

28 comments1 min readEA link

(www.safe.ai)

AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for research on democratic governance of AI

Center for AI Safety30 May 2023 11:44 UTC

16 points

3 comments6 min readEA link

(newsletter.safe.ai)

AISN #9: Statement on Extinction Risks, Competitive Pressures, and When Will AI Reach Human-Level?

Center for AI Safety6 Jun 2023 15:56 UTC

12 points

2 comments7 min readEA link

(newsletter.safe.ai)

AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering Instrumental Convergence

Center for AI Safety27 Jun 2023 15:25 UTC

30 points

3 comments7 min readEA link

(newsletter.safe.ai)

Catastrophic Risks from AI #5: Rogue AIs

Center for AI Safety27 Jun 2023 22:06 UTC

16 points

1 comment1 min readEA link

Catastrophic Risks from AI #6: Discussion and FAQ

Center for AI Safety27 Jun 2023 23:23 UTC

10 points

0 comments1 min readEA link

AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehave

Center for AI Safety5 Jul 2023 15:33 UTC

25 points

0 comments9 min readEA link

(newsletter.safe.ai)

Modeling the impact of AI safety field-building programs

Center for AI Safety10 Jul 2023 17:22 UTC

81 points

0 comments7 min readEA link

Cost-effectiveness of student programs for AI safety research

Center for AI Safety10 Jul 2023 17:23 UTC

53 points

6 comments15 min readEA link

Cost-effectiveness of professional field-building programs for AI safety research

Center for AI Safety10 Jul 2023 17:26 UTC

36 points

2 comments18 min readEA link

AISN#14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments in military AI use

Center for AI Safety12 Jul 2023 16:58 UTC

26 points

0 comments4 min readEA link

(newsletter.safe.ai)

AISN#15: China and the US take action to regulate AI, results from a tournament forecasting AI risk, updates on xAI’s plan, and Meta releases its open-source and commercially available Llama 2

Center for AI Safety19 Jul 2023 1:40 UTC

5 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #16: White House Secures Voluntary Commitments from Leading AI Labs and Lessons from Oppenheimer

Center for AI Safety25 Jul 2023 16:45 UTC

7 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum, and Senate Hearing on AI Oversight

Center for AI Safety1 Aug 2023 15:24 UTC

15 points

0 comments8 min readEA link

AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s Security Breach, and Conceptual Research on AI Safety

Center for AI Safety8 Aug 2023 15:52 UTC

12 points

0 comments5 min readEA link

(newsletter.safe.ai)

An Overview of Catastrophic AI Risks

Center for AI Safety15 Aug 2023 21:52 UTC

37 points

1 comment13 min readEA link

(www.safe.ai)