AI Safety NewsletterCenter for AI Safety16 May 2023 15:22 UTCAI Safety Newsletter #1 [CAIS Linkpost]Akash10 Apr 2023 20:18 UTC38 points0 comments1 min readEA linkAI Safety Newsletter #2: ChaosGPT, Natural Selection, and AI Safety in the MediaOliver Z18 Apr 2023 18:36 UTC56 points1 comment4 min readEA link(newsletter.safe.ai)AI Safety Newsletter #3: AI policy proposals and a new challenger approachesOliver Z25 Apr 2023 16:15 UTC35 points1 comment4 min readEA link(newsletter.safe.ai)AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risksCenter for AI Safety2 May 2023 16:51 UTC35 points2 comments5 min readEA link(newsletter.safe.ai)AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language modelsCenter for AI Safety9 May 2023 15:26 UTC60 points0 comments4 min readEA link(newsletter.safe.ai)AI Safety Newsletter #6: Examples of AI safety progress, Yoshua Bengio proposes a ban on AI agents, and lessons from nuclear arms controlCenter for AI Safety16 May 2023 15:14 UTC32 points1 comment6 min readEA link(newsletter.safe.ai)AI Safety Newsletter #7: Disinformation, Governance Recommendations for AI labs, and Senate Hearings on AICenter for AI Safety23 May 2023 21:42 UTC23 points0 comments6 min readEA link(newsletter.safe.ai)AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for research on democratic governance of AICenter for AI Safety30 May 2023 11:44 UTC16 points3 comments6 min readEA link(newsletter.safe.ai)AISN #9: Statement on Extinction Risks, Competitive Pressures, and When Will AI Reach Human-Level?Center for AI Safety6 Jun 2023 15:56 UTC12 points2 comments7 min readEA link(newsletter.safe.ai)AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering Instrumental ConvergenceCenter for AI Safety27 Jun 2023 15:25 UTC30 points3 comments7 min readEA link(newsletter.safe.ai)AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehaveCenter for AI Safety5 Jul 2023 15:33 UTC25 points0 comments9 min readEA link(newsletter.safe.ai)AISN#14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments in military AI useCenter for AI Safety12 Jul 2023 16:58 UTC26 points0 comments4 min readEA link(newsletter.safe.ai)AISN #16: White House Secures Voluntary Commitments from Leading AI Labs and Lessons from OppenheimerCenter for AI Safety25 Jul 2023 16:45 UTC7 points0 comments6 min readEA link(newsletter.safe.ai)AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum, and Senate Hearing on AI OversightCenter for AI Safety1 Aug 2023 15:24 UTC15 points0 comments8 min readEA linkAISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s Security Breach, and Conceptual Research on AI SafetyCenter for AI Safety8 Aug 2023 15:52 UTC12 points0 comments5 min readEA link(newsletter.safe.ai)AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI CapabilitiesCenter for AI Safety29 Aug 2023 15:03 UTC12 points0 comments8 min readEA link(newsletter.safe.ai)AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI PolicyCenter for AI Safety5 Sep 2023 14:59 UTC13 points0 comments5 min readEA link(newsletter.safe.ai)AISN #22: The Landscape of US AI Legislation - Hearings, Frameworks, Bills, and LawsCenter for AI Safety19 Sep 2023 14:43 UTC15 points1 comment5 min readEA link(newsletter.safe.ai)AISN #23: New OpenAI Models, News from Anthropic, and Representation EngineeringCenter for AI Safety4 Oct 2023 17:10 UTC7 points0 comments5 min readEA link(newsletter.safe.ai)AISN #24: Kissinger Urges US-China Cooperation on AI, China’s New AI Law, US Export Controls, International Institutions, and Open Source AICenter for AI Safety18 Oct 2023 17:03 UTC16 points1 comment6 min readEA link(newsletter.safe.ai)AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI RisksCenter for AI Safety31 Oct 2023 19:24 UTC21 points0 comments6 min readEA link(newsletter.safe.ai)AISN #26: National Institutions for AI Safety, Results From the UK Summit, and New Releases From OpenAI and xAICenter for AI Safety15 Nov 2023 16:03 UTC11 points0 comments6 min readEA link(newsletter.safe.ai)AISN #27: Defensive Accelerationism, A Retrospective On The OpenAI Board Saga, And A New AI Bill From Senators Thune And KlobucharCenter for AI Safety7 Dec 2023 15:57 UTC10 points0 comments6 min readEA link(newsletter.safe.ai)AISN #28: Center for AI Safety 2023 Year in ReviewCenter for AI Safety23 Dec 2023 21:31 UTC17 points1 comment5 min readEA link(newsletter.safe.ai)AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copyright Infringement, and Congressional Questions about Research Standards in AI SafetyCenter for AI Safety4 Jan 2024 16:03 UTC5 points0 comments6 min readEA link(newsletter.safe.ai)AISN #30: Investments in Compute and Military AI Plus, Japan and Singapore’s National AI Safety InstitutesCenter for AI Safety24 Jan 2024 19:38 UTC7 points1 comment6 min readEA link(newsletter.safe.ai)AISN #31: A New AI Policy Bill in California Plus, Precedents for AI Governance and The EU AI OfficeCenter for AI Safety21 Feb 2024 21:55 UTC27 points0 comments6 min readEA link(newsletter.safe.ai)AISN #32: Measuring and Reducing Hazardous Knowledge in LLMs Plus, Forecasting the Future with LLMs, and Regulatory MarketsCenter for AI Safety7 Mar 2024 16:37 UTC15 points2 comments8 min readEA link(newsletter.safe.ai)AISN #33: Reassessing AI and Biorisk Plus, Consolidation in the Corporate AI Landscape, and National Investments in AICenter for AI Safety12 Apr 2024 16:11 UTC19 points0 comments9 min readEA link(newsletter.safe.ai)