aogara

Karma: 3,158

Research Engineering Intern at the Center for AI Safety. Helping to write the AI Safety Newsletter. Studying CS and Economics at the University of Southern California, and running an AI safety club there.

AISN #34: New Military AI Systems Plus, AI Labs Fail to Uphold Voluntary Commitments to UK AI Safety Institute, and New AI Policy Proposals in the US Senate

Center for AI Safety2 May 2024 16:12 UTC

21 points

5 comments8 min readEA link

(newsletter.safe.ai)

AISN #33: Reassessing AI and Biorisk Plus, Consolidation in the Corporate AI Landscape, and National Investments in AI

Center for AI Safety12 Apr 2024 16:11 UTC

19 points

0 comments9 min readEA link

(newsletter.safe.ai)

AISN #32: Measuring and Reducing Hazardous Knowledge in LLMs Plus, Forecasting the Future with LLMs, and Regulatory Markets

Center for AI Safety7 Mar 2024 16:37 UTC

15 points

2 comments8 min readEA link

(newsletter.safe.ai)

AISN #31: A New AI Policy Bill in California Plus, Precedents for AI Governance and The EU AI Office

Center for AI Safety21 Feb 2024 21:55 UTC

27 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #30: Investments in Compute and Military AI Plus, Japan and Singapore’s National AI Safety Institutes

Center for AI Safety24 Jan 2024 19:38 UTC

7 points

1 comment6 min readEA link

(newsletter.safe.ai)

AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copyright Infringement, and Congressional Questions about Research Standards in AI Safety

Center for AI Safety4 Jan 2024 16:03 UTC

5 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #28: Center for AI Safety 2023 Year in Review

Center for AI Safety23 Dec 2023 21:31 UTC

17 points

1 comment5 min readEA link

(newsletter.safe.ai)

AISN #27: Defensive Accelerationism, A Retrospective On The OpenAI Board Saga, And A New AI Bill From Senators Thune And Klobuchar

Center for AI Safety7 Dec 2023 15:57 UTC

10 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #26: National Institutions for AI Safety, Results From the UK Summit, and New Releases From OpenAI and xAI

Center for AI Safety15 Nov 2023 16:03 UTC

11 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI Risks

Center for AI Safety31 Oct 2023 19:24 UTC

21 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #24: Kissinger Urges US-China Cooperation on AI, China’s New AI Law, US Export Controls, International Institutions, and Open Source AI

Center for AI Safety18 Oct 2023 17:03 UTC

16 points

1 comment6 min readEA link

(newsletter.safe.ai)

AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering

Center for AI Safety4 Oct 2023 17:10 UTC

7 points

0 comments5 min readEA link

(newsletter.safe.ai)

AISN #22: The Landscape of US AI Legislation - Hearings, Frameworks, Bills, and Laws

Center for AI Safety19 Sep 2023 14:43 UTC

15 points

1 comment5 min readEA link

(newsletter.safe.ai)

MLSN: #10 Adversarial Attacks Against Language and Vision Models, Improving LLM Honesty, and Tracing the Influence of LLM Training Data

Center for AI Safety13 Sep 2023 18:02 UTC

7 points

0 comments5 min readEA link

(newsletter.mlsafety.org)

AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI Policy

Center for AI Safety5 Sep 2023 14:59 UTC

13 points

0 comments5 min readEA link

(newsletter.safe.ai)

AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI Capabilities

Center for AI Safety29 Aug 2023 15:03 UTC

12 points

0 comments8 min readEA link

(newsletter.safe.ai)

AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s Security Breach, and Conceptual Research on AI Safety

Center for AI Safety8 Aug 2023 15:52 UTC

12 points

0 comments5 min readEA link

(newsletter.safe.ai)

AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum, and Senate Hearing on AI Oversight

Center for AI Safety1 Aug 2023 15:24 UTC

15 points

0 comments8 min readEA link

AISN #16: White House Secures Voluntary Commitments from Leading AI Labs and Lessons from Oppenheimer

Center for AI Safety25 Jul 2023 16:45 UTC

7 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehave

Center for AI Safety5 Jul 2023 15:33 UTC

25 points

0 comments9 min readEA link

(newsletter.safe.ai)