Center for AI Safety

Karma: 992

AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risks

Center for AI Safety2 May 2023 16:51 UTC

35 points

2 comments5 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models

Center for AI Safety9 May 2023 15:26 UTC

60 points

0 comments4 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #6: Examples of AI safety progress, Yoshua Bengio proposes a ban on AI agents, and lessons from nuclear arms control

Center for AI Safety16 May 2023 15:14 UTC

32 points

1 comment6 min readEA link

(newsletter.safe.ai)

AI Safety Newsletter #7: Disinformation, Governance Recommendations for AI labs, and Senate Hearings on AI

Center for AI Safety23 May 2023 21:42 UTC

23 points

0 comments6 min readEA link

(newsletter.safe.ai)

Statement on AI Extinction—Signed by AGI Labs, Top Academics, and Many Other Notable Figures

Center for AI Safety30 May 2023 9:06 UTC

427 points

28 comments1 min readEA link

(www.safe.ai)

AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for research on democratic governance of AI

Center for AI Safety30 May 2023 11:44 UTC

16 points

3 comments6 min readEA link

(newsletter.safe.ai)

AISN #9: Statement on Extinction Risks, Competitive Pressures, and When Will AI Reach Human-Level?

Center for AI Safety6 Jun 2023 15:56 UTC

12 points

2 comments7 min readEA link

(newsletter.safe.ai)

AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering Instrumental Convergence

Center for AI Safety27 Jun 2023 15:25 UTC

30 points

3 comments7 min readEA link

(newsletter.safe.ai)

Catastrophic Risks from AI #5: Rogue AIs

Center for AI Safety27 Jun 2023 22:06 UTC

16 points

1 comment1 min readEA link

Catastrophic Risks from AI #6: Discussion and FAQ

Center for AI Safety27 Jun 2023 23:23 UTC

10 points

0 comments1 min readEA link

AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehave

Center for AI Safety5 Jul 2023 15:33 UTC

25 points

0 comments9 min readEA link

(newsletter.safe.ai)

Modeling the impact of AI safety field-building programs

Center for AI Safety10 Jul 2023 17:22 UTC

81 points

0 comments7 min readEA link

Cost-effectiveness of student programs for AI safety research

Center for AI Safety10 Jul 2023 17:23 UTC

53 points

6 comments15 min readEA link

Cost-effectiveness of professional field-building programs for AI safety research

Center for AI Safety10 Jul 2023 17:26 UTC

36 points

2 comments18 min readEA link

AISN#14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments in military AI use

Center for AI Safety12 Jul 2023 16:58 UTC

26 points

0 comments4 min readEA link

(newsletter.safe.ai)

AISN#15: China and the US take action to regulate AI, results from a tournament forecasting AI risk, updates on xAI’s plan, and Meta releases its open-source and commercially available Llama 2

Center for AI Safety19 Jul 2023 1:40 UTC

5 points

0 comments6 min readEA link

(newsletter.safe.ai)

AISN #16: White House Secures Voluntary Commitments from Leading AI Labs and Lessons from Oppenheimer

Center for AI Safety25 Jul 2023 16:45 UTC

7 points

0 comments6 min readEA link

(newsletter.safe.ai)

Center for AI Safety 26 Jul 2023 12:59 UTC
1 point
0 ∶ 0
in reply to: Linch’s comment on: Cost-effectiveness of student programs for AI safety research
This is broadly correct!
Regarding
I’m still not sure how they ended up with 84x and 8.7x
the answer is discounting for time and productivity. Consider the 84x for research scientists. With a 20% annual research discount rate, the average value of otherwise-identical research relative to the present is a bit less than 0.9. And productivity relative to peak is very slightly less than 1. These forces move the 100 to 84.
Regarding
why the two numbers [84x and 8.7x] are so different from each other
the answer is mainly differences in productivity relative to peak and scientist-equivalence. As in the plots in this section, PhD students midway through their PhD are ~0.5x as productive as they will be at their career peak. And, as in this section, we value PhD student research labor at 0.1x that of research scientists. The other important force is the length of a PhD—the research scientist is assumed to be working for 1 year whilst the PhD student is funded for 5 years, which increases the duration of the treatment effect and decreases the average time value of research.
Very roughly: 100x baseline you identified * ~0.5x productivity * 0.1x scientist-equivalence * 5 years * ~0.5 average research discount rate = 12.5. (Correcting errors in these rough numbers takes us to 8.4.)

AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum, and Senate Hearing on AI Oversight

Center for AI Safety1 Aug 2023 15:24 UTC

15 points

0 comments8 min readEA link

AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s Security Breach, and Conceptual Research on AI Safety

Center for AI Safety8 Aug 2023 15:52 UTC

12 points

0 comments5 min readEA link

(newsletter.safe.ai)