Owain Evans on AI alignment (situational awareness in LLM, benchmarking truthfulness)
Ben Garfinkel on AI policy (best practices in AI governance, open source, the UK’s AI efforts)
Anthony Aguirre on AI governance, forecasting, cosmology
Beth Barnes on dangerous capability evals (GPT-4′s and Claude’s eval)
+1 to Beth Barnes on dangerous capability evals
Owain Evans on AI alignment (situational awareness in LLM, benchmarking truthfulness)
Ben Garfinkel on AI policy (best practices in AI governance, open source, the UK’s AI efforts)
Anthony Aguirre on AI governance, forecasting, cosmology
Beth Barnes on dangerous capability evals (GPT-4′s and Claude’s eval)
+1 to Beth Barnes on dangerous capability evals