The AI Safety Atlas is an amazing resource, and just the type I think you are looking for (understandable by me, a pianist with zero STEM background beyond high-school).
For your purposes I’d recommend Chapter 1.4 + maybe one or two extra chapters specifically around capabilities and ‘the bitter lesson’, then perhaps this video by Rational Animations about goal misgeneralisation.
Since you’re in Oxford, I’d also recommend reaching out to the Oxford AI Safety Initiative, a student-led group doing amazing work to educate around issues of AI Safety.
I’ve been doing their Core Fellowship this term and it has been amazing.
Hi, fellow Oxford neighbour here!
The AI Safety Atlas is an amazing resource, and just the type I think you are looking for (understandable by me, a pianist with zero STEM background beyond high-school).
For your purposes I’d recommend Chapter 1.4 + maybe one or two extra chapters specifically around capabilities and ‘the bitter lesson’, then perhaps this video by Rational Animations about goal misgeneralisation.
Since you’re in Oxford, I’d also recommend reaching out to the Oxford AI Safety Initiative, a student-led group doing amazing work to educate around issues of AI Safety.
I’ve been doing their Core Fellowship this term and it has been amazing.