Anecdotally, approximately everyone who’s now working on AI safety with Russian origins got into it because of HPMOR. Just a couple of days ago, an IOI gold medalist reached out to me, they’ve been going through ARENA.
HPMOR tends to make people with that kind of background act more on trying to save the world. It also gives some intuitive sense for some related stuff (up to “oh, like the mirror from HPMOR?”), but this is a lot less central than giving people the ~EA values and making them actually do stuff.
(Plus, at this point, the book is well-known enough in some circles that some % of future Russian ML researchers would be a lot easier to alignment-pill and persuade to not work on something that might kill everyone or prompt other countries to build something that kills everyone.
I’m not sure how universal this is- the kind of Russian kid who is into math/computer science is the kind of kid who would often be into the HPMOR aesthetics- but it seems to work.
I think many past IMO/IOI medalists are generally very capable and can help, and it’s worth looking at the list of them and reaching out to people who’ve read HPMOR (and possibly The Precipice/Human Compatible) and getting them to work on AI safety.
Anecdotally, approximately everyone who’s now working on AI safety with Russian origins got into it because of HPMOR. Just a couple of days ago, an IOI gold medalist reached out to me, they’ve been going through ARENA.
HPMOR tends to make people with that kind of background act more on trying to save the world. It also gives some intuitive sense for some related stuff (up to “oh, like the mirror from HPMOR?”), but this is a lot less central than giving people the ~EA values and making them actually do stuff.
(Plus, at this point, the book is well-known enough in some circles that some % of future Russian ML researchers would be a lot easier to alignment-pill and persuade to not work on something that might kill everyone or prompt other countries to build something that kills everyone.
Like, the largest Russian broker decided to celebrate the New Year by advertising HPMOR and citing Yudkowsky.)
I’m not sure how universal this is- the kind of Russian kid who is into math/computer science is the kind of kid who would often be into the HPMOR aesthetics- but it seems to work.
I think many past IMO/IOI medalists are generally very capable and can help, and it’s worth looking at the list of them and reaching out to people who’ve read HPMOR (and possibly The Precipice/Human Compatible) and getting them to work on AI safety.