Distributed computing seems to be a skill in high demand among AI safety organizations. Does anyone have recommendations for resources to learn about it? Would it look like using the PyTorch Distributed package or something like a microservices architecture?
To answer your question: My best idea is working on a production distributed product that has bugs around that and debug them, and/or set up a system like that, perhaps with k8s.
This answer is pretty bad, all these ideas are very hard and will take a long time to implement.
My prior is that you might have a bottle neck which is easier to tackle than this one. Wanna share what your goal is, where you stand, and what you’re planning to do?
Is your bottle neck distributed computing and except for that you can get [the AI Safety job you spoke about] tomorrow? Did you interview with such a company to make sure this is all you have missing?
If not—I’d probably focus on other skills, mainly because distributed computing (1) is not a must, and (2) is pretty hard to learn, compared to other skills, many of which are more important, at least given the ideas I have on how to learn it.
See also the section on preparing for AI Safety jobs if you didn’t see it already
Distributed computing seems to be a skill in high demand among AI safety organizations. Does anyone have recommendations for resources to learn about it? Would it look like using the PyTorch Distributed package or something like a microservices architecture?
To answer your question: My best idea is working on a production distributed product that has bugs around that and debug them, and/or set up a system like that, perhaps with k8s.
This answer is pretty bad, all these ideas are very hard and will take a long time to implement.
My prior is that you might have a bottle neck which is easier to tackle than this one. Wanna share what your goal is, where you stand, and what you’re planning to do?
I have a better answer:
Is your bottle neck distributed computing and except for that you can get [the AI Safety job you spoke about] tomorrow? Did you interview with such a company to make sure this is all you have missing?
If not—I’d probably focus on other skills, mainly because distributed computing (1) is not a must, and (2) is pretty hard to learn, compared to other skills, many of which are more important, at least given the ideas I have on how to learn it.
See also the section on preparing for AI Safety jobs if you didn’t see it already