I attended this talk not knowing much about mechanistic interpretability at all and came away quite excited about the idea of working on it. Particularly, I found that there were concepts and intuitions around MI that overlap or have similarities with fluid mechanics and turbulence, which were the focus of my PhD. This surprised me and I’ve since been looking into MI further as something I could work on in the future.
I also think there could be similar transferable intuitions from other fields of physical engineering which I’m interested in exploring further to help other engineers transition into the field (as part of my work at High Impact Engineers).
Thanks for giving this talk and sharing such a comprehensive write-up, Neel!
I attended this talk not knowing much about mechanistic interpretability at all and came away quite excited about the idea of working on it. Particularly, I found that there were concepts and intuitions around MI that overlap or have similarities with fluid mechanics and turbulence, which were the focus of my PhD. This surprised me and I’ve since been looking into MI further as something I could work on in the future.
I also think there could be similar transferable intuitions from other fields of physical engineering which I’m interested in exploring further to help other engineers transition into the field (as part of my work at High Impact Engineers).
Thanks for giving this talk and sharing such a comprehensive write-up, Neel!