Hi, I’m Rohin Shah! I work as a Research Scientist on the technical AGI safety team at DeepMind. I completed my PhD at the Center for Human-Compatible AI at UC Berkeley, where I worked on building AI systems that can learn to assist a human user, even if they don’t initially know what the user wants.
I’m particularly interested in big picture questions about artificial intelligence. What techniques will we use to build human-level AI systems? How will their deployment affect the world? What can we do to make this deployment go better? I write up summaries and thoughts about recent work tackling these questions in the Alignment Newsletter.
In the past, I ran the EA UC Berkeley and EA at the University of Washington groups.
As someone sympathetic to many of Habryka’s positions, while also disagreeing with many of Habryka’s positions, my immediate reaction to this was “well that seems like a bad thing”, c.f.
I’d feel differently if you had said “people feel obliged to take criticism seriously if it points at a real problem” or something like that, but I agree with you that the mechanism is more like “people are unable to ignore criticism irrespective of its quality” (the popularity of the criticism matters, but sadly that is only weakly correlated with quality).