replaceability misses the point (with why EAs skew heavily on not liking protests). it’s way more an epistemics issue—messaging and advocacy are just deeply corrosive under any reasonable way of thinking about uncertainty.
In my sordid past I did plenty of “finding the three people for nuanced logical mind-changing discussions amidst a dozens of ‘hey hey ho ho outgroup has got to go’”, so I’ll do the same here (if I’m in town), but selection effects seem deeply worrying (for example, you could go down to the soup kitchen or punk music venue and recruit all the young volunteers who are constantly sneering about how gentrifying techbros are evil and can’t coordinate on whether their “unabomber is actually based” argument is ironic or unironic, but you oughtn’t. The fact that this is even a question, that if you have a “mass movement” theory of change you’re constantly temped to lower your standards in this way, is so intrinsically risky that no one should be comfortable that ML safety or alignment is resorting to this sort of thing).
replaceability misses the point (with why EAs skew heavily on not liking protests). it’s way more an epistemics issue—messaging and advocacy are just deeply corrosive under any reasonable way of thinking about uncertainty.
In my sordid past I did plenty of “finding the three people for nuanced logical mind-changing discussions amidst a dozens of ‘hey hey ho ho outgroup has got to go’”, so I’ll do the same here (if I’m in town), but selection effects seem deeply worrying (for example, you could go down to the soup kitchen or punk music venue and recruit all the young volunteers who are constantly sneering about how gentrifying techbros are evil and can’t coordinate on whether their “unabomber is actually based” argument is ironic or unironic, but you oughtn’t. The fact that this is even a question, that if you have a “mass movement” theory of change you’re constantly temped to lower your standards in this way, is so intrinsically risky that no one should be comfortable that ML safety or alignment is resorting to this sort of thing).