JWS 🔸 comments on JWS’s Quick takes

JWS 🔸Jul 14, 2023, 7:23 PM
50 points
12 ∶ 18
The HLI discussion on the Forum recently felt off to me, bad vibes all around. It seems very heated, not a lot of scout mindset, and reading the various back-and-forth chains I felt like I was ‘getting Eulered’ as Scott once described.
I’m not an expert on evaluating charities, but I followed a lot of links to previous discussions and found this discussion involving one of the people running an RCT on Strongminds (which a lot of people are waiting for the final results of) who was highly sceptical of SM efficacy. But the person offering counterarguments in the thread seems to be just as valid to me? My current position, for what it’s worth,^[1] is:
- the initial Strongminds results of 10x cash transfer should raise a sceptical response. most things aren’t that effective
- it’s worth there being exploration of what the SWB approach would recommend as the top charities (think of this as trying other bandits in a multi-armed bandit charity evaluation problem)
- it’s very difficult to do good social science, and the RCT won’t give us dispositive evidence about the effectiveness of Strongminds (especially at scale), but it may help us update. In general we should be mindful of how far we can make rigorous empirical claims in the social sciences
- HLI has used language too loosely in the past and overclaimed/been overconfident, which Michael has apologised for, though perhaps some critics would like a stronger signal of neutrality (this links to the ‘epistemic probation’ comments)
- GiveWell’s own ‘best guess’ analysis seems to be that Strongminds is 2.3x that of GiveDirectly.^[2] I’m generally a big fan of the GiveDirectly approach for reasons of autonomy—even if Strongminds got reduced in efficacy to around ~1x GD, it’d still be a good intervention? I’m much more concerned with what this number is than the tone of HLIs or Michael’s claims tbh (though not at the expense of epistemic rigour).
- The world is rife with actively wasted or even negative action, spending, and charity. The integrity of EA research, and holding charity evaluators to account is important to both the EA mission and EAs identity. But HLI seems to have been singled out for very harsh criticism,^[3] but so much of the world is worse.
I’m also quite unsettled by a lot of what I call ‘drive-by downvoting’. While writing a comment is a lot more effort than clicking to vote on a comment/post, I think the signal is a lot higher, and would help those involved in debates reach consensus better. Some people with high-karma accounts seem to be making some very strong votes on that thread, and very few are making their reasoning clear (though I salute those who are in either direction).
So I’m very unsure how to feel. It’s an important issue, but I’m not sure the Forum has shown itself in a good light in this instance.
1. ^
  And I stress this isn’t much in this area, I generally defer to evaluators
2. ^
  On the table at the top of the link, go to the column ‘GiveWell best guess’ and the row ‘Cost-effectiveness, relative to cash’
3. ^
  Again, I don’t think I have the ability to adjudicate here, which is part of why I’m so confused.
- Jason Jul 14, 2023, 10:31 PM
  37 points
  15 ∶ 0
  Parent
  Some people with high-karma accounts seem to be making some very strong votes on that thread, and very few are making their reasoning clear (though I salute those who are in either direction).
  I think this is a significant datum in favor of being able to see the strong up/up/down/strong down spread for each post/comment. If it appeared that much of the karma activity was the result of a handful of people strongvoting each comment in a directional activity, that would influence how I read the karma count as evidence in trying to discern the community’s viewpoint. More importantly, it would probably inform HLI’s takeaways—in its shoes, I would treat evidence of a broad consensus of support for certain negative statements much, much more seriously than evidence of carpet-bomb voting by a small group on those statements.
  What links here?
  - David M's comment on EA Forum feature suggestion thread by Aaron Gertler 🔸 (Jul 17, 2023, 4:37 PM; 4 points)
  - JP Addison🔸Jul 26, 2023, 12:28 AM
    4 points
    0 ∶ 0
    Parent
    Indeed our new reacts system separates them. But our new reacts system also doesn’t have strong votes. A problem with displaying the number of types of votes when strong votes are involved is that it much more easily allows for deanonymization if there are only a few people in the thread.
    - Jason Jul 27, 2023, 2:53 PM
      2 points
      0 ∶ 0
      Parent
      That makes sense. On the karma side, I think some of my discomfort comes from the underlying operationalization of post/comment karma as merely additive of individual karma weights.
      True opinion of the value of the bulk of posts/comments probably lies on a bell curve, so I would expect most posts/comments to have significantly more upvotes than strong upvotes if voters are “honestly” conveying preferences and those preferences are fairly representative of the user base. Where the karma is coming predominately from strongvotes, the odds that the displayed total reflects the opinion of a smallish minority that feels passionately is much higher. That can be problematic if it gives the impression of community consensus where no such consensus exists.
      If it were up to me, I would probably favor a rule along the lines of: a post/comment can’t get more than X% of its net positive karma from strongvotes, to ensure that a high karma count reflects some degree of breadth of community support rather than voting by a small handful of people with powerful strongvotes. Downvotes are a bit trickier, because the strong downvote hammer is an effective way of quickly pushing down norm-breaking and otherwise problematic content, and I think putting posts into deep negative territory is generally used for that purpose.
  - David M Jul 20, 2023, 10:53 AM
    2 points
    0 ∶ 0
    Parent
    Looks like this feature is being rolled out on new posts. Or at least one post: https://forum.effectivealtruism.org/posts/gEmkxFuMck8SHC55w/introducing-the-effective-altruism-addiction-recovery-group
- Sol3:2 Jul 17, 2023, 7:26 PM
  9 points
  7 ∶ 5
  Parent
  EA is just a few months out from a massive scandal caused in part by socially enforced artificial consensus (FTX), but judging by this post nothing has been learned and the “shut up and just be nice to everyone else on the team” culture is back again, even when truth gets sacrificed on the process. No thinks HLI is stealing billions of dollars of course, but the charge that they keep quasi-deliberately stacking the deck in StrongMinds’ favour is far from outrageous and should be discussed honestly and straightforwardly.
  - Jason Jul 17, 2023, 8:44 PM
    11 points
    5 ∶ 0
    Parent
    JWS’ quick take has often been in negative agreevote territory and is +3 at this writing. Meanwhile, the comments of the lead HLI critic suggesting potential bad faith have seen consistent patterns of high upvote / agreevote. I don’t see much evidence of “shut up and just be nice to everyone else on the team” culture here.
  - JWS 🔸Jul 17, 2023, 9:56 PM
    5 points
    2 ∶ 0
    Parent
    Hey Sol, some thoughts on this comment:
    I don’t think the Forum’s reaction to the HLI post has been “shut up and just be nice to everyone else on the team”, as Jason’s response suggested.
    I don’t think mine suggests that either! In fact, my first bullet point has a similar sceptical prior to what you express in this comment^[1] I also literally say “holding charity evaluators to account is important to both the EA mission and EAs identity”, and point that I don’t want to sacrifice epistemic rigour. In fact, one of my main points is that people—even those disagreeing with HLI, are shutting up too much! I think disagreement without explanation is bad, and I salute the thorough critics on that post who have made their reasoning for putting HLI in ‘epistemic probation’ clear.
    I don’t suggest ‘sacrificing the truth’. My position is that the truth on StrongMind’s efficacy is hard to get a strong signal on, and therefore HLI should have been more modest early on their history, instead of framing it as the most effective way to donate.
    As for the question of whether HLI were “quasi-deliberately stacking the deck”, well I was quite open that I think I am confused on where the truth is, and find it difficult to adjudicate what the correct takeway should be.
    I don’t think we really disagree that much, and I definitely agree that the HLI discussion should proceed transparently and EA has a lot to learn from the last year, including FTX. I think if you maybe re-read my Quick Take, I’m not taking the position you think I am.
    ^
    That’s my interpretation of course, please correct me if I’ve misunderstood