Yarrow🔸

Karma: 877

Pronouns: she/her or they/them.

I got interested in effective altruism back before it was called effective altruism, back before Giving What We Can had a website. Later on, I got involved in my university EA group and helped run it for a few years. Now I’m trying to figure out where effective altruism can fit into my life these days and what it means to me.

Yarrow🔸May 3, 2025, 4:07 PM
11 points
2 ∶ 1
in reply to: Will_Davison’s comment on: Will_Davison’s Quick takes
This is the kind of idea that has a superficial sheen of plausibility but begins to look plainly absurd when you take time to reflect on the deep reasons our moral views are the way they are.
Here are a few reasons this argument doesn’t make sense.
1. It’s not an apples-to-apples comparison
The Nazis also didn’t donate money to help with global poverty, so to make this comparison you have to count the deaths they could have averted by donating but didn’t on top of the people they actively killed.
2. Even if we are focused just on outcomes, intent still matters
Intent is important to outcome.
If you give money to one of GiveWell’s top charities, like the Against Malaria Foundation, you are relying on the intent of the people who work there to produce a good outcome — more money leads to more people helped, such as more anti-malarial bednets deployed. If you somehow found out the people who work at a charity secretly have bad intent (like that they wanted to abscond with the money), you wouldn’t donate because your prediction of the outcome would change. Your view that the charity is good would change.
Part of what made the Nazis bad was not just what they did — although that, of course, was among the worst things anyone’s ever done, among the worst things imaginable — but also what they intended to do if they had won World War II and gained more power. The outcome would have been bad. One of the ways to assess someone’s moral character is to ask what the outcome would be if they had a lot more money, power, or influence.
3. Actively killing is morally worse than passively letting die
Moral agency, moral responsibility, and moral luck are complex and vexing topics. But I will still say that actively killing someone, deliberately and violently, is morally worse than failing to save someone’s life by spending money on personal consumption rather than donating to charity.
Even if you think of things in a pure, rigid consequentialist or utilitarian way, it makes sense to believe that directly, violently killing people is worse because of what I just said about the connection between intent and outcomes.
What kind of world would we live in if we elevated people prone to violently killing others to positions of wealth, power, and influence, rather than treating this as morally evil? The proclivity to actively kill people is strongly correlated with a failure to respond to global poverty humanely, and it’s also strongly correlated with everything else bad.
4. Altruistic self-sacrifice is part of moral assessment
A factor that seems important in assessing the morality of an action or the moral character of a person is altruistic self-sacrifice.
In thought experiments (what the philosopher Daniel Dennett would call “intuition pumps”) like Peter Singer’s parable of the drowning child,^[1] the amount of sacrifice the person is required to make to save a life is typically stipulated to be minimal. In reality, the amount of personal sacrifice required to save a life is probably greater — more like 3,000 to 5,000 USD rather than the cost of ruined shoes or clothes in Singer’s hypothetical.
And these thought experiments don’t directly address the question (don’t directly pump our intuitions about) the morality of a person’s actions if they are faced with millions of drowning children and now face real trade-offs between living their own life normally and saving the children. In Singer’s parable, the dilemma is small because the sacrifice is small. Extrapolating from that parable to an argument or an intuition that larger sacrifices should also be required may or may not be justified, but we need to consider how you get from the drowning child parable to that conclusion.
But more importantly to the topic at hand, how much self-sacrifice a person should be willing to endure in order to altruistically help others and whether a person has sacrificed enough is a different topic entirely than what the Nazis did. The Nazis went out of their way to kill people.

This point, #4, builds on points #2 and #3. The willingness to engage in altruistic self-sacrifice matters even if we’re just focused on outcomes. Going out of your way to kill people seems much worse than failing to engage in “enough” altruistic self-sacrifice, and if someone insists on it, we can even justify that on consequentialist grounds.
1. ^
  Someone somewhere once came up with a clever variation of this parable where you are holding your phone and have to drop it on the ground, smashing it, to catch a child falling off a building.

Yarrow🔸May 3, 2025, 2:58 PM
3 points
1 ∶ 0
in reply to: Mikolaj Kniejski’s comment on: Mikolaj Kniejski’s Quick takes
Have you tried Discord? Discord seems absurdly casual for any kind of business or serious use, but that’s more about Discord’s aesthetics, brand, and reputation than its actual functionality.

My impression when Discord came out was that it copied Slack pretty directly. But Slack was a product for teams at companies to talk to each other and Discord was a tool to make it easier for friends or online communities to play video games together.

Slack is still designed for businesses and Discord is still designed primarily for gamers. But Discord has been adopted by many other types of people for many other purposes.

Discord has voice chat and makes it super easy to switch between servers. Back when people were using Slack as a meeting place for online communities (whereas today they all use Discord), one of my frustrations was switching between teams, as you described.

I think Discord is functionally much better than Slack for many use cases, but asking people to use Discord in a business context or a serious context feels absurd, like holding a company meeting over Xbox Live. If you can get over using a gaming app with a cartoon mascot, then it might be the best solution.

Yarrow🔸May 3, 2025, 2:30 PM
8 points
0 ∶ 0
in reply to: Ben_West🔸’s comment on: OpenAI’s o3 model scores 3% on the ARC-AGI-2 benchmark, compared to 60% for the average human
I played it the other way around, where I asked o4-mini to come up with a word that I would try to guess. I tried this twice and it made the same mistake both times.
The first word was “butterfly”. I guessed “B” and it said, “The letter B is not in the word.”
Then, when I lost the game and o4-mini revealed the word, it said, “Apologies—I mis-evaluated your B guess earlier.”
The second time around, I tried to help it by saying: “Make a plan for how you would play hangman with me. Lay out the steps in your mind but don’t tell me anything. Tell me when you’re ready to play.”
It made the same mistake again. I guessed the letters A, E, I, O, U, and Y, and it told me none of the letters were in the word. That exhausted the number of wrong guesses I was allowed, so it ended the game and revealed the word was “schmaltziness”.
This time, it didn’t catch its own mistake right away. I prompted it to review the context window and check for mistakes. At that point, it said that A, E, and I are actually in the word.^[1]
Related to this: François Chollet has a great talk from August 2024, which I posted here, that includes a section on some of the weird, goofy mistakes that LLMs make.
He argues that when a new mistake or category of mistake is discovered and becomes widely known, LLM companies fine-tune their models to avoid these mistakes in the future. But if you change up the prompt a bit, you can still elicit the same kind of mistake.
So, the fine-tuning may give the impression that LLMs’ overall reasoning ability is improving, but really this is a patchwork approach that can’t possibly scale to cover the space of all human reasoning, which is impossibly vast and can only be mastered through better generalization.
1. ^
  I edited my comment to add this footnote on 2025-05-03 at 16:33 UTC. I just checked and o4-mini got the details on this completely wrong. It said:
  But the final word SCHMALTZINESS actually contains an A (in position 5), an I (in positions 10 and 13), and two E’s (in positions 11 and 14).
  What it said about the A is correct. It said that one letter, I, was in two positions, and neither of the positions it gave contain an I. It said there are two Es, but there is only E. It gets the position of the E right, but says there is a second E in position 14, which doesn’t exist.

Yarrow🔸May 3, 2025, 2:21 PM
1 point
0 ∶ 0
in reply to: Rasool’s comment on: OpenAI’s o3 model scores 3% on the ARC-AGI-2 benchmark, compared to 60% for the average human
Good Lord! Thanks for this information!
The Twitter thread by Toby Ord is great. Thanks for linking that. This tweet helps put things in perspective:
For reference, these are simple puzzles that my 10-year-old child can solve in about 4 minutes.

Yarrow🔸May 3, 2025, 2:13 PM
3 points
0 ∶ 0
in reply to: Mo Putera’s comment on: Joseph Lemien’s Shortform
Thank you!

Yarrow🔸May 3, 2025, 2:05 PM
1 point
0 ∶ 0
in reply to: Joseph’s comment on: Joseph Lemien’s Shortform
When you say people in EA have failed at the Circle of Hell Test, what do you mean? Do you literally mean in that specific situation of people standing and talking in a circle and someone trying to join the circle?

Yarrow🔸May 3, 2025, 1:40 PM
5 points
1 ∶ 1
on: Why I am Still Skeptical about AGI by 2030
Kudos on a well-written, well-researched, and well-argued post!

The part about “AI researchers” (which don’t actually exist; there is no such thing as an “AI researcher”) vs. human researchers gets at a simple mistake, which is confusing inputs and outputs.
For example, I believe that a GPU running a large language model (LLM) uses a lot more energy than a human brain.^[1] Is this evidence that LLMs are smarter than humans?
No, of course not.
If anything, people have interpreted this as a weakness of AI. Maybe it means the way current AI systems solve problems is too “brute force” and we’re missing something fundamental about the nature of intelligence.

Saying that AI systems are using more and more compute over time doesn’t directly say anything about their actual intelligence, any more than saying that AI systems are using more and more energy over time does.

How intelligent are AI systems compared to humans? How much has that been changing over time? How do we measure that? These are the key questions and, as you pointed out in your section on benchmarks, a lot of AI benchmarks don’t seem like they really measure this.

One potential way to measure the intelligence of AI vs. humans is the extent to which AI can displace human labour. You nicely covered this on your section on real-world adoption.

One study I can contribute to the discussion is this one I found about a customer support centre adopting LLMs. The outcome was mixed, with the study finding that LLMs harmed the productivity of the centre’s most experienced employees.

I think this post does a great job of pointing out the ways in which the arguments for near-term artificial general intelligence (AGI) rely on hand-wavy reasoning and make simple, fundamental mistakes — mistakes so simple and so fundamental that I’m confused why people like Will MacAskill don’t catch these mistakes themselves, since surely they should be able to clearly see these mistakes themselves and don’t need people like you or I to point them out.
1. ^
  Here’s one estimate I found after a quick Google, but I don’t know if it’s accurate.

Yarrow🔸May 3, 2025, 1:25 PM
1 point
0 ∶ 0
in reply to: Sharmake’s comment on: The case for multi-decade timelines [Linkpost]
I don’t understand what you’re trying to say here. By “the trend”, do you mean Nvidia’s revenue growth? And what do you mean by “have diverted normal compute expenditures into the AI boom”?

Yarrow🔸May 3, 2025, 12:38 PM
9 points
14 ∶ 12
on: Oliver Habryka on OpenPhil and GoodVentures
I know very little about the context of the disagreements between Oliver Habryka and Dustin Moskovitz. I’ve read one of their backs-and-forths in comments on the EA Forum and it was almost impossible to follow what they were talking about, partly due to both of their writing styles, but also probably due to there being a lot of context and background they weren’t trying to explain to people like me who weren’t already in the know.

I think Dustin may also have purposely been trying to be a bit vague because he was sick of being criticized by people on the EA Forum and felt that the more he said, the more he would be criticized (he made a comment to that effect).

So, I really don’t know all the details here and could be getting this all wrong. This is just my impression of things knowing as little as I do right now.

One of things Oliver has done a lot in his comments on the EA Forum which has bothered me is to try to shift a debate about what the right thing to do is on a specific topic (e.g., should EA buy a castle, should EA-related organizations invite people with extreme racist views to its conferences) into questioning the motives of people who disagree with him, accusing them of being too concerned with reputation rather than doing the right thing. Oliver seems to think he prioritizes doing the right thing over having a good reputation, but other people do it the other way around.

For example, Oliver holds views and is willing to take actions that I would categorize as racist and that I find morally objectionable for that reason. I’m not nearly the first person to express this. But Oliver’s response is not “some people disagree with me because they have different opinions about racism”, it’s more like “people pretend to disagree with me because they’re scared about what people will think and aren’t willing to speak the truth”. (Just to be clear, these are not real quotes. I’m just paraphrasing what I understood from reading some of Oliver’s comments.)

It’s a lot less compelling, rhetorically, to say “me and Dustin disagree about what constitutes racism” than to say “Dustin is overly concerned about his personal reputation (and I’m not)”. (Again, these are not real quotes.) But it’s also dishonest and mean-spirited.

I think part of the reason some of the discussions about racism in EA get diverted into discussions about EA’s reputation is that people are trying to leave a quick comment without getting dragged into an interminable and stressful debate about racism. LessWrong users have an inexhaustible capacity for getting into protracted, technical, and verbose forum debates. In general, people are averse to getting into debates about politics, race and racism, and social justice online. It’s tempting to try to get around a 100,000-word debate on the definition of racism by saying “these kinds of words and actions will alienate many people from effective altruism and worsen our reputation”.

Maybe that kind of response makes it seem like reputation is the primary concern. But it’s not the primary concern. The primary concern is that racism is evil and the racist words and actions of Oliver, et al. are evil. And you don’t want Oliver to write a 5,000-word comment that cites Astral Codex Ten seven times and LessWrong fourteen times arguing that holding racist views is actually smart that you’re going to feel obligated to read and respond to. So, instead you’ll just say “this kind of thing is really off-putting to many people, and damaging to our community”. And yet Oliver still found a way to respond to this that is about equally as annoying as the thing you were hoping to avoid. He says, “Aha! You care about reputation! I care about truth!” (Again, just to be clear, this is a fake quote.)

Let me repeat the caveat that I get the sense that there’s a whole lot of context and background to Dustin and Oliver’s disagreements that I don’t understand and I’m giving my impression of their disagreements despite this limited understanding. So, I could be getting Dustin’s perspective wrong and I could be getting Oliver’s perspective wrong.

But, with this limited understanding, my interpretation is that Dustin thinks that Lightcone Infrastructure’s and the rationalist community’s views and actions are racist and immoral and doesn’t want to be morally responsible for funding or supporting racism, either directly or indirectly. That, I think, is his primary reason for cutting ties with Lightcone and the rationalist community, not reputation. Reputation is one thing he’s considered, but it’s not the only thing and I don’t think it’s the primary thing. The primary thing is that racism is evil.

Yarrow🔸May 1, 2025, 5:29 PM
1 point
0 ∶ 0
on: EA Leaders Forum: Survey on EA priorities (data and analysis)
Wow. This is my first time reading this post.

The last section — “Problems with the EA community/movement” — was surprising. I was surprised that people in leadership positions at EA or EA-related organizations seemed to feel dismayed and irritated by the EA Forum and the online EA community for many of the same reasons I do.

I guess I feel relieved that they also see these problems, but, on the other hand, this survey is from 2019. I wonder how many respondents since left EA because these things put them off too much — or other things, like the sexual harassment, the racism, etc. If they stayed in EA, I wonder why I don’t get a sense of any leaders in the EA community/movement trying to address these problems.

Yarrow🔸May 1, 2025, 4:51 PM
1 point
0 ∶ 0
on: Community Polls for the Community
We should promote AI safety ideas more than other EA ideas
AGI is probably a long time away. No one knows when AGI will be created. No one knows how to create AGI. AGI safety is such a vague, theoretical concept that there’s essentially nothing you can do about it today or in the near future.

Yarrow🔸May 1, 2025, 3:27 PM
2 points
0 ∶ 0
in reply to: tobycrisford 🔸’s comment on: o3
A comment from François Chollet on this topic posted on Bluesky on January 6, 2025:
I don’t think people really appreciate how simple ARC-AGI-1 was, and what solving it really means.
It was designed as the simplest, most basic assessment of fluid intelligence possible. Failure to pass signifies a near-total inability to adapt or problem-solve in unfamiliar situations.
Passing it means your system exhibits non-zero fluid intelligence—you’re finally looking at something that isn’t pure memorized skill. But it says rather little about how intelligent your system is, or how close to human intelligence it is.
o3 gets 3% on ARC-AGI-2.

Yarrow🔸May 1, 2025, 2:33 PM
3 points
0 ∶ 0
on: Poll: Advance Notice of Critical Posts
How much advance notice would be appropriate in an ordinary case?
I don’t have a strong opinion on this, but I put my icon where I imagined 2 weeks would be. This is just an off-the-cuff stab at what a good rule of thumb might be.
More than 2 weeks feels like an onerous amount of time to wait to publish something.
2 weeks also seems like a reasonable amount of time for an organization to draft at least a short response. I don’t think we should expect organizations to write a detailed, comprehensive response to every piece of criticism they receive — either immediately or ever. (How much of a response feels warranted depends on how harsh the criticism is and how convincing it comes across.)
But 2 weeks is plenty of time to write a short reply of a few sentences or a few paragraphs, which can do a lot to defuse criticism if it’s convincing enough. For example, if you can point out a specific, provable error in the criticism that is actually important to the case it’s making (i.e., not just nitpicking). That might be enough to defuse the criticism as much as you care to defuse it, or it might be enough to convince people to withhold judgment while you take time to write a longer response.

But as I said, this is just my attempt to come up with a good rule of thumb, and, as with the other question, the real answer is “it depends”.

Yarrow🔸May 1, 2025, 2:20 PM
3 points
0 ∶ 0
on: Poll: Advance Notice of Critical Posts
Giving meaningful advance notice of a post that is critical of an EA person or organization should be
I put my answer at the midway point between neutral on the question and 100% agreeing with “almost always done” because the answer is “it depends”. It depends, for example, on how much money the organization being criticized has, how much criticism it is already getting, and how harsh your criticism is.

OpenAI’s o3 model scores 3% on the ARC-AGI-2 benchmark, compared to 60% for the average human

Yarrow🔸May 1, 2025, 1:57 PM

14 points

8 comments3 min readEA link

(arcprize.org)

Yarrow🔸May 1, 2025, 9:47 AM
3 points
1 ∶ 1
on: Don’t accuse your interlocutor of being insufficiently truth-seeking
“Truthseeking” is a strange piece of jargon. I’m not sure what purpose it serves. It seems like the meaning of “truthseeking” ambiguates between “practicing good epistemology” and “being intellectually honest”, as you describe. So, why not use one of those terms instead?

One thing that annoys me about the EA Forum (which I previously wrote about here) is that there’s way too much EA Forum-specific jargon. One negative effect of this is it makes it harder to understand what people are trying to say. Another negative effect is it elevates a lot of interesting conjecture to the level of conventional wisdom. If you have some interesting idea in a blog post or a forum post, and then people are quick to incorporate that into the lingo, you’ve made that idea part of the culture, part of the conventional wisdom. And it seems like people do this too easily.

If you see someone using the term “truthseeking” on the EA Forum, then:
1. There is no clear definition of this term anywhere that you can easily Google or search on the forum. There is a vague definition on the Effective Altruism Australia website. There is no entry for “truthseeking” in the EA Forum Wiki. The Wikipedia page for truth-seeking says, “Truth-seeking processes allow societies to examine and come to grips with past crimes and atrocities and prevent their future repetition. Truth-seeking often occurs in societies emerging from a period of prolonged conflict or authoritarian rule.[1] The most famous example to date is the South African Truth and Reconciliation Commission, although many other examples also exist.”
2. To the extent EA Forum users even have a clear definition of this term in their heads, they may be bringing along their own quirky ideas about epistemology or intellectual honesty or whatever. And are those good ideas? Who knows? Probably some are and a lot aren’t. Making “truthseeking” a fundamental value and then defining “truthseeking” in your own quirky way elevates something you read on an obscure blog last year to the level of an idea that has been scrutinized and debated by a diverse array of scholars across the world for decades and stood the test of time. That’s a really silly, bad way to decide which ideas are true and which are false (or dubious, or promising, or a mixed bag, or whatever).
3. Chances are the person is using it passive-aggressively, or with the implication that they’re more truthseeking than someone else. I’ve never seen someone say, “I wasn’t being truthseeking enough and changed my approach.” This kinda makes it feel like the main purpose of the word is to be passive-aggressive and act superior.
So, is this jargon anything but a waste of time?

Yarrow🔸Apr 28, 2025, 8:04 PM
7 points
1 ∶ 0
in reply to: TFD’s comment on: Criticism on the EA Forum
This post is about criticism of EA organizations, so it doesn’t apply to OpenAI or the U.S. government.

I interpreted this post as mostly being about charities with a small number of employees and relatively small budgets that either actively associate themselves with EA or that fall into a cause area EA generally supports, such as animal welfare or global poverty.

For example, if you wanted to criticize 80,000 Hours, New Harvest, or one of these charities focusing on mental health in poor countries, then I’d say you should send them a copy of your criticism before publishing and give them a chance to prepare a reply before you post. These organizations are fairly small in terms of their staff, have relatively little funding, and aren’t very well-known. So, I think it’s fair to give them more of an opportunity to defend their work.

If you wanted to criticize Good Ventures, Open Philanthropy, GiveWell, GiveDirectly, or the Against Malaria Foundation, then I think you could send them a courtesy email if you wanted, but they have so much funding and — in the case of Open Philanthropy at least — a large staff. They’re also already the subject of media attention and public discourse. With one of the smaller charities, you could plausibly hurt them with your post, so I think more caution is warranted. With these larger charities with more resources that are already getting debated and criticized a lot, an EA Forum post has a much lower chance of doing accidental harm.

Yarrow🔸Apr 28, 2025, 7:37 PM
18 points
4 ∶ 2
in reply to: VettedCauses’s comment on: Criticism on the EA Forum
I think you behaved inappropriately, as I and others explained in the comments on that post about the dubious “fraud” accusation. I completely understand why Sinergia said they don’t want to engage with your criticism anymore.
Upvotes/downvotes are not a meaningless number in this context, but a sign of EA Forum users’ opinion on whether you behaved appropriately and whether your claim that Sinergia committed fraud was true or misleading. You can see this in the comments on that post as well. It seems like there is, so far, unanimous agreement that you behaved inappropriately and that your claim was misleading or false.
I’m not sure if Vetted Causes is a salvageable project at this point. Its reputation is badly damaged. It might be best to put the project to an end and move on to something else.
Speaking for myself, I will never trust any evaluation that Vetted Causes ever publishes about any charity, and I would feel an obligation to warn people in this community that Vetted Causes is an untrustworthy and unreliable source for charity evaluations.

Yarrow🔸Apr 28, 2025, 4:55 PM
1 point
0 ∶ 0
in reply to: NickLaing’s comment on: Criticism on the EA Forum
I’m not familiar with the context, but my comment might address this sort of situation.

Yarrow🔸Apr 28, 2025, 4:54 PM
8 points
1 ∶ 1
on: Criticism on the EA Forum
I’m guessing this is probably a response to the post that unfairly accused a charity of fraud? (The post I’m thinking of currently has −60 karma, 0 agree votes, 6 disagree votes, and 4 top-level comments that are all critical.)

Some criticism might be friendly and constructive enough that giving the organization a chance to write a reply before publishing is not that important. Or if the organization is large, powerful, and has lots of money, like Open Philanthropy, and especially if your criticisms are of a more general or a more philosophical kind, it might not be important to send them a copy before you publish. This depends partly on how influential you are in EA and on how harsh your criticisms are.

Definitely accusing a small charity of fraud is something you should run by the charity beforehand. In that case, though, the charity was already so frustrated with the critic’s poor-quality criticism that they had publicly stated (before the fraud accusation) they didn’t want to engage with it anymore.

Yarrow🔸

OpenAI’s o3 model scores 3% on the ARC-AGI-2 bench­mark, com­pared to 60% for the av­er­age human

OpenAI’s o3 model scores 3% on the ARC-AGI-2 benchmark, compared to 60% for the average human