Ben_West🔸

Karma: 14,872

Non-EA interests include chess and TikTok (@benthamite). We are probably hiring: https://metr.org/hiring

Ben_West🔸May 4, 2025, 2:14 AM
1 point
0 ∶ 0
in reply to: Yarrow🔸’s comment on: OpenAI’s o3 model scores 3% on the ARC-AGI-2 benchmark, compared to 60% for the average human
Fixed the link. I also tried your original prompt and it worked for me.
But interesting! The “Harder word, much vaguer clue” seems to prompt it to not actually play hangman and instead antagonistically try to post hoc create a word after each guess which makes your guess wrong. I asked “Did you come up with a word when you first told me the number of letters or are you changing it after each guess?” And it said “I picked the word up front when I told you it was 10 letters long, and I haven’t changed it since. You’re playing against that same secret word the whole time.” (Despite me being able to see its reasoning trace that this is not what it’s doing.) When I say I give up it says “I’m sorry—I actually lost track of the word I’d originally picked and can’t accurately reveal it now.” (Because it realized that there was no word consistent with its clues, as you noted.)
So I don’t think it’s correct to say that it doesn’t know how to play hangman. (It knows, as you noted yourself.) It just wants so badly to make you lose that it lies about the word.

Ben_West🔸May 3, 2025, 5:25 PM
11 points
0 ∶ 0
in reply to: Yarrow🔸’s comment on: OpenAI’s o3 model scores 3% on the ARC-AGI-2 benchmark, compared to 60% for the average human
Huh interesting, I just tried that direction and it worked fine as well. This isn’t super important but if you wanted to share the conversation I’d be interested to see the prompt you used.

Ben_West🔸May 2, 2025, 12:05 AM
20 points
1 ∶ 0
on: OpenAI’s o3 model scores 3% on the ARC-AGI-2 benchmark, compared to 60% for the average human
By analogy, o4-mini’s inability to play hangman is a sign that it’s far from artificial general intelligence (AGI)
What is your source for this? I just tried and it played hangman just fine.

Ben_West🔸May 1, 2025, 8:16 PM
7 points
2 ∶ 1
on: Debate Poll: Some Positions In EA Leadership Should Be Elected
Given that some positions in EA leadership are already elected, I might suggest changing the wording to something like:
There should be an international body whose power is roughly comparable to CEA whose leadership is elected

Ben_West🔸May 1, 2025, 8:10 PM
26 points
4 ∶ 0
in reply to: calebp’s comment on: calebp’s Shortform
I think I agree with your overall point but some counterexamples:
1. EA Criticism and Red Teaming Contest winners. E.g. GiveWell said “We believe HLI’s feedback is likely to change some of our funding recommendations, at least marginally, and perhaps more importantly improve our decision-making across multiple interventions”
2. GiveWell said of their Change Our Mind contest “To give a general sense of the magnitude of the changes we currently anticipate, our best guess is that Matthew Romer and Paul Romer Present’s entry will change our estimate of the cost-effectiveness of Dispensers for Safe Water by very roughly 5 to 10% and that Noah Haber’s entry may lead to an overall shift in how we account for uncertainty (but it’s too early to say how it would impact any given intervention).”
3. HLI discussed some meaningful ways they changed as the result of criticism here.

Ben_West🔸Apr 29, 2025, 9:04 PM
2 points
0 ∶ 0
in reply to: GavinRuneblade’s comment on: The AI Adoption Gap: Preparing the US Government for Advanced AI
This is an extremely helpful response, thank you!

Ben_West🔸Apr 13, 2025, 11:19 PM
8 points
0 ∶ 0
on: Announcing our 2025 strategy
This is cool, I like BHAGs in general and this one in particular. Do you have a target for when you want to get to 1M pledgers?

Ben_West🔸Apr 13, 2025, 10:06 PM
8 points
0 ∶ 0
in reply to: titotal’s comment on: Enough about AI timelines— we already know what we need to know.
If you manage to convince an investor that timelines are very short without simultaneously convincing them to care a lot about x-risk, I feel like their immediate response will be to rush to invest briefcases full of cash into the AI race, thus helping make timelines shorter and more dangerous.
I’m the corresponding author for a paper that Holly is maybe subtweeting and was worried about this before publication but don’t really feel like those fears were realized.
Firstly, I don’t think there are actually very many people who sincerely think that timelines are short but aren’t scared by that. I think what you are referring to is people who think “timelines are short” means something like “AI companies will 100x their revenue in the next five years”, not “AI companies will be capable of instituting a global totalitarian state in the next five years.” There are some people who believe the latter and aren’t bothered by it but in my experience they are pretty rare.
Secondly, when VCs get the “AI companies will 100x their revenue in the next five years” version of short timelines they seem to want to invest into LLM-wrapper startups, which makes sense because almost all VC firms lack the AUM to invest in the big labs.^[1] I think there are plausible ways in which this makes timelines shorter and more dangerous but it seems notably different from investing in the big labs.^[2]
Overall, my experience has mostly been that getting people to take short timelines seriously is very close to synonymous with getting them to care about AI risk.
1. ^
  Caveat that ~everyone has the AUM to invest in publicly traded stocks. I didn’t notice any bounce in share price for e.g. NVDA when we published and would be kind of surprised if there was a meaningful effect, but hard to say.
2. ^
  Of course, there’s probably some selection bias in terms of who reaches out to me. Masayoshi Son probably feels like he has better info than what I could publish, but by that same token me publishing stuff doesn’t cause much harm.

Ben_West🔸Apr 13, 2025, 4:45 AM
4 points
0 ∶ 0
in reply to: Ozzie Gooen’s comment on: Ozzie Gooen’s Shortform
Do you think that distancing is ever not in the interest of both parties? If so, what is special about Anthropic/EA?
(I think it’s plausible that the answer is that distancing is always good; the negative risks of tying your reputation to someone always exceed the positive. But I’m not sure.)

Ben_West🔸Apr 13, 2025, 4:32 AM
12 points
0 ∶ 0
on: Cost-effectiveness of Anima International Poland
Thanks for doing this Saulius! I have been wondering about modeling the cost effectiveness of animal welfare advocacy under assumptions of relatively short AI timelines. It seems like one possible way of doing this is to to change the “Yearly decrease in probability that commitment is relevant” numbers in your sheet (cells I28:30). Do you have any thoughts on that approach?

Ben_West🔸Apr 12, 2025, 6:03 PM
1 point
2 ∶ 0
in reply to: Yarrow🔸’s comment on: On January 1, 2030, there will be no AGI (and AGI will still not be imminent)
You had never thought through “whether artificial intelligence could be increasing faster than Moore’s law.” Should we conclude that AI risk skeptics are “insular, intolerant of disagreement or intellectual or social non-conformity (relative to the group’s norms), and closed-off to even reasonable, relatively gentle criticism?”

Ben_West🔸Apr 12, 2025, 12:49 AM
2 points
0 ∶ 1
in reply to: Yarrow🔸’s comment on: On January 1, 2030, there will be no AGI (and AGI will still not be imminent)
I have to say, the bad part supports my observation!
Steven was responding to this:
The community of people most focused on keeping up the drumbeat of near-term AGI predictions seems insular, intolerant of disagreement or intellectual or social non-conformity (relative to the group’s norms), and closed-off to even reasonable, relatively gentle criticism
None of Steven’s bullet points support this. Many of them say the exact opposite of this.

Ben_West🔸Apr 9, 2025, 5:57 PM
10 points
2 ∶ 0
in reply to: Yarrow🔸’s comment on: On January 1, 2030, there will be no AGI (and AGI will still not be imminent)
More seriously, I didn’t really think through precisely whether artificial intelligence could be increasing faster than Moore’s law.
Fair enough, but in that case I feel kind of confused about what your statement “Progress does not seem like a fast exponential trend, faster than Moore’s law” was intended to imply.
If the claim you are making is “AGI by 2030 will require some growth faster than Moore’s law” then the good news is that almost everyone agrees with you but the bad news is that everyone already agrees with you so this point is not really cruxy to anyone.
Maybe you have an additional claim like ”...and growth faster than moore’s law is unlikely?” If so, I would encourage you to write that because I think that is the kind of thing that would engage with people’s cruxes!

Ben_West🔸Apr 8, 2025, 8:09 PM
12 points
4 ∶ 0
in reply to: Yarrow🔸’s comment on: On January 1, 2030, there will be no AGI (and AGI will still not be imminent)
If you drew a chart for the GPT models on ARC-AGI-2, it would mostly just be a flat line.. It’s only with the o3-low and o1-pro models we see scores above 0%
… which is what (super)-exponential growth looks like, yes?
Specifically: We’ve gone from o1 (low) getting 0.8% to o3 (low) getting 4% in ~1 year, which is ~2 doublings per year (i.e. 4x Moore’s law). Forecasting from this few data points sure seems like a cursed endeavor to me, but if you want to do it then I don’t see how you can rule out Moore’s-law-or-faster growth.
What links here?
- Greg_Colbourn ⏸️ 's comment on On January 1, 2030, there will be no AGI (and AGI will still not be imminent) by Yarrow🔸 (Apr 9, 2025, 4:30 PM; 1 point)

Ben_West🔸Apr 6, 2025, 5:42 PM
7 points
3 ∶ 0
in reply to: Ben_West🔸’s comment on: On January 1, 2030, there will be no AGI (and AGI will still not be imminent)
I would be curious to know what the best benchmarks are which show a sub-Moore’s-law trend.

Ben_West🔸Apr 6, 2025, 4:23 AM
28 points
8 ∶ 1
on: On January 1, 2030, there will be no AGI (and AGI will still not be imminent)
Progress does not seem like a fast exponential trend, faster than Moore’s law and laying the groundwork for an intelligence explosion
Moore’s law is ~1 doubling every 2 years. Barnes’ law is ~4 doublings every 2 years:

Ben_West🔸Apr 5, 2025, 4:07 AM
8 points
0 ∶ 0
on: The AI Adoption Gap: Preparing the US Government for Advanced AI
This post is focused on what the government can do but I’m curious if you have thoughts about what the private sector can do to meet the government where it is.
I imagine that palantir is making a killing off of adapting generative AI to work for government requirements, but I assume there are still gaps in the marketplace? Do you have a sense for what these gaps are? is there some large segment of the government which would use generative AI if only it was compliant with standard X?

Ben_West🔸Apr 3, 2025, 4:32 PM
10 points
1 ∶ 0
in reply to: NickLaing’s comment on: Anthropic is not being consistently candid about their connection to EA
the other hand though some leadership jobs might not be the right job fit if they’re not up for that kind of critique
Yeah, this used to be my take but a few iterations of trying to hire for jobs which exclude shy awkward nerds from consideration when the EA candidate pool consists almost entirely of shy awkward nerds has made the cost of this approach quite salient to me.
There are trade-offs to everything 🤷‍♂️

Ben_West🔸Apr 3, 2025, 3:41 PM
15 points
3 ∶ 2
in reply to: NickLaing’s comment on: Anthropic is not being consistently candid about their connection to EA
Only the most elite 0.1 percent of people can even have a meaningful “public private disconnect” as you have to have quite a prominent public profile for that to even be an issue.
Hmm yeah, that’s kinda my point? Like complaining about your annoying coworker anonymously online is fine, but making a public blog post like “my coworker Jane Doe sucks for these reasons” would be weird, people get fired for stuff like that. And referencing their wedding website would be even more extreme.
(Of course, most people’s coworkers aren’t trying to reshape the lightcone without public consent so idk, maybe different standards should apply here. I can tell you that a non-trivial number of people I’ve wanted to hire for leadership positions in EA have declined for reasons like “I don’t want people critiquing my personal life on the EA Forum” though.)

Ben_West🔸Apr 3, 2025, 3:14 AM
19 points
6 ∶ 6
in reply to: NickLaing’s comment on: Anthropic is not being consistently candid about their connection to EA
fwiw I think in any circle I’ve been a part of critiquing someone publicly based on their wedding website would be considered weird/a low blow. (Including corporate circles.) ^[1]
1. ^
  I think there is a level of influence at which everything becomes fair game, e.g. Donald Trump can’t really expect a public/private communication disconnect. I don’t think that’s true of Daniela, although I concede that her influence over the light cone might not actually be that much lower than Trump’s.