Ben_West🔸

Karma: 14,529

Non-EA interests include chess and TikTok (@benthamite). We are probably hiring: https://metr.org/hiring

Ben_West🔸Mar 31, 2025, 9:47 PM
10 points
2 ∶ 0
in reply to: Lorenzo Buonanno🔸’s comment on: Anthropic is not being consistently candid about their connection to EA
Great points, I don’t want to imply that they contribute nothing back, I will think about how to reword my comment.
I do think 1) community goods are undersupplied relative to some optimum, 2) this is in part because people aren’t aware how useful those goods are to orgs like Anthropic, and 3) that in turn is partially downstream of messaging like what OP is critiquing.

Ben_West🔸Mar 31, 2025, 5:24 PM
46 points
15 ∶ 4
on: Anthropic is not being consistently candid about their connection to EA
I’m sympathetic to wanting to keep your identity small, particularly if you think the person asking about your identity is a journalist writing a hit piece, but if everyone takes funding, staff, etc. from the EA commons and don’t share that they got value from that commons, the commons will predictably be under-supported in the future.
I hope Anthropic leadership can find a way to share what they do and don’t get out of EA (e.g. in comments here).

Ben_West🔸Mar 21, 2025, 3:09 AM
11 points
1 ∶ 0
on: Joey/AIM transition announcement
Thanks for all your work Joey! If it is the case that your counterfactual impact is lower now, it is coming down from a high place, because I have been impressed with AIM for a while and my impression is that you were pivotal in founding and running it.

Ben_West🔸Mar 21, 2025, 1:09 AM
4 points
1 ∶ 0
in reply to: David Mathers🔸’s comment on: METR: Measuring AI Ability to Complete Long Tasks
Fair enough! My guess is that when the trend breaks it will be because things have gone super-exponential rather than sub-exponential (some discussion here) but yeah, I agree that this could happen!

Ben_West🔸Mar 21, 2025, 1:05 AM
4 points
0 ∶ 0
in reply to: David Mathers🔸’s comment on: METR: Measuring AI Ability to Complete Long Tasks
Thanks for the question David! I expect that I can’t summarize this more simply than the paper does; particularly: section 4 goes into more detail on what the horizon means and section 8.1 discusses some limitations of this approach.

Ben_West🔸Mar 20, 2025, 9:08 PM
4 points
0 ∶ 0
in reply to: David Mathers🔸’s comment on: METR: Measuring AI Ability to Complete Long Tasks
So the claim is:
1. The 50% trend will break down at some length of task $T$
2. The 80% trend will therefore break at $T / 4$
3. And maybe $T$ is large enough to cause some catastrophic risk, but $T / 4$ isn’t
?

Ben_West🔸Mar 20, 2025, 3:20 PM
18 points
0 ∶ 0
in reply to: titotal’s comment on: METR: Measuring AI Ability to Complete Long Tasks
Figure four averages across all models. I think figure six is more illuminating:
Basically, the 80% threshold is ~2 doublings behind the 50% threshold, or ~1 year. An extra year isn’t nothing! But you’re still not getting to 10+ year timelines.

Ben_West🔸Mar 13, 2025, 4:57 PM
4 points
0 ∶ 0
on: AI is not taking over material science (for now): an analysis and conference report
Thanks for writing this up! I really like when people do concrete empirical surveys like this, it’s helpful to get a sense of how widely current tools are actually being used.
I’m curious if you have thoughts about what automation would actually speed you up? It sounds like maybe something like “current LLMs but without hallucination?”
Also, do you have a sense for how much investment has been made into AI tools in CEST? My impression is that deepmind really loves getting into nature/science but has very little interest in actually commercializing these tools, so it feels not that surprising to me that the thing which got into science didn’t actually get used.^[1] It would update me if they tried very hard to commercialize it but failed.
1. ^
  I agree that this doesn’t speak well of the editorial process though

Ben_West🔸Mar 13, 2025, 4:34 PM
5 points
0 ∶ 0
on: From Comfort Zone to Frontiers of Impact: Pursuing A Late-Career Shift to Existential Risk Reduction
This was a great post, thanks for writing it up

Ben_West🔸Mar 4, 2025, 6:47 PM
25 points
7 ∶ 1
in reply to: Habryka [Deactivated]’s comment on: Habryka’s Quick takes
It feels appropriate that this post has a lot of hearts and simultaneously disagree reacts. We will miss you, even (perhaps especially) those of us who often disagreed with you.
I would love to reflect with you on the other side of the singularity. If we make it through alive, I think there’s a decent chance that it will be in part thanks to your work.

Ben_West🔸Mar 2, 2025, 6:57 PM
6 points
1 ∶ 0
on: Kurzgesagt video on factory farming
I was excited that they did this and thought it was well produced. The focus on cost cutting feels like a double edged sword: it absolves viewers of responsibility, which makes them more open to the message but also less likely to do anything. I scrolled through the first couple pages of comments and saw a bunch of “corporations are greedy” complaints but couldn’t find anyone suggesting a concrete behavioral change (for themselves or others).
I wonder if there’s an adjacent version of this which keeps the viewer absolved of responsibility but still has a call to action. Plausible ideas:
1. Race to the top: e.g. specifically call out the worst corporate offender in the video
2. Political stuff, e.g. push for EU Commission to keep their cage banning promise
  1. Maybe YouTube rules about politics prevents them saying this, not sure
In any case, kudos to the Kurzgesagt team for making a video on this which (as of this writing) has 2M+ views!

Ben_West🔸Feb 24, 2025, 6:47 PM
31 points
0 ∶ 0
on: Ben_West’s Shortform
If you can get a better score than our human subjects did on any of METR’s RE-Bench evals, send it to me and we will fly you out for an onsite interview
Caveats:
1. you’re employable (we can sponsor visas from most but not all countries)
2. use same hardware
3. honor system that you didn’t take more time than our human subjects (8 hours). If you take more still send it to me and we probably will still be interested in talking
(Crossposted from twitter.)

Ben_West🔸Feb 2, 2025, 5:29 AM
4 points
1 ∶ 0
in reply to: Joris 🔸’s comment on: Joris P’s Quick takes
Wow that’s great. Congrats to you and all the organizers!

Ben_West🔸Feb 2, 2025, 5:26 AM
18 points
16 ∶ 0
on: Leadership change at the Center on Long-Term Risk
I appreciate you being willing to share your candid reasons publicly, Jesse. Best of luck with your future plans, and best of luck to Tristan and Mia!

Ben_West🔸Jan 18, 2025, 7:15 PM
32 points
6 ∶ 0
on: Ben_West’s Shortform
EA Awards
1. I feel worried that the ratio of the amount of criticism that one gets for doing EA stuff to the amount of positive feedback one gets is too high
2. Awards are a standard way to counteract this
3. I would like to explore having some sort of awards thingy
4. I currently feel most excited about something like: a small group of people solicit nominations and then choose a short list of people to be voted on by Forum members, and then the winners are presented at a session at EAG BA
5. I would appreciate feedback on:
  1. whether people think this is a good idea
  2. How to frame this—I want to avoid being seen as speaking on behalf of all EAs
6. Also if anyone wants to volunteer to co-organize with me I would appreciate hearing that
What links here?
- Jeff Kaufman 🔸's comment on What are we doing about the EA Forum? (Jan 2025) by Sarah Cheng (Jan 19, 2025, 10:26 PM; 7 points)

Ben_West🔸Dec 24, 2024, 9:48 PM
32 points
1 ∶ 0
in reply to: Ben_West🔸’s comment on: Ben_West’s Shortform
It looks like she did a giving season fundraiser for Helen Keller International, which she credits to the EA class she took. Maybe we will see her at a future EAG!

Ben_West🔸Dec 24, 2024, 6:06 PM
14 points
0 ∶ 1
on: Donation Celebration Post
Gave ~50% of my income to my DAF. I will probably disburse it mostly to AI Safety things which make sense on ⇐ 5 year AGI timelines.

Ben_West🔸Dec 24, 2024, 5:50 PM
130 points
0 ∶ 0
on: Ben_West’s Shortform
Adult film star Abella Danger apparently took an class on EA at University of Miami, became convinced, and posted about EA to raise $10k for One for the World. She was PornHub’s most popular female performer in 2023 and has ~10M followers on instagram. Her post has ~15k likes, comments seem mostly positive.
I think this might be the class that @Richard Y Chappell🔸 teaches?
Thanks Abella and kudos to whoever introduced her to EA!

Ben_West🔸Nov 27, 2024, 5:54 PM
4 points
5 ∶ 0
in reply to: Greg_Colbourn ⏸️ ’s comment on: Greg_Colbourn’s Shortform
Thank you for sharing your donation choices!

Ben_West🔸Nov 27, 2024, 5:53 PM
13 points
3 ∶ 0
on: Pulse 2024: Awareness and perceptions of effective altruism
This is great, thanks for doing this survey!