Søren Elverlin

Karma: 210

Søren Elverlin 1 Jan 2026 9:54 UTC
1 point
0 ∶ 0
in reply to: Denkenberger🔸’s comment on: Quick polls on AGI doom
=Confusion in What mildest scenario do you consider doom?=
My probability distribution looks like what you call the MIRI Torch, and what I call the MIRI Logo: Scenarios 3 to 9 aren’t well described in the literature because they are not in a stable equilibrium. In the real world, once you are powerless, worthless and an obstacle to those in power, you just end up dead.

=Confusion in Minimum P(doom) that is unacceptable to develop AGI?=
For non-extreme values, the concrete estimate and the most of the considerations you mention are irrelevant. The question is morally isomorphic to “What percentage of the worlds population am I willing to kill in expectation?”. Answers such as “10^6 humans” and “10^9 humans” are both monstrous, even though your poll would rate them very differently.

These possible answers don’t become moral even if you think that it’s really positive that humans don’t have to work any longer. You aren’t allowed to do something worse than the Holocaust in expectation, even if you really really like space travel or immortality, or ending factory farming, or whatever. You aren’t allowed to unilaterally decide to roll the dice on omnicide even if you personally believe that global warming is an existential risk, or that it would be good to fill the universe with machines of your creation.

Søren Elverlin 31 Dec 2025 15:34 UTC
1 point
0 ∶ 1
on: Quick polls on AGI doom
Meta: I count 25 questionmarks in this “quick” poll, and a lot of the questions appear to be seriously confused. A proper response here would take many hours.

Take your scenario number 5, for instance. Is there any serious literature examining this? Are there any reasons why anyone would assign that scenario >epsilon probability? Do any decisions hinge on this?

Søren Elverlin 9 Aug 2025 18:46 UTC
1 point
0 ∶ 0
on: Polls on De/Accelerating AI
We should slow AI down
Otherwise I expect AI will kill us

Søren Elverlin 1 Jul 2025 11:58 UTC
44 points
9 ∶ 0
on: Don’t Eat Honey
>The mean estimate was [that bees suffer] around 15% as intensely as people.

To clarify. Does this mean that when comparing:
1. A horrible prison keeping 10.000 humans
2. A beehive with 70.000 bees
The estimate implies that some people feel that the beehive is a worse moral problem? This strongly contradicts my moral intuitions.

Søren Elverlin 8 Feb 2025 7:29 UTC
3 points
1 ∶ 2
on: Improving capability evaluations for AI governance: Open Philanthropy’s new request for proposals
This seems to be of questionable effectiveness. Brief answers/challenges:
Evaluations are key input to ineffective governance. The safety frameworks presented by the frontier labs are “safety-washing”, more appropriately considered roadmaps towards an unsurvivable future.
Disagreement on AI capabilities underpin performative disagreements on AI Risk. As far as I know, there’s no recent published substantial such disagreement—I’d like sources for your claim, please.
We don’t need more situational awareness of what current frontier models can and cannot do in order to respond appropriately. No decision-relevant conclusions can be drawn from evaluations in the style of Cybench and Re-Bench.

Søren Elverlin 7 Jan 2025 8:05 UTC
2 points
0 ∶ 0
on: Idea: Repository for AI Safety Presentations
I’m also practicing how to give good presentations and introductions to AI Safety. You can see my YouTube channel here:
You might also be interested in one of my older presentations, number 293, which is closer to what you are working on.

Feel free to book a half-hour chat about this topic with me on this link:

Søren Elverlin 25 Nov 2024 10:04 UTC
11 points
1 ∶ 0
in reply to: Marcus Abramovitch 🔸’s comment on: What are some criticisms of PauseAI?
The provided source doesn’t show PauseAI affiliated people calling Sam Altman and Dario Amodei evil.

Søren Elverlin 25 Nov 2024 9:56 UTC
3 points
1 ∶ 0
in reply to: Matthew_Barnett’s comment on: What are some criticisms of PauseAI?
I do in fact believe that delaying AI by 5 years reduce existential risk by something like 10 percentage points.
Probably this thread isn’t the best place to hash it out, however.

Søren Elverlin 25 Nov 2024 9:52 UTC
1 point
1 ∶ 0
in reply to: Throwaway81’s comment on: What are some criticisms of PauseAI?
Another org in the same space, comprised of highly competent and experienced/plugged in people would certainly be welcome, and plausibly could be more effective.

Søren Elverlin 25 Nov 2024 9:49 UTC
5 points
2 ∶ 0
in reply to: David T’s comment on: What are some criticisms of PauseAI?
>PauseAI suffers from the same shortcomings most lobbying outfits do...

I’m confused about this section: Yes, this kind of lobbying is hard, and the impact of a marginal dollar is very unclear. The acc-side also have far more resources (probably; we should be vary of this becoming a Bravery Debate).
This doesn’t feel like a criticism of PauseAI. Limited tractability is easily outweighed by a very high potential impact.

Søren Elverlin 25 Nov 2024 9:28 UTC
1 point
0 ∶ 2
in reply to: yanni kyriacos’s comment on: What are some criticisms of PauseAI?
I strongly agree. Almost all of the criticism in this thread seem to start from assumptions about AI that are very far from those held by PauseAI. This thread really needs to be split up to factor that out.
As an example: If you don’t think shrimp can suffer, then that’s a strong argument against the Shrimp Welfare Project. However, that criticism doesn’t belong in the same thread as a discussion about whether the organization is effective, because the two subjects are so different.

Søren Elverlin 22 Nov 2024 6:05 UTC
3 points
0 ∶ 0
on: Please vote for PauseAI US in the Donation Election!
Your link is broken—it looks like it’s been pasted twice.
https://forum.effectivealtruism.org/posts/aYxuFeCcqRvaszHPb/ama-pauseai-us-needs-money-ask-founder-exec-dir-holly-elmore

Søren Elverlin 4 Oct 2024 11:31 UTC
0 points
0 ∶ 2
on: The Indispensability of Longtermism in AI Safety: Balancing Immediate Benefits and Future Risks
- Without Delay: There is a 10% chance of catastrophe.
- With a Cautious Delay of 70 Years: The risk of catastrophe is reduced to 5%.
Could you post a link to anyone who has something like these probabilities? I would be quite surprised to learn that this was not an extremely niche set of assumptions.

Søren Elverlin 29 Sep 2024 17:56 UTC
3 points
0 ∶ 0
on: GPT5 won’t be what kills us all
It’s a long post, and it starts by talking about consciousness.
Does it contain any response to the classic case for AI Risk, e.g. Bostrom’s Superintelligence or Yudkowsky’s List of Lethalities?

Søren Elverlin 26 Oct 2023 11:40 UTC
6 points
1 ∶ 0
on: Schlep Blindness in EA
Ajeya Cotra posted an essay on schlep in the context of AI 2 weeks ago:
https://www.planned-obsolescence.org/scale-schlep-and-systems/
I find that many of the topics she suggests as ‘schlep’ are actually very exciting and lots of fun to work on. This is plausible why we see so much open source effort in the space of LLM-hacking.
What would you think of as examples of schlep in other EA areas?

Søren Elverlin 14 Oct 2023 9:34 UTC
5 points
0 ∶ 0
on: I’m interviewing Bryan Caplan. What should I ask him?
The 2016 Caplan-Yudkowsky debate (https://www.econlib.org/archives/2016/03/so_far_my_respo.html) fizzled out, with Bryan not answering Eliezers last question. I’d like to know his answer

Søren Elverlin 29 May 2023 5:30 UTC
5 points
0 ∶ 0
in reply to: DaneelO’s comment on: Has Russia’s Invasion of Ukraine Changed Your Mind?
The Budapest Memorandum provided security assurances, not security guarantees. And I believe this war has already caused enough damage to Russia that we can’t talk about Russia “getting away with” the invasion.
The destruction of the Russian military should be expected to make the world safer primarily because it will prevent future Russian agression.

Søren Elverlin 1 Apr 2023 16:47 UTC
9 points
1 ∶ 0
in reply to: EdoArad🔸’s comment on: Announcing drama curfews on the Forum
The police is not bound by the “No drama” rule. If you steal money, you can expect the police to be “dramatic” about it.

Søren Elverlin 5 Mar 2023 23:47 UTC
17 points
0 ∶ 0
on: Please don’t criticize EAs who “sell out” to OpenAI and Anthropic
A single data point: At a party at EAG, I met a developer who worked at Anthropic. I asked for his p(DOOM), and he said 50%. He told me he was working on AI capability.

I inquired politely about his views on AI safety, and he frankly did not seem to have given the subject much thought. I do not recall making any joke about “selling out”, but I may have asked what effect he thought his actions would have on X-risk.
I don’t recall anyone listening, so this was probably not the situation OP is referring to.

Søren Elverlin 4 Mar 2023 9:32 UTC
35 points
14 ∶ 2
on: Misalignment Museum opens in San Francisco: ‘Sorry for killing most of humanity’
I appreciate cultural works creating common knowledge that the AGI labs are behaving strongly unethically.
As for the specific scenario, point 17 seems to be contradicted by the orthogonality thesis / lack of moral realism.