Matthew_Barnett

Karma: 4,440

Matthew_Barnett Jun 1, 2025, 9:17 AM
12 points
2 ∶ 2
in reply to: Chris Leong’s comment on: Matthew_Barnett’s Shortform
To be clear, I was not calling your request for clarification “cult-like”. My comment was directed at how the accusation against me was seemingly handled—as though it were credible until I could somehow prove otherwise. No evidence was offered to support the claim. Instead, assertions were made without substantiation. I directly and clearly denied the accusations, but despite that, the line of questioning continued in a way that strongly suggested the accusation might still be valid.
To illustrate the issue more clearly: imagine if I were to accuse you of something completely baseless, and even after your firm denials, I continued to press you with questions that implicitly treated the accusation as credible. You would likely find that approach deeply frustrating and unfair, and understandably so. You’d be entirely justified in pushing back against it.
That said, I acknowledge that describing the behavior as “cult-like” may have generated more heat than light. It likely escalated the tone unnecessarily, and I’ll be more careful to avoid that kind of rhetoric going forward.

Matthew_Barnett Jun 1, 2025, 6:04 AM
11 points
5 ∶ 6
in reply to: Chris Leong’s comment on: Matthew_Barnett’s Shortform
Is it baseless?
Yes, absolutely. With respect, unless you can provide some evidence indicating that I’ve acted improperly, I see no productive reason to continue engaging on this point.
What concerns me most here is that the accusation seems to be treated as credible despite no evidence being presented and a clear denial from me. That pattern—assuming accusations about individuals who criticize or act against core dogmas are true without evidence—is precisely the kind of cult-like behavior I referenced in my original comment.
Suggesting that I’ve left myself “substantial wiggle room” misinterprets what I intended, and given the lack of supporting evidence, it feels unfair and unnecessarily adversarial. Repeatedly implying that I’ve acted improperly without concrete substantiation does not reflect a good-faith approach to discussion.

Matthew_Barnett Jun 1, 2025, 5:23 AM
29 points
6 ∶ 5
in reply to: Chris Leong’s comment on: Matthew_Barnett’s Shortform
I agree that some of your critics may not have quite been able to hit the nail on the head when they tried to articulate their critiques (it took me substantial effort to figure out what I precisely thought was wrong, as opposed to just ‘this feels bad’), but I believe that the general thrust of their arguments generally holds up.
In context, this comes across to me as an overly charitable characterization of what actually occurred: someone publicly labeled me a literal traitor and then made a baseless, false accusation against me. What’s even more concerning is that this unfounded claim is now apparently being repeated and upvoted by others.
When communities choose to excuse or downplay this kind of behavior—by interpreting it in the most charitable possible way, or by glossing over it as being “essentially correct”—they end up legitimizing what is, in fact, a low-effort personal attack without a factual basis. Brushing aside or downplaying such attacks as if they are somehow valid or acceptable doesn’t just misrepresent the situation; it actively undermines the conditions necessary for good faith engagement and genuine truth-seeking.
I urge you to recognize that tolerating or rationalizing this type of behavior has real social consequences. It fosters a hostile environment, discourages honest dialogue, and ultimately corrodes the integrity of any community that claims to value fairness and reasoned discussion.

Matthew_Barnett Jun 1, 2025, 2:37 AM
29 points
12 ∶ 5
in reply to: MichaelDickens’s comment on: Matthew_Barnett’s Shortform
If this line of reasoning is truly the basis for calling me a “sellout” and a “traitor”, then I think the accusation becomes even more unfounded and misguided. The claim is not only unreasonable: it is also factually incorrect by any straightforward or good-faith interpretation of the facts.
To be absolutely clear: I have never taken funds that were earmarked for slowing down AI development and redirected them toward accelerating AI capabilities. There has been no repurposing or misuse of philanthropic funding that I am aware of. The startup in question is an entirely new and independent entity. It was created from scratch, and it is funded separately—it is not backed by any of the philanthropic donations I received in the past. There is no financial or operational overlap.
Furthermore, we do not plan on meaningfully making use of benchmarks, datasets, or tools that were developed during my previous roles in any substantial capacity at the new startup. We are not relying on that prior work to advance our current mission. And as far as I can tell, we have never claimed or implied otherwise publicly.
It’s also important to address the deeper assumption here: that I am somehow morally or legally obligated to permanently align my actions with the preferences or ideological views of past philanthropic funders who supported an organization that employed me. That notion seems absurd. It has no basis in ordinary social norms, legal standards, or moral expectations. People routinely change roles, perspectives evolve, and institutions have limited scopes and timelines. Holding someone to an indefinite obligation based solely on past philanthropic support would be unreasonable.
Even if, for the sake of argument, such an obligation did exist, it would still not apply in this case—because, unless I am mistaken, the philanthropic grant that supported me as an employee never included any stipulation about slowing down AI in the first place. As far as I know, that goal was never made explicit in the grant terms, which renders the current accusations irrelevant and unfounded.
Ultimately, these criticisms appear unsupported by evidence, logic, or any widely accepted ethical standards. They seem more consistent with a kind of ideological or tribal backlash to the idea of accelerating AI than with genuine, thoughtful, and evidence-based concerns.

Matthew_Barnett May 31, 2025, 11:51 PM
70 points
22 ∶ 15
on: Matthew_Barnett’s Shortform
I want to clarify, for the record, that although I disagree with most members of the EA community on whether we should accelerate or slow down AI development, I still consider myself an effective altruist in the senses that matter. This is because I continue to value and support most EA principles, such as using evidence and reason to improve the world, prioritizing issues based on their scope, not discriminating against foreigners, and antispeciesism.
I think it’s unfortunate that disagreements about AI acceleration often trigger such strong backlash within the community. It appears that advocating for slowing AI development has become a “sacred” value that unites much of the community more strongly than other EA values do. Despite hinging on many uncertain and IMO questionable empirical assumptions, the idea that we should decelerate AI development is now sometimes treated as central to the EA identity in many (albeit not all) EA circles.
As a little bit of evidence for this, I have been publicly labeled a “sellout and traitor” on X by a prominent member of the EA community simply because I cofounded an AI startup. This is hardly an appropriate reaction to what I perceive as a measured, academic disagreement occurring within the context of mainstream cultural debates. Such reactions frankly resemble the behavior of a cult, rather than an evidence-based movement—something I personally did not observe nearly as much in the EA community ten years ago.

Matthew_Barnett May 31, 2025, 10:45 PM
25 points
2 ∶ 21
in reply to: Chris Leong’s comment on: casebash’s Shortform
Some useful context is that I think a software singularity is unlikely to occur; see this blog post for some arguments. Loosely speaking, under the view expressed in the linked blog post, there aren’t extremely large gains from automating software engineering tasks beyond the fact that these tasks represent a significant (and growing) fraction of white collar labor by wage bill.
Even if I thought a software singularity will likely happen in the future, I don’t think this type of work would be bad in expectation, as I continue to think that accelerating AI is likely good for the world. My main argument is that speeding up AI development will hasten large medical, technological, and economic benefits to people alive today, without predictably causing long-term harms large enough to outweigh these clear benefits. For anyone curious about my views, I’ve explained my perspective on this issue at length on this forum and elsewhere.

Matthew_Barnett May 28, 2025, 9:19 PM
5 points
0 ∶ 0
in reply to: akash 🔸’s comment on: Matthew_Barnett’s Shortform
I’m relatively confident in these views, with the caveat that much of what I just expressed concerns morality, rather than epistemic beliefs about the world. I’m not a moral realist, so I am not quite sure how to parse my “confidence” in moral views.

Matthew_Barnett May 26, 2025, 4:52 AM
4 points
0 ∶ 1
in reply to: Ryan Greenblatt’s comment on: Matthew_Barnett’s Shortform
I’m sympathetic to cutting off at an earlier point and rejecting all galaxy brained arguments.
As am I. At least when it comes to the important action-relevant question of whether to work on AI development, in the final analysis, I’d probably simplify my reasoning to something like, “Accelerating general-purpose technology seems good because it improves people’s lives.” This perspective roughly guides my moral views on not just AI, but also human genetic engineering, human cloning, and most other potentially transformative technologies.
I mention my views on preference utilitarianism mainly to explain why I don’t particularly value preserving humanity as a species beyond preserving the individual humans who are alive now. I’m not mentioning it to commit to any form of galaxy-brained argument that I think makes acceleration look great for the long-term. In practice, the key reason I support accelerating most technology, including AI, is simply the belief that doing so would be directly beneficial to people who exist or who will exist in the near-term.
And to be clear, we could separately discuss what effect this reasoning has on the more abstract question of whether AI takeover is bad or good in expectation, but here I’m focusing just on the most action-relevant point that seems salient to me, which is whether I should choose to work on AI development based on these considerations.

Matthew_Barnett May 26, 2025, 2:49 AM
4 points
0 ∶ 1
in reply to: Ryan Greenblatt’s comment on: Matthew_Barnett’s Shortform
The goodness of worlds could easily vary by many orders of magnitude for any version of this view I can quickly think of and which seems plausible. I’m not sure whether you agree with this, but I think you probably don’t because you often seem to give off the vibe that you’re indifferent to very different possibilities.
I don’t think I agree with the strong version of the indifference view that you’re describing here. However, I probably do agree with a weaker version. In the weaker version that I largely agree with, our profound uncertainty about the long-term future means that, although different possible futures could indeed be extremely different in terms of their value, our limited ability to accurately predict or forecast outcomes so far ahead implies that, in practice, we shouldn’t overly emphasize these differences when making almost all ordinary decisions.
This doesn’t mean I think we should completely ignore the considerations you mentioned in your comment, but it does mean that I don’t tend to find those considerations particularly salient when deciding whether to work on certain types of AI research and development.
This reasoning is similar to why I try to be kind to people around me: while it’s theoretically possible that some galaxy-brained argument might exist showing that being extremely rude to people around me could ultimately lead to far better long-term outcomes that dramatically outweigh the short-term harm, in practice, it’s too difficult to reliably evaluate such abstract and distant possibilities. Therefore, I find it more practical to focus on immediate, clear, and direct considerations, like the straightforward fact that being kind is beneficial to the people I’m interacting with.
This puts me perhaps closest to the position you identified in the last paragraph:
And, maybe you think this perspective makes you so uncertain about human control vs AI control that the relative impacts current human actions could have are small given how much you weight long term outcomes relative to other stuff (like ensuring currently existing humans get to live for at least 100 more years or similar).
Here’s an analogy that could help clarify my view: suppose we were talking about the risks of speeding up research into human genetic engineering or human cloning. In that case, I would still seriously consider speculative moral risks arising from the technology. For instance, I think it’s possible that genetically enhanced humans could coordinate to oppress or even eliminate natural unmodified humans, perhaps similar to the situation depicted in the movie GATTACA. Such scenarios could potentially have enormous long-term implications under my moral framework, even if it’s not immediately obvious what those implications might actually be.
However, even though these speculative risks are plausible and seem important to take into account, I’m hesitant to prioritize their (arguably very speculative) impacts above more practical and direct considerations when deciding whether to pursue such technologies. This is true even though it’s highly plausible that the long-run implications are, in some sense, more significant than the direct considerations that are easier to forecast.
Put more concretely, if someone argued that accelerating genetically engineering humans might negatively affect the long-term utilitarian moral value we derive from cosmic resources as a result of some indirect far-out consideration, I would likely find that argument far less compelling than if they informed me of more immediate, clear, and predictable effects of the research.
In general, I’m very cautious about relying heavily on indirect, abstract reasoning when deciding what actions we should take or what careers we should pursue. Instead, I prefer straightforward considerations that are harder to fool oneself about.

Matthew_Barnett May 24, 2025, 9:06 PM
29 points
1 ∶ 12
on: Matthew_Barnett’s Shortform
A summary of my current views on moral theory and the value of AI
I am essentially a preference utilitarian and an illusionist regarding consciousness. This combination of views leads me to conclude that future AIs will very likely have moral value if they develop into complex agents capable of long-term planning, and are embedded within the real world. I think such AIs would have value even if their preferences look bizarre or meaningless to humans, as what matters to me is not the content of their preferences but rather the complexity and nature of their minds.
When deciding whether to attribute moral patienthood to something, my focus lies primarily on observable traits, cognitive sophistication, and most importantly, the presence of clear open-ended goal-directed behavior, rather than on speculative or less observable notions of AI welfare, about which I am more skeptical. As a rough approximation, my moral theory aligns fairly well with what is implicitly proposed by modern economists, who talk about revealed preferences and consumer welfare.
Like most preference utilitarians, I believe that value is ultimately subjective: loosely speaking, nothing has inherent value except insofar as it reflects a state of affairs that aligns with someone’s preferences. As a consequence, I am comfortable, at least in principle, with a wide variety of possible value systems and future outcomes. This means that I think a universe made of only paperclips could have value, but only if that’s what preference-having beings wanted the universe to be made out of.
To be clear, I also think existing people have value too, so this isn’t an argument for blind successionism. Also, it would be dishonest not to admit that I am also selfish to a significant degree (along with almost everyone else on Earth). What I have just described simply reflects my broad moral intuitions about what has value in our world from an impartial point of view, not a prescription that we should tile the universe with paperclips. Since humans and animals are currently the main preference-having beings in the world, at the moment I care most about fulfilling what they want the world to be like.

Matthew_Barnett May 14, 2025, 3:50 AM
2 points
0 ∶ 0
in reply to: CarlShulman’s comment on: What if we just…didn’t build AGI? An Argument Against Inevitability
Perhaps I overstated some of my claims or was unclear. So let me try to be more clear about my basic thesis. First of all, I agree that in the most basic model of the situation, being slightly ahead of a competitor can be the decisive factor between going bankrupt and making enormous profits. This creates a significant personal incentive to race ahead, even if doing so only marginally increases existential risk overall. As a result, AI labs may end up taking on more risk than they would in the absence of such pressure. More generally, I agree that without competition—whether between states or between AI companies—progress would likely be slower than it currently is.
My main point, however, is that these effects are likely not strong enough to justify the conclusion that the socially optimal pace of AI R&D is meaningfully slower than the current pace we in fact observe. In other words, I’m not convinced that what’s rational from an individual actor’s perspective diverges greatly from what would be rational from a collective or societal standpoint.
This is the central claim underlying my objection: if there is no meaningful difference between what is individually rational and what is collectively rational, then there is little reason to believe we are facing a tragedy-of-the-commons scenario as suggested in the post.
To sketch a more complete argument here, I would like to make two points:
First, while some forces incentivize speeding up AI development, others push in the opposite direction. Measures like export controls, tariffs, and (potentially) future AI regulations can slow down progress. In these cases, the described dynamic flips: the global costs of slowing down are shared, while the political rewards—such as public credit or influence—are concentrated among the policymakers or lobbyists who implement the slowdown.
Second, as I’ve mentioned, a large share of both the risks and benefits of AI accrue directly to those driving its development. This alignment of incentives gives them a reason to avoid reckless acceleration that would dramatically increase risk.
As a testable prediction of my view, we could ask whether AI labs are actively lobbying for slower progress internationally. If they truly preferred collective constraint but felt compelled to move forward individually, we would expect them to support measures that slow everyone down—while personally moving forward as fast as they can in the meantime. However, to my knowledge, such lobbying is not happening. This suggests that labs may not, in fact, collectively prefer significantly slower development.

Matthew_Barnett May 13, 2025, 4:28 AM
3 points
1 ∶ 0
in reply to: Nate Sharpe’s comment on: What if we just…didn’t build AGI? An Argument Against Inevitability
There are psychological pressures that can lead to motivated reasoning on both sides of this issue. On the pro-acceleration side, individuals may be motivated to downplay or dismiss the potential risks and downsides of rapid AI development. On the other side, those advocating for slowing or pausing AI progress may be motivated to dismiss or undervalue the possible benefits and upsides. Because both the risks and the potential rewards of AI are substantial, I don’t see a compelling reason to assume that one side must be much more prone to denial or bias than the other.
At most, I see a simple selection effect: the people most actively pushing for faster AI development are likely those who are least worried about the risks. This could lead to a unilateralist curse, where the least concerned actors push capabilities forward despite a high risk of disaster. But the opposite scenario could also happen, if the most concerned actors are able to slow down progress for everyone else, delaying the benefits of AI unacceptably. Whether you should care more about the first or second scenario depends on your judgement of whether rapid AI progress is good or bad overall.
Ultimately, I think it’s more productive to frame the issue around empirical facts and value judgments: specifically, how much risk rapid AI development actually introduces, and how much value we ought to place on the potential benefits of rapid development. I find this framing more helpful, not only because it identifies the core disagreement between accelerationists and pause advocates, but also because I think it better accounts for the pace of AI development we actually observe in the real world.

Matthew_Barnett May 12, 2025, 10:54 PM
4 points
0 ∶ 1
in reply to: CarlShulman’s comment on: What if we just…didn’t build AGI? An Argument Against Inevitability
If you look at the leaders of major AI companies you see people like Elon Musk and others who are concerned with getting to AGI before others who they distrust and fear. They fear immense power in the hands of rivals with conflicting ideologies or in general.
Musk is a vivid example of the type of dynamic you’re describing, but he’s also fairly unusual in this regard. Sundar Pichai, Satya Nadella, and most other senior execs strike me as more like conventional CEOs: they want market share, profits, and higher margins, but they’re not seeking the kind of hegemonic control that would justify accepting a much higher p(doom). If the dominant motive is ordinary profit maximization rather than paranoid power-seeking, then my original point stands: both the upside (huge profit streams) and the downside (self-annihilation) accrue to the people pushing AI forward, so the private incentives already internalize a large chunk of the social calculus.
Likewise, it’s true that governments often seek hegemonic control over the world in a way that creates destructive arms races, but even in the absence of such motives, there would still be a strong desire among humans to advance most technologies to take advantage of the benefits.
The most important fact here is that AI has an enormous upside: people would still have strong reasons to aggressively seek it to obtain life extension and extraordinary wealth even in the absence of competitive dynamics—unless they were convinced that the risk from pursuing that upside was unacceptably high (which is an epistemic consideration, not a game-theoretic trap).
Power, including the power to coerce or harm others, and relative standing, are more important there than access to advanced medicine or broad prosperity for the competitive dynamics.
You may have misread me here. I’m not claiming that AI labs are motivated by a desire to create broad prosperity. They certainly do care about “power,” but the key question is whether they’re primarily driven by the type of zero-sum, adversarial power-seeking you described. I’m skeptical that this is the dominant motive. Instead, I think ordinary material incentives likely play a larger role.
And the epistemic variation makes it all worse, where the most unconcerned players set a higher baseline risk spontaneously.
The unilateralist’s curse is primarily worrisome when the true value of an initiative is negative; for good projects it usually helps them proceed. Moreover, if good projects can be vetoed (e.g., via regulators), this creates a reverse curse that can block beneficial progress. Ultimately I don’t see a strong argument for a trap that forces AI to advance forward at everyone’s expense. We aren’t really at the mercy of Moloch here. The main story is that other people simply assign (a) lower extinction probabilities or (b) higher weight to the upside potential than EAs tend to. That is a rather ordinary disagreement over facts and values, not a real curse.

Matthew_Barnett May 11, 2025, 11:06 PM
6 points
0 ∶ 3
on: What if we just…didn’t build AGI? An Argument Against Inevitability
This confluence of factors creates a powerful coordination problem. Everyone might privately agree that racing headlong into AGI without robust safety guarantees is madness, but nobody wants to be the one who urges caution while others surge ahead.
I dispute that we’re facing a coordination problem in the sense you described. Chad Jones’ paper is a helpful comparison here to illustrate AI racing dynamics.
His model starts from the observation that the very same actors who might enjoy benefits from faster AI also face the extinction hazard that faster AI could bring. In his social-planner formulation, the idea is to pick an R&D pace that equates the marginal gain in permanent consumption growth with the marginal rise in a one-time extinction probability^[1]; when the two curves cross, that is the point you stop. Nothing in the mechanism lets one party obtain the upside while another bears the downside of existential risk, so the familiar logic of a classic tragedy of the commons—”If I restrain myself, someone else will defect and stick me with the loss”—doesn’t apply. The optimal policy is simply to pick whatever pace of development makes the risk–reward ratio come out favorable.
Why is that a realistic way to view today’s situation? First, extinction risk is highly non-rival: if an unsafe system destroys the world, it wipes out everyone, including the engineers and investors who pushed the system forward. They cannot dump that harm on an outside group the way a factory dumps effluent into a river. Second, the primary benefits—higher incomes and earlier biomedical breakthroughs—are also broadly shared; they are not gated to the single lab that crosses the finish line first. Because both tails of the distribution are so widely spread, each lab’s private calculus already contains a big slice of the social calculus.
Third, empirical incentives inside frontier labs look far more like “pick your preferred trade-off” than “cheat while others cooperate.” Google, Anthropic, OpenAI, and their peers hold billions of dollars in equity that vaporizes if a catastrophic failure occurs; their founders and employees live in the metaphorical blast radius just like everyone else.
So why does it look in practice as though labs are racing? The Jones model suggests the answer is epistemic, not game-theoretic. Different actors slot in different parameter values for how much economic growth matters or how sharply risk rises with capability, and those disagreements lead to divergent optimal policies. That is a dispute over facts and forecasts, not a coordination failure in the classic tragedy-of-the-commons sense, where each player gains by defecting even though all would jointly prefer restraint.
1. ^
  He technically models the problem as a choice of when to stop, rather than strictly picking a pace of R&D. However, for the purpose of this analysis, the difference between these two ways of modeling the problem is largely irrelevant.

Matthew_Barnett Apr 25, 2025, 7:08 PM
4 points
0 ∶ 0
in reply to: Neel Nanda’s comment on: Simplify EA Pitches to “Holy Shit, X-Risk”
When I wrote this post, pauseAI and similar were much less of a thing
I agree, this seems broadly accurate. I suppose I should have clarified that your post was perhaps true at the time, but in my view, has since become false if one counts AI pause as a “core action relevant point” of EA.
I just think if you told people “there’s this new technology that could cause human extinction, or be a really big deal and save many lives and cause an age of wonders, should we be slow and cautious in how we develop it” most people would say yes?
I believe that people’s answers to questions like this are usually highly sensitive to how the issue is framed. If you simply presented them with the exact quote you wrote here, without explaining that “saving many lives” would likely include the lives of their loved ones, such as their elderly relatives, I agree that most would support slowing down development. However, if you instead clarified that continuing development would likely save their own lives and the lives of their family members by curing most types of diseases, and if you also emphasized that the risk of human extinction from continued development is very low (for example, 1-2%), then I think there would be a significantly higher chance that most people would support moving forward with the technology at a reasonably fast pace, though presumably with some form of regulation in place to govern the technology.
One possible response to my argument is to point to survey data that shows most people favor pausing AI. However, while I agree survey data can be useful, I don’t think it provides strong evidence in this case for the claim. This is because most people, when answering survey questions, lack sufficient context and have not spent much time thinking deeply about these complex issues. Their responses are often made without fully understanding the stakes or the relevant information. In contrast, if you look at the behavior of current legislators and government officials who are being advised by scientific experts and given roughly this same information, it does not seem that they are currently strongly in favor of pausing AI development.

Matthew_Barnett Apr 23, 2025, 8:14 PM
6 points
0 ∶ 2
on: Simplify EA Pitches to “Holy Shit, X-Risk”
TL;DR If you believe the key claims of “there is a >=1% chance of AI causing x-risk and >=0.1% chance of bio causing x-risk in my lifetime” this is enough to justify the core action relevant points of EA. This clearly matters under most reasonable moral views and the common discussion of longtermism, future generations and other details of moral philosophy in intro materials is an unnecessary distraction.
I think the central thesis of this post—as I understand it—is false, for the reasons I provided in this comment. [Edit: to be clear, I think this post was perhaps true at the time, but in my view, has since become false if one counts pausing AI as a “core action relevant point” of EA]. To quote myself:
Let’s assume that there’s a 2% chance of AI causing existential risk, and that, optimistically, pausing [AI progress] for a decade would cut this risk in half (rather than barely decreasing it, or even increasing it). This would imply that the total risk would diminish from 2% to 1%.
According to OWID, approximately 63 million people die every year, although this rate is expected to increase, rising to around 74 million in 2035. If we assume that around 68 million people will die per year during the relevant time period, and that they could have been saved by AI-enabled medical progress, then pausing AI for a decade would kill around 680 million people.
This figure is around 8.3% of the current global population, and would constitute a death count higher than the combined death toll from World War 1, World War 2, the Mongol Conquests, the Taiping rebellion, the Transition from Ming to Qing, and the Three Kingdoms Civil war.
(Note that, although we are counting deaths from old age in this case, these deaths are comparable to deaths in war from a years of life lost perspective, if you assume that AI-accelerated medical breakthroughs will likely greatly increase human lifespan.)
From the perspective of an individual human life, a 1% chance of death from AI is significantly lower than a 8.3% chance of death from aging—though obviously in the former case this risk would apply independently of age, and in the latter case, the risk would be concentrated heavily among people who are currently elderly.
Even a briefer pause lasting just two years, while still cutting risk in half, would not survive this basic cost-benefit test. Of course, it’s true that it’s difficult to directly compare the individual personal costs from AI existential risk to the diseases of old age. For example, AI existential risk has the potential to be briefer and less agonizing, which, all else being equal, should push us to favor it. On the other hand, most people might consider death from old age to be preferable since it’s more natural and allows the human species to continue.
Nonetheless, despite these nuances, I think the basic picture that I’m presenting holds up here: under typical assumptions [...] a purely individualistic framing of the costs and benefits of AI pause do not clearly favor pausing, from the perspective of people who currently exist. This fact was noted in Nick Bostrom’s original essay on Astronomical Waste, and more recently, by Chad Jones in his paper on the tradeoffs involved in stopping AI development.

Matthew_Barnett Mar 27, 2025, 9:53 PM
8 points
0 ∶ 0
in reply to: AnonymousTurtle’s comment on: Ozzie Gooen’s Shortform
Using the data cited in your source (the Distributional Financial Accounts (DFA) provided by the Federal Reserve Board of Governors), it seems to me that the growth in the share of wealth held by the top 0.1% has not been very fast in the last 20 years—growing from around 10-11% to around 14% over that period. In my opinion, this is a significant, albeit rather unimportant trend relative to other social shifts in the last 20 years.
Moreover, this data does not include wealth held in social insurance programs (as I pointed out in another comment). If included, this would presumably decrease the magnitude of the trend seen in this plot, especially regarding the declining share of wealth held by the bottom 90%.

Matthew_Barnett Mar 27, 2025, 9:23 PM
4 points
0 ∶ 0
in reply to: Ozzie Gooen’s comment on: Ozzie Gooen’s Shortform
I could sympathize with the frustration, but I feel like I’m being attacked in a way that’s pretty unfair.
Sorry if my previous comment came across as rude or harsh—that wasn’t my intention. I didn’t mean to attack you. I asked those questions to clarify your exact claim because I wanted to understand it fully and potentially challenge it depending on its interpretation. My intent was for constructive disagreement, not criticism of you personally.
I find your other papers you linked in other comments interesting. That said, I don’t see them changing my main argument much.
Your main argument started with and seemed to depend heavily on the idea that inequality has been increasing. If it turns out that this key assumption is literally incorrect, then it seems like that should significantly affect your argument.

Matthew_Barnett Mar 27, 2025, 8:44 PM
6 points
1 ∶ 0
in reply to: titotal’s comment on: Ozzie Gooen’s Shortform
Your point about the top 1%’s rising income share uses pre-tax and transfers data, which can be misleading here because the discussion is specifically about how much income rich people actually control and can redirect towards their desired ends. Post-tax and transfer measures are more informative in this context since they directly reflect the resources individuals genuinely have available after taxes and redistribution. In other words, taxes and transfers matter because they substantially reduce the actual amount of wealth the rich can freely use, donate, or influence society with. Ignoring this gives a distorted picture of how much power or control rich people practically possess, which is central to the original discussion.
Other studies have notably not found meaningful increases in the top 1%’s income share after taxes and transfers are taken into account:

Matthew_Barnett Mar 27, 2025, 8:33 PM
8 points
1 ∶ 1
in reply to: huw’s comment on: Ozzie Gooen’s Shortform
It’s worth noting that much of the reported increase in wealth inequality since 1989 seems to be explained by the rising share of wealth held via social insurance programs. Catherine et al. notes,
Recent influential work finds large increases in inequality in the U.S. based on measures of wealth concentration that notably exclude the value of social insurance programs. This paper shows that top wealth shares have not changed much over the last three decades when Social Security is properly accounted for. This is because Social Security wealth increased substantially from $7 trillion in 1989 to $39 trillion in 2019 and now represents 49% of the wealth of the bottom 90% of the wealth distribution. This finding is robust to potential changes to taxes and benefits in response to system financing concerns.
Since both ordinary private wealth and social insurance programs are similar in that they provide continuous streams of income to people, I think it’s likely misleading to suggest that wealth inequality has gone up meaningfully in recent decades in the United States—at least based on the reported datasets that presently exist.
Social insurance income streams are especially relevant in this context because they directly affect how much real economic power and control people have in practice. Ignoring social insurance thus exaggerates how concentrated real economic power actually is, since it underestimates the resources available to the broader population.
That said, inequality statistics are quite contentious in general given the lack of reliable data on the exact variables we care about, so I’m not highly confident in this picture. Ultimately I’m unsure whether inequality has remained roughly constant over the last few decades in the sense we should care about.