Lukas_Gloor

Karma: 6,725

Lukas_Gloor May 10, 2024, 5:07 PM
17 points
2 ∶ 0
on: Cooperative AI: Three things that confused me as a beginner (and my current understanding)
No need to reply to my musings below, but this post prompted me to think about what different distinctions I see under “making things go well with powerful AI systems in a messy world.”
That said, my current favourite explanation of what cooperative AI is is that while AI alignment deals with the question of how to make one powerful AI system behave in a way that is aligned with (good) human values, cooperative AI is about making things go well with powerful AI systems in a messy world where there might be many different AI systems, lots of different humans and human groups and different sets of (sometimes contradictory) values.
First of all, I like this framing! Since quite a lot of factors feed into making things go well in such a messy world, I also like highlighting “cooperative intelligence” as a subset of factors you maybe want to zoom in on with the specific research direction of Cooperative AI.
Another recurring framing is that cooperative AI is about improving the cooperative intelligence of advanced AI, which leads to the question of what cooperative intelligence is. Here also there are many different versions in circulation, but the following one is the one I find most useful so far:
Cooperative intelligence is an agent’s ability to achieve their goals in ways that also promote social welfare, in a wide range of environments and with a wide range of other agents.
As you point out, a lot of what goes under “cooperative intelligence” sounds dual-use. For differential development to have a positive impact, we of course want to select aspects of it that robustly reduce risks of conflict (and escalation thereof). CLR’s research agenda lists rational crisis bargaining and surrogate goals/safe pareto improvements. Those seem like promising candidates to me! I wonder at what level to best to intervene with a goal of installing these skills and highlighting these strategies. Would it make sense to put together a “peaceful bargaining curriculum” for deliberate practice/training? (If so, should we add assumptions like availability of safe commitment devices to any of the training episodes?) Is it enough to just describe the strategies in a “bargaining manual?” Do they also intersect with an AI’s “values” and therefore have to be considered early on in training (e.g., when it comes to surrogate goals/safe pareto improvements)? (I feel very uncertain about these questions.)
I can think of more traits that can fit into, “What specific traits would I want to see in AIs, assuming they don’t all share the same values/goals?,” but many of the things I’m thinking of are “AI psychologies”/“AI character traits.” They arguably lie closer to “values” than (pure) “capabilities/intelligence,” so I’m not sure to what degree they aren’t already covered by alignment research. (But maybe Cooperative AI could be a call for alignment research to pay special attention to desiderata that matter in messy multi-agent scenarios.)
To elaborate on the connection to values, I think of “agent psychologies” as something that is in between (or “has components of both”) capabilities and values. On one side, there are “pure capabilities,” such as the ability to guess what other agents want, what they’re thinking, what their constraints are. Then, there are “pure values,” such as caring terminally about human well-being and/or the well-being (or goal achievement) of other AI agents. Somewhere in between, there are agent psychologies/character traits that arose because they were adaptive (in people it was during evolution, in AIs it would be during training) for a specific niche. These are “capabilities” in the sense that they allow the agent to excel at some skills beneficial in its niche. For instance, consider the cluster of skills around “being good at building trust” (in an environment composed of specific other agents). It’s a capability of sorts, but it’s also something that’s embodied, and it comes with tradeoffs. For comparison, in role-playing games, you often have only a limited number of character points to allocate to different character dimensions. Likewise, the AI that’s best-optimized for building trust probably cannot also be the one best at lying. (We can also speculate about training with interpretability tools and whether it has an effect on an agent’s honesty or propensity to self-deceive, etc.)
To give some example character traits that would contribute towards peaceful outcomes in messy multi-agent settings:
(I’m mostly thinking about human examples, but for many of these, I don’t see why they wouldn’t also be helpful in AIs as well with “AI versions” of these traits.)
Traits that predispose agents to steer away from unnecessary conflicts/escalation:
- Having an aversion to violence, suffering, other “typical costs of conflict.”
- ‘Liking’ to see others succeed alongside you (without necessarily caring directly about their goal achievement).
- a general inclination to be friendly/welcoming/cosmopolitan. Lack of spiteful or (needlessly) belligerent instincts.
Agents with these traits will have a comparatively stronger interest in re-framing real-world situations with PD-characteristics into different, more positive-sum terms.
Traits around “being a good coalition partner” or “being good at building peaceful coalitions” (these have considerable overlap with the bullet points above):
- Integrity, solid communication, honesty, charitable, not naive (i.e., is aware of deceptive or reckless agent phenotypes, is willing to dish out altruistic punishment if necessary), self-aware/low propensity to self-deceive, able to accurately see other’s perspective, etc.
“Good social intuitions” about other agents in one’s environment:
- In humans, there are also intuition-based skills like “being good at noticing when someone is lying” or “being good at noticing when someone is trustworthy.” Maybe there could be AI equivalents of these skills. That said, presumably AIs would learn these skills if they’re being trained in multi-agent environments that also contain deceptive and reckless AIs, which opens up the question: Is it a good idea to introduce such potentially dangerous agents solely for training purposes? (The answer might well be yes, but it obviously depends on the ways this can backfire.)
Lastly, there might be trust/cooperation-relevant procedures or technological interventions that become possible with future AIs, but cannot be done with humans:
- Inspecting source codes.
- Putting AIs into sandbox settings to see/test what they would do in specific scenarios.
- Interpretability, provided it makes sufficient advances. (In theory, neuroscience could make similar advances, but my guess is that mind-reading technology will arrive earlier in ML, if it arrives at all.)
- …
To sum up, here are a couple of questions I’d focus on if I were working in this area:
- To what degree (if any) does Cooperative AI want to focus on things that we can think of as “AI character traits?” If this should be a focus, how much conceptual overlap is there with alignment work in theory, and how much actual overlap is there with alignment work in practice as others are doing it at the moment?
- For things that go under the heading of “learnable skills related to cooperative intelligence,” how much of it can we be confident is more likely good than bad? And what’s the best way to teach these skills to AI systems (or make them salient)?
- How good or bad would it be if AI training regimes are the way they are with current LLMs (solo-competition, AI is scored by human evaluators) vs whether training is multi-agent or “league-based” (AIs competing with close copies, training more analogous to human evolution). If AI developers do go into multi-agent training despite its risks (such as the possibility for spiteful instincts to evolve), what are important things to get right?
- Does it make sense to deliberately think about features of bargaining among AIs that will be different from bargaining among humans, and zoom in on studying those (or practicing with those)?
A lot of the things I pointed out are probably outside the scope of “Cooperative AI” the way you think about it, but I wasn’t sure where to draw the boundary, and I thought it could be helpful to collect my thoughts about this entire cluster of things in once place/comment.

Lukas_Gloor Apr 29, 2024, 1:12 AM
6 points
1 ∶ 0
in reply to: Jason’s comment on: Motivation gaps: Why so much EA criticism is hostile and lazy
As potentially relevant here, the differential includes particularly bipolar spectrum disorders, but also major depression, schizophrenia, attention-deficit/hyperactivity disorder, and posttraumatic stress disorder.
If one of the ways a person is acting unusually is holding grudges against people they once thought highly of (or against movements they were formerly a part of), I’d also consider NPD and pathological narcissism for the differential diagnosis (the latter has a vulnerable subtype that has some overlap with BPD but is separate construct). I’m adding this to underscore your point that a specific diagnosis is difficult without a lot of context.
I also agree with not wanting to add to the stigma against people with personality disorders. A stigma means some commonly held association that is either wrong or unfairly negative. I think the risk with talking about diagnoses instead of specific symptoms is that this can unfairly harm the reputation of other people with the same diagnosis. BPD in particular has 9 symptom criteria, of which people have to only meet 5 in order to be diagnosed. So, you can have two people with BPD who share 1 symptom out of 9.

Another way in which talk about personality disorders can be stigmatizing is if the implication or connotation is something like “this person is irredeemable.” To avoid this connotation (if we were to armchair-diagnose people at all), I would add caveats like “untreated” or “and they seem to lack insight.” Treatment success for BPD without comorbid narcissism is actually high, and for NPD it’s more difficult but I wouldn’t completely give up hope.
Edit: Overall, I should say that I still agree with the comments that sometimes it can make sense to highlight that a person’s destructive behavior makes up a pattern and is more unusual than what you see in conflicts between people without personality disorders. However, I don’t know if it is ever necessary for forum users to make confident claims about what specific type of cluster b personality disorder (or other, related condition) someone may have. More generally, for the reasons I mentioned in the discussion around stigma, I would prefer if this subject was handled with more care than SuperDuperForecasting was giving it. I overall didn’t downvote their initial comment because I think something in the vicinity of what they said is an important hypothesis to put out there, but SuperDuperForecasting is IMO hurting their own cause/camp in the way they were talking about it.

Lukas_Gloor Apr 25, 2024, 12:45 PM
2 points
0 ∶ 0
in reply to: SiebeRozendal’s comment on: SiebeRozendal’s Shortform
virologists believing rumors that humans are getting infected
What are you referring to here?
We already have confirmation that it happened hundreds of times that people got infected with H5N1 from contact with animals (only 2 cases in the US so far, but one of them very recently). We can guess that there might be some percentage of unreported extra cases, but I’d expect that to be small because of the virus’s high mortality rate in its current form (and how much vigilance there is now).
So, I’m confused whether you’re referring to confirmed information with the word “rumors,” or whether there are rumors of some new development that’s meaningfully more concerning than what we already have confirmations of. (If so, I haven’t come across it – though “virus particles in milk” and things like that do seem concerning.)

Lukas_Gloor Apr 21, 2024, 9:36 PM
10 points
0 ∶ 0
in reply to: Richard Y Chappell🔸’s comment on: What should the EA community learn from the FTX / SBF disaster? An in-depth discussion with Will MacAskill on the Clearer Thinking podcast
I agree with what you say in the last paragraph, including the highlighting of autonomy/placing value on it (whether in a realist or anti-realist way).
I’m not convinced by what you said about the effects of belief in realism vs anti-realism.
If you hold fixed people’s first-order views, not just about axiology but also about practical norms, then their metaethics makes no further difference.
Sure, but that feels like it’s begging the question.
Let’s grant that the people we’re comparing already have liberal intuitions. After all, this discussion started in a context that I’d summarize as “What are ideological risks in EA-related settings, like the FTX/SBF setting?,” so, not a setting where authoritarian intuitions are common. Also, the context wasn’t “How would we reform people who start out with illiberal intuitions” – that would be a different topic.
With that out of the way, then, the relevant question strikes me as something like this:
Under which metaethical view (if any) – axiological realism vs axiological anti-realism – is there more of a temptation for axiologically certain individuals with liberal intuitions to re-think/discount these liberal intuitions so as to make the world better according to their axiology?
Here’s how I picture the axiological anti-realist’s internal monologue:

“The point of liberal intuitions is to prevent one person from imposing their beliefs on others. I care about my axiological views, but, since I have these liberal intuitions, I do not feel compelled to impose my views on others. There’s no tension here.”
By contrast, here’s how I picture the axiological realist:
“I have these liberal intuitions that make me uncomfortable with the thought of imposing my views on others. At the same time, I know what the objectively correct axiology is, so, if I, consequentialist-style, do things that benefit others according to the objectively correct axiology, then there’s a sense in which that will be better for them than if I didn’t do it. Perhaps this justifies going against the common-sense principles of liberalism, if I’m truly certain enough and am not self-deceiving here? So, I’m kind of torn...”
I’m not just speaking about hypotheticals. I think this is a dynamic that totally happens with some moral realists in the EA context. For instance, back when I was a moral realist negative utilitarian, I didn’t like that my moral beliefs put my goals in tension with most of the rest of the world, but I noticed that there was this tension. It feels like the tension disappeared when I realized that I have to agree to disagree with others about matters of axiology (as opposed to thinking, “I have to figure out whether I’m indeed correct about my high confidence, or whether I’m the one who’s wrong”).
Sure, maybe the axiological realist will come up with a for-them compelling argument why they shouldn’t impose the correct axiology on others. Or maybe their notion of “correct axiology” was always inherently about preference fulfillment, which you could say entails respecting autonomy by definition. (That said, if someone were also counting “making future flourishing people,” as “creating more preference fulfillment,” then this sort of axiology is at least in some possible tension with respecting the autonomy of present/existing people.) ((Also, this is just a terminological note, but I usually think of preference utilitarianism as a stance that isn’t typically “axiologically realist,” so I’d say any “axiological realism” faces the same issue with there being at least a bit of tension with belief in and and valuing autonomy in practice.))
When I talked about whether there’s a “clear link” between two beliefs, I didn’t mean that the link would be binding or inevitable. All I meant is that there’s some tension that one has to address somehow.
That was the gist of my point, and I feel like the things you said in reply were perhaps often correct but they went past the point I tried to convey. (Maybe part of what goes into this disagreement is that you might be strawmanning what I think of as “anti-realism” with “relativism”.)

Lukas_Gloor Apr 19, 2024, 4:34 PM
19 points
6 ∶ 1
in reply to: Wei Dai’s comment on: What should the EA community learn from the FTX / SBF disaster? An in-depth discussion with Will MacAskill on the Clearer Thinking podcast
I feel like it’s more relevant what a person actually believes than whether they think of themselves as uncertain. Moral certainty seems directly problematic (in terms of risks of recklessness and unilateral action) only when it comes together with moral realism: If you think you know the single correct moral theory, you’ll consider yourself justified to override other people’s moral beliefs and thwart the goals they’ve been working towards.
By contrast, there seems to me to be no clear link from “anti-realist moral certainty in some subjectivist axiology” to “considers themselves justified to override other people’s life goals.” On the contrary, unless someone has an anti-social personality to begin with, it seems only intuitive/natural to me to go from “anti-realism about morality is true” to “we should probably treat moral disagreements between morally certain individuals more like we’d ideally treat political disagreements.” How would we want to ideally treat political disagreements? I’d say we want to keep political polarization at a low, accept that there’ll be view differences, and we’ll agree to play fair and find positive-sum compromises. If some political faction goes around thinking it’s okay to sabotage others or use their power unfairly (e.g., restricting free expression of everyone who opposes their talking points), the problem is not that they’re “too politically certain in what they believe.” The problem is that they’re too politically certain that what they believe is what everyone ought to believe. This seems like an important difference!
There’s also something else that I find weird about highlighting uncertainty as a solution to recklessness/fanaticism. Uncertainty can transition to increased certainty later on, as people do more thinking. So, it doesn’t feel like a stable solution. (Not to mention that, as EAs tell themselves it’s virtuous to remain uncertain, this impedes philosophical progress at the level of individuals.)
So, while I’m on board with cautioning against overconfidence and would probably concede that there’s often a link between overconfidence and unjustified moral or metaehtical confidence, I feel like it’s misguided in more than one way to highlight “moral certainty” as the thing that’s directly bad here.
(You’re of course free to disagree.)

Lukas_Gloor Apr 9, 2024, 6:28 PM
4 points
0 ∶ 0
in reply to: EJT’s comment on: My favourite arguments against person-affecting views
Sorry, I hate it when people comment on something that has already been addressed.
FWIW, though, I had read the paper the day it was posted on the GPI fb page. At that time, I didn’t feel like my point about “there is no objective axiology” fit into your discussion.
I feel like even though you discuss views that are “purely deontic” instead of “axiological,” there are still some assumptions from the axiology-based framework that underly your conclusion about how to reason about such views. Specifically, when explaining why a view says that it would be wrong to create only Amy but not Bobby, you didn’t say anything that suggests understanding of “there is no objective axiology about creating new people/beings.”
That said, re-reading the sections you point to, I think it’s correct that I’d need to give some kind of answer to your dilemmas, and what I’m advocating for seems most relevant to this paragraph:
5.2.3. Intermediate wide views
Given the defects of permissive and restrictive views, we might seek an intermediate wide view: a wide view that is sometimes permissive and sometimes restrictive. Perhaps (for example) wide views should say that there’s something wrong with creating Amy and then later declining to create Bobby in Two-Shot Non-Identity if and only if you foresee at the time of creating Amy that you will later have the opportunity to create Bobby. Or perhaps our wide view should say that there’s something wrong with creating Amy and then later declining to create Bobby if and only if you intend at the time of creating Amy to later decline to create Bobby.
At the very least, I owe you an explanation of what I would say here.
I would indeed advocate for what you call the “intermediate wide view,” but I’d motivate this view a bit differently.
All else equal, IMO, the problem with creating Amy and then not creating Bobby is that these specific choices, in combination, and if it would have been low-effort to choose differently (or the other way around), indicate that you didn’t consider the interests of possible people/beings even to a minimum degree. Considering them to a minimum degree would mean being willing to at least take low-effort actions to ensure your choices aren’t objectionable from their perspective (the perspective of possible people/beings). Adding someone with +1 when you could’ve easily added someone else with +100 just seems careless. If Alice and Bobby sat behind a veil of ignorance, not knowing which of them will be created with +1 or +100 (if someone gets created at all), the one view they would never advocate for is “only create the +1 person.” If they favor anti-natalist views, they advocate for creating no one. If they favor totalist views, they’d advocate for creating both. If one favors anti-natalism and the other favors totalism, they might compromise on creating only the +100 person. So, most options here really are defensible, but you don’t want to do the one thing that shows you weren’t trying at all.
So, it would be bad to only create the +1 person, but it’s not “99 units bad” in some objective sense, so this is not always the dominant concern and seems less problematic if we dial up the degree of effort that’s needed to choose differently, or when there are externalities like “by creating Amy at +1 instead of Bob at +100, you create a lot of value for existing people.” I don’t remember if it was Parfit or Singer who first gave this example of delaying pregnancy for a short number of days (or maybe it was three months?) to avoid your future child suffering from a serious illness. There, it seems mainly objectionable not to wait because of how easy it would be to wait. (Quite a few people, when trying to have children, try for years, so a few months is not that significant.)
So, if you’re at age 20 and contemplate having a child at happiness level 1, knowing that 15 years later they’ll invent embryo-selection therapy to make new babies happier and guarantee happiness level 100, having only the child at 20 is a little selfish, but it’s not like “wait 15 years,” when you really want a child, is a low-effort accommodation. (Also, I personally think having children is under pretty much all circumstances “a little selfish,” at least in the sense of “you could spend your resources on EA instead.” But that’s okay. Lots of things people choose are a bit selfish.) I think it would be commendable to wait, but not mandatory. (And like Michael ST Jules points out, not waiting is the issue here; after that’s happened, it’s done, and when you contemplate having a second child 15 years later, it’s now a new decision and it no longer matters what you did earlier.)
And although intentions are often relevant to questions of blameworthiness, I’m doubtful whether they are ever relevant to questions of permissibility. Certainly, it would be a surprising downside of wide views if they were committed to that controversial claim.
The intentions are relevant here in the sense of: You should always act with the intention of at least taking low-effort ways to consider the interests of possible people/beings. It’s morally frivolous if someone has children on a whim, especially if that leads to them making worse choices for these children than they could otherwise have easily made. But it’s okay if the well-being of their future children was at least an important factor in their decision, even if it wasn’t the decisive factor. Basically, “if you bring a child into existence and it’s not the happiest child you could have, you better have a good reason for why you did things that way, but it’s conceivable for there to be good reasons, and then it’s okay.”

Lukas_Gloor Apr 9, 2024, 12:55 PM
16 points
4 ∶ 0
in reply to: cata’s comment on: David Mathers’s Quick takes
I feel like you’re trying to equivocate “wrong or heartless” (or “heartless-and-prejudiced,” as I called it elsewhere) with “socially provocative” or “causes outrage to a subset of readers.”
That feels like misdirection.
I see two different issues here:
(1) Are some ideas that cause social backlash still valuable?
(2) Are some ideas shitty and worth condemning?
My answer is yes to both.
When someone expresses a view that belongs into (2), pointing at the existence of (1) isn’t a good defense.
You may be saying that we should be humble and can’t tell the difference, but I think we can. Moral relativism sucks.
FWIW, if I thought we couldn’t tell the difference, then it wouldn’t be obvious to me that we should go for “condemn pretty much nothing” as opposed to “condemn everything that causes controversy.” Both of these seem equally extremely bad.
I see that you’re not quite advocating for “condemn nothing” because you write this bit:
perhaps with some caveats (e.g. that they are the sort of view that a person might honestly come by, as opposed to something invented simply maliciously.)
It depends on what you mean exactly, but I think this may not be going far enough. Some people don’t cult-founder-style invent new beliefs with some ulterior motive (like making money), but the beliefs they “honestly” come to may still be hateful and prejudiced. Also, some people might be aware that there’s a lot of misanthropy and wanting to feel superior in their thinking, but they might be manipulatively pretending to only be interested in “truth-seeking,” especially when talking to impressionable members of the rationality community, where you get lots of social credit for signalling truth-seeking virtues.
To get to the heart of things, do you think Hanania’s views are no worse than the examples you give? If so, I would expect people to say that he’s not actually racist.
However, if they are worse, then I’d say let’s drop the cultural relativism and condemn them.
It seems to me like there’s no disagreement by people familiar with Hanania that his views were worse in the past. That’s a red flag. Some people say he’s changed his views. I’m not per se against giving people second chances, but it seems suspicious to me that someone who admits that they’ve had really shitty racist views in the past now continues to focus on issues where they – even according to other discussion participants here who defend him – still seem racist. Like, why isn’t he trying to educate people on how not to fall victim to a hateful ideology, since he has personal experience with that. It’s hard to come away with “ah, now the motivation is compassion and wanting the best for everyone, when previously it was something dark.” (I’m not saying such changes of heart are impossible, but I don’t view it as likely, given what other commenters are saying.)
Anyway, to comment on your examples:
Singer faced most of the heat for his views on preimplantation diagnostics and disability before EA became a movement. Still, I’d bet that, if EAs had been around back then, many EAs, and especially the ones I most admire and agree with, would’ve come to his defense.

I just skimmed that eugenics article you link to and it seems fine to me, or even good. Also, most of the pushback there from EA forum participants is about the strategy of still using the word “eugenics” instead of using a different word, so many people don’t seem to disagree much with the substance of the article.
In Bostrom’s case, I don’t think anyone thinks that Bostrom’s comments from long ago were a good thing, but there’s a difference between them being awkward and tone-deaf, vs them being hateful or hate-inspired. (And it’s more forgivable for people to be awkward and tone-deaf when they’re young.)
Lastly, on Scott Alexander’s example, whether intelligence differences are at least partly genetic is an empirical question, not a moral one. It might well be influenced by someone having hateful moral views, so it matters where a person’s interest in that sort of issue is coming from. Does it come from a place of hate or wanting to seem superior, or does it come from a desire for truth-seeking and believing that knowing what’s the case makes it easier to help? (And: Does the person make any actual efforts to help disadvantaged groups?) As Scott Alexander points out himself:
Somebody who believes that Mexicans are more criminal than white people might just be collecting crime stats, but we’re suspicious that they might use this to justify an irrational hatred toward Mexicans and desire to discriminate against them. So it’s potentially racist, regardless of whether you attribute it to genetics or culture.
So, all these examples (I think Zach Davis’s writing is more “rationality community” than EA, and I’m not really familiar with it, so I won’t comment on it) seem fine to me.
When I said,
None of the people who were important to EA historically have had hateful or heartless-and-prejudiced views (or, if someone had them secretly, at least they didn’t openly express it).
This wasn’t about, “Can we find some random people (who we otherwise wouldn’t listen to when it comes to other topics) who will be outraged.”
Instead, I meant that we can look at people’s views at the object level and decide whether they’re coming from a place of compassion for everyone and equal consideration of interests, or whether they’re coming from a darker place.
And someone can have wrong views that aren’t hateful:
Many of my extended family members consider the idea that abortion is permissible to be hateful and wrong. I consider their views, in addition to many of their other religious views, to be hateful and wrong.
I’m not sure if you’re using “hateful” here as a weird synonym to “wrong,” or whether your extended relatives have similarities to the Westboro Baptist Church.
Normally, I think of people who are for abortion bans as merely misguided (since they’re often literally misguided about empirical questions, or sometimes they seem to have an inability to move away from rigid-category thinking and not understand the necessity of having a different logic for non-typical examples/edge cases).
When I speak of “hateful,” it’s something more. I then mean that the ideology has an affinity for appealing to people’s darker motivations. I think ideologies like that are properly dangerous, as we’ve seen historically. (And it applies to, e.g., Communism just as well as to racism.)
I agree with you that conferences do very little “vetting” (and find this is okay), but I think the little vetting that they do and should do includes “don’t bring in people who are mouthpieces to ideologies that appeal to people’s dark instincts.” (And also things like, “don’t bring in people who are known to cause harm to others,” whether that’s through sexually predatory behavior or the tendency to form mini-cults around themselves.)
What links here?
- Lukas_Gloor's comment on Against the Guardian’s hit piece on Manifest by Bentham's Bulldog (Jun 20, 2024, 12:41 PM; 13 points)

Lukas_Gloor Apr 7, 2024, 12:56 AM
57 points
15 ∶ 0
in reply to: Jason’s comment on: David Mathers’s Quick takes
+1

If even some of the people defending this person start with “yes, he’s pretty racist,” that makes me think David Mathers is totally right.
Regarding cata’s comment:
But I think that the modern idea that it’s good policy to “shun” people who express wrong (or heartless, or whatever) views is totally wrong, and is especially inappropriate for EA in practice, the impact of which has largely been due to unusual people with unusual views.
Why move from “wrong or heartless” to “unusual people with unusual views”? None of the people who were important to EA historically have had hateful or heartless-and-prejudiced views (or, if someone had them secretly, at least they didn’t openly express it). It would also be directly opposed to EA core principles (compassion, equal consideration of interests).
Whether someone speaks at Manifest (or is on a blogroll, or whatever) should be about whether they are going to give an interesting talk to Manifest, not because of their general moral character.
I think sufficiently shitty character should be disqualifying. I agree with you insofar that, if someone has ideas that seem worth discussing, I can imagine a stance of “we’re talking to this person in a moderated setting to hear their ideas,” but I’d importantly caveat it by making sure to also expose their shittiness. In other words, I think platforming a person who promotes a dangerous ideology (or, to give a different example, someone who has a tendency to form mini-cults around them that predictably harm some of the people they come into contact with) isn’t necessarily wrong, but it comes with a specific responsibility. What would be wrong is implicitly conveying that the person you’re platforming is vetted/normal/harmless, when they actually seem dangerous. If someone actually seems dangerous, make sure that, if you do decide to platform them (presumably because you think they also have some good/important things to say), others won’t come away with the impression that you don’t think they’re dangerous.

Lukas_Gloor Apr 3, 2024, 11:28 AM
2 points
0 ∶ 0
in reply to: JackM’s comment on: My favourite arguments against person-affecting views
We can’t use the argument that it is better from an impartial view to focus on existing-and-sure-to-exist people/beings because of the classic ‘future could be super-long’ argument.
I’d say the two are tied contenders for “what’s best from an impartial view.”
I believe the impartial view is under-defined for cases of population ethics, and both of these views are defensible options in the sense that some morally-motivated people would continue to endorse them even after reflection in an idealized reflection procedure.
For fixed population contexts, the “impartial stance” is arguably better defined and we want equal considering of [existing] interests, which gives us some form of preference utilitarianism. However, once we go beyond the fixed population context, I think it’s just not clear how to expand those principles, and Narveson’s slogan isn’t necessarily a worse justification than “the future could be super-long/big.”

Lukas_Gloor Apr 2, 2024, 1:27 PM
2 points
0 ∶ 0
in reply to: Lukas_Gloor’s comment on: Population Ethics Without Axiology: A Framework
The parent comment here explains ambitious morality vs minimal morality.
My post also makes some other points, such as giving new inspiration to person-affecting views.
For a summary of that, see here.

Lukas_Gloor Apr 2, 2024, 1:15 PM
23 points
4 ∶ 4
on: My favourite arguments against person-affecting views
In my post Population Ethics Without [An Objective] Axiology, I argued that person-affecting views are IMO underappreciated among effective altruists.
Here’s my best attempt at a short version of my argument:
- The standard critiques of person-affecting views are right in pointing out how person-affecting views don’t give satisfying answers to “what’s best for possible people/beings.”
- However, they are wrong in thinking that this is a problem.
- It’s only within the axiology-focused approach (common in EA and utilitarian-tradition academic philosophy) that a theory of population ethics must tell us what’s best for both possible people/beings and for existing (or sure-to-exist) people/beings simultaneously.
- Instead, I think it’s okay for EAs who find Narveson’s slogan compelling to reason as follows:
  (1) I care primarily about what’s best for existing (and sure-to-exist) people/beings.
  (2) When it comes to creating or not creating people/beings whose existence depends on my actions, all I care about is following some minimal notion of “don’t be a jerk.” That is, I wouldn’t want to do anything that disregards the interests of such possible people/beings according to all plausible axiological accounts, but I’m okay with otherwise just not focusing on possible people/beings all that much.
- We can think of this stance as analogous to:
  - The utilitarian parent: “I care primarily about doing what’s best for humanity at large, but I wouldn’t want to neglect my children to such a strong degree that all defensible notions of how to be a decent parent state that I fucked up.”
- Just like the utilitarian parent had to choose between two separate values (their own children vs humanity at large), the person with person-affecting life goals had to choose between two values as well (existing-and-sure-to-exist people/beings vs possible people/beings).
  - The person with person-affecting life goals: “I care primarily about doing what’s best for existing and sure-to-exist people/beings, but I wouldn’t want to neglect the interests of possible people/beings to such a strong degree that all defensible notions of how to be a decent person towards them state that I fucked up.”
- Note that it’s not like only advocates of person-affecting morality have to make such a choice. Analogously:
  - The person with totalist/strong longtermist life goals: “I care primarily about doing what’s best according to my totalist axiology (i.e., future generations whose existence is optional), but I wouldn’t want to neglect the interests of existing people to such a strong degree that all defensible notions of how to be a decent person towards them state that I fucked up.”
- Anyway, for the person with person-affecting life goals, when it comes to cases like whether it’s permissible for them to create individual new people, or bundles of people (one at welfare level 100, the other at 1), or similar cases spread out over time, etc., it seems okay that there isn’t a single theory that fulfills both of the following conditions:
  (1) The theory has the ‘person-affecting’ properties (e.g., it is the sort of theory that people who find Narveson’s slogan compelling would want).
  (2) The theory gives us precise, coherent, non-contradictory guidelines on what’s best for newly created people/beings.
- Instead, I’d say what we want is to drop (2), and come up with an alternative theory that fulfills only (1) and (3):
  (1) The theory has the ‘person-affecting’ properties (e.g., it is the sort of theory that people who find Narveson’s slogan compelling would want).
  (3) The theory contains some minimal guidelines of the form “don’t be a jerk” that tell us what NOT to do when it comes to creating new people/beings. The things it allows us to do are acceptable, even though it’s true that someone who cares maximally about possible people/beings on a specific axiological notion of caring [but remember that there’s no universally compelling solution here!]) could have done “better”. (I put “better” in quotation marks because it’s not better in an objectivist moral realist way, just “better” in a sense where we introduce a premise that our actions’ effects on possible people/beings are super important.)
What I’m envisioning under (3) is quite similar to how common-sense morality thinks about the ethics of having children. IMO, common-sense morality would say that:
- People are free to decide against becoming parents.
- People who become parents are responsible towards their children.
- It’s not okay to have a child and then completely abandon them, or to decide to have an unhappy child if you could’ve chosen a happier child at basically no cost.
- If the parents can handle it, it’s okay for parents to have 8+ children, even if this lowers the resources available per child.
- The responsibility towards one’s children isn’t absolute (e.g., if the children are okay, parents aren’t prohibited from donating to charity even though the money could further support their children).
The point being: The ethics of having children is more about “here’s how not to do it” rather than “here’s the only acceptable best way to do it.”
--
The longer version of the argument is in my post. My view there relies on a few important premises:
- Moral anti-realism
- Adopting a different ethical ontology from “something has intrinsic value”
I can say a bit more about these here.
As I write in the post:
I see the axiology-focused approach, the view that “something has intrinsic value,” as an assumption in people’s ethical ontology.
The way I’m using it here, someone’s “ontology” consists of the concepts they use for thinking about a domain – how they conceptualize their option space. By proposing a framework for population ethics, I’m (implicitly) offering answers to questions like “What are we trying to figure out?”, “What makes for a good solution?” and “What are the concepts we want to use to reason successfully about this domain?”
Discussions about changing one’s reasoning framework can be challenging because people are accustomed to hearing object-level arguments and interpreting them within their preferred ontology.
For instance, when first encountering utilitarianism, someone who thinks about ethics primarily in terms of “there are fundamental rights; ethics is about the particular content of those rights” would be turned off. Utilitarianism doesn’t respect “fundamental rights,” so it’ll seem crazy to them. However, asking, “How does utilitarianism address the all-important issue of [concept that doesn’t exist within the utilitarian ontology]” begs the question. To give utilitarianism a fair hearing, someone with a rights-based ontology would have to ponder a more nuanced set of questions.
So, let it be noted that I’m arguing for a change to our reasoning frameworks. To get the most out of this post, I encourage readers with the “axiology-focused” ontology to try to fully inhabit[8] my alternative framework, even if that initially means reasoning in a way that could seem strange.
To get a better sense of what I mean by the framework that I’m arguing against, see here:
>Before explaining what’s different about my proposal, I’ll describe what I understand to be the standard approach it seeks to replace, which I call “axiology-focused.”
[...]
The axiology-focused approach goes as follows. First, there’s the search for an axiology, a theory of (intrinsic) value. (E.g., the axiology may state that good experiences are what’s valuable.) Then, there’s further discussion on whether ethics contains other independent parts or whether everything derives from that axiology. For instance, a consequentialist may frame their disagreement with deontology as follows. “Consequentialism is the view that making the world a better place is all that matters, while deontologists think that other things (e.g., rights, duties) matter more.” Similarly, someone could frame population-ethical disagreements as follows. “Some philosophers think that all that matters is more value in the world and less disvalue (“totalism”). Others hold that further considerations also matter – for instance, it seems odd to compare someone’s existence to never having been born, so we can discuss what it means to benefit a person in such contexts.”

In both examples, the discussion takes for granted that there’s something that’s valuable in itself. The still-open questions come afterward, after “here’s what’s valuable.”
In my view, the axiology-focused approach prematurely directs moral discourse toward particular answers. I want to outline what it could look like to “do population ethics” without an objective axiology or the assumption that “something has intrinsic value.”
To be clear, there’s a loose, subjective meaning of “axiology” where anyone who takes systematizing stances[1] on moral issues implicitly “has an axiology.” This subjective sense isn’t what I’m arguing against. Instead, I’m arguing against the stronger claim that there exists a “true theory of value” based on which some things are “objectively good” (good regardless of circumstance, independently of people’s interests/goals).[2]
(This doesn’t leave me with “anything goes.” In my sequence on moral anti-realism, I argued that rejecting moral realism doesn’t deserve any of the connotations people typically associate with “nihilism.” See also the endnote that follows this sentence.[3])
Note also that when I criticize the concept of “intrinsic value,” this isn’t about whether good things can outweigh bad things. Within my framework, one can still express beliefs like “specific states of the world are worthy of taking serious effort (and even risks, if necessary) to bring about.” Instead, I’m arguing against the idea that good things are good because of “intrinsic value.”
So, the above quote described the framework I want to push back against.

The alternative ethical ontology I’m proposing is ‘anti-realist’ in the sense of: There’s no such thing as “intrinsic value.”

Instead, I view ethics as being largely about interests/goals.
- “There’s no objective axiology” implies (among other things) that there’s no goal that’s correct for everyone who’s self-oriented to adopt. Accordingly, goals can differ between people (see my post, The Life-Goals Framework: How I Reason About Morality as an Anti-Realist). There are, I think, good reasons for conceptualizing ethics as being about goals/interests. (Dismantling Hedonism-inspired Moral Realism explains why I don’t see ethics as being about experiences. Against Irreducible Normativity explains why I don’t conceptualize ethics as being about things we can’t express in non-normative terminology.)
From that “ethics is about interests/goals” perspective, population ethics seems clearly under-defined. First off, it’s under-defined how many new people/beings there will be (with interests and goals). And secondly, it’s under-defined which interests/goals new people/beings will have. (This depends on who you choose to create!)
With these building blocks, I can now sketch the summary of my overall population-ethical reasoning framework (this summary is copied from my post but lightly adapted):
- Ethics is about interests/goals.
- Nothing is intrinsically valuable, but various things can be conditionally valuable if grounded in someone’s interests/goals.
- The rule “focus on interests/goals” has comparatively clear implications in fixed population contexts. The minimal morality of “don’t be a jerk” means we shouldn’t violate others’ interests/goals (and perhaps even help them where it’s easy and our comparative advantage). The ambitious morality of “do the most moral/altruistic thing” coincides with something like preference utilitarianism.
- On creating new people/beings, “focus on interests/goals” no longer gives unambiguous results:[4]
  The number of interests/goals isn’t fixed
  The types of interests/goals aren’t fixed
- This leaves population ethics under-defined with two different perspectives: that of existing or sure-to-exist people/beings (what they want from the future) and that of possible people/beings (what they want from their potential creators).
- Without an objective axiology, any attempt to unify these perspectives involves subjective judgment calls. (In other words: It likely won’t be possible to unify these perspectives in a way that’ll be satisfying for anyone.)
- People with the motivation to dedicate (some of) their life to “doing the most moral/altruistic thing” will want clear guidance on what to do/pursue. To get this, they must adopt personal (but defensible), population-ethically-complete specifications of the target concept of “doing the most moral/altruistic thing.” (Or they could incorporate a compromise, as in a moral parliament between different plausible specifications.)
- Just like the concept “athletic fitness” has several defensible interpretations (e.g., the difference between a 100m sprinter and a marathon runner), so (I argue) does “doing the most moral/altruistic thing.”
- In particular, there’s a tradeoff where cashing out this target concept primarily according to the perspective of other existing people leaves less room for altruism on the second perspective (that of newly created people/beings) and vice versa.
- Accordingly, people can think of “population ethics” in several different (equally defensible)[5] ways:
  Subjectivist person-affecting views: I pay attention to creating new people/beings only to the minimal degree of “don’t be a jerk” while focusing my caring budget on helping existing (and sure-to-exist) people/beings.
  Subjectivist totalism: I count appeals from possible people/beings just as much as existing (or sure-to-exist) people/beings. On the question “Which appeals do I prioritize?” my view is, “Ones that see themselves as benefiting from being given a happy existence.”
  Subjectivist anti-natalism: I count appeals from possible people/beings just as much as existing (or sure-to-exist) people/beings. On the question “Which appeals do I prioritize?” my view is, “Ones that don’t mind non-existence but care to avoid a negative existence.”
- The above descriptions (non-exhaustively) represent “morality-inspired” views about what to do with the future. The minimal morality of “don’t be a jerk” still applies to each perspective and recommends cooperating with those who endorse different specifications of ambitious morality.
- One arguably interesting feature of my framework is that it makes standard objections against person-affecting views no longer seem (as) problematic. A common opinion among effective altruists is that person-affecting views are difficult to make work.[6] In particular, the objection is that they give unacceptable answers to “What’s best for new people/beings.”[7] My framework highlights that maybe person-affecting views aren’t meant to answer that question. Instead, I’d argue that someone with a person-affecting view has answered a relevant earlier question so that “What’s best for new people/beings” no longer holds priority. Specifically, to the question “What’s the most moral altruistic/thing?,” they answered “Benefitting existing (or sure-to-exist) people/beings.” In that light, under-definedness around creating new people/beings is to be expected – it’s what happens when there’s a tradeoff between two possible values (here: the perspective of existing/sure-to-exist people and that of possible people) and someone decides that one option matters more than the other.
What links here?
- Lukas_Gloor's comment on Population Ethics Without Axiology: A Framework by Lukas_Gloor (Apr 2, 2024, 1:27 PM; 2 points)

Lukas_Gloor Apr 2, 2024, 11:44 AM
2 points
0 ∶ 0
in reply to: dstudioscode’s comment on: Population Ethics Without Axiology: A Framework
So is it basically saying that many people follow different types of utilitarianism (I’m assuming this means the “ambitious moralities”)
Yes to this part. (“Many people” maybe not in the world at large, but especially in EA circles where people try to orient their lives around altruism.)
Also, I’m here speaking of “utilitarianism as a personal goal” rather than “utilitarianism as the single true morality that everyone has to adopt.”
This distinction is important. Usually, when people speak about utilitarianism, or when they write criticisms of utilitarianism, they assume that utilitarians believe that everyone ought to be a utilitarian, and that utilitarianism is the answer to all questions in morality. By contrast, “utilitarianism as a personal morality” is just saying “Personally, I want to devote my life to making the world a better place according to the axiology behind my utilitarianism, but it’s a separate question how I relate to other people who pursue different goals in their life.”
And this is where minimal morality comes in: Minimal morality is answering that separate question with “I will respect other people’s life goals.”
So, minimal morality is a separate thing from ambitious morality. (I guess the naming is unfortunate here since it sounds like ambitious morality is just “more on top of” minimal morality. Instead, I see them as separate things. The reason I named them the way I did is because minimal morality is relevant to everyone as a constraint for how to not go through life as a jerk, while ambitious morality is something only a handful of particularly-morally motivated people are interested in. (Of course, per Singer’s drowning child argument, maybe more people should be interested in ambitious morality than is the case.)
but judging which one is better is quite neglible since all the types usually share important moral similarities (I’m assuming what this means “minimal morality”)?
Not exactly.
“Judging which one is better” isn’t necessarily negligible, but it’s a personal choice, meaning there’s no uniquely compelling answer that will appeal to everyone.
You may ask “Why do people even endorse a well-specified axiology at all if they know it won’t be convincing to everyone? Why not just go with the ‘minimum core of morality’ that everyone endorses, even if this were to leave lots of things vague and under-defined?”
I’ve written a dialogue about this in a previous post:
Critic: Why would moral anti-realists bother to form well-specified moral views? If they know that their motivation to act morally points in an arbitrary direction, shouldn’t they remain indifferent about the more contested aspects of morality? It seems that it’s part of the meaning of “morality” that this sort of arbitrariness shouldn’t happen.
Me: Empirically, many anti-realists do bother to form well-specified moral views. We see many examples among effective altruists who self-identify as moral anti-realists. That seems to be what people’s motivation often does in these circumstances.
Critic: Point taken, but I’m saying maybe they shouldn’t? At the very least, I don’t understand why they do it.
Me: You said that it’s “part of the meaning of morality” that arbitrariness “shouldn’t happen.” That captures the way moral non-naturalists think of morality. But in the moral naturalism picture, it seems perfectly coherent to consider that morality might be under-defined (or “indefinable”). If there are several defensible ways to systematize a target concept like “altruism/doing good impartially,” you can be indifferent between all those ways or favor one of them. Both options seem possible.
Critic: I understand being indifferent in the light of indefinability. If the true morality is under-defined, so be it. That part seems clear. What I don’t understand is favoring one of the options. Can you explain to me the thinking of someone who self-identifies as a moral anti-realist yet has moral convictions in domains where they think that other philosophically sophisticated reasoners won’t come to share them?
Me: I suspect that your beliefs about morality are too primed by moral realist ways of thinking. If you internalized moral anti-realism more, your intuitions about how morality needs to function could change.
Consider the concept of “athletic fitness.” Suppose many people grew up with a deep-seated need to study it to become ideally athletically fit. At some point in their studies, they discover that there are multiple options to cash out athletic fitness, e.g., the difference between marathon running vs. 100m-sprints. They may feel drawn to one of those options, or they may be indifferent.
Likewise, imagine that you became interested in moral philosophy after reading some moral arguments, such as Singer’s drowning child argument in Famine, Affluence and Morality. You developed the motivation to act morally as it became clear to you that, e.g., spending money on poverty reduction ranks “morally better” (in a sense that you care about) than spending money on a luxury watch. You continue to study morality. You become interested in contested subdomains of morality, like theories of well-being or population ethics. You experience some inner pressure to form opinions in those areas because when you think about various options and their implications, your mind goes, “Wow, these considerations matter.” As you learn more about metaethics and the option space for how to reason about morality, you begin to think that moral anti-realism is most likely true. In other words, you come to believe that there are likely different systematizations of “altruism/doing good impartially” that individual philosophically sophisticated reasoners will deem defensible. At this point, there are two options for how you might feel: either you’ll be undecided between theories, or you find that a specific moral view deeply appeals to you.
In the story I just described, your motivation to act morally comes from things that are very “emotionally and epistemically close” to you, such as the features of Peter Singer’s drowning child argument. Your moral motivation doesn’t come from conceptual analysis about “morality” as an irreducibly normative concept. (Some people do think that way, but this isn’t the story here!) It also doesn’t come from wanting other philosophical reasoners to necessarily share your motivation. Because we’re discussing a naturalist picture of morality, morality tangibly connects to your motivations. You want to act morally not “because it’s moral,” but because it relates to concrete things like helping people, etc. Once you find yourself with a moral conviction about something tangible, you don’t care whether others would form it as well.
I mean, you would care if you thought others not sharing your particular conviction was evidence that you’re making a mistake. If moral realism was true, it would be evidence of that. However, if anti-realism is indeed correct, then it wouldn’t have to weaken your conviction.
Critic: Why do some people form convictions and not others?
Me: It no longer feels like a choice when you see the option space clearly. You either find yourself having strong opinions on what to value (or how to morally reason), or you don’t.
So, some people may feel too uncertain to choose right away, while others will be drawn to a particular personal/subjective answer to “What utilitarian axiology do I want to use as my target criterion for making the world a better place?”
Different types of utilitarianism can give quite opposing recommendations for how to act, so I wouldn’t say the similarities are insignificant or that there’s no reason to pay attention to there being differences.
However, I think people’s attitudes to their personal moral views should be different if they see their moral views as subjective/personal, as opposed to objective/absolutist.
For instance, let’s say I favor a tranquilist axiology that’s associated with negative utilitarianism. If I thought negative utilitarianism was the single correct moral theory that everyone would adopt if only they were smart and philosophically sophisticated enough, I might think it’s okay to destroy the world. However, since I believe that different morally-motivated people can legitimately come to quite different conclusions about how they want to do “the most moral/altruistic thing,” there’s a sense in which I only use my tranquilist convictions to “cast a vote” in favor of my desired future, but wouldn’t unilaterally act on it in ways that are bad for other people’s morality.
This is a bit like the difference between Democrats and Republicans in the US. If Democrats thought “being a Democrat” is the right answer to everything and Replublicans are wrong in a deep sense, they might be tempted to poison the tea of their Republican neighbor on election day. However, the identities “Democrat” or “Republican” are not all that matters! In addition, people should have an identity of “It’s important to follow the overarching process of having a democracy.”
“It’s important to follow the overarching process of having a democracy” is here analogous to recognizing the importance of minimal morality.

Lukas_Gloor Apr 1, 2024, 11:35 AM
2 points
0 ∶ 0
in reply to: dstudioscode’s comment on: Population Ethics Without Axiology: A Framework
I realized the same thing and have been thinking about writing a much shorter, simplified account of this way of thinking about population ethics. Unfortunately, I haven’t gotten around to writing that.

I think the flowchart in the middle of the post is not a terrible summary to start at, except that it doesn’t say anything about what “minimal morality” is in the framework.

Basically, the flowchart shows how there are several defensible “ambitious moralities” (axiological frameworks such as the ones in various types of utilitarianism, which specify how someone scores their actions toward the goal of “doing the most moral/altruistic thing”). Different people might be drawn to different ambitious moralities, or they may remain uncertain or undecided between options.

The reason why people pursuing different ambitious moralities don’t get into tensions with each other over their differences is because ambitious moralities aren’t all that matters. Most people who are moral anti-realists also endorse something like minimal morality (though they may use different terminology!), which is about “not being a jerk.” Acting as though your anti-realist ambitious-morality moral views justify overriding everyone else moral views (or their personal self-oriented goals) would be being a jerk.

Lukas_Gloor Mar 31, 2024, 9:37 PM
4 points
0 ∶ 0
in reply to: SteadyPanda’s comment on: Repost: SBF sentenced to 25 years jail
I don’t want to spend too much time on this so won’t answer to all points, but I wanted to point you to some examples for this bit about evasiveness by saying things like, “I don’t know what this is referring to”:
I’d be interested to hear examples (genuinely)
See the transcript here: the word “referring” occurs 30 times and at least a couple of those times strike me as the weasel-like suspicious behavior of someone whose approach to answering questions is “never admit to anything unless you learn that they already have the evidence.” So, he always answers first with “not sure/don’t know what you refer to/don’t remember” and only admits to things when shown evidence.
This behavior is strikingly abnormal and different from how a person who doesn’t have anything to hide would behave.
(Edit – and again, it seems to me like it’s different from autistic literal-mindedness! Literally answering the question would mean to comb your memory and answer without regard for what the prosecution is referring to. It would also include saying confidently “no” if you’re sure you never said something.)
Someone trustworthy would answer questions immediately, sometimes admitting to things that the prosecution may not already know.
Some examples:
Q. You also marketed FTX as a safe crypto exchange compared to your competitors, didn’t you?
A. With FTX US I think that may be the case. I am not sure about FTX International.
Q. Did you or did you not market FTX International as safe compared to other crypto exchanges?
A. I don’t specifically remember that. I am not sure.
MS. SASSOON: If we could pull up Government Exhibit 900
A. The government offers Government Exhibit 900
THE COURT: Hearing no objection, it’s received. (Government Exhibit 900A received in evidence)
MR. COHEN: I thought it was in already.
THE COURT: No harm, no foul.
MS. SASSOON: I believe the full video is in. This is just a screenshot. Mr. Bianco, if you could publish that, please. We can go ahead and take that down.
Q. You publicly described FTX as the most regulated crypto MBAN3 exchange by far, didn’t you?
A. I think that’s right.
Q. And you also acted like you cared about customer protections, right?
A. I think I did care about them, yes.
Q. And you made public statements to that effect, didn’t you?
A. I probably did.
Q. I didn’t hear you.
A. I probably did.
Q. Yes or no, do you recall making statements that you cared about customer protections?
A. Yes.
Q. In fact, over and over again in public forums you described FTX platform as safe, correct?
A. I am not sure specifically what that is referring to. I may have.
Q. Yes or no, do you recall making numerous public statements to the effect that the FTX platform was safe?
A. I recall with respect to FTX US. It may be true with respect to FTX International, but I don’t specifically recall, no.
Q. You were CEO of FTX International, yes?
A. Yes.
Q. Sitting here today, you cannot recall one way or the other whether you made public statements that FTX was a safe MBAN3 platform? 1
A. I am not sure exactly what you are referring to. I made a lot of public statements.
Q. Yes or no, do you recall making public statements that FTX was a safe platform?
A. I can’t think of a specific one off the top of my head.
Q. Generally, do you recall in substance making statements that FTX was a safe platform?
MR. COHEN: Objection.
THE COURT: Overruled.
A. Some things that were sort of like that, yes. I am not sure exactly what you are referring to. But I am not saying —
THE COURT: Mr. Bankman-Fried, the issue is not what she is referring to. Please answer the question.
Q. Putting aside what I’m referring to, I’m asking whether you recall making statements as CEO of FTX that in substance stated that the FTX platform was safe.
A. I remember things around specific parts of the FTX platform that were related to that. I don’t remember a general statement to that effect. I am not sure there wasn’t one.
Q. In media interviews isn’t it true that you insisted on that FTX had protections for retail customers?
A. Yup.
Q. You told your customers that users’ funds and safety come MBAN3 first, didn’t you?
A. Something to that effect, yes. 2
Q. And you also made statements that you would always allow withdrawals, didn’t you?
A. Yup.
MS. SASSOON: If we could pull up what’s marked as Government Exhibit 829. The government offers Government Exhibit 829.
MR. COHEN: No objection.
THE COURT: Received. (Government Exhibit 829 received in evidence)
MS. SASSOON: Mr. Bianco, can you publish that.
Q. Mr. Bankman-Fried, can you read the first line of your tweet from August 9, 2021.
A. Sure. And, as always, our users’ funds and safety come first.
Q. Beneath that do you see where it says, we will always allow withdrawals except in cases of suspected money laundering/theft/etc.?
A. Yup.
MS. SASSOON: We can take that down.
[...]
Q. You also claimed that FTX had a conservative approach to managing risk, didn’t you?
A. OnóóI’m not sure exactly what that was referring to.
Q. You don’t recall saying that?
A. I don’t remember the context.
Q. Do you recall saying that in any context?
A. I’m not confident.
[...]
Q. So is it your testimony that as CEO of FTX, after this catastrophic event, you did not learn the details of the code change that you directed?
A. That’s correct. I trusted Gary and Nishad.
Q. You testified on direct that FTX had an AWS database, correct?
A. Yup.
Q. And you described its content, right? For example, it stored customer account information?
A. Yup, that’s right.
Q. And it had information about trades?
A. Yup.
Q. And customer balances?
A. Yup.
Q. And as CEO, you had access to the database, correct?
A. Nope.
Q. Your testimony is that you did not have the ability to access the database?
A. I never did. To my knowledge, I didn’t have access to it.
Q. I’m asking you whether you had authorization to search the database.
A. I have no idea whether someone had created an account in my name that in theory was designed for me. If so, I never used it.
Q. And so it’s your testimony that until October 2022, you never looked in the database.
A. That’s correct. And even as of then, I never looked in the AWS database.
Q. After FTX declared bankruptcy, isn’t it true that one of the first things you did was try to restore your administrative access to the AWS database?
A. That’s not how I would put it.
Q. Isn’t it true that in the weeks following the bankruptcy, you asked to have your access to the AWS database restored?
A. NotóóI was not specifically looking for my personal access to the AWS database.
Q. Isn’t it true you were requesting AWS access?
A. I was requesting it on behalf of the joint provisional liquidators in the Bahamas.
Q. So yes or no: You made requests to restore access to the AWS database?
A. I’m not sure exactly what you’re referring to here.
THE COURT: Look, could you just answer the question instead of trying to ask the questioner what she’s referring to? THE WITNESS: Okay.
A. No.
Q. Isn’t it true that you made to-do lists after FTX’s 4 collapse that included things like “try to get AWS access”?
A. Probably.
Q. And so isn’t it true that you were trying to get AWS access after FTX declared bankruptcy?
A. Yes.
To me, the focus on “What this is referring to” is illuminating because it shows how SBF is laser-focused on what the prosecution has on him. What’s strikingly absent is a desire to try hard at remembering so he can tell as much of the truth as possible.

Lukas_Gloor Mar 31, 2024, 7:55 PM
5 points
4 ∶ 5
in reply to: MathiasKB🔸’s comment on: What exactly did FTX do wrong?
I downvoted the question.
I’d have found it okay if the question had explicitly asked for just good summaries of the trial coverage or the sentencing report.

(E.g., there’s the twitter handle Inner city press that was tweeting transcript summaries of every day on trial, or the Carl Reilly youtube channel for daily summaries of the trial. And there’s the more recent sentencing report that someone here linked to.)
Instead, the question came across as though there’s maybe a mystery here for which we need the collective smarts and wisdom of the EA forum.
There are people who do trial coverage for a living who’ve focused on this case. EAs are no longer best-positioned to opine on this, so it’s a bit weird to imply that this is an issue that EAs should discuss (as though it’s the early days of Covid or the immediate aftermath of FTX, when the EA forum arguably had some interesting alpha).
It’s also distracting.
I think part of what made me not like this question is that OP admits on the one hand that they struggled with finding good info on Google, but then they still give their own summary about what they’ve found so far. Why give these half-baked takes if you’re just a not-yet-well-informed person who’s struggled to find good summaries? It feels like “discussion baiting.”
Now, if someone did think that SBF/FTX didn’t do anything illegal, I think that could be worth discussing, but it should start with a high-quality post where someone demonstrates that they’ve done their homework and have good reasons for disagreeing with those who have followed the trial coverage and concluded, like the jury, that SBF/FTX engaged in fraud.

Lukas_Gloor Mar 29, 2024, 4:23 PM
4 points
0 ∶ 0
in reply to: Jason’s comment on: Repost: SBF sentenced to 25 years jail
What I meant with by “he didn’t take it back” is a situation as follows:

The prosecution asks him if he made certain claims in the media. SBF says “yes” or “it appears that way” or whatever. The prosecution at some other point in the trial (maybe days earlier, maybe afterwards) asks some specific details about how FTX accounts were structured and how money was moved that contradicts what SBF said in the media. At some third point in the trial, they ask him if he deliberately lied to the media/gave false accounts about how things worked, and he said no. (Or, instead of asking him “did you tell the truth to the media?,” maybe they just asked him the same question the media asked him, and SBF stuck to his guns to avoid admitting that he mislead the media – similar outcome because he’s now saying the false thing in the trial setting.)

(I’m not actually sure this exact thing happened – it’s been a while since the trial. But I’d guess there were cases like that where you can look at different responses he gave to various questions and find that they’re in tension with things established by the prosecution from other witnesses. That’s probably also what the judge meant when he said that he found three instances of perjury? But I’m flagging that I haven’t looked at Kaplan’s elaborations on those three points. It’s more that I trust that Kaplan is likely to come to the right interpretations on those counts because when I followed the trial, it also felt to me like SBF was, at various points, getting trapped/caught in a web of questions, previous answers, and others’ testimony. In other words, I’m pretty sure I thought at various points “this is probably perjury established, if we believe this other witness?” – but I didn’t write those down, so I don’t remember the exact web of testimonies that would’ve conclusively trapped him.)

Lukas_Gloor Mar 29, 2024, 3:23 PM
2 points
0 ∶ 0
in reply to: SteadyPanda’s comment on: Repost: SBF sentenced to 25 years jail
It wasn’t evasiveness, in my view.

I agree that some of his behavior was just unproblematic “being very literal about answers.”

But the thing I mean by evasiveness was more stuff like:
- Not remembering important things
- Not giving answers that substantially clarified what happened
- Saying “I don’t know which piece of evidence you’re referring to?” when he was several times asked simple questions on whether he did or didn’t do something. Note that this is the opposite of taking questions too literally; instead, it’s being deliberately obtuse to mask his refusal to ever disclose information that the prosecution didn’t already present, which is a central example of “being evasive.”
Regarding perjury, a lawyer (I believe it was an FTX lawyer?) testified that SBF asked him what excuses he could use to explain that the money is gone, then the lawyer suggested some pontential avenues but was like “they won’t work because it’s not actually legal/doesn’t hold up with the details” (paraphrased), and SBF nodded at that, but then used these excuses anyway talking to the media?! And then when asked about what he had said in the media, he didn’t take it back, which at that point constitutes perjury.

Lukas_Gloor Mar 29, 2024, 2:24 PM
5 points
2 ∶ 1
in reply to: SteadyPanda’s comment on: Repost: SBF sentenced to 25 years jail
I think people who have followed this unusually closely should be encouraged to argue for what they think is right if they have a strong take, but I just don’t think this theory is likely. An innocent person would be more likely to talk more freely about things/be less evasive and they’d probably have a better explanation of how it is that they could have missed an 8 billion hole in the bank. It’s suspicious if you need to make the same move (“he could’ve not seen this” or “he could have not looked closely at that”) multiple times to preserve the chance of innocence. Every time you use a not-too-likely excuse like this, your hypothesis takes a hit.

I don’t have a strong view on the sentence length. I think the main reason for a low sentence length (compared to the guidelines for the counts he was convicted of) is that I do believe SBF was strongly altruistically motivated, which is unusual for cases like that. I think the main reason for a long(-ish) sentence length is perjury and evasiveness.

Lukas_Gloor Mar 29, 2024, 1:54 AM
27 points
4 ∶ 1
in reply to: Jason’s comment on: Repost: SBF sentenced to 25 years jail
That’s a good point. If there weren’t a convincing story for why more donations weren’t made or at least set up to be made soon, I’d say your point counts for quite a lot!
However, in this specific case, I feel like there are good reasons why I wouldn’t expect that many donations to be made right away:
- FTX did have a hole in the bank! It’s interesting that donations were made at all given that they were strictly speaking insolvent. (Of course, he had to keep up appearances or pay for previously-made commitments, etc., so it’s not too surprising. I’m just saying it probably wouldn’t have been wise even for someone as risk tolerant and unconcerned-by-the-illegality-of-it as SBF to donate much more when there was still a hole in their wallet.) (Edit: Admittedly, this point also cuts against giving money to his parents.)
- Longtermist grantmaking (which SBF thought was most impactful, I believe) already started to feel a bit crowded once FTX Future Fund came in, so it’s not actually that easy (or sensible) to deploy $100s of millions on short notice in that cause area.
- The FTX Future Fund was still relatively new; it makes sense to scale up giving as you learn things and do various types of preparatory work for your grantmaking/”active grantmaking.”
- “Donating now vs later” isn’t actually the most obvious of EA strategy questions, esp. if you think you’re greatly beating the market on investment returns, which he had been doing at least in his own mind.
- SBF’s entire philosophy was about massive wins and game-changing ambitious stuff, so he probably thought of the donations he had already made or was making at the time as not that significant compared to what he was gonna do later, and probably had tentative ideas for long-term plans like “amass enough money to buy a chip company and then use that as leverage to get more AI safety work done at labs,” or something super ambitious like that, for which it felt worthwhile to keep investing in the growth of his FTX empire.
- Lastly, and somewhat related to the previous bullet point of weird super-ambitious ideas, he did discuss the paying Trump thing and, at least according to Michael Lewis’s sources, he was entertaining the thought (though probably not super seriously because this was when they had a massive hole in the bank?) of paying a sum for it that would have been a lot bigger than the $150 million in charitable contributions you mentioned. (But sure, you can say that, since nothing happened there, it was only talk. We don’t know, but I find it plausible that he’d have liked the thought of being the guy who prevented Trump from running!)
Edit: If someone says “giving 10 million to his parents (and a house, but we’re not sure if the house had other uses) shows he must have had mixed motives,” I’d be like: Okay, yeah, it does look like this isn’t what a utilitarian robot would’ve gone for, but when I hear “mixed motives,” I think of something like 50-50 or at least 40-self-oriented, 60-altruism, whereas giving money to get your parents to retire early could also be compatible with something like 10-90 (or even 5-95), “focus on impact for the big decisions, but do something nice for your parents once you’ve made it big.”

Lukas_Gloor Mar 29, 2024, 1:11 AM
31 points
7 ∶ 5
in reply to: Jason’s comment on: Repost: SBF sentenced to 25 years jail
I think a conclusion that he acted from mixed motives is better supported by the evidence.
I disagree, but it obviously depends what exactly we’re discussing.

Was his judgment for not coming clean when things were only starting to get bad compromised by not wanting to lose his influence, money, and reputation? Probably!

However, do I think he made some of his most consequential decisions to a significant degree because he thought he could get nice things for himself that way? I actually don’t think so!
Making big decisions for reasons other than impact would ruin his score in the game. It looks to me like his primary drive was optimizing his life for impact like a video game, trying to score the most points. Making a big life decision for self-oriented reasons would be forfeiting points he could otherwise have gotten, which would feel deeply unsatisfying for someone who’s always thinking about how to get the highest score.

Also, $16 million is peanuts for someone who’s personally worth $2-10 billion, which SBF probably “”somewhat reasonably”″ thought he was worth when things were going well (or when he optimistically thought he could make them well again soon enough). It’s like what a couple hundred bucks is to someone with a net worth of $80,000 – not that significant! Imagine you’re making decisions for tens of millions every day. I can’t emphasize enough how unimportant it then becomes whether you pay 1 million or 16 million for your parents’ house (which you may or may not also plan to use for yourself or some of your coworkers/flatmates). Just like with offices or the building where he planned to sleep in, the price isn’t the relevant variable (what’s more relevant is, “Does it take up your time or people’s time with high opportunity costs? Is it close to the office where you work? Is it convenient?” Etc.)

I feel like,
- if someone displays as much single-minded obsessiveness as SBF,
- and his conversations with the people around him tell you he’s hyperfocused on impact (he wasn’t just obsessing about how to grow FTX for money, he was also constantly discussing ideas around grantmaking strategies or paying Trump to not run again for election),
- and his close social circle is reinforcing the impact-orientedness,
- and there’s a credible ideology behind it which generated many other examples of similarly impact-obsessed people (hopefully the vast majority of them without the recklessness and “no fear of bad consequences” part),
then I’d say the evidence points to being primarily impact-driven pretty much as strongly as it gets. Of course, no one has the motivations of a perfect utilitarian robot, but I think SBF was trying, so it doesn’t seem fair to call it “acted from mixed motives.”
Edit to add: We could discuss whether he was “unusually biased” for someone who’s trying to have a ton of impact – hopefully yes (otherwise EAs don’t stand a chance of getting impact right), but that’s still meaningfully different from “acting from mixed motives.” (Also, I’d say the most severe bias was probably a blindspot around risk taking and risks of things going badly, rather than e.g., being too much into a rich lifestyle or being too much into fame or whatever. FWIW, my best guess is that he also was too much into wanting power/influence; I’m just saying that it doesn’t seem like the primary reason things went so poorly.)
Edit2: Somewhat relevant comment I made a year ago about why I think SBF having dark triad traits is compatible with the above picture.