Rubi J. Hudson

Karma: 652

Rubi J. Hudson 29 Dec 2021 3:56 UTC
159 points
1 ∶ 0
on: Democratising Risk—or how EA deals with critics
I thought the paper itself was poorly argued, largely as a function of biting off too much at once. Several times the case against the TUA was not actually argued, merely asserted to exist along with one or two citations for which it is hard to evaluate if they represent a consensus. Then, while I thought the original description of TUA was accurate, the TUA response to criticisms was entirely ignored. Statements like “it is unclear why a precise slowing and speeding up of different technologies...across the world is more feasible or effective than the simpler approach of outright bans and moratoriums” were egregious, and made it seem like you did not do your research. You spoke to 20+ reviewers, half of which were sought out to disagree with you, and not a single one could provide a case for differential technology? Not a single mention of the difficulty of incorporating future generations into the democratic process?
Ultimately, I think the paper would have been better served by focusing on a single section, leaving the rest to future work. The style of assertions rather than argument and skipping over potential responses comes across as more polemical than evidence-seeking. I believe that was the major contributor to blowback you have received.
I agree that more diversity in funders would be beneficial. It is harmful to all researchers if access to future funding is dependent on the results of their work. Overall, it is unclear from your post the actual extent of the blowback. What does “tried to prevent the paper being published” mean? Is the threat of withdrawn funding real or imagined? Were the authors whose work was criticized angry, and did they take any actions to retaliate?
Finally, I would like to abstract away from this specific paper. Criticisms of the dominant paradigm limiting future funding and career opportunities is a sign of terrible epistemics in a field. However, poor criticisms of the dominant paradigm limiting future funding and career opportunities is completely valid. The one line you wrote that I think all EAs would agree with is “This is not a game. Fucking it up could end really badly”. If the wrong arguments being made would cause harm when believed, it is not only the right but the responsibility of funders to reduce their reach. Of course, the difficulty is in differentiating wrong criticisms from criticisms against the current paradigm, while within the current paradigm. The responsibility of the researcher is to make their case as bulletproofed as possible, and designed to convince believers in the current paradigm. Otherwise, even if their claims are correct, they won’t make an impact. The “Effective” part of EA includes making the right arguments to convince the right people, rather than the argument that is cathartic to unleash.

Rubi J. Hudson 9 Mar 2023 3:51 UTC
52 points
8 ∶ 2
on: Against EA-Community-Received-Wisdom on Practical Sociological Questions
How does Rationalist Community Attention/Consensus compare? I’d like to mention a paper of mine published at the top AI theory conference which proves that when a certain parameter of a certain agent is set sufficiently high, the agent will not aim to kill everyone, while still achieving at least human-level intelligence. This follows from Corollary 14 and Corollary 6. I am quite sure most AI safety researchers would have confidently predicted no such theorems ever appearing in the academic literature. And yet there are no traces of any minds being blown. The associated Alignment Forum post only has 22 upvotes and one comment, and I bet you’ve never heard any of your EA friends discuss it. It hasn’t appeared, to my knowledge, in any AI safety syllabuses. People don’t seem to bother investigating or discussing whether their concerns with the proposal are surmountable. I’m reluctant to bring up this example since it has the air of a personal grievance, but I think the disinterest from the Rationality Community is erroneous enough that it calls for an autopsy. (To be clear, I’m not saying everyone should be hailing this as an answer to AI existential risk, only that it should definitely be of significant interest.)
I’m someone who has read your work (this paper and FGOIL, the latter of which I have included in a syllabus), and who would like to see more work in similar vein, as well as more formalism in AI safety. I say this to establish my bona fides, the way you established your AI safety bona fides.
I don’t think this paper is mind-blowing, and I would call it representative of one of the ways in which tailoring theoretical work for the peer-review process can go wrong. In particular, you don’t show that “when a certain parameter of a certain agent is set sufficiently high, the agent will not aim to kill everyone”, you show something more like “when you can design and implement an agent that acts and updates its beliefs in a certain way and can restrict the initial beliefs to a set containing the desired ones and incorporate a human into the process who has access to the ground truth of the universe, then you can set a parameter high enough that the agent will not aim to kill everyone” [edit: Michael disputes this last point, see his comment below and my response], which is not at all the same thing. The standard academic failure mode is to make a number of assumptions for tractability that severely lower the relevance of the results (and the more pernicious failure mode is to hide those assumptions).
You’d be right if you said that most AI safety people did not read the paper and come to that conclusion themselves, and even if you said that most weren’t even aware of it. Very little of the community has the relevant background for it (and I would like to see a shift in that direction), especially the newcomers that are the targets of syllabi. All that said, I’m confident that you got enough qualified eyes on it that if you had shown what you said in your summary, it would have had an impact similar in scale to what you think is appropriate.
This comment is somewhat of a digression from the main post, but I am concerned that if someone took your comments about the paper at face value, they would come away with an overly negative perception of how the AI safety community engages with academic work.

Rubi J. Hudson 2 Jan 2022 22:00 UTC
43 points
0 ∶ 0
in reply to: CarlaZoeC’s comment on: Democratising Risk—or how EA deals with critics
Hi Carla,
Thanks for taking the time to engage with my reply. I’d like to engage with a few of the points you made.
First of all, my point prefaced with ‘speaking abstractly’ was genuinely that. I thought your paper was poorly argued, but certainly within acceptable limits that it should not result in withdrawn funding. On a sufficient timeframe, everybody will put out some duds, and your organizations certainly have a track record of producing excellent work. My point was about avoiding an overcorrection, where consistently low quality work is guaranteed some share of scarce funding merely out of fear that withdrawing such funding would be seen as censorship. It’s a sign of healthy epistemics (in a dimension orthogonal to the criticisms of your post) for a community to be able to jump from a specific discussion to the general case, but I’m sorry you saw my abstraction as a personal attack.
You saw “we do not argue against the TUA, but point out the unanswered questions we observed. .. but highlight assumptions that may be incorrect or smuggle in values”. Pointing out unanswered questions and incorrect assumptions is how you argue against something! What makes your paper polemical is that you do not sufficiently check whether the questions really are unanswered, or if the assumptions really are incorrect. There is no tension between calling your paper polemical and saying you do not sufficiently critique the TUA. A more thorough critique that took counterarguments seriously and tried to address them would not be a polemic, as it would more clearly be driven by truth-seeking than hostility.
I was not “asking that we [you] articulate and address every hypothetical counterargument”, I was asking that you address any, especially the most obvious ones. Don’t just state “it is unclear why” they are believed to skip over a counterargument.
I am disappointed that you used my original post to further attack the epistemics of this community, and doubly so for claiming it failed to articulate clear, specific criticisms. The post was clear that the main failing I saw in your paper was a lack of engagement with counterarguments, specifically the case for technological differentiation and the case for avoiding the disenfranchisement of future generations through a limited democracy. I do not believe that my criticism of the paper jumping around too much rather than engaging deeply on fewer issues was ambiguous either. Ignoring these clear, specific criticisms to use the post as evidence of poor epistemics in the EA community makes me think you may be interpreting any disagreement as evidence for your point.

Rubi J. Hudson 29 Dec 2021 9:19 UTC
38 points
0 ∶ 0
in reply to: Davidmanheim’s comment on: Democratising Risk—or how EA deals with critics
That puts a huge and dictatorial responsibility on funders in ways that are exactly what the paper argued are inappropriate.

If not the funders, do you believe anyone should be responsible for ensuring harmful and wrong ideas are not widely circulated? I can certainly see the case that even wrong, harmful ideas should only be addressed by counterargument. However, I’m not saying that resources should be spent censoring wrong ideas harmful to EA, just that resources should not be spent actively promoting them. Funding is a privilege, consistently making bad arguments should eventually lead to the withdrawal of funding, and if on top of that those bad arguments are harmful to EA causes that should expedite the decision.
To be clear, that is absolutely not to say that publishing Democratizing Risk is/was justification for firing or cut funding, I am still very much talking abstractly.

Rubi J. Hudson 29 Dec 2021 10:30 UTC
36 points
0 ∶ 0
in reply to: Guy Raveh’s comment on: Democratising Risk—or how EA deals with critics
>it’s important that we don’t condition funding on agreement with the funders’ views.
Surely we can condition funding on the quality of the researcher’s past work though? Freedom of speech and freedom of research are both important, but taking a heterodox approach shouldn’t guarantee a sinecure either.
If you completely disagree that people consistently producing bad work should not be allocated scare funds, I’m not sure we can have a productive conversation.

Rubi J. Hudson 25 May 2023 14:53 UTC
32 points
15 ∶ 0
in reply to: Geoffrey Miller’s comment on: If you find EA conferences emotionally difficult, you’re not alone
“People saying things that are mildly offensive but not worth risking an argument by calling out, and get tiring after repeated exposure” is just obviously a type of comment that exists, and is what most people mean when they say microaggression. Your paper debunking it alternates between much stricter definitions and claiming an absence of evidence for something that very clearly is going to be extremely hard to measure rigorously.

Rubi J. Hudson 2 Jan 2022 22:26 UTC
31 points
0 ∶ 0
in reply to: anonymousEA’s comment on: Democratising Risk—or how EA deals with critics
To clear up my identity, I am not Seán and do not know him. I go by Rubi in real life, although it is a nickname rather than my given name. I did not mean for my account to be an anonymous throwaway, and I intend to keep on using this account on the EA Forum. I can understand how that would not be obvious as this was my first post, but that is coincidental. The original post generated a lot of controversy, which is why I saw it and decided to comment.
You spoke to 20+ reviewers, half of which were sought out to disagree with you, and not a single one could provide a case for differential technology?
I would have genuinely liked an answer to this. If none of the reviewers made the case, that is useful information about the selection of the reviewers. If some reviewers did, but were ignored by the authors, then it reflects negatively on the authors not to address this and say that the case for differential technology is unclear.

Rubi J. Hudson 8 Jul 2022 1:34 UTC
25 points
0 ∶ 0
on: Announcing the Future Forum—Apply Now
In addition to EAG SF, there are some other major events and a general concentration of EAs happening in this 2-week time span in the Bay Area, so it might be generally good to come to the Bay around this time.
Which other events are happening around that time?

Rubi J. Hudson 21 Jun 2022 23:43 UTC
19 points
0 ∶ 0
on: A Quick List of Some Problems in AI Alignment As A Field
their approaches are correlated with each other. They all relate to things like corrigibility, the current ML paradigm, IDA, and other approaches that e.g. Paul Christiano would be interested in.
You need to explain better how these approaches are correlated, and what an uncorrelated approach might look like. It seems to me that, for example, MIRI’s agent foundations and Anthropic’s prosaic interpretability approaches are wildly different!
By the time you get good enough to get a grant, you have to have spent a lot of time studying this stuff. Unpaid, mind you, and likely with another job/school/whatever taking up your brain cycles.
I think you are wildly underestimating how easy it is for broadly competent people with an interest in AI alignment but no experience to get funding to skill up. I’d go so far as to say it’s a strength of the field.

Rubi J. Hudson 13 Apr 2022 20:57 UTC
19 points
0 ∶ 0
on: Free-spending EA might be a big problem for optics and epistemics
Even before a cost-benefit analysis, I’d like to see an ordinal ranking of priorities. For organizations like the CEA, what would they do with a 20% budget increase? What would they cut if they had to reduce their budget by 20%? Same thing for specific events, like EAGs. For a student campus club, what would they do with $500 in funding? $2,000? $10,000? I think this type of analysis would be helpful for determining if some of the spending that appears more frivolous is actually the least important.

Rubi J. Hudson 15 Nov 2023 23:07 UTC
16 points
1 ∶ 0
on: Hypothetical grants that the Long-Term Future Fund narrowly rejected
Does the LTFF ever counter-offer with an amount that would move the grant past the funding bar for cost-effectiveness? I would guess that some of these hypothetical applicants would accept a salary at 80% of what they applied for, and if the grants are already marginal then a 25% increase in cost-effectiveness could push them over the bar.

Rubi J. Hudson 30 Jan 2023 20:59 UTC
16 points
3 ∶ 0
on: We’re no longer “pausing most new longtermist funding commitments”
Thanks for the communication, and especially giving percentages. Would you be able to either break it down by grants for individuals vs. grants to organizations, or note if the two groups were affected equally? While I appreciate knowing how high the bar has risen in general, I would be particularly interested in how high it has risen for the kinds of applications I might submit in the future.

Rubi J. Hudson 29 Dec 2021 2:54 UTC
15 points
0 ∶ 0
in reply to: anonymousEA’s comment on: Democratising Risk—or how EA deals with critics
Priors should matter! For example, early rationalists were (rightfully) criticized for being too open to arguments from white nationalists, believing they should only look at the argument itself rather than the source. It isn’t good epistemics to ignore the source of an argument and their potential biases (though it isn’t good epistemics to dismiss them out of hand either based on that, of course).

Rubi J. Hudson 31 May 2022 0:20 UTC
13 points
0 ∶ 0
in reply to: Brian Jabarian’s comment on: Introducing EAecon: Community-Building Project
Hopefully one day they grow big enough to hire an executive assistant.

Rubi J. Hudson 9 Sep 2023 4:47 UTC
11 points
0 ∶ 0
on: A quick update from Nonlinear
Was Ben Pace shown these screenshots before he published his post?

Rubi J. Hudson 25 May 2022 6:47 UTC
11 points
0 ∶ 0
on: Hiring: How to do it better
While I’m familiar with literature on hiring, particularly unstructured interviews, I think EA organizations should give serious consideration to the possibility that they can do better than average. In particular, the literature is correlational, not causal, with major selection biases, and is certainly not as broadly applicable as authors claim.

From Cowen and Gross’s book Talent, which I think captures the point I’m trying to make well:
> Most importantly, many of the research studies pessimistic about interviewing focus on unstructured interviews performed by relatively unskilled interviewers for relatively uninteresting, entry-level jobs. You can do better. Even if it were true that interviews do not on average improve candidate selection, that is a statement about averages, not about what is possible. You still would have the power, if properly talented and intellectually equipped, to beat the market averages. In fact, the worse a job the world as a whole is at doing interviews, the more reason to believe there are highly talented candidates just waiting to be found by you.
The fact that EA organizations are looking for specific, unusual qualities, and the fact that EAs are generally smarter and more perceptive than the average hiring committee are both strong reasons to think that EA can beat the average results from research that tells only a partial story.

Rubi J. Hudson 9 Feb 2023 1:51 UTC
10 points
3 ∶ 1
in reply to: zchuang’s comment on: Book Post: The Good It Promises, the Harm It Does: Critical Essays on Effective Altruism
An EA steelman example of similar points of thinking are EAs who are incredibly anti-working for OpenAI or Deepmind at all because it safety washes and pushes capabilities anyways. The criticism here is the way EA views problems means EA will only go towards solution that are piecemeal rather than transformative. A lot of Marxists felt similarly to welfare reform in that it quelled the political will for “transformative” change to capitalism.
For instance they would say a lot of companies are pursuing RLHF in AI Safety not because it’s the correct way to go but because it’s the easiest low hanging fruit (even if it produces deceptive alignment).
I want to address this point not to argue against the animal activist’s point, but rather because it is a bad analogy for that point. The argument against working for safety teams at capabilities orgs or RLHF is not that they reduce x-risk to an “acceptable” level, causing orgs to give up on further reductions, but rather than they don’t reduce x-risk.

Rubi J. Hudson 11 Jul 2022 14:56 UTC
10 points
0 ∶ 0
on: Information Design of the Library of Effective Altruism (1/2)
I love the idea of a Library of EA! It would be helpful to eventually augment it with auxiliary and meta-information, probably through crowdsourcing among EAs. Each book could also be associated with short and medium summaries of the key arguments and takeaways, and warnings about which sections were later disproven or controversial (or a warning that the whole thing is a partial story/misleading). There’s also a lot of overlap and superseding within the books (especially within the rationality and epistemology section), so it would be good to say “If you’ve read X, you don’t need to read Y”. It would also be great to have a “Summary of Y for people who have already read X” that just covers the key information.
I do strongly feel that a smaller library would be better. While there are advantages to being comprehensive, a smaller library is better at directing people to the most important books. It is really valuable to say that someone should start with a particular book on a subject, rather than their uninformed choice from a list. Parsimony in recommendations, at least on a personal level, is also important for conveying the importance of the recommendations you do make. It somewhat feels like you weren’t confident enough to cut a book that was recommended by some subgroup, even if there were better options available.
There’s a Pareto principle at play here, where reading 20% of the books will provide 80% of the value, and a repeated Pareto principle where 4% provide 64% of the value. I think you could genuinely recommend four or five books from this list that provide two-thirds of the EA value of the entire list between them. My picks would be The Most Good You Can Do, The Precipice, Reasons and Persons, and Scout Mindset. Curious what others would pick.

Rubi J. Hudson 11 Jun 2022 0:09 UTC
7 points
0 ∶ 0
on: Digital people could make AI safer
I think your “digital people lead to AI” argument is spot on, and basically invalidates the entire approach. I think getting whole brain emulation working before AGI is such a longshot that the main effect of investing in it is advancing AI capabilities faster.

Rubi J. Hudson 12 Oct 2022 4:25 UTC
5 points
0 ∶ 0
on: Which AI Safety Org to Join?
Seems worth asking in interviews “I’m concerned about advancing capabilities and shortening timelines, what actions is your organization taking to prevent that”, with the caveat that you will be BSed.
Bonus: You can turn down roles explicitly because they’re doing capabilities work, which if it becomes a pattern may incentivize them to change their plan.