Max Nadeau

Karma: 299

Max Nadeau Feb 28, 2025, 2:31 AM
4 points
0 ∶ 0
in reply to: ThaoOnEarth🔹’s comment on: Open Philanthropy Technical AI Safety RFP - $40M Available Across 21 Research Areas
Some common failure modes:
- Not reading the eligibility criteria
- Not clearly distinguishing your project from prior work on the topic you’re interested in
- Not demonstrating a good understanding of prior work (would be good to read some/all of the papers we link to in this doc for whatever section you’re applying within)
- Not demonstrating that you/your team has prior experience doing ML projects. If you don’t have such experience, then it’s good to work with/be mentored by someone who does.
“Research expeneses” does not include stipends, but you can apply for a project grant, which does.
If you’re looking for money to spend on ML experiments or to pay people who are spending their time doing ML research, then that may fall within this RFP. If you’re looking for money to do other things (e.g. reading groups, events, etc), then that may fall under the capacity-building team’s RFPs.

Open Philanthropy Technical AI Safety RFP - $40M Available Across 21 Research Areas

Jake MendelFeb 6, 2025, 6:59 PM

95 points

3 comments1 min readEA link

(www.openphilanthropy.org)

Max Nadeau Nov 8, 2024, 1:42 AM
3 points
1 ∶ 3
in reply to: Austin’s comment on: Has your organisation lost funding due to the Good Ventures funding shift? Have you managed to replace it?
https://www.openphilanthropy.org/focus/global-aid-policy/

“Build right-of-center support for aid, such as Civita’s work to create and discuss development policy recommendations with conservative Norwegian lawmakers.”

Max Nadeau Jun 27, 2024, 9:51 PM
12 points
3 ∶ 0
on: Detecting Genetically Engineered Viruses With Metagenomic Sequencing
I love seeing posts from people making tangible progress towards preventing catastrophes—it’s very encouraging!
I know nothing about this area, so excuse me if my question doesn’t make sense or was addressed in your post. I’m curious what the returns are on spending more money on sequencing, e.g. running the machine more than one a week or running it on more samples. If we were spending $10M a year instead of $1.5M on sequencing, how much less than 0.2% of people would have to be infected before an alert was raised?
Some other questions:
- How should I feel about 0.2%? Where is 0.2% on the value spectrum from no alert system and an alert system that triggered on a single infection?
- How many people’s worth of wastewater can be tested with $1.5M of sequencing?
Thanks for the update; it was interesting even as a layperson.

Max Nadeau Jan 12, 2024, 8:18 PM
6 points
0 ∶ 0
on: I’m interviewing Vitalik Buterin about ‘my techno-optimism’, E/acc and D/acc. What should I ask him?
I’d love to hear his thoughts on defensive measures for “fuzzier” threats from advanced AI, e.g. manipulation, persuasion, “distortion of epistemics”, etc. Since it seems difficult to delineate when these sorts of harms are occuring (as opposed to benign forms of advertising/rhetoric/expression), it seems hard to construct defenses.
This is a related concept mechanisms for collective epistemics like prediction markets or community notes, which Vitalik praises here. But the harms from manipulation are broader, and could route through “superstimuli”, addictive platforms, etc. beyond just the spread of falsehoods. See manipulation section here for related thoughts.

Max Nadeau Oct 24, 2023, 6:27 PM
8 points
0 ∶ 0
on: AMA: Six Open Philanthropy staffers discuss OP’s new GCR hiring round
Disclaimer: I joined OP two weeks ago in the Program Associate role on the Technical AI Safety team. I’m leaving some comments describing questions I wanted to know to assess whether I should take the job (which, obviously, I ended up doing).
What sorts of personal/career development does the PA role provide? What are the pros and cons of this path over e.g. technical research (which has relatively clear professional development in the form of published papers, academic degrees, high-status job titles that bring public credibility)?

Max Nadeau Oct 24, 2023, 6:23 PM
1 point
0 ∶ 0
on: AMA: Six Open Philanthropy staffers discuss OP’s new GCR hiring round
Disclaimer: I joined OP two weeks ago in the Program Associate role on the Technical AI Safety team. I’m leaving some comments describing questions I wanted to know to assess whether I should take the job (which, obviously, I ended up doing).
How inclined are you/would the OP grantmaking strategy be towards technical research with theories of impact that aren’t “researcher discovers technique that makes the AI internally pursue human values” → “labs adopt this technique”. Some examples of other theories of change that technical research might have:
- Providing evidence for the dangerous capabilities of current/future models (should such capabilities emerge) that can more accurately inform countermeasures/policy/scaling decisions.
- Detecting/demonstrating emergent misalignment from normal training procedures. This evidence would also serve to more accurately inform countermeasures/policy/scaling decisions.
- Reducing the ease of malicious misuse of AIs by humans.
- Limiting the reach/capability of models instead of ensuring their alignment.

Max Nadeau Oct 24, 2023, 5:43 PM
1 point
0 ∶ 0
on: AMA: Six Open Philanthropy staffers discuss OP’s new GCR hiring round
Disclaimer: I joined OP two weeks ago in the Program Associate role on the Technical AI Safety team. I’m leaving some comments describing questions I wanted to know to assess whether I should take the job (which, obviously, I ended up doing).
How much do the roles on the TAIS team involve engagement with technical topics? How do the depth and breadth of “keeping up with” AI safety research compare to being an AI safety researcher?

Max Nadeau Oct 24, 2023, 1:37 AM
8 points
1 ∶ 0
on: AMA: Six Open Philanthropy staffers discuss OP’s new GCR hiring round
Disclaimer: I joined OP two weeks ago in the Program Associate role on the Technical AI Safety team. I’m leaving some comments describing questions I wanted to know to assess whether I should take the job (which, obviously, I ended up doing).
What does OP’s TAIS funding go to? Don’t professors’ salaries already get paid by their universities? Can (or can’t) PhD students in AI get no-strings-attached funding (at least, can PhD students at prestigious universities)?

Max Nadeau Oct 24, 2023, 1:36 AM
7 points
0 ∶ 0
on: AMA: Six Open Philanthropy staffers discuss OP’s new GCR hiring round
Disclaimer: I joined OP two weeks ago in the Program Associate role on the Technical AI Safety team. I’m leaving some comments describing questions I wanted to know to assess whether I should take the job (which, obviously, I ended up doing).
Is it way easier for researchers to do AI safety research within AI scaling labs (due to: more capable/diverse AI models, easier access to them (i.e. no rate limits/usage caps), better infra for running experiments, maybe some network effects from the other researchers at those labs, not having to deal with all the logistical hassle that comes from being a professor/independent researcher)?
Does this imply that the research ecosystem OP is funding (which is ~all external to these labs) isn’t that important/cutting-edge for AI safety?

Max Nadeau Sep 13, 2023, 9:59 PM
45 points
7 ∶ 0
on: Who should we interview for The 80,000 Hours Podcast?
Sampled from my areas of personal interest, and not intended to be at all thorough or comprehensive:

AI researchers (in no particular order):
- Prof. Jacob Steinhardt: author of multiple fascinating pieces on forecasting AI progress and contributor/research lead on numerous AI safety-relevant papers.
- Dan Hendrycks: director of the multi-faceted and hard-to-summarize research and field-building non-profit CAIS.
- Prof. Sam Bowman: has worked on many varieties of AI safety research at Anthropic and NYU
- Ethan Perez: researcher doing fascinating work to display and address misalignments in today’s AIs.
- Toby Shevlane: Model Evaluations for Extreme Risks
- Jess Whittlestone: head of AI policy at Center for Long-Term Resilience, much research here
- Plenty of others: Jade Leung (AI governance and evaluations at OpenAI), Prof. David Krueger (varied AI safety research), Prof. Percy Liang (evaluating models), Prof. Roger Grosse (influence functions for interpretability), many others listed here.
Economists who have written (esp. but not only deflationary arguments contra Davidson) on AI’s economic impact:
- Chad Jones (see here)
- Ben Jones (see e.g. this, but also all his research)
- Matt Clancy (see this debate, though an episode with him should also address his non-AI work as well!)
- Daron Acemoglu (see Power and Progress)
- Maybe other reviewers here?
Ethicists:
- Iason Gabriel: has worked both on critiques of effective altruism, AI evaluations (extreme risks, representation), and normative questions related to AI alignment. This excellent FLI interview had so many ideas that would be great to explore in more depth.
- David Thorstad: has written critiques of existential risk reduction and longtermism.
- Emma Curran: author of contractualist reply to longtermism
The three I would personally be most excited to listen to: Toby Shevlane, Matt Clancy, Iason Gabriel.

Max Nadeau May 16, 2023, 8:49 PM
5 points
4 ∶ 0
in reply to: Habiba Banu’s comment on: Habiba’s Shortform
Best of luck with your new gig; excited to hear about it! Also, I really appreciate the honesty and specificity in this post.

Max Nadeau Nov 4, 2022, 2:10 AM
1 point
0 ∶ 0
in reply to: Eli Rose’s comment on: Apply to the Redwood Research Mechanistic Interpretability Experiment (REMIX), a research program in Berkeley
From the post: “We plan to have some researchers arrive early, with some people starting as soon as possible. The majority of researchers will likely participate during the months of December and/or January.”

Max Nadeau Oct 28, 2022, 5:18 PM
13 points
0 ∶ 0
on: What should I ask Ajeya Cotra — senior researcher at Open Philanthropy, and expert on AI timelines and safety challenges?
Artir Kel (aka José Luis Ricón Fernández de la Puente) at Nintil wrote an essay broadly sympathetic to AI risk scenarios but doubtful of a particular step in the power-seeking stories Cotra, Gwern, and others have told. In particular, he has a hard time believing that a scaled-up version of present systems (e.g. Gato) would learn facts about itself (e.g. that it is an AI in a training process, what its trainers motivations would be, etc) and incorporate those facts into its planning (Cotra calls this “situational awareness”). Some AI safety researchers I’ve spoken to personally agree with Kel’s skepticism on this point.
Since incorporating this sort of self-knowledge into one’s plans is necessary for breaking out of training, initiating deception, etc, this seems like a pretty important disagreement. In fact, Kel claims that if he came around on this point, he would agree almost entirely with Cotra’s analysis.
Can she describe in more detail what situational awareness means? Could it be demonstrated with current/nearterm models? Why does she think that Kel (and others) think it’s so unlikely?

Max Nadeau Oct 27, 2022, 6:00 PM
3 points
0 ∶ 0
in reply to: Dan Valentine’s comment on: Apply to the Redwood Research Mechanistic Interpretability Experiment (REMIX), a research program in Berkeley
No concrete plans one way or the other.

Apply to the Redwood Research Mechanistic Interpretability Experiment (REMIX), a research program in Berkeley

Max NadeauOct 27, 2022, 1:39 AM

95 points

5 comments12 min readEA link

Max Nadeau May 9, 2022, 2:32 AM
2 points
0 ∶ 0
in reply to: SeanEngelhart’s comment on: Apply to the second ML for Alignment Bootcamp (MLAB 2) in Berkeley [Aug 15 - Fri Sept 2]
It is possible but unlikely that such a person would be a TA. Someone with little prior ML experience would be a better fit as a participant.

Max Nadeau May 8, 2022, 12:27 AM
2 points
0 ∶ 0
in reply to: SeanEngelhart’s comment on: Apply to the second ML for Alignment Bootcamp (MLAB 2) in Berkeley [Aug 15 - Fri Sept 2]
We intended that sentence to be read as: “In addition to people who plan on doing technical alignment, MLAB can be valuable to other sorts of people (e.g. theoretical researchers)”.

Apply to the second ML for Alignment Bootcamp (MLAB 2) in Berkeley [Aug 15 - Fri Sept 2]

BuckMay 6, 2022, 12:19 AM

111 points

7 comments6 min readEA link

Max Nadeau

Open Philan­thropy Tech­ni­cal AI Safety RFP - $40M Available Across 21 Re­search Areas

Ap­ply to the Red­wood Re­search Mechanis­tic In­ter­pretabil­ity Ex­per­i­ment (REMIX), a re­search pro­gram in Berkeley

Ap­ply to the sec­ond ML for Align­ment Boot­camp (MLAB 2) in Berkeley [Aug 15 - Fri Sept 2]

Open Philanthropy Technical AI Safety RFP - $40M Available Across 21 Research Areas

Apply to the Redwood Research Mechanistic Interpretability Experiment (REMIX), a research program in Berkeley

Apply to the second ML for Alignment Bootcamp (MLAB 2) in Berkeley [Aug 15 - Fri Sept 2]