Red teaming

TagLast edit: 13 Jun 2022 15:01 UTC by Leo

A red team is an independent group that challenges an organization or movement in order to improve it. Red teaming is the practice of using red teams.

History of the term

The term “red teaming” appears to originate in the United States military. A common exercise was to pitch an offensive “red team”, representing the enemy, against a defensive “blue team”, representing the U.S. The purpose of the exercise was to identify vulnerabilities and develop effective countermeasures.^[1] The term was later extended to cover related practices in other fields, including information security and intelligence analysis.

Red teaming in effective altruism

Within effective altruism, “red teaming” refers to attempts to identify problems or errors in popular or prestigious views held by members of this community, such as views about the value of different causes or organizations.^[2]

Related concepts include minimal-trust investigations,^[3] epistemic spot-checks,^[4] and hypothetical apostasy.^[5]

Further reading

Räuker, Max et al. (2022) Idea: Red-teaming fellowships, Effective Altruism Forum, February 2.

Vaintrob, Lizka & Fin Moorhouse (2022) Resource for criticisms and red teaming, Effective Altruism Forum, June 1.

Zhang, Linchuan (2021) Red teaming papers as an EA training exercise?, Effective Altruism Forum, June 22.

Related entries

criticism of effective altruism | epistemology | epistemic deference | tabletop exercises

^
Johnson, Rowland (2015) How your red team penetration testers can help improve your blue team, SC Magazine, August 18.
^
Räuker, Max et al. (2022) Idea: Red-teaming fellowships, Effective Altruism Forum, February 2.
^
Karnofsky, Holden (2021) Minimal-trust investigations, Effective Altruism Forum, November 23.
^
Ravid, Yoav (2020) Epistemic spot check, LessWrong Wiki, August 7.
^
Bostrom, Nick (2009) Write your hypothetical apostasy, Overcoming Bias, February 21.

Minimal-trust investigations

Holden Karnofsky23 Nov 2021 18:02 UTC

163 points

10 comments12 min readEA link

Idea: Red-teaming fellowships

MaxRa2 Feb 2022 22:13 UTC

102 points

12 comments3 min readEA link

Offering: Material for a Red-Teaming Workshop

Lukas Gebhard3 Jun 2025 14:11 UTC

6 points

0 comments4 min readEA link

The motivated reasoning critique of effective altruism

Linch14 Sep 2021 20:43 UTC

287 points

59 comments18 min readEA link

The Future Might Not Be So Great

Jacy30 Jun 2022 13:01 UTC

147 points

119 comments34 min readEA link

(www.sentienceinstitute.org)

Apply for Red Team Challenge [May 7 - June 4]

Cillian_18 Mar 2022 18:53 UTC

92 points

11 comments2 min readEA link

Malaria vaccines: how confident are we?

Sanjay5 Jan 2024 0:59 UTC

66 points

15 comments7 min readEA link

$100 bounty for the best ideas to red team

Cillian_18 Mar 2022 18:54 UTC

48 points

33 comments2 min readEA link

Compilation of Profit for Good Redteaming and Responses

Brad West🔸19 Sep 2023 13:33 UTC

41 points

10 comments9 min readEA link

Seeking Input for Applying Satellite Tech to Reduce Farmed Fish Suffering

haven10 Jul 2024 18:43 UTC

27 points

4 comments3 min readEA link

Flimsy Pet Theories, Enormous Initiatives

Ozzie Gooen9 Dec 2021 15:10 UTC

213 points

57 comments4 min readEA link

A Critique of The Precipice: Chapter 6 - The Risk Landscape [Red Team Challenge]

Sarah Weiler26 Jun 2022 10:59 UTC

57 points

2 comments21 min readEA link

Winners of the EA Criticism and Red Teaming Contest

Lizka1 Oct 2022 1:50 UTC

227 points

42 comments19 min readEA link

[Question] What “pivotal” and useful research … would you like to see assessed? (Bounty for suggestions)

david_reinstein28 Apr 2022 15:49 UTC

37 points

21 comments7 min readEA link

A philosophical review of Open Philanthropy’s Cause Prioritisation Framework

MichaelPlant15 Jul 2022 8:21 UTC

160 points

11 comments29 min readEA link

(www.happierlivesinstitute.org)

A Red-Team Against the Impact of Small Donations

AppliedDivinityStudies24 Nov 2021 16:03 UTC

183 points

53 comments8 min readEA link

Potatoes: A Critical Review

Pablo Villalobos10 May 2022 15:27 UTC

120 points

27 comments9 min readEA link

(docs.google.com)

Nuclear Expert Comment on Samotsvety Nuclear Risk Forecast

Jhrosenberg26 Mar 2022 9:22 UTC

135 points

13 comments18 min readEA link

A concern about the “evolutionary anchor” of Ajeya Cotra’s report on AI timelines.

NunoSempere16 Aug 2022 14:44 UTC

86 points

40 comments5 min readEA link

(nunosempere.com)

Red Teaming CEA’s Community Building Work

AnonymousEAForumAccount1 Sep 2022 14:42 UTC

299 points

68 comments66 min readEA link

Issues with Futarchy

Lizka7 Oct 2021 17:24 UTC

59 points

8 comments25 min readEA link

Should Sinergia Animal be credited for helping Alibem’s piglets?

Vasco Grilo🔸20 Nov 2025 18:11 UTC

17 points

4 comments3 min readEA link

Pre-announcing a contest for critiques and red teaming

Lizka25 Mar 2022 11:52 UTC

173 points

27 comments2 min readEA link

Announcing a contest: EA Criticism and Red Teaming

Lizka1 Jun 2022 18:58 UTC

276 points

64 comments13 min readEA link

Why Effective Altruists Should Put a Higher Priority on Funding Academic Research

Stuart Buck25 Jun 2022 19:17 UTC

118 points

15 comments11 min readEA link

A review of Our Final Warning: Six Degrees of Climate Emergency by Mark Lynas

John G. Halstead15 Apr 2022 13:43 UTC

178 points

6 comments13 min readEA link

My impact assessment of Giving What We Can

Vasco Grilo🔸15 Apr 2023 6:59 UTC

40 points

33 comments9 min readEA link

Independent impressions

MichaelA🔸26 Sep 2021 18:43 UTC

169 points

8 comments1 min readEA link

A dozen doubts about GiveWell’s numbers

JoelMcGuire1 Nov 2022 2:25 UTC

134 points

18 comments19 min readEA link

(www.happierlivesinstitute.org)

Resource for criticisms and red teaming

Lizka1 Jun 2022 18:58 UTC

65 points

3 comments8 min readEA link

Disagreeables and Assessors: Two Intellectual Archetypes

Ozzie Gooen5 Nov 2021 9:01 UTC

91 points

20 comments3 min readEA link

Red-teaming PowerSmoothie.org by Holden Karnofsky

JamesÖz 🔸1 Apr 2025 16:03 UTC

105 points

1 comment2 min readEA link

The Long Reflection as the Great Stagnation

Larks1 Sep 2022 20:55 UTC

43 points

2 comments8 min readEA link

Benjamin Jones’ review of “Could Advanced AI Drive Explosive Economic Growth?”

Vasco Grilo🔸26 Dec 2025 17:28 UTC

22 points

0 comments19 min readEA link

(docs.google.com)

Epistemic Spot Check: Expected Value of Donating to Alex Bores’s Congressional Campaign

MichaelDickens13 Nov 2025 19:09 UTC

67 points

3 comments6 min readEA link

Critique of OpenPhil’s macroeconomic policy advocacy

Hauke Hillebrandt24 Mar 2022 22:03 UTC

143 points

39 comments24 min readEA link

Questioning the Value of Extinction Risk Reduction

Red Team 87 Jul 2022 4:44 UTC

61 points

8 comments27 min readEA link

Giving multiplier of Giving What We Can

Vasco Grilo🔸26 Sep 2025 16:53 UTC

15 points

2 comments5 min readEA link

Reviews of “Is power-seeking AI an existential risk?”

Joe_Carlsmith16 Dec 2021 20:50 UTC

71 points

4 comments1 min readEA link

Can a war cause human extinction? Once again, not on priors

Vasco Grilo🔸25 Jan 2024 7:56 UTC

67 points

29 comments18 min readEA link

Research I’d like to see

Alex Cohen17 Dec 2024 14:08 UTC

62 points

1 comment5 min readEA link

Opinioni indipendenti

EA Italy18 Jan 2023 11:21 UTC

1 point

0 comments1 min readEA link

Red teaming and cause area investigation: challenges for screwworm (NWS) eradication

Ramiro31 Mar 2025 20:05 UTC

63 points

1 comment11 min readEA link

Against Longtermism: I welcome our robot overlords, and you should too!

MattBall2 Jul 2022 2:05 UTC

6 points

6 comments6 min readEA link

We’re testing “Governance by Physics” instead of “Alignment by Intent.”

Harsha Gullapalli23 Jan 2026 15:42 UTC

1 point

0 comments2 min readEA link

Against AI As An Existential Risk

Noah Birnbaum30 Jul 2024 19:24 UTC

6 points

5 comments1 min readEA link

(irrationalitycommunity.substack.com)

Concerns/Thoughts over international aid, longtermism and philosophical notes on speaking with Larry Temkin.

Ben Yeoh27 Jul 2022 19:51 UTC

35 points

1 comment12 min readEA link

Red teaming a model for estimating the value of longtermist interventions—A critique of Tarsney’s “The Epistemic Challenge to Longtermism”

AF16 Jul 2022 19:05 UTC

21 points

0 comments30 min readEA link

[Question] Prizes for EA Red Teaming

Noah Birnbaum22 Jul 2024 21:13 UTC

12 points

0 comments1 min readEA link

Red-teaming existential risk from AI

Zed Tarar30 Nov 2023 14:35 UTC

30 points

16 comments6 min readEA link

Red-teaming Holden Karnofsky’s AI timelines

Vasco Grilo🔸25 Jun 2022 14:24 UTC

58 points

2 comments11 min readEA link

Effective Altruism Risks Perpetuating a Harmful Worldview

Theo Cox20 Aug 2022 1:10 UTC

−2 points

9 comments20 min readEA link

Local Detours On A Narrow Path: How might treaties fail in China?

Jack_S🔸11 Aug 2025 20:33 UTC

9 points

0 comments14 min readEA link

(torchestogether.substack.com)

Three Weeks In: What GPT-5 Still Gets Wrong

JAM27 Aug 2025 14:43 UTC

2 points

0 comments3 min readEA link

An Empirical Demonstration of a New AI Catastrophic Risk Factor: Metaprogrammatic Hijacking

Hiyagann27 Jun 2025 13:38 UTC

5 points

0 comments1 min readEA link

The role of academia in AI Safety.

PabloAMC 🔸28 Mar 2022 0:04 UTC

71 points

19 comments3 min readEA link

Belonging

Barracuda10 Jun 2022 7:27 UTC

2 points

0 comments1 min readEA link

Guiding civil servants to Improve Institutional Decision-Making through an ‘Impact Challenge’

Iftekhar1 Jul 2022 13:37 UTC

19 points

0 comments8 min readEA link

#176 – The final push for AGI, understanding OpenAI’s leadership drama, and red-teaming frontier models (Nathan Labenz on the 80,000 Hours Podcast)

80000_Hours4 Jan 2024 16:00 UTC

15 points

0 comments22 min readEA link

The Importance of Intercausal Impacts

Sebastian Joy 樂百善24 Aug 2022 10:41 UTC

61 points

2 comments8 min readEA link

AI Red Teaming at GiveWell: What We’ve Learned (and Where We’d Welcome Your Input)

Brendan Phillips14 Jan 2026 19:47 UTC

46 points

18 comments3 min readEA link

Controlling the options AIs can pursue

Joe_Carlsmith29 Sep 2025 17:24 UTC

9 points

0 comments35 min readEA link

No comments.