This is a sequence version of the Effective Altruism Foundation’s research agenda on Cooperation, Conflict, and Transformative Artificial Intelligence. The agenda outlines what we think are the most promising avenues for developing technical and governance interventions aimed at avoiding conflict between transformative AI systems. We draw on international relations, game theory, behavioral economics, machine learning, decision theory, and formal epistemology.
We appreciate all comments and questions. We’re also looking for people to work on the questions we outline. So if you’re interested or know people who might be, please get in touch at stefan.torges@ea-foundation.org.
[Link] EAF Research agenda: “Cooperation, Conflict, and Transformative Artificial Intelligence”
This is a sequence version of the Effective Altruism Foundation’s research agenda on Cooperation, Conflict, and Transformative Artificial Intelligence. The agenda outlines what we think are the most promising avenues for developing technical and governance interventions aimed at avoiding conflict between transformative AI systems. We draw on international relations, game theory, behavioral economics, machine learning, decision theory, and formal epistemology.
Preface to EAF’s Research Agenda on Cooperation, Conflict, and TAI
Sections 1 & 2: Introduction, Strategy and Governance
Sections 3 & 4: Credibility, Peaceful Bargaining Mechanisms
Sections 5 & 6: Contemporary Architectures, Humans in the Loop
Section 7: Foundations of Rational Agency Acknowledgements & References
Acknowledgements & References
We appreciate all comments and questions. We’re also looking for people to work on the questions we outline. So if you’re interested or know people who might be, please get in touch at stefan.torges@ea-foundation.org.