Some work that seems relevant:
https://arxiv.org/abs/2006.04948
https://futureoflife.org/2020/09/15/andrew-critch-on-ai-research-considerations-for-human-existential-safety/
https://www.alignmentforum.org/posts/EzoCZjTdWTMgacKGS/clr-s-recent-work-on-multi-agent-systems
The Andrew Critch interview is so far exactly what I’m looking for.
Some work that seems relevant:
https://arxiv.org/abs/2006.04948
https://futureoflife.org/2020/09/15/andrew-critch-on-ai-research-considerations-for-human-existential-safety/
https://www.alignmentforum.org/posts/EzoCZjTdWTMgacKGS/clr-s-recent-work-on-multi-agent-systems
The Andrew Critch interview is so far exactly what I’m looking for.