skluug answers What “defense layers” should governments, AI labs, and businesses use to prevent catastrophic AI failures?

skluug 5 Dec 2021 16:26 UTC
4 points
0 ∶ 0
Four layers come to mind for me:
- Have strong theoretical reasons to think your method of creating the system cannot result in something motivated to take dangerous actions
- Inspect the system thoroughly after creation, before deployment, to make sure it looks as expected and appears incapable of making dangerous decisions
- Deploy the system in an environment where it is physically incapable of doing anything dangerous
- Monitor the internals of the system closely during deployment to ensure operation is as expected, and that no dangerous actions are attempted