Anthropic capture

TagLast edit: Jul 12, 2022, 12:12 AM by Pablo

Anthropic capture is a capability control method in which an advanced artificial intelligence thinks it might be in a simulation and as such attempts to behave in ways that will be rewarded by its simulators.

An­thropic capture

Further reading

Anthropic capture