Michael St Jules 🔸 comments on DeepMind: Generally capable agents emerge from open-ended play

Michael St Jules 🔸 30 Jul 2021 1:54 UTC
2 points
0 ∶ 0
It seems like this could extend naturally to cooperative inverse reinforcement learning. Basically, the real world is a new game the AI has to play, and humans decide the reward subjectively (rather than with some explicit rule). The AI has developed some general competence beforehand by playing games, but it has to learn the new rules in the real world, which are not explicit.