Also see Brian Christian briefly suggesting a cause allocation rule a bit like this towards the end of 80k’s interview with him.
We were discussing solutions to the explore-exploit problem, and one is that you allocate resources in proportion to your credence the option is best.
Also see Brian Christian briefly suggesting a cause allocation rule a bit like this towards the end of 80k’s interview with him.
We were discussing solutions to the explore-exploit problem, and one is that you allocate resources in proportion to your credence the option is best.