Digression but I would recommend reading about Thompson sampling :) (wikipedia, inscrutable LessWrong post). It’s a good model to have for thinking about explore-exploit tradeoffs in general.
Digression but I would recommend reading about Thompson sampling :) (wikipedia, inscrutable LessWrong post). It’s a good model to have for thinking about explore-exploit tradeoffs in general.