The optimizer’s curse is the phenomenon that causes an agent optimizing over a set set of options with uncertain value to systematically overestimate the value of the chosen option.
Illustration
The authors of the paper that originally introduced the optimizer’s curse provide a simple illustration of the phenomenon:[1]
suppose that we evaluate three alternatives that all have true values of exactly zero. The value of each alternative is estimated and the estimates are independent and normally distributed with mean equal to the true value of zero (they are conditionally unbiased) and a standard deviation of one. Selecting the highest value estimate then amounts to selecting the maximum of three draws from a standard normal distribution. The distribution of this maximal value estimate is easily determined by simulation or using results from order statistics and is displayed in Figure 1. The mean of this distribution is 0.85, so in this case, the expected disappointment,, is 0.85.
Further reading
Muehlhauser, Luke (2011) The optimizer’s curse and how to beat it, LessWrong, September 15.
Smith, James E. & Robert L. Winkler (2006) The optimizer’s curse: skepticism and postdecision surprise in decision analysis, Management Science, vol. 52, pp. 311–322.
Related entries
expected value | unilateralist’s curse
- ^
Smith, James E. & Robert L. Winkler (2006) The optimizer’s curse: skepticism and postdecision surprise in decision analysis, Management Science, vol. 52, pp. 312.
See also: Goodhart’s law
I’m also thinking of putting a potentially large bounty on solutions to it which work in practice or which help mitigate it. In particular, I’ve recently become more skeptical of being able to implement the canonical Bayesian adjustment in practice.
Notes: https://wikiless.org/wiki/Winner%27s_curse?lang=en
https://www.lesswrong.com/posts/5gQLrJr2yhPzMCcni/the-optimizer-s-curse-and-how-to-beat-it#tzmqXnWvTiL69H2sR