My expectation is that software without humans in the loop evaluating it, will Goodhart’s law itself and over fit to the metrics/measures given.
My expectation is that software without humans in the loop evaluating it, will Goodhart’s law itself and over fit to the metrics/measures given.