MichaelDickens answers Is benchmarking AI capabilities positive EV?