Yes, that is a big limitation. Even more limiting is that it is only based on a subset of METR’s data on this. That’s enough to raise the question and illustrate what an answer might look like in data like this, but not to really answer it.
I’m not aware of others exploring this question, but I haven’t done much looking.
Yes, that is a big limitation. Even more limiting is that it is only based on a subset of METR’s data on this. That’s enough to raise the question and illustrate what an answer might look like in data like this, but not to really answer it.
I’m not aware of others exploring this question, but I haven’t done much looking.