Any idea if these capabilities were made public or, for example, only used for private METR evals?
In the case of OpenDevin it seems like the grant is directly funding an open-source project that advances capabilities.
Iβd like more transparency on this.
Any idea if these capabilities were made public or, for example, only used for private METR evals?
In the case of OpenDevin it seems like the grant is directly funding an open-source project that advances capabilities.
Iβd like more transparency on this.