Larks comments on Rolling Thresholds for AGI Scaling Regulation

Larks Jan 12, 2025, 2:24 AM
4 points
0 ∶ 0
Interesting suggestion! Continuous or pseudo-continuous threshold raising isn’t something I considered. Here are some quick thoughts:
- Continuous scaling could make eval validity easier, because the jump between eval-train (n-1) and eval-deploy (n) is smaller.
- Continuous scaling encourages training to be done quickly, because you want to get your model launched before it is outdated.
- Continuous scaling means you give up on the idea of models being evaluated side-by-side.