Sharmake comments on How Well Does RL Scale?

Sharmake 23 Oct 2025 20:05 UTC
4 points
0 ∶ 0
The crux for me is I don’t agree that compute scaling has dramatically changed, because I don’t think pre-training scaling has gotten much worse returns.