technicalities comments on Intrinsic limitations of GPT-4 and other large language models, and why I’m not (very) worried about GPT-n

technicalities 4 Jun 2023 12:11 UTC
3 points
1 ∶ 0
Nitpick: It’s fairly unlikely that GPT-4 is 1tn params; this size doesn’t seem compute-optimal. I grant you the Semafor assertion is some evidence, but I’m putting more weight on compute arithmetic.