Thanks for the post. I think many of these excerpted comments are missing the point. They say ‘GPT-5 is a product release and not a much larger, newer model’. Granted. But they don’t ask why it is not a much larger, newer model. My answer is that OpenAI has tried and does not yet have the ability to build anything much bigger and more capable relative to GPT-4, despite two years and untold billions of investment. What we are seeing is massive dimishing returns relative to investment. The fact that this is the best OpenAI can do to warrant the GPT-5 label after all this time and money warrants a significant update.
But they don’t ask why it is not a much larger, newer model. My answer is that OpenAI has tried and does not yet have the ability to build anything much bigger and more capable relative to GPT-4, despite two years and untold billions of investment.
I’m not sure this is true. Two key points are made in the Sam Hammond tweet:
OpenAI has made better models internally, they just haven’t been released.
There wasn’t a big increase in compute for GPT5 because this compute isn’t yet available. Big compute projects take time. Maybe we’re in a bit of a compute slump now, but it isn’t clear this will always be the case.
Thanks for the post.
I think many of these excerpted comments are missing the point. They say ‘GPT-5 is a product release and not a much larger, newer model’. Granted. But they don’t ask why it is not a much larger, newer model. My answer is that OpenAI has tried and does not yet have the ability to build anything much bigger and more capable relative to GPT-4, despite two years and untold billions of investment. What we are seeing is massive dimishing returns relative to investment. The fact that this is the best OpenAI can do to warrant the GPT-5 label after all this time and money warrants a significant update.
I’m not sure this is true. Two key points are made in the Sam Hammond tweet:
OpenAI has made better models internally, they just haven’t been released.
There wasn’t a big increase in compute for GPT5 because this compute isn’t yet available. Big compute projects take time. Maybe we’re in a bit of a compute slump now, but it isn’t clear this will always be the case.