Depends on what assurance you need. If GPT-7 reliably provides true results in most/all settings you can find, that’s good evidence.
If GPT-7 is really Machiavellian, and is conspiring against you to make GPT-8, then it’s already too late for you, but it’s also a weird situation. If GPT-7 were seriously conspiring against you, I assume it wouldn’t need to wait until GPT-8 to take action.
How do you know it tells the truth or its best knowledge of the truth without solving the “eliciting latent knowledge” problem?
Depends on what assurance you need. If GPT-7 reliably provides true results in most/all settings you can find, that’s good evidence.
If GPT-7 is really Machiavellian, and is conspiring against you to make GPT-8, then it’s already too late for you, but it’s also a weird situation. If GPT-7 were seriously conspiring against you, I assume it wouldn’t need to wait until GPT-8 to take action.